Patents by Inventor Ya-Qin Zhang

Ya-Qin Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20050074065
    Abstract: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a wavelet transform is applied to produce coefficients within different sub-bands. The wavelet coefficients are coded independently for each sub-band to permit easy separation at a decoder, making resolution scalability and temporal scalability natural and easy. In particular, the coefficients are assigned various contexts based on the significance of neighboring samples in previous, current, and next frame, thereby taking advantage of any motion information between frames.
    Type: Application
    Filed: November 9, 2004
    Publication date: April 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Ya-Qin Zhang
  • Publication number: 20050069036
    Abstract: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer i bitstream.
    Type: Application
    Filed: October 29, 2004
    Publication date: March 31, 2005
    Applicant: Microsoft Corporation
    Inventors: Ya-Qin Zhang, Shipeng Li, Feng Wu, Rong Yan
  • Publication number: 20050063463
    Abstract: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer bitstream.
    Type: Application
    Filed: October 29, 2004
    Publication date: March 24, 2005
    Applicant: Microsoft Corporation
    Inventors: Ya-Qin Zhang, Shipeng Li, Feng Wu, Rong Yan
  • Publication number: 20050041745
    Abstract: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer bitstream.
    Type: Application
    Filed: September 20, 2004
    Publication date: February 24, 2005
    Applicant: Microsoft Corporation
    Inventors: Ya-Qin Zhang, Shipeng Li, Feng Wu, Rong Yan
  • Patent number: 6850564
    Abstract: An apparatus and method for dynamically varying the frame rate of an image sequence is disclosed. In one embodiment, the image sequence is encoded and stored at different frame rates (e.g., 30, 25, 20 fps and so on). Alternatively, only the motion information, e.g., motion vectors, for the other frame rates are stored.
    Type: Grant
    Filed: August 31, 1998
    Date of Patent: February 1, 2005
    Assignee: Sarnoff Corporation
    Inventors: Sassan Pejhan, Ya-Qin Zhang, Tihao Chiang
  • Publication number: 20050008076
    Abstract: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients.
    Type: Application
    Filed: July 29, 2004
    Publication date: January 13, 2005
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Ya-Qin Zhang
  • Publication number: 20050002457
    Abstract: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients.
    Type: Application
    Filed: July 29, 2004
    Publication date: January 6, 2005
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Ya-Qin Zhang
  • Publication number: 20040264567
    Abstract: A video encoding system and method utilizes a three-dimensional (3-D) wavelet transform and entropy coding that utilize motion information in a way to reduce the sensitivity to motion. In one implementation, the coding process initially estimates motion trajectories of pixels in a video object from frame to frame in a video sequence to account for motion of the video object throughout the frames. After motion estimation, a 3-D wavelet transform is applied in two parts. First, a temporal 1-D wavelet transform is applied to the corresponding pixels along the motion trajectories in a time direction. The temporal wavelet transform produces decomposed frames of temporal wavelet transforms, where the spatial correlation within each frame is well preserved. Second, a spatial 2-D wavelet transform is applied to all frames containing the temporal wavelet coefficients. The wavelet transforms produce coefficients within different sub-bands. The process then codes wavelet coefficients.
    Type: Application
    Filed: July 29, 2004
    Publication date: December 30, 2004
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Ya-Qin Zhang
  • Publication number: 20040258157
    Abstract: A video coding system and method utilizes a 3-D wavelet transform that is memory efficient and reduces boundary effect across frame boundaries. The transform employs a lifting-based scheme and buffers wavelet coefficients at intermediate lifting steps towards the end of one GOP (group of pictures) until intermediate coefficients from the beginning of the next GOP are available. The wavelet transform scheme does not physically break the video sequence into GOPs, but processes the sequence without intermission. In this manner, the system simulates an infinite wavelet transformation across frame boundaries and the boundary effect is significantly reduced or essentially eliminated. Moreover, the buffering is very small and the scheme can be used to implement other decomposition structures. The wavelet transform scheme provides superb video playback quality with little or no boundary effects.
    Type: Application
    Filed: August 6, 2004
    Publication date: December 23, 2004
    Applicant: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Zixiang Xiong, Ya-Qin Zhang
  • Publication number: 20040252886
    Abstract: Automatic video object extraction that defines substantially precise objects is disclosed. In one embodiment, color segmentation and motion segmentation are performed on a source video. The color segmentation segments the video by substantially uniform color regions thereof. The motion segmentation segments the video by moving regions thereof. The color regions and the moving regions are then combined to define the video objects. In varying embodiments, pre-processing and post-processing is performed to further clean the source video and the video objects defined, respectively.
    Type: Application
    Filed: July 13, 2004
    Publication date: December 16, 2004
    Applicant: Microsoft Corporation
    Inventors: Jinhui Pan, Shipeng Li, Ya-Qin Zhang
  • Patent number: 6816194
    Abstract: A scalable layered video coding scheme that encodes video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video, adds error resilience to the enhancement layer. Unique resynchronization marks are inserted into the enhancement layer bitstream in headers associated with each video packet, headers associated with each bit plane, and headers associated with each video-of-plane (VOP) segment. Following transmission of the enhancement layer bitstream, the decoder tries to detect errors in the packets. Upon detection, the decoder seeks forward in the bitstream for the next known resynchronization mark. Once this mark is found, the decoder is able to begin decoding the next video packet. With the addition of many resynchronization marks within each frame, the decoder can recover very quickly and with minimal data loss in the event of a packet loss or channel error in the received enhancement layer bitstream.
    Type: Grant
    Filed: February 16, 2001
    Date of Patent: November 9, 2004
    Assignee: Microsoft Corporation
    Inventors: Ya-Qin Zhang, Shipeng Li, Feng Wu, Rong Yan
  • Publication number: 20040190617
    Abstract: An accelerated video decoding system utilizes a graphics processing unit to perform motion compensation, image reconstruction, and color space conversion processes, while utilizing a central processing unit to perform other decoding processes.
    Type: Application
    Filed: March 28, 2003
    Publication date: September 30, 2004
    Applicant: MICROSOFT CORPORATION
    Inventors: Guobin Shen, Lihua Zhu, Shipeng Li, Ya-Qin Zhang, Richard F. Rashid
  • Patent number: 6795504
    Abstract: A video coding system and method utilizes a 3-D wavelet transform that is memory efficient and reduces boundary effect across frame boundaries. The transform employs a lifting-based scheme and buffers wavelet coefficients at intermediate lifting steps towards the end of one GOP (group of pictures) until intermediate coefficients from the beginning of the next GOP are available. The wavelet transform scheme does not physically break the video sequence into GOPs, but processes the sequence without intermission. In this manner, the system simulates an infinite wavelet transformation across frame boundaries and the boundary effect is significantly reduced or essentially eliminated. Moreover, the buffering is very small and the scheme can be used to implement other decomposition structures. The wavelet transform scheme provides superb video playback quality with little or no boundary effects.
    Type: Grant
    Filed: June 21, 2000
    Date of Patent: September 21, 2004
    Assignee: Microsoft Corporation
    Inventors: Jizheng Xu, Shipeng Li, Zixiang Xiong, Ya-Qin Zhang
  • Patent number: 6785329
    Abstract: Automatic video object extraction that defines substantially precise objects is disclosed. In one embodiment, color segmentation and motion segmentation are performed on a source video. The color segmentation segments the video by substantially uniform color regions thereof. The motion segmentation segments the video by moving regions thereof. The color regions and the moving regions are then combined to define the video objects. In varying embodiments, pre-processing and post-processing is performed to further clean the source video and the video objects defined, respectively.
    Type: Grant
    Filed: December 21, 1999
    Date of Patent: August 31, 2004
    Assignee: Microsoft Corporation
    Inventors: Jinhui Pan, Shipeng Li, Ya-Qin Zhang
  • Publication number: 20040071358
    Abstract: An image distribution system has a source that encodes digital images and transmits them over an error-prone channel to a destination. The source has an image coder that processes the digital images using vector transformation followed by vector quantization. This produces groups of vectors and quantized values that are representative of the images. The image coder orders the vectors in the codebooks and assigns vector indexes to the vectors such that a bit error occurring at a less significant bit in a vector index results in less distortion than a bit error occurring at a more significant bit. Depending upon the format and the capabilities of the source and destination, the image coder may allocate different numbers of bits to different groups of vectors according to a bit allocation map for this allocation process. The source also has a UEP (Unequal Error Protection) coder that layers the vector indexes according to their significance.
    Type: Application
    Filed: July 23, 2003
    Publication date: April 15, 2004
    Inventors: Jun Xin, Shipeng Li, Ya-Qin Zhang
  • Patent number: 6700933
    Abstract: A video encoding scheme employs progressive fine-granularity layered coding to encode video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from at least one same or lower quality layer in a reference frame, whereby the lower quality layer is not necessarily the base layer. Use of multiple reference layers of different quality results in occasional fluctuations in the encoded image data. The video encoding scheme efficiently eliminates such fluctuations by predicting higher quality data from the lower quality data encoded in the base layer and a low quality enhancement layer.
    Type: Grant
    Filed: February 15, 2000
    Date of Patent: March 2, 2004
    Assignee: Microsoft Corporation
    Inventors: Feng Wu, Shipeng Li, Ya-Qin Zhang
  • Patent number: 6690833
    Abstract: A method and apparatus for selecting a quantizer scale for each macroblock within a frame to optimize the coding rate is presented. A quantizer scale is selected for each macroblock within each frame such that the target bit rate for the frame is achieved while maintaining a uniform visual quality over the entire frame.
    Type: Grant
    Filed: June 18, 1998
    Date of Patent: February 10, 2004
    Assignee: Sarnoff Corporation
    Inventors: Tihao Chiang, Hung-Ju Lee, Ya-Qin Zhang
  • Publication number: 20040005095
    Abstract: A video encoding scheme employs progressive fine-granularity layered coding to encode video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from at least one lower quality layer in a reference frame, whereby the lower quality layer is not necessarily the base layer.
    Type: Application
    Filed: July 2, 2003
    Publication date: January 8, 2004
    Inventors: Feng Wu, Shipeng Li, Ya-Qin Zhang
  • Patent number: 6621935
    Abstract: An image distribution system has a source that encodes digital images and transmits them over an error-prone channel to a destination. The source has an image coder that processes the digital images using vector transformation followed by vector quantization. This produces groups of vectors and quantized values that are representative of the images. The image coder orders the vectors in the codebooks and assigns vector indexes to the vectors such that a bit error occurring at a less significant bit in a vector index results in less distortion than a bit error occurring at a more significant bit. Depending upon the format and the capabilities of the source and destination, the image coder may allocate different numbers of bits to different groups of vectors according to a bit allocation map for this allocation process. The source also has a UEP (Unequal Error Protection) coder that layers the vector indexes according to their significance.
    Type: Grant
    Filed: May 1, 2000
    Date of Patent: September 16, 2003
    Assignee: Microsoft Corporation
    Inventors: Jun Xin, Shipeng Li, Ya-Qin Zhang
  • Patent number: 6614936
    Abstract: A video encoding scheme employs progressive fine-granularity layered coding to encode video data frames into multiple layers, including a base layer of comparatively low quality video and multiple enhancement layers of increasingly higher quality video. Some of the enhancement layers in a current frame are predicted from at least one lower quality layer in a reference frame, whereby the lower quality layer is not necessarily the base layer.
    Type: Grant
    Filed: December 3, 1999
    Date of Patent: September 2, 2003
    Assignee: Microsoft Corporation
    Inventors: Feng Wu, Shipeng Li, Ya-Qin Zhang