Patents by Inventor Sanjeev Mehrotra

Sanjeev Mehrotra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Frequency segmentation to obtain bands for efficient coding of digital media

Publication number: 20070016412

Abstract: Frequency segmentation is important to the quality of encoding spectral data. Segmentation involves breaking the spectral data into units called sub-bands or vectors. Homogeneous segmentation may be suboptimal. Various features are described for providing spectral data intensity dependent segmentation. Finer segmentation is provided for regions of greater spectral variance and coarser segmentation is provided for more homogeneous regions. Sub-bands which have similar characteristics may be merged with very little effect on quality, whereas sub-bands with highly variable data may be better represented if a sub-band is split. Various methods are described for measuring tonality, energy, or shape of a sub-band. These various measurements are discussed in light of making decisions of when to split or merge sub-bands to provide variable frequency segmentation.

Type: Application

Filed: July 15, 2005

Publication date: January 18, 2007

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Intra compression of pixel blocks using predicted mean

Patent number: 7162091

Abstract: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

Type: Grant

Filed: March 31, 2003

Date of Patent: January 9, 2007

Assignee: Microsoft Corporation

Inventors: Albert S. Wang, Sanjeev Mehrotra
Text detection in continuous tone image segments

Patent number: 7085420

Abstract: For encoding of mixed-mode images containing text and continuous-tone content, the pixels in the image that form the text content are detected and separated. Text detection classifies pixels as text or continuous tone content by accumulating pixel counts for groups of contiguous, non-smooth pixels with the same color. Groups whose pixel count exceeds a threshold are classified as text. The text detection technique further reduces classification errors by testing for boundary dimensions and pixel density of the group characteristic of long straight lines or large borders. The text detection technique further searches the neighborhood of groups qualifying as text for pixels of the same color, so as to also detect pixels for isolated text marks like dots, accents or punctuation. The separated text and continuous-tone content can be encoded separately for efficient compression while preserving text quality, and the text again superimposed on the continuous tone content at decompression.

Type: Grant

Filed: June 28, 2002

Date of Patent: August 1, 2006

Assignee: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Segmentation of digital video and images into continuous tone and palettized regions

Patent number: 7072512

Abstract: For encoding of mixed-content images containing palettized and continuous-tone content, continuous tone content regions in the image are detected and separated. Continuous tone content segmentation classifies pixels as continuous tone content by counting a number of unique pixel values within a pixel neighborhood. Pixels whose count exceeds a threshold are classified as continuous tone content. The technique further scans the image for regions of high continuous tone pixel density. The segmented continuous-tone and palettized content can be encoded separately for efficient compression, and then reassembled at decompression.

Type: Grant

Filed: July 23, 2002

Date of Patent: July 4, 2006

Assignee: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Matrix multiplication in a Galois field for forward error correction

Patent number: 7047480

Abstract: Transreceived packages use forward error correction (FEC) with matrix multiplication in a Galois field of size P (GF(P)) and contain at least a portion of K rows of matrix B having elements Bk,m in M columns. Packages include matrix C having elements Cn,m for FEC for the K rows. Matrix C has 0 to (N?1)th rows redundant with matrix B data. Elements Cn,m are computed by XOR'ing GExp[(GLog[An,k]+GLog[Bk,m]) mod (P?1)] for k from 0 to (K?1). Matrix A has elements An,k with N rows and K columns. GExp and GLog are one-dimensional arrays. Matrix A is chosen so up to N rows of B and C (total) can be lost, and B can be recovered. An inverse matrix D is computed from A with the rows of B and C. B is reconstructed from D and the received rows of B and C using another matrix multiplication.

Type: Grant

Filed: November 12, 2002

Date of Patent: May 16, 2006

Assignee: Microsoft Corporation

Inventors: Andrew Kadatch, Sanjeev Mehrotra
Rate allocation for mixed content video

Publication number: 20060045368

Abstract: The present invention relates to regulating the quality and/or bitrate of content within mixed content video when the video is compressed subject to a bitrate constraint. For example, a screen capture encoder encodes palletized content within a frame of screen capture video. Subject to an overall bitrate constraint, the encoder then allocates bits for continuous tone content within the frame. By controlling the allocation of bits used to encode the continuous tone content, the encoder regulates bitrate for the continuous tone content. This in turn can allow the encoder to regulate spatial quality and/or overall temporal quality for the video. In one scenario, for screen capture video encoded to a (relatively) constant overall bitrate, the screen capture encoder reduces the bitrate (and quality) of the continuous tone content, instead spending bits to increase the overall frame rate of the video.

Type: Application

Filed: October 26, 2005

Publication date: March 2, 2006

Applicant: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Receiver-driven layered error correction multicast over heterogeneous packet networks

Patent number: 6996097

Abstract: A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission.

Type: Grant

Filed: May 21, 1999

Date of Patent: February 7, 2006

Assignee: Microsoft Corporation

Inventors: Philip A. Chou, Albert S. Wang, Sanjeev Mehrotra, Alexander E. Mohr
Rate allocation for mixed content video

Patent number: 6980695

Abstract: The present invention relates to regulating the quality and/or bitrate of content within mixed content video when the video is compressed subject to a bitrate constraint. For example, a screen capture encoder encodes palletized content within a frame of screen capture video. Subject to an overall bitrate constraint, the encoder then allocates bits for continuous tone content within the frame. By controlling the allocation of bits used to encode the continuous tone content, the encoder regulates bitrate for the continuous tone content. This in turn can allow the encoder to regulate spatial quality and/or overall temporal quality for the video. In one scenario, for screen capture video encoded to a (relatively) constant overall bitrate, the screen capture encoder reduces the bitrate (and quality) of the continuous tone content, instead spending bits to increase the overall frame rate of the video.

Type: Grant

Filed: June 28, 2002

Date of Patent: December 27, 2005

Assignee: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Intra compression of pixel blocks using predicted mean

Publication number: 20050259877

Abstract: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

Type: Application

Filed: March 31, 2003

Publication date: November 24, 2005

Inventors: Albert Wang, Sanjeev Mehrotra
Receiver-driven layered error correction multicast over heterogeneous packet networks

Publication number: 20050249211

Abstract: A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission.

Type: Application

Filed: July 8, 2005

Publication date: November 10, 2005

Applicant: Microsoft Corporation

Inventors: Philip Chou, Albert Wang, Sanjeev Mehrotra, Alexander Mohr
Receiver-driven layered error correction multicast over heterogeneous packet networks

Publication number: 20050204242

Abstract: A system and method for correcting errors and losses occurring during a receiver-driven layered multicast (RLM) of real-time media over a heterogeneous packet network such as the Internet. This is accomplished by augmenting RLM with one or more layers of error correction information. This allows each receiver to separately optimize the quality of received audio and video information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM would have one or more multicasted error correction data streams (i.e., layers) associated therewith. Each of the error correction layers would contain information that can be used to replace lost packets from the associated source layer. More than one error correction layer is proposed as some of the error correction packets contained in the data stream needed to replace the packets lost in the associated source stream may themselves be lost in transmission.

Type: Application

Filed: April 18, 2005

Publication date: September 15, 2005

Applicant: Microsoft Corporation

Inventors: Philip Chou, Albert Wang, Sanjeev Mehrotra, Alexander Mohr
Efficient coding of digital media spectral data using wide-sense perceptual similarity

Publication number: 20050165611

Abstract: Traditional audio encoders may conserve coding bit-rate by encoding fewer than all spectral coefficients, which can produce a blurry low-pass sound in the reconstruction. An audio encoder using wide-sense perceptual similarity improves the quality by encoding a perceptually similar version of the omitted spectral coefficients, represented as a scaled version of already coded spectrum. The omitted spectral coefficients are divided into a number of sub-bands. The sub-bands are encoded as two parameters: a scale factor, which may represent the energy in the band; and a shape parameter, which may represent a shape of the band. The shape parameter may be in the form of a motion vector pointing to a portion of the already coded spectrum, an index to a spectral shape in a fixed code-book, or a random noise vector. The encoding thus efficiently represents a scaled version of a similarly shaped portion of spectrum to be copied at decoding.

Type: Application

Filed: June 29, 2004

Publication date: July 28, 2005

Applicant: Microsoft Corporation

Inventors: Sanjeev Mehrotra, Wei-Ge Chen
Entropy coding by adapting coding between level and run-length/level modes

Publication number: 20050015249

Abstract: An audio encoder performs adaptive entropy encoding of audio data. For example, an audio encoder switches between variable dimension vector Huffman coding of direct levels of quantized audio data and run-level coding of run lengths and levels of quantized audio data. The encoder can use, for example, context-based arithmetic coding for coding run lengths and levels. The encoder can determine when to switch between coding modes by counting consecutive coefficients having a predominant value (e.g., zero). An audio decoder performs corresponding adaptive entropy decoding.

Type: Application

Filed: August 25, 2003

Publication date: January 20, 2005

Inventors: Sanjeev Mehrotra, Wei-ge Chen
Matrix multiplication in a Galois field for forward error correction

Publication number: 20040093550

Abstract: Transreceived packages use forward error correction (FEC) with matrix multiplication in a Galois field of size P (GF(P)) and contain at least a portion of K rows of matrix B having elements Bk,m in M columns. Packages include matrix C having elements Cn,m for FEC for the K rows. Matrix C has 0 to (N−1)th rows redundant with matrix B data. Elements Cn,m are computed by XOR'ing GE xp[(G Log[An,k]+GLog[Bk,m]) mod (P−1)] for k from 0 to (K−1). Matrix A has elements An, k with N rows and K columns. G Exp and G Log are one-dimensional arrays. Matrix A is chosen so up to N rows of B and C (total) can be lost, and B can be recovered. An inverse matrix D is computed from A with the rows of B and C. B is reconstructed from D and the received rows of B and C using another matrix multiplication.

Type: Application

Filed: November 12, 2002

Publication date: May 13, 2004

Inventors: Andrew Kadatch, Sanjeev Mehrotra
Segmentation of digital video and images into continuous tone and palettized regions

Publication number: 20040017939

Abstract: For encoding of mixed-content images containing palettized and continuous-tone content, continuous tone content regions in the image are detected and separated. Continuous tone content segmentation classifies pixels as continuous tone content by counting a number of unique pixel values within a pixel neighborhood. Pixels whose count exceeds a threshold are classified as continuous tone content. The technique further scans the image for regions of high continuous tone pixel density. The segmented continuous-tone and palettized content can be encoded separately for efficient compression, and then reassembled at decompression.

Type: Application

Filed: July 23, 2002

Publication date: January 29, 2004

Applicant: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Text detection in continuous tone image segments

Publication number: 20040001634

Abstract: For encoding of mixed-mode images containing text and continuous-tone content, the pixels in the image that form the text content are detected and separated. Text detection classifies pixels as text or continuous tone content by accumulating pixel counts for groups of contiguous, non-smooth pixels with the same color. Groups whose pixel count exceeds a threshold are classified as text. The text detection technique further reduces classification errors by testing for boundary dimensions and pixel density of the group characteristic of long straight lines or large borders. The text detection technique further searches the neighborhood of groups qualifying as text for pixels of the same color, so as to also detect pixels for isolated text marks like dots, accents or punctuation. The separated text and continuous-tone content can be encoded separately for efficient compression while preserving text quality, and the text again superimposed on the continuous tone content at decompression.

Type: Application

Filed: June 28, 2002

Publication date: January 1, 2004

Applicant: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Motion estimation/compensation for screen capture video

Publication number: 20040001544

Abstract: The present invention relates to motion estimation and compensation. For example, a screen capture encoder performs motion estimation that is adapted to screen capture video in various respects. For example, the motion estimation uses a distortion measure based upon the count of equal/unequal pixels in two regions, sub-samples the distortion measure to speed up motion estimation, and/or uses a search pattern that prioritizes types of motion common in screen capture video. Or, a screen capture decoder performs motion compensation that is adapted to screen capture video in various respects. For example, the decoder performs the motion compensation for pixels with different values at corresponding locations in a current frame and a reference frame, but not for all pixels of the current frame. Alternatively, an encoder/decoder performs the motion estimation/compensation to compress/decompress other kinds of content.

Type: Application

Filed: June 28, 2002

Publication date: January 1, 2004

Applicant: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Rate allocation for mixed content video

Publication number: 20040001638

Abstract: The present invention relates to regulating the quality and/or bitrate of content within mixed content video when the video is compressed subject to a bitrate constraint. For example, a screen capture encoder encodes palletized content within a frame of screen capture video. Subject to an overall bitrate constraint, the encoder then allocates bits for continuous tone content within the frame. By controlling the allocation of bits used to encode the continuous tone content, the encoder regulates bitrate for the continuous tone content. This in turn can allow the encoder to regulate spatial quality and/or overall temporal quality for the video. In one scenario, for screen capture video encoded to a (relatively) constant overall bitrate, the screen capture encoder reduces the bitrate (and quality) of the continuous tone content, instead spending bits to increase the overall frame rate of the video.

Type: Application

Filed: June 28, 2002

Publication date: January 1, 2004

Applicant: Microsoft Corporation

Inventor: Sanjeev Mehrotra
Intra compression of pixel blocks using predicted mean

Publication number: 20030185452

Abstract: An apparatus and method for encoding video frames is provided. The video frames are divided into blocks for encoding. Encoding of the video blocks utilizes motion detection, motion estimation and adaptive compression, to obtain the desired compression for a particular bit rate. Adaptive compression includes intra compression (without regard to other frames) and inter compression (with regard to other frames). Intra compression, inter compression with motion detection, and inter compression with motion estimation are performed on a block by block basis, as needed. Segmentation is provided to compare encoding of a block with encoding of its sub-blocks, and to select the best block size for encoding.

Type: Application

Filed: March 31, 2003

Publication date: October 2, 2003

Inventors: Albert S. Wang, Sanjeev Mehrotra
Receiver-driven layered error correction multicast over heterogeneous packet networks

Patent number: 6594798

Abstract: Correction of errors and losses in a receiver-driven layered multicast (RLM) of real-time media over a network is augmented using one or more layers of error correction information. Each receiver separately optimizes the quality of received information by subscribing to at least one error correction layer. Ideally, each source layer in a RLM has one or more associated multicasted error correction data streams. Each error correction layer contains information for replacing lost packets from the associated source layer. More than one error correction layer is proposed to correct for lost packets in other error correction layers. Error correction streams are preferably generated using a pseudo-Automatic Repeat Request (ARQ) wherein a broadcaster sends both the source packets in a primary stream and delayed versions thereof in one or more redundant streams. A hybrid technique combines the psuedo-ARQ method with an adaptation of Forward Error Correction (FEC) techniques.

Type: Grant

Filed: May 21, 1999

Date of Patent: July 15, 2003

Assignee: Microsoft Corporation

Inventors: Philip A. Chou, Albert S. Wang, Sanjeev Mehrotra, Alexander E. Mohr

prev … 3 4 5 6 7 8 next