Patents by Inventor Shankar Regunathan

Shankar Regunathan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20120020409
    Abstract: Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values.
    Type: Application
    Filed: September 29, 2011
    Publication date: January 26, 2012
    Applicant: Microsoft Corporation
    Inventors: Cheng Chang, Chih-Lung B. Lin, Shankar Regunathan
  • Publication number: 20110280303
    Abstract: Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations.
    Type: Application
    Filed: July 26, 2011
    Publication date: November 17, 2011
    Applicant: Microsoft Corporation
    Inventors: Shankar Regunathan, Sridhar Srinivasan, Jie Liang, Chih-Lung Lin, Minghui Xia
  • Patent number: 8059721
    Abstract: Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values.
    Type: Grant
    Filed: April 7, 2006
    Date of Patent: November 15, 2011
    Assignee: Microsoft Corporation
    Inventors: Cheng Chang, Chih-Lung Lin, Shankar Regunathan
  • Patent number: 8054886
    Abstract: Rules for the signaling and interpretation of chroma position are described. One rule, called the short rule, defines fifteen discrete chroma centering positions and corresponding four-bit syntax element. Another rule, called the extended rule, defines 81 discrete chroma centering positions and corresponding seven-bit syntax elements. A described method includes receiving digital media data at a digital media encoder, determining chroma position information for the received digital media data, and representing the chroma position information with one or more syntax elements in an encoded bitstream. The one or more syntax elements are operable to communicate the chroma position information to a digital media decoder. The chroma position information facilitates an image rotation or flip.
    Type: Grant
    Filed: June 29, 2007
    Date of Patent: November 8, 2011
    Assignee: Microsoft Corporation
    Inventors: Sridhar Srinivasan, Chengjie Tu, Shankar Regunathan, Gary J. Sullivan
  • Patent number: 8014450
    Abstract: Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations.
    Type: Grant
    Filed: November 15, 2004
    Date of Patent: September 6, 2011
    Assignee: Microsoft Corporation
    Inventors: Shankar Regunathan, Sridhar Srinivasan, Jie Liang, Chih-Lung Lin, Minghui Xia
  • Patent number: 8009739
    Abstract: Techniques and tools for intensity compensation for interlaced forward-predicted fields are described. For example, a video decoder receives and decodes a variable length code that indicates which of two reference fields for an interlaced forward-predicted field use intensity compensation (e.g., both, only the first, or only the second). The decoder performs intensity compensation on each of the two reference fields that uses intensity compensation. A video encoder performs corresponding intensity estimation/compensation and signaling.
    Type: Grant
    Filed: September 2, 2004
    Date of Patent: August 30, 2011
    Assignee: Microsoft Corporation
    Inventors: Thomas W. Holcomb, Sridhar Srinivasan, Shankar Regunathan
  • Patent number: 7924921
    Abstract: A decoder receives an entry point header comprising plural control parameters for an entry point segment corresponding to the entry point header. The entry point header is in an entry point layer of a bitstream comprising plural layers. The decoder decodes the entry point header. The plural control parameters can include various combinations of control parameters such as a pan scan on/off parameter, a reference frame distance on/off parameter, a loop filtering on/off parameter, a fast chroma motion compensation on/off parameter, an extended range motion vector on/off parameter, a variable sized transform on/off parameter, an overlapped transform on/off parameter, a quantization decision parameter, and an extended differential motion vector coding on/off parameter, a broken link parameter, a closed entry parameter, one or more coded picture size parameters, one or more range mapping parameters, a hypothetical reference decoder buffer parameter, and/or other parameter(s).
    Type: Grant
    Filed: November 15, 2004
    Date of Patent: April 12, 2011
    Assignee: Microsoft Corporation
    Inventors: Regis J. Crinon, Chih-Lung Lin, Jie Liang, Shankar Regunathan, Shuo-Jen Wu, Timothy E. Onders, Thomas W. Holcomb
  • Patent number: 7852919
    Abstract: A decoder receives a field start code for an entry point key frame. The field start code indicates a second coded interlaced video field in the entry point key frame following a first coded interlaced video field in the entry point key frame and indicates a point to begin decoding of the second coded interlaced video field. The first coded interlaced video field is a predicted field, and the second coded interlaced video field is an intra-coded field. The decoder decodes the second field without decoding the first field. The field start code can be followed by a field header. The decoder can receive a frame header for the entry point key frame. The frame header may comprise a syntax element indicating a frame coding mode for the entry point key frame and/or a syntax element indicating field types for the first and second coded interlaced video fields.
    Type: Grant
    Filed: November 15, 2004
    Date of Patent: December 14, 2010
    Assignee: Microsoft Corporation
    Inventors: Regis J. Crinon, Thomas W. Holcomb, Shankar Regunathan, Sridhar Srinivasan
  • Patent number: 7839930
    Abstract: Techniques and tools for coding/decoding of digital video, and in particular, for determining, signaling and detecting entry points in video streams are described. Techniques and tools described herein are used to embed entry point indicator information in the bitstream that receivers, editing systems, insertion systems, and other systems can use to detect valid entry points in compressed video.
    Type: Grant
    Filed: June 30, 2004
    Date of Patent: November 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Thomas W. Holcomb, Regis J. Crinon, Timothy E. Onders, Sridhar Srinivasan, Shankar Regunathan
  • Publication number: 20100280827
    Abstract: Embodiments for implementing a speech recognition system that includes a speech classifier ensemble are disclosed. In accordance with one embodiment, the speech recognition system includes a classifier ensemble to convert feature vectors that represent a speech vector into log probability sets. The classifier ensemble includes a plurality of classifiers. The speech recognition system includes a decoder ensemble to transform the log probability sets into output symbol sequences. The speech recognition system further includes a query component to retrieve one or more speech utterances from a speech database using the output symbol sequences.
    Type: Application
    Filed: April 30, 2009
    Publication date: November 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Kunal Mukerjee, Kazuhito Koishida, Shankar Regunathan
  • Publication number: 20100246671
    Abstract: Described tools and techniques relate to signaling for DC coefficients at small quantization step sizes. The techniques and tools can be used in combination or independently. For example, a tool such as a video encoder or decoder processes a VLC that indicates a DC differential for a DC coefficient, a FLC that indicates a value refinement for the DC differential, and a third code that indicates the sign for the DC differential. Even with the small quantization step sizes, the tool uses a VLC table with DC differentials for DC coefficients above the small quantization step sizes. The FLCs for DC differentials have lengths that vary depending on quantization step size.
    Type: Application
    Filed: June 14, 2010
    Publication date: September 30, 2010
    Applicant: Microsoft Corporation
    Inventors: Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
  • Patent number: 7738554
    Abstract: Described tools and techniques relate to signaling for DC coefficients at small quantization step sizes. The techniques and tools can be used in combination or independently. For example, a tool such as a video encoder or decoder processes a VLC that indicates a DC differential for a DC coefficient, a FLC that indicates a value refinement for the DC differential, and a third code that indicates the sign for the DC differential. Even with the small quantization step sizes, the tool uses a VLC table with DC differentials for DC coefficients above the small quantization step sizes. The FLCs for DC differentials have lengths that vary depending on quantization step size.
    Type: Grant
    Filed: July 17, 2004
    Date of Patent: June 15, 2010
    Assignee: Microsoft Corporation
    Inventors: Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
  • Patent number: 7724827
    Abstract: Entropy coding and decoding techniques are described, which may be implemented separately or in combination. For example, a video encoder uses two-layer run level coding to reduce bitrate for frequency transform coefficients in a quick and efficient manner, and a video decoder uses corresponding two-layer run level decoding. This two-layer coding/decoding can be generalized to more than two layers of run level coding/decoding. The video encoder and decoder exploit common patterns in run level information to reduce code table size and create opportunities for early termination of decoding. Using zoned Huffman code tables helps limit overall table size while still providing a level of adaptivity in encoding and decoding. Using embedded Huffman code tables allows the encoder and decoder to reuse codes for 8×8, 8×4, 4×8, and 4×4 blocks.
    Type: Grant
    Filed: April 15, 2004
    Date of Patent: May 25, 2010
    Assignee: Microsoft Corporation
    Inventors: Jie Liang, Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
  • Publication number: 20100104025
    Abstract: A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors.
    Type: Application
    Filed: September 18, 2009
    Publication date: April 29, 2010
    Applicant: THE REGENTS OF THE UNIVERSITY OF CALIFORNIA
    Inventors: Kenneth Rose, Rui Zhang, Shankar Regunathan
  • Publication number: 20100092098
    Abstract: In certain embodiments, overlap operators are applied during encoding and/or decoding of digital media, where the overlap operators have reduced DC gain mismatch and/or DC leakage between interior overlap operators and overlap operators at the edge and/or corner. In other embodiments, information indicating a selected tile boundary option for overlap processing can be encoded and/or decoded. The selected tile boundary option indicates one of a hard tile boundary option and a soft tile boundary option for processing with overlap operators. Overlap transform processing can then be applied based at least in part on the selected tile boundary option.
    Type: Application
    Filed: September 30, 2009
    Publication date: April 15, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: DAN SCHONBERG, SHANKAR REGUNATHAN, SHIJUN SUN, GARY J. SULLIVAN, ZHI ZHOU, SRIDHAR SRINIVASAN
  • Publication number: 20100061447
    Abstract: Techniques and tools for skip modes in encoding and decoding of inter-layer residual video are described. For example, an encoder encodes multiple macroblocks of a picture of inter-layer residual video. For a current macroblock that is skipped, the encoder selects a skip mode from among multiple available skip modes and uses the selected skip mode when encoding the current macroblock. The skip modes can include intra skip mode and predicted-motion skip mode. A corresponding decoder, for the current macroblock, selects and uses the skip mode for the current macroblock during decoding. As another example, an encoder encodes multiple channels of a picture of inter-layer residual video. For each channel, the encoder determines whether to skip the channel. The encoder signals channel skip information to indicate which channels are skipped. A corresponding decoder parses the channel skip information and determines on a channel-by-channel basis whether to skip the respective channels.
    Type: Application
    Filed: September 5, 2008
    Publication date: March 11, 2010
    Applicant: Microsoft Corporation
    Inventors: Chengjie Tu, Shankar Regunathan, Shijun Sun, Chih-Lung Lin
  • Publication number: 20100046612
    Abstract: Techniques and tools for conversion operations between modules in a scalable video encoding tool or scalable video decoding tool are described. For example, given reconstructed base layer video in a low resolution format (e.g., 4:2:0 video with 8 bits per sample) an encoding tool and decoding tool adaptively filter the reconstructed base layer video and upsample its sample values to a higher sample depth (e.g., 10 bits per sample). The tools also adaptively scale chroma samples to a higher chroma sampling rate (e.g., 4:2:2). The adaptive filtering and chroma scaling help reduce energy in inter-layer residual video by making the reconstructed base layer video closer to input video, which typically makes compression of the inter-layer residual video more efficient. The encoding tool also remaps sample values of the inter-layer residual video to adjust dynamic range before encoding, and the decoding tool performs inverse remapping after decoding.
    Type: Application
    Filed: August 25, 2008
    Publication date: February 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Shijun Sun, Shankar Regunathan, Chengjie Tu, Chih-Lung Lin
  • Publication number: 20100046626
    Abstract: Techniques and tools for encoding and decoding data values that are hierarchically organized are presented. For example, an encoder encodes data as a set that has a hierarchy of subsets with set symbols. In the encoding, the encoder evaluates the data values of the set and selectively encodes a symbol combination code that indicates the set symbols of multiple subsets of the set. Then, for each of the multiple subsets considered as a new set, the encoder selectively repeats the evaluating, selective encoding and selective repetition for the new set. In corresponding decoding, a decoder decodes data encoded as a set that has a hierarchy of subsets with set symbols. In some implementations, the encoding and decoding are adaptive and use a symbol alphabet with nested elements.
    Type: Application
    Filed: August 22, 2008
    Publication date: February 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Chengjie Tu, Shankar Regunathan, Shijun Sun, Chih-Lung Lin
  • Patent number: 7639739
    Abstract: A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors.
    Type: Grant
    Filed: November 1, 2002
    Date of Patent: December 29, 2009
    Assignee: The Regents of the University of California
    Inventors: Kenneth Rose, Rui Zhang, Shankar Regunathan
  • Publication number: 20090296808
    Abstract: Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels.
    Type: Application
    Filed: June 3, 2008
    Publication date: December 3, 2009
    Applicant: Microsoft Corporation
    Inventors: Shankar Regunathan, Shijun Sun, Chengjie Tu, Chih-Lung Lin