Patents by Inventor Shankar Regunathan
Shankar Regunathan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20120020409Abstract: Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values.Type: ApplicationFiled: September 29, 2011Publication date: January 26, 2012Applicant: Microsoft CorporationInventors: Cheng Chang, Chih-Lung B. Lin, Shankar Regunathan
-
Publication number: 20110280303Abstract: Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations.Type: ApplicationFiled: July 26, 2011Publication date: November 17, 2011Applicant: Microsoft CorporationInventors: Shankar Regunathan, Sridhar Srinivasan, Jie Liang, Chih-Lung Lin, Minghui Xia
-
Patent number: 8059721Abstract: Techniques and tools are described for compensating for rounding when estimating sample-domain distortion in the transform domain. For example, a video encoder estimates pixel-domain distortion in the transform domain for a block of transform coefficients after compensating for rounding in the DC coefficient of the block. In this way, the video encoder improves the accuracy of pixel-domain distortion estimation but retains the computational advantages of performing the estimation in the transform domain. Rounding compensation includes, for example, looking up an index (from a de-quantized transform coefficient) in a rounding offset table to determine a rounding offset, then adjusting the coefficient by the offset. Other techniques and tools described herein are directed to creating rounding offset tables and encoders that make encoding decisions after considering rounding effects that occur after an inverse frequency transform on de-quantized transform coefficient values.Type: GrantFiled: April 7, 2006Date of Patent: November 15, 2011Assignee: Microsoft CorporationInventors: Cheng Chang, Chih-Lung Lin, Shankar Regunathan
-
Patent number: 8054886Abstract: Rules for the signaling and interpretation of chroma position are described. One rule, called the short rule, defines fifteen discrete chroma centering positions and corresponding four-bit syntax element. Another rule, called the extended rule, defines 81 discrete chroma centering positions and corresponding seven-bit syntax elements. A described method includes receiving digital media data at a digital media encoder, determining chroma position information for the received digital media data, and representing the chroma position information with one or more syntax elements in an encoded bitstream. The one or more syntax elements are operable to communicate the chroma position information to a digital media decoder. The chroma position information facilitates an image rotation or flip.Type: GrantFiled: June 29, 2007Date of Patent: November 8, 2011Assignee: Microsoft CorporationInventors: Sridhar Srinivasan, Chengjie Tu, Shankar Regunathan, Gary J. Sullivan
-
Patent number: 8014450Abstract: Techniques and tools are described for flexible range reduction of samples of video. For example, an encoder signals a first set of one or more syntax elements for range reduction of luma samples and signals a second set of one or more syntax elements for range reduction of chroma samples. The encoder selectively scales down the luma samples and chroma samples in a manner consistent with the first syntax element(s) and second syntax element(s), respectively. Or, an encoder signals range reduction syntax element(s) in an entry point header for an entry point segment, where the syntax element(s) apply to pictures in the entry point segment. If range reduction is used for the pictures, the encoder scales down samples of the pictures. Otherwise, the encoder skips the scaling down. A decoder performs corresponding parsing and scaling up operations.Type: GrantFiled: November 15, 2004Date of Patent: September 6, 2011Assignee: Microsoft CorporationInventors: Shankar Regunathan, Sridhar Srinivasan, Jie Liang, Chih-Lung Lin, Minghui Xia
-
Patent number: 8009739Abstract: Techniques and tools for intensity compensation for interlaced forward-predicted fields are described. For example, a video decoder receives and decodes a variable length code that indicates which of two reference fields for an interlaced forward-predicted field use intensity compensation (e.g., both, only the first, or only the second). The decoder performs intensity compensation on each of the two reference fields that uses intensity compensation. A video encoder performs corresponding intensity estimation/compensation and signaling.Type: GrantFiled: September 2, 2004Date of Patent: August 30, 2011Assignee: Microsoft CorporationInventors: Thomas W. Holcomb, Sridhar Srinivasan, Shankar Regunathan
-
Patent number: 7924921Abstract: A decoder receives an entry point header comprising plural control parameters for an entry point segment corresponding to the entry point header. The entry point header is in an entry point layer of a bitstream comprising plural layers. The decoder decodes the entry point header. The plural control parameters can include various combinations of control parameters such as a pan scan on/off parameter, a reference frame distance on/off parameter, a loop filtering on/off parameter, a fast chroma motion compensation on/off parameter, an extended range motion vector on/off parameter, a variable sized transform on/off parameter, an overlapped transform on/off parameter, a quantization decision parameter, and an extended differential motion vector coding on/off parameter, a broken link parameter, a closed entry parameter, one or more coded picture size parameters, one or more range mapping parameters, a hypothetical reference decoder buffer parameter, and/or other parameter(s).Type: GrantFiled: November 15, 2004Date of Patent: April 12, 2011Assignee: Microsoft CorporationInventors: Regis J. Crinon, Chih-Lung Lin, Jie Liang, Shankar Regunathan, Shuo-Jen Wu, Timothy E. Onders, Thomas W. Holcomb
-
Patent number: 7852919Abstract: A decoder receives a field start code for an entry point key frame. The field start code indicates a second coded interlaced video field in the entry point key frame following a first coded interlaced video field in the entry point key frame and indicates a point to begin decoding of the second coded interlaced video field. The first coded interlaced video field is a predicted field, and the second coded interlaced video field is an intra-coded field. The decoder decodes the second field without decoding the first field. The field start code can be followed by a field header. The decoder can receive a frame header for the entry point key frame. The frame header may comprise a syntax element indicating a frame coding mode for the entry point key frame and/or a syntax element indicating field types for the first and second coded interlaced video fields.Type: GrantFiled: November 15, 2004Date of Patent: December 14, 2010Assignee: Microsoft CorporationInventors: Regis J. Crinon, Thomas W. Holcomb, Shankar Regunathan, Sridhar Srinivasan
-
Patent number: 7839930Abstract: Techniques and tools for coding/decoding of digital video, and in particular, for determining, signaling and detecting entry points in video streams are described. Techniques and tools described herein are used to embed entry point indicator information in the bitstream that receivers, editing systems, insertion systems, and other systems can use to detect valid entry points in compressed video.Type: GrantFiled: June 30, 2004Date of Patent: November 23, 2010Assignee: Microsoft CorporationInventors: Thomas W. Holcomb, Regis J. Crinon, Timothy E. Onders, Sridhar Srinivasan, Shankar Regunathan
-
Publication number: 20100280827Abstract: Embodiments for implementing a speech recognition system that includes a speech classifier ensemble are disclosed. In accordance with one embodiment, the speech recognition system includes a classifier ensemble to convert feature vectors that represent a speech vector into log probability sets. The classifier ensemble includes a plurality of classifiers. The speech recognition system includes a decoder ensemble to transform the log probability sets into output symbol sequences. The speech recognition system further includes a query component to retrieve one or more speech utterances from a speech database using the output symbol sequences.Type: ApplicationFiled: April 30, 2009Publication date: November 4, 2010Applicant: Microsoft CorporationInventors: Kunal Mukerjee, Kazuhito Koishida, Shankar Regunathan
-
Publication number: 20100246671Abstract: Described tools and techniques relate to signaling for DC coefficients at small quantization step sizes. The techniques and tools can be used in combination or independently. For example, a tool such as a video encoder or decoder processes a VLC that indicates a DC differential for a DC coefficient, a FLC that indicates a value refinement for the DC differential, and a third code that indicates the sign for the DC differential. Even with the small quantization step sizes, the tool uses a VLC table with DC differentials for DC coefficients above the small quantization step sizes. The FLCs for DC differentials have lengths that vary depending on quantization step size.Type: ApplicationFiled: June 14, 2010Publication date: September 30, 2010Applicant: Microsoft CorporationInventors: Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
-
Patent number: 7738554Abstract: Described tools and techniques relate to signaling for DC coefficients at small quantization step sizes. The techniques and tools can be used in combination or independently. For example, a tool such as a video encoder or decoder processes a VLC that indicates a DC differential for a DC coefficient, a FLC that indicates a value refinement for the DC differential, and a third code that indicates the sign for the DC differential. Even with the small quantization step sizes, the tool uses a VLC table with DC differentials for DC coefficients above the small quantization step sizes. The FLCs for DC differentials have lengths that vary depending on quantization step size.Type: GrantFiled: July 17, 2004Date of Patent: June 15, 2010Assignee: Microsoft CorporationInventors: Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
-
Patent number: 7724827Abstract: Entropy coding and decoding techniques are described, which may be implemented separately or in combination. For example, a video encoder uses two-layer run level coding to reduce bitrate for frequency transform coefficients in a quick and efficient manner, and a video decoder uses corresponding two-layer run level decoding. This two-layer coding/decoding can be generalized to more than two layers of run level coding/decoding. The video encoder and decoder exploit common patterns in run level information to reduce code table size and create opportunities for early termination of decoding. Using zoned Huffman code tables helps limit overall table size while still providing a level of adaptivity in encoding and decoding. Using embedded Huffman code tables allows the encoder and decoder to reuse codes for 8×8, 8×4, 4×8, and 4×4 blocks.Type: GrantFiled: April 15, 2004Date of Patent: May 25, 2010Assignee: Microsoft CorporationInventors: Jie Liang, Chih-Lung Lin, Shankar Regunathan, Sridhar Srinivasan
-
Publication number: 20100104025Abstract: A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors.Type: ApplicationFiled: September 18, 2009Publication date: April 29, 2010Applicant: THE REGENTS OF THE UNIVERSITY OF CALIFORNIAInventors: Kenneth Rose, Rui Zhang, Shankar Regunathan
-
Publication number: 20100092098Abstract: In certain embodiments, overlap operators are applied during encoding and/or decoding of digital media, where the overlap operators have reduced DC gain mismatch and/or DC leakage between interior overlap operators and overlap operators at the edge and/or corner. In other embodiments, information indicating a selected tile boundary option for overlap processing can be encoded and/or decoded. The selected tile boundary option indicates one of a hard tile boundary option and a soft tile boundary option for processing with overlap operators. Overlap transform processing can then be applied based at least in part on the selected tile boundary option.Type: ApplicationFiled: September 30, 2009Publication date: April 15, 2010Applicant: MICROSOFT CORPORATIONInventors: DAN SCHONBERG, SHANKAR REGUNATHAN, SHIJUN SUN, GARY J. SULLIVAN, ZHI ZHOU, SRIDHAR SRINIVASAN
-
Publication number: 20100061447Abstract: Techniques and tools for skip modes in encoding and decoding of inter-layer residual video are described. For example, an encoder encodes multiple macroblocks of a picture of inter-layer residual video. For a current macroblock that is skipped, the encoder selects a skip mode from among multiple available skip modes and uses the selected skip mode when encoding the current macroblock. The skip modes can include intra skip mode and predicted-motion skip mode. A corresponding decoder, for the current macroblock, selects and uses the skip mode for the current macroblock during decoding. As another example, an encoder encodes multiple channels of a picture of inter-layer residual video. For each channel, the encoder determines whether to skip the channel. The encoder signals channel skip information to indicate which channels are skipped. A corresponding decoder parses the channel skip information and determines on a channel-by-channel basis whether to skip the respective channels.Type: ApplicationFiled: September 5, 2008Publication date: March 11, 2010Applicant: Microsoft CorporationInventors: Chengjie Tu, Shankar Regunathan, Shijun Sun, Chih-Lung Lin
-
Publication number: 20100046612Abstract: Techniques and tools for conversion operations between modules in a scalable video encoding tool or scalable video decoding tool are described. For example, given reconstructed base layer video in a low resolution format (e.g., 4:2:0 video with 8 bits per sample) an encoding tool and decoding tool adaptively filter the reconstructed base layer video and upsample its sample values to a higher sample depth (e.g., 10 bits per sample). The tools also adaptively scale chroma samples to a higher chroma sampling rate (e.g., 4:2:2). The adaptive filtering and chroma scaling help reduce energy in inter-layer residual video by making the reconstructed base layer video closer to input video, which typically makes compression of the inter-layer residual video more efficient. The encoding tool also remaps sample values of the inter-layer residual video to adjust dynamic range before encoding, and the decoding tool performs inverse remapping after decoding.Type: ApplicationFiled: August 25, 2008Publication date: February 25, 2010Applicant: Microsoft CorporationInventors: Shijun Sun, Shankar Regunathan, Chengjie Tu, Chih-Lung Lin
-
Publication number: 20100046626Abstract: Techniques and tools for encoding and decoding data values that are hierarchically organized are presented. For example, an encoder encodes data as a set that has a hierarchy of subsets with set symbols. In the encoding, the encoder evaluates the data values of the set and selectively encodes a symbol combination code that indicates the set symbols of multiple subsets of the set. Then, for each of the multiple subsets considered as a new set, the encoder selectively repeats the evaluating, selective encoding and selective repetition for the new set. In corresponding decoding, a decoder decodes data encoded as a set that has a hierarchy of subsets with set symbols. In some implementations, the encoding and decoding are adaptive and use a symbol alphabet with nested elements.Type: ApplicationFiled: August 22, 2008Publication date: February 25, 2010Applicant: Microsoft CorporationInventors: Chengjie Tu, Shankar Regunathan, Shijun Sun, Chih-Lung Lin
-
Patent number: 7639739Abstract: A method is described for efficiently determining total end-to-end distortion of a pre-compressed data stream, such as video streams or other media streams, at the time of delivery over a lossy-network, and for providing adaptive error-resilient delivery schemes based on distortion estimates. The methods can be utilized with single or multilayer packet streams and are particularly well suited for video streams. By way of example, distortion estimates are performed by generating side-information at the time of data stream compression, wherein the side-information is used in conjunction with information about the network status to determine an estimated distortion for the group of packets when the data stream is transported over the network to a destination end. This estimation may be utilized within described resiliency techniques in which the error correction mechanism is selected in response to the estimated distortion, which may be additionally refined in reference to cost factors.Type: GrantFiled: November 1, 2002Date of Patent: December 29, 2009Assignee: The Regents of the University of CaliforniaInventors: Kenneth Rose, Rui Zhang, Shankar Regunathan
-
Publication number: 20090296808Abstract: Techniques and tools for encoding enhancement layer video with quantization that varies spatially and/or between color channels are presented, along with corresponding decoding techniques and tools. For example, an encoding tool determines whether quantization varies spatially over a picture, and the tool also determines whether quantization varies between color channels in the picture. The tool signals quantization parameters for macroblocks in the picture in an encoded bit stream. In some implementations, to signal the quantization parameters, the tool predicts the quantization parameters, and the quantization parameters are signaled with reference to the predicted quantization parameters. A decoding tool receives the encoded bit stream, predicts the quantization parameters, and uses the signaled information to determine the quantization parameters for the macroblocks of the enhancement layer video. The decoding tool performs inverse quantization that can vary spatially and/or between color channels.Type: ApplicationFiled: June 3, 2008Publication date: December 3, 2009Applicant: Microsoft CorporationInventors: Shankar Regunathan, Shijun Sun, Chengjie Tu, Chih-Lung Lin