Patents by Inventor Sharath Manjunath

Sharath Manjunath has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CODING OF TRANSITIONAL SPEECH FRAMES FOR LOW-BIT-RATE APPLICATIONS

Publication number: 20090319263

Abstract: Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.

Type: Application

Filed: October 30, 2008

Publication date: December 24, 2009

Applicant: QUALCOMM Incorporated

Inventors: Alok Kumar Gupta, Sharath Manjunath
Variable rate speech coding

Patent number: 7496505

Abstract: A method and apparatus for the variable rate coding of a speech signal. An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes (i.e., high bit rate, broadly applicable to different types of speech) during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. Input speech signal is classified into active and inactive regions. Active regions are further classified into voiced, unvoiced, and transient regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode.

Type: Grant

Filed: November 13, 2006

Date of Patent: February 24, 2009

Assignee: QUALCOMM Incorporated

Inventors: Sharath Manjunath, William Gardner
METHOD AND APPARATUS FOR PREDICTIVELY QUANTIZING VOICED SPEECH

Publication number: 20080312917

Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

Type: Application

Filed: August 12, 2008

Publication date: December 18, 2008

Applicant: QUALCOMM Incorporated

Inventors: Arasanipalai K. Ananthapadmanabhan, Sharath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy, Andrew P. Dejaco
TANDEM-FREE INTERSYSTEM VOICE COMMUNICATION

Publication number: 20080288245

Abstract: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.

Type: Application

Filed: July 29, 2008

Publication date: November 20, 2008

Applicant: QUALCOMM Incorporated

Inventors: Khaled Helmi El-Maleh, Ananthapadmanabhan Arasanipalai Kandhadai, Sharath Manjunath
Methods of Performing Error Concealment For Digital Video

Publication number: 20080232478

Abstract: Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction.

Type: Application

Filed: March 23, 2007

Publication date: September 25, 2008

Inventors: Chia-Yuan Teng, Sharath Manjunath
Method and apparatus for quantizing pitch, amplitude, phase and linear spectrum of voiced speech

Patent number: 7426466

Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.

Type: Grant

Filed: July 22, 2004

Date of Patent: September 16, 2008

Assignee: QUALCOMM Incorporated

Inventors: Arasanipalai K. Ananthapadmanabhan, Sharath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy, Andrew P. DeJaco
3D VIDEO ENCODING

Publication number: 20080198920

Abstract: A stereo 3D video frame includes left and right components that are combined to produce a stereo image. For a given amount of distortion, the left and right components may have different impacts on perceptual visual quality of the stereo image due to asymmetry in the distortion response of the human eye. A 3D video encoder adjusts an allocation of coding bits between left and right components of the 3D video based on a frame-level bit budget and a weighting between the left and right components. The video encoder may generate the bit allocation in the rho (?) domain. The weighted bit allocation may be derived based on a quality metric that indicates overall quality produced by the left and right components. The weighted bit allocation compensates for the asymmetric distortion response to reduce overall perceptual distortion in the stereo image and thereby enhance or maintain visual quality.

Type: Application

Filed: February 21, 2007

Publication date: August 21, 2008

Inventors: Kai Chieh Yang, Haohong Wang, Khaled Helmi El-Maleh, Sharath Manjunath
DISTORTION ESTIMATION FOR QUANTIZED DATA

Publication number: 20080192821

Abstract: Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.

Type: Application

Filed: February 8, 2007

Publication date: August 14, 2008

Inventors: Narendranath Malayath, Sharath Manjunath
Tandem-free intersystem voice communication

Patent number: 7406096

Abstract: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.

Type: Grant

Filed: December 6, 2002

Date of Patent: July 29, 2008

Assignee: QUALCOMM Incorporated

Inventors: Khaled Helmi El-Maleh, Ananthapadmanabhan Arasanipalai Kandhadai, Sharath Manjunath
RENDERING 3D VIDEO IMAGES ON A STEREO-ENABLED DISPLAY

Publication number: 20080165181

Abstract: The rendering of 3D video images on a stereo-enabled display (e.g., stereoscopic or autostereoscopic display) is described. The process includes culling facets facing away from a viewer, defining foreground facets for Left and Right Views and common background facets, determining lighting for these facets, and performing screen mapping and scene rendering for one view (e.g., Right View) using computational results for facets of the other view (i.e., Left View). In one embodiment, visualization of images is provided on the stereo-enabled display of a low-power device, such as mobile phone, a computer, a video game platform, or a Personal Digital Assistant (PDA) device.

Type: Application

Filed: January 5, 2007

Publication date: July 10, 2008

Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath, Yingyong Qi
COMPLEXITY-ADAPTIVE 2D-TO-3D VIDEO SEQUENCE CONVERSION

Publication number: 20080150945

Abstract: Techniques for complexity-adaptive and automatic two-dimensional (2D) to three-dimensional (3D) image and video conversion which classifies a frame of a 2D input into one of a flat image class and a non-flat image class are described. The flat image class frame is directly converted into 3D stereo for display. The frame that is classified as a non-flat image class is further processed automatically and adaptively, based on complexity, to create a depth map estimate. Thereafter, the non-flat image class frame is converted into a 3D stereo image using the depth map estimate or an adjusted depth map. The adjusted depth map is processed based on the complexity.

Type: Application

Filed: December 22, 2006

Publication date: June 26, 2008

Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath
Real-time capturing and generating stereo images and videos with a monoscopic low power mobile device

Publication number: 20080031327

Abstract: A monoscopic low-power mobile device is capable of creating real-time stereo images and videos from a single captured view. The device uses statistics from an autofocusing process to create a block depth map of a single capture view. Artifacts in the block depth map are reduced and an image depth map is created. Stereo three-dimensional (3D) left and right views are created from the image depth map using a Z-buffer based 3D surface recover process and a disparity map which is a function of the geometry of binocular vision.

Type: Application

Filed: August 1, 2006

Publication date: February 7, 2008

Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath
Mobile device with dual digital camera sensors and methods of using the same

Publication number: 20080024614

Abstract: A mobile device comprising a first image sensor, a second image sensor configured to change position with respect to the first image sensor, a controller configured to control the position of the second image sensor, and an image processing module configured to process and combine images captured by the first and second image sensors.

Type: Application

Filed: July 25, 2006

Publication date: January 31, 2008

Inventors: Hsiang-Tsun Li, Behnam Katibian, Haohong Wang, Sharath Manjunath
Stereo image and video capturing device with dual digital sensors and methods of using the same

Publication number: 20080024596

Abstract: An apparatus comprising a first image sensor, a second image sensor spaced apart from the first image sensor, a diversity combine module to combine image data from the first and second image sensors, and an image processing module configured to process combined image data from the diversity combine module.

Type: Application

Filed: July 25, 2006

Publication date: January 31, 2008

Inventors: Hsiang-Tsun Li, Behnam Katibian, Haohong Wang, Sharath Manjunath
VIDEO CODING WITH FINE GRANULARITY SCALABILITY USING CYCLE-ALIGNED FRAGMENTS

Publication number: 20080013622

Abstract: The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.

Type: Application

Filed: July 12, 2007

Publication date: January 17, 2008

Inventors: Yiliang Bao, Narendranath Malayath, Sharath Manjunath, Yan Ye
SELECTION OF ENCODING MODES AND/OR ENCODING RATES FOR SPEECH COMPRESSION WITH CLOSED LOOP RE-DECISION

Publication number: 20070244695

Abstract: In a device configurable to encode speech performing an closed loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. In a first closed loop stage, a first set of compressed components and a first set of uncompressed components for a current frame may be generated. A first set of features may be generated by comparing current and past frame amplitude and/or phase components. In a second closed loop stage, a second set of compressed components for the current frame may be generated by compressing the first set of compressed components and compressing the first set of uncompressed components. Generation of a second set of features may be based on the second set of compressed components from the current frame and a combination of amplitude and/or phase components from the past frame.

Type: Application

Filed: January 22, 2007

Publication date: October 18, 2007

Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhada, Eddie Choy
SELECTION OF ENCODING MODES AND/OR ENCODING RATES FOR SPEECH COMPRESSION WITH OPEN LOOP RE-DECISION

Publication number: 20070219787

Abstract: In a device configurable to encode speech performing an open loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. During the current frame, there may be an extraction of uncompressed amplitude components and uncompressed phase components. The amplitude components and the phase components from the past frame may then be retrieved. A set of features may be generated based on the uncompressed amplitude components from the current frame, the uncompressed phase components from the current frame, the amplitude components from the past frame, and the phase components from the past frame. The set of features may be checked as part of the open loop re-decision, and determining a final encoding decision based on the checking may be performed. The final encoding decision may be an encoding mode and/or encoding rate.

Type: Application

Filed: January 22, 2007

Publication date: September 20, 2007

Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai, Eddie Choy
SYSTEMS, METHODS, AND APPARATUS FOR FREQUENCY-DOMAIN WAVEFORM ALIGNMENT

Publication number: 20070185708

Abstract: Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic waveforms at two different phase shifts.

Type: Application

Filed: December 1, 2006

Publication date: August 9, 2007

Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai
VARIABLE RATE SPEECH CODING

Publication number: 20070179783

Abstract: A method and apparatus for the variable rate coding of a speech signal. An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes (i.e., high bit rate, broadly applicable to different types of speech) during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. Input speech signal is classified into active and inactive regions. Active regions are further classified into voiced, unvoiced, and transient regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode.

Type: Application

Filed: November 13, 2006

Publication date: August 2, 2007

Inventors: Sharath Manjunath, William Gardner
ARBITRARY AVERAGE DATA RATES FOR VARIABLE RATE CODERS

Publication number: 20070171931

Abstract: Methods and apparatus are provided for achieving an arbitrary average data rate for a variable rate coder. One method includes selecting a set (e.g., a pair) of initial composite rates surrounding the arbitrary average data rate. A reallocation fraction is then calculated based on the initial composite rates. The reallocation fraction is used to reassign a number of frames from one component rate of an initial composite rate to another in order to achieve the arbitrary average data rate. Such a method may be configured such that selecting an initial composite rate on one side of (e.g., less than) the arbitrary average data rate implicitly selects the initial composite rate on the other side of the arbitrary average data rate.

Type: Application

Filed: January 22, 2007

Publication date: July 26, 2007

Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai

prev 1 2 3 4 5 next