Patents by Inventor Sharath Manjunath

Sharath Manjunath has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20090319261
    Abstract: Systems, methods, and apparatus for low-bit-rate coding of transitional speech frames are disclosed.
    Type: Application
    Filed: June 20, 2008
    Publication date: December 24, 2009
    Applicant: Qualcomm Incorporated
    Inventors: Alok Kumar Gupta, Sharath Manjunath, Ananthapadmanabhan A. Kandhadai
  • Patent number: 7496505
    Abstract: A method and apparatus for the variable rate coding of a speech signal. An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes (i.e., high bit rate, broadly applicable to different types of speech) during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. Input speech signal is classified into active and inactive regions. Active regions are further classified into voiced, unvoiced, and transient regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode.
    Type: Grant
    Filed: November 13, 2006
    Date of Patent: February 24, 2009
    Assignee: QUALCOMM Incorporated
    Inventors: Sharath Manjunath, William Gardner
  • Publication number: 20080312917
    Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.
    Type: Application
    Filed: August 12, 2008
    Publication date: December 18, 2008
    Applicant: QUALCOMM Incorporated
    Inventors: Arasanipalai K. Ananthapadmanabhan, Sharath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy, Andrew P. Dejaco
  • Publication number: 20080288245
    Abstract: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.
    Type: Application
    Filed: July 29, 2008
    Publication date: November 20, 2008
    Applicant: QUALCOMM Incorporated
    Inventors: Khaled Helmi El-Maleh, Ananthapadmanabhan Arasanipalai Kandhadai, Sharath Manjunath
  • Publication number: 20080232478
    Abstract: Error concealment is used to hide the effects of errors detected within digital video information. A complex error concealment mode decision is disclosed to determine whether spatial error concealment (SEC) or temporal error concealment (TEC) should be used. The error concealment mode decision system uses different methods depending on whether the damaged frame is an intra-frame or an inter-frame. If the video frame is an intra-frame then a similarity metric is used to determine if the intra-frame represents a scene-change or not. If the video frame is an intra-frame, a complex multi-termed equation is used to determine whether SEC or TEC should be used. A novel spatial error concealment technique is disclosed for use when the error concealment mode decision determines that spatial error concealment should be used for reconstruction.
    Type: Application
    Filed: March 23, 2007
    Publication date: September 25, 2008
    Inventors: Chia-Yuan Teng, Sharath Manjunath
  • Patent number: 7426466
    Abstract: A method and apparatus for predictively quantizing voiced speech includes a parameter generator and a quantizer. The parameter generator is configured to extract parameters from frames of predictive speech such as voiced speech, and to transform the extracted information to a frequency-domain representation. The quantizer is configured to subtract a weighted sum of the parameters for previous frames from the parameter for the current frame. The quantizer is configured to quantize the difference value. A prototype extractor may be added to first extract a pitch period prototype to be processed by the parameter generator.
    Type: Grant
    Filed: July 22, 2004
    Date of Patent: September 16, 2008
    Assignee: QUALCOMM Incorporated
    Inventors: Arasanipalai K. Ananthapadmanabhan, Sharath Manjunath, Pengjun Huang, Eddie-Lun Tik Choy, Andrew P. DeJaco
  • Publication number: 20080198920
    Abstract: A stereo 3D video frame includes left and right components that are combined to produce a stereo image. For a given amount of distortion, the left and right components may have different impacts on perceptual visual quality of the stereo image due to asymmetry in the distortion response of the human eye. A 3D video encoder adjusts an allocation of coding bits between left and right components of the 3D video based on a frame-level bit budget and a weighting between the left and right components. The video encoder may generate the bit allocation in the rho (?) domain. The weighted bit allocation may be derived based on a quality metric that indicates overall quality produced by the left and right components. The weighted bit allocation compensates for the asymmetric distortion response to reduce overall perceptual distortion in the stereo image and thereby enhance or maintain visual quality.
    Type: Application
    Filed: February 21, 2007
    Publication date: August 21, 2008
    Inventors: Kai Chieh Yang, Haohong Wang, Khaled Helmi El-Maleh, Sharath Manjunath
  • Publication number: 20080192821
    Abstract: Techniques for estimating distortion due to quantization of data are described. A histogram with multiple bins may be obtained for a set of coefficients to be quantized. Distortion due to quantization of the set of coefficients may be estimated based on the histogram and average distortions for the histogram bins. The number of coefficients in each bin may be multiplied with an average distortion for the bin to obtain a per-bin distortion. The per-bin distortions for all of the bins may be accumulated and scaled with a correction factor to obtain the estimated distortion. The techniques may be used to estimate distortions for a set of coding elements. Distortion and rate may be estimated for each coding element for each of multiple quantization steps. A set of quantization steps may be selected for the set of coding elements based on the estimated distortions and the estimated rates for the set of coding elements for different quantization steps.
    Type: Application
    Filed: February 8, 2007
    Publication date: August 14, 2008
    Inventors: Narendranath Malayath, Sharath Manjunath
  • Patent number: 7406096
    Abstract: Techniques are presented herein to provide tandem-free operation between two wireless terminals through two otherwise incompatible wireless networks. Specifically, embodiments provide tandem-free operation between a wireless terminal communicating through a continuous transmission (CTX) wireless channel to a wireless terminal communicating through a discontinuous transmission (DTX) wireless channel. In a first aspect, inactive speech frames are translated between DTX and CTX formats. In a second aspect, each wireless terminal includes an active speech decoder that is compatible with the active speech encoder on the opposite end of the mobile-to-mobile connection.
    Type: Grant
    Filed: December 6, 2002
    Date of Patent: July 29, 2008
    Assignee: QUALCOMM Incorporated
    Inventors: Khaled Helmi El-Maleh, Ananthapadmanabhan Arasanipalai Kandhadai, Sharath Manjunath
  • Publication number: 20080165181
    Abstract: The rendering of 3D video images on a stereo-enabled display (e.g., stereoscopic or autostereoscopic display) is described. The process includes culling facets facing away from a viewer, defining foreground facets for Left and Right Views and common background facets, determining lighting for these facets, and performing screen mapping and scene rendering for one view (e.g., Right View) using computational results for facets of the other view (i.e., Left View). In one embodiment, visualization of images is provided on the stereo-enabled display of a low-power device, such as mobile phone, a computer, a video game platform, or a Personal Digital Assistant (PDA) device.
    Type: Application
    Filed: January 5, 2007
    Publication date: July 10, 2008
    Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath, Yingyong Qi
  • Publication number: 20080150945
    Abstract: Techniques for complexity-adaptive and automatic two-dimensional (2D) to three-dimensional (3D) image and video conversion which classifies a frame of a 2D input into one of a flat image class and a non-flat image class are described. The flat image class frame is directly converted into 3D stereo for display. The frame that is classified as a non-flat image class is further processed automatically and adaptively, based on complexity, to create a depth map estimate. Thereafter, the non-flat image class frame is converted into a 3D stereo image using the depth map estimate or an adjusted depth map. The adjusted depth map is processed based on the complexity.
    Type: Application
    Filed: December 22, 2006
    Publication date: June 26, 2008
    Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath
  • Publication number: 20080031327
    Abstract: A monoscopic low-power mobile device is capable of creating real-time stereo images and videos from a single captured view. The device uses statistics from an autofocusing process to create a block depth map of a single capture view. Artifacts in the block depth map are reduced and an image depth map is created. Stereo three-dimensional (3D) left and right views are created from the image depth map using a Z-buffer based 3D surface recover process and a disparity map which is a function of the geometry of binocular vision.
    Type: Application
    Filed: August 1, 2006
    Publication date: February 7, 2008
    Inventors: Haohong Wang, Hsiang-Tsun Li, Sharath Manjunath
  • Publication number: 20080024596
    Abstract: An apparatus comprising a first image sensor, a second image sensor spaced apart from the first image sensor, a diversity combine module to combine image data from the first and second image sensors, and an image processing module configured to process combined image data from the diversity combine module.
    Type: Application
    Filed: July 25, 2006
    Publication date: January 31, 2008
    Inventors: Hsiang-Tsun Li, Behnam Katibian, Haohong Wang, Sharath Manjunath
  • Publication number: 20080024614
    Abstract: A mobile device comprising a first image sensor, a second image sensor configured to change position with respect to the first image sensor, a controller configured to control the position of the second image sensor, and an image processing module configured to process and combine images captured by the first and second image sensors.
    Type: Application
    Filed: July 25, 2006
    Publication date: January 31, 2008
    Inventors: Hsiang-Tsun Li, Behnam Katibian, Haohong Wang, Sharath Manjunath
  • Publication number: 20080013622
    Abstract: The disclosure describes FGS video coding techniques that use cycle-aligned fragments (CAFs). The techniques may perform cycle-based coding of FGS video data block coefficients and syntax elements, and encapsulate cycles in fragments for transmission. The fragments may be cycle-aligned such that a start of a payload of each of the fragments substantially coincides with a start of one of the cycles. In this manner, cycles can be readily accessed via individual fragments. Some cycles may be controlled with a vector mode to scan to a predefined position within a block before moving to another block. In this manner, the number of cycles can be reduced, reducing the number of fragments and associated overhead. The CAFs may be entropy coded independently of one another so that each fragment may be readily accessed and decoded without waiting for decoding of other fragments. Independent entropy coding may permit parallel decoding and simultaneous processing of fragments.
    Type: Application
    Filed: July 12, 2007
    Publication date: January 17, 2008
    Inventors: Yiliang Bao, Narendranath Malayath, Sharath Manjunath, Yan Ye
  • Publication number: 20070244695
    Abstract: In a device configurable to encode speech performing an closed loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. In a first closed loop stage, a first set of compressed components and a first set of uncompressed components for a current frame may be generated. A first set of features may be generated by comparing current and past frame amplitude and/or phase components. In a second closed loop stage, a second set of compressed components for the current frame may be generated by compressing the first set of compressed components and compressing the first set of uncompressed components. Generation of a second set of features may be based on the second set of compressed components from the current frame and a combination of amplitude and/or phase components from the past frame.
    Type: Application
    Filed: January 22, 2007
    Publication date: October 18, 2007
    Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhada, Eddie Choy
  • Publication number: 20070219787
    Abstract: In a device configurable to encode speech performing an open loop re-decision may comprise representing a speech signal by amplitude components and phase components for a current frame and a past frame. During the current frame, there may be an extraction of uncompressed amplitude components and uncompressed phase components. The amplitude components and the phase components from the past frame may then be retrieved. A set of features may be generated based on the uncompressed amplitude components from the current frame, the uncompressed phase components from the current frame, the amplitude components from the past frame, and the phase components from the past frame. The set of features may be checked as part of the open loop re-decision, and determining a final encoding decision based on the checking may be performed. The final encoding decision may be an encoding mode and/or encoding rate.
    Type: Application
    Filed: January 22, 2007
    Publication date: September 20, 2007
    Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai, Eddie Choy
  • Publication number: 20070185708
    Abstract: Systems, methods, and apparatus described include waveform alignment operations in which a single set of evaluated cosines and sines is used to calculate cross-correlations of two periodic waveforms at two different phase shifts.
    Type: Application
    Filed: December 1, 2006
    Publication date: August 9, 2007
    Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai
  • Publication number: 20070179783
    Abstract: A method and apparatus for the variable rate coding of a speech signal. An input speech signal is classified and an appropriate coding mode is selected based on this classification. For each classification, the coding mode that achieves the lowest bit rate with an acceptable quality of speech reproduction is selected. Low average bit rates are achieved by only employing high fidelity modes (i.e., high bit rate, broadly applicable to different types of speech) during portions of the speech where this fidelity is required for acceptable output. Lower bit rate modes are used during portions of speech where these modes produce acceptable output. Input speech signal is classified into active and inactive regions. Active regions are further classified into voiced, unvoiced, and transient regions. Various coding modes are applied to active speech, depending upon the required level of fidelity. Coding modes may be utilized according to the strengths and weaknesses of each particular mode.
    Type: Application
    Filed: November 13, 2006
    Publication date: August 2, 2007
    Inventors: Sharath Manjunath, William Gardner
  • Publication number: 20070171931
    Abstract: Methods and apparatus are provided for achieving an arbitrary average data rate for a variable rate coder. One method includes selecting a set (e.g., a pair) of initial composite rates surrounding the arbitrary average data rate. A reallocation fraction is then calculated based on the initial composite rates. The reallocation fraction is used to reassign a number of frames from one component rate of an initial composite rate to another in order to achieve the arbitrary average data rate. Such a method may be configured such that selecting an initial composite rate on one side of (e.g., less than) the arbitrary average data rate implicitly selects the initial composite rate on the other side of the arbitrary average data rate.
    Type: Application
    Filed: January 22, 2007
    Publication date: July 26, 2007
    Inventors: Sharath Manjunath, Ananthapadmanabhan Kandhadai