Patents by Inventor Sapna George

Sapna George has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20110099021
    Abstract: A time-domain system and method of modifying the time scale of digital audio signals includes a pre-processor. The pre-processor forms a synthesized signal for processing with minimum computation and that has optional features to give preference to certain audio channels and/or frequency bands, a mechanism of adaptively characterizing the temporal features of the synthesized signal by its normalized power and zero-crossing count, and a mechanism of identifying a segment of the synthesized signal where the time scale can be modified without introducing artifacts or losing content.
    Type: Application
    Filed: October 4, 2010
    Publication date: April 28, 2011
    Applicant: STMICROELECTRONICS ASIA PACIFIC PTE LTD
    Inventors: Wenbo Zong, Yuan Wu, Sapna George
  • Patent number: 7873510
    Abstract: A system and method for adaptive rate control in audio processing is provided. The process could include receiving uncompressed audio data from an input and generating MDCT spectrum for each frame of the uncompressed audio data using a filterbank. The process could also include estimating masking thresholds for current frame to be encoded based on the MDCT spectrum. The masking thresholds reflect a bit budget for the current frame. The process could also include performing quantization of the current frame based on the masking thresholds. After the quantization of the current frame, the bit budget for next frame is updated for estimating the masking thresholds of the next frame. The process could also include encoding the quantized audio data.
    Type: Grant
    Filed: April 26, 2007
    Date of Patent: January 18, 2011
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Evelyn Kurniawati, Sapna George
  • Patent number: 7873515
    Abstract: A method includes receiving a sequence of frames containing audio information and determining that a frame is missing in the sequence of frames. The method also includes comparing the frame that precedes the missing frame to the received frames to identify a selected frame. The method further includes identifying a replacement frame comprising the frame that follows the selected frame. In addition, the method includes inserting the replacement frame into the sequence of frames in place of the missing frame.
    Type: Grant
    Filed: November 23, 2004
    Date of Patent: January 18, 2011
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi P. Padhi, Sudhir K. Kumar, Sapna George
  • Publication number: 20100208899
    Abstract: The quality of music output from audio systems is improved by simulating the effect of low frequency signals in the human ear. This thus allows listeners to perceive the lower frequency signals, even though the speakers may be incapable of providing such low frequency outputs. A method is provided for processing enhancing bass effect in audio signals. The method also results in the bass enhancement being computationally less intensive. The bass effect enhancement techniques are based on the response of sine and cosine transfer functions and on the directional independence of low frequency components. The human ear is unable to resolve directions from low frequency components. The bass effect enhancement technique alternatively is based on the response of an exponential transfer function.
    Type: Application
    Filed: February 18, 2010
    Publication date: August 19, 2010
    Applicant: STMICROELECTRONICS ASIA PACIFIC PTE. LTD.
    Inventors: Sudhir K. Kasargod, Sapna George
  • Publication number: 20100169102
    Abstract: The invention provides for the encoding of surround sound produced by any coincident microphone techniques with coincident-to-virtual microphone signal matrixing. An encoding scheme provides significantly lower computational demand, by deriving the spatial parameters and output downmixes from the coincident microphone array signals and the coincident-to-surround channel-coefficients matrix, instead of the multi-channel signals.
    Type: Application
    Filed: March 16, 2009
    Publication date: July 1, 2010
    Applicant: STMicroelectronics Asia Pacific Pte.Ltd.
    Inventors: Samsudin ., Sapna George
  • Patent number: 7725323
    Abstract: An MPEG-1 layer 3 audio encoder, including a scalefactor generator for determining first scalefactors for encoding a block of audio data if a temporal masking transient is not detected in said block of audio data; and for selecting the maximum of said scalefactors for encoding said block of audio data if a temporal masking transient is detected in said block of audio data to enable greater compression of said audio data. Increases in quantization error, due to use of the maximum scalefactor are pre-masked or post-masked by the temporal masking transient. In cases where the last portion of a block includes a temporal masking transient that masks the preceding portions of the block, the maximum scalefactor is only used to encode the block if the resulting increase in quantization error is less than 30% of the quantization error for the block.
    Type: Grant
    Filed: September 14, 2004
    Date of Patent: May 25, 2010
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi Prakash Padhi, Sudhir Kumar Kasargod, Sapna George
  • Patent number: 7705912
    Abstract: A method of decoding audio data, encoded in multiple DIF blocks in a Digital Video (DV) data stream, and outputting said audio data as a PCM frame, includes fetching a single Digital Interface Frame (DIF) block from the DV data stream. A first byte in the single DIF block is de-shuffled to determine its index (n) in the PCM frame. Each byte in the in the single DIF block is de-shuffled to determine its respective index (n) in the PCM frame. The de-shuffled data is written into the PCM frame for output if the present DIF block is the last in the present DV frame. Subsequent DIF blocks in the DV frame are processed in the manner described above.
    Type: Grant
    Filed: March 8, 2004
    Date of Patent: April 27, 2010
    Assignee: STMicroelectronics Asia Pacific Pte, Ltd.
    Inventors: Jianhua Sun, Sapna George
  • Patent number: 7680671
    Abstract: AC-3 is a high quality audio compression format widely used in feature films and, more recently, on Digital Versatile Disks (DVD). For consumer applications the algorithm is usually coded into the firmware of a DSP Processor, which due to cost considerations may be capable of only fixed point arithmetic. It is generally assumed that 16-bit processing is incapable of delivering the high fidelity audio, expected from the AC-3 technology. Double precision computation can be utilized on such processors to provide the high quality; but the computational burden of such implementation will be beyond the capacity of the processor to enable real-time operation. Through extensive simulation study of a high quality AC-3 encoder implementation, a multi-precision technique for each processing block is presented whereby the quality of the encoder on a 16-bit processor matches the single precision 24-bit implementation very closely without excessive additional computational complexity.
    Type: Grant
    Filed: September 8, 2006
    Date of Patent: March 16, 2010
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Mohammed Javed Absar, Sapna George, Antonio Mario Alvarez-Tinoco
  • Patent number: 7660718
    Abstract: Pitch detection of speech signals finds numerous applications in karaoke, voice recognition and scoring applications. While most of the existing techniques rely on time domain methods, the invention utilizes frequency domain methods. There is provided a method and system for determining the pitch of speech from a speech signal. The method includes the steps of: producing or obtaining the speech signal; distinguishing the speech signal into voiced, unvoiced or silence sections using speech signal energy levels; applying a Fourier Transform to the speech signal and obtaining speech signal parameters; determining peaks of the Fourier transformed speech signal; tracking the speech signal parameters of the determined peaks to select partials; and determining the pitch from the selected partials using a two-way mismatch error calculation.
    Type: Grant
    Filed: September 23, 2004
    Date of Patent: February 9, 2010
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi Prakash Padhi, Sapna George
  • Patent number: 7653537
    Abstract: A system and method is provided for determining whether a data frame of a coded speech signal corresponds to voice or to noise. In one embodiment, a voice activity detector determines a cross-correlation of data. If the cross-correlation is lower than a predetermined cross-correlation value, then the data frame corresponds to noise. If not, then the voice activity detector determines a periodicity of the cross-correlation and a variance of the periodicity. If the variance is less than a predetermined variance value, then the data frame corresponds to voice. In another embodiment, a method determines energy of the data frame and an average energy of the coded speech signal. If the data frame is one of a predetermined number of initial data frames, then a comparison between the average energy to the energy of the data frame is used to determine whether the data frame is noise or voice.
    Type: Grant
    Filed: September 28, 2004
    Date of Patent: January 26, 2010
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi Prakash Padhi, Sapna George
  • Patent number: 7626110
    Abstract: An energy-based pattern recognition algorithm receives the input frames of an audio signal and a test frame sequence and returns a best match in the audio signal to the given test frame sequence. The energy of each input frame is computed, and input frames for which the energy is within a predetermined degree of closeness to the local maximum energy within the test frame sequence are identified as probable matches. Probable matches are then eliminated if the respective probable match does not correspond to a local maximum within a respective neighborhood of adjacent frames. The difference between overall energy for frames neighboring the remaining probable matches and the test frame sequence is computed as a percentage, with the minimum percent deviation in energy from the test frame sequence being returned as the best pattern match. Local signal characteristic matching may be employed to refine matching.
    Type: Grant
    Filed: September 30, 2004
    Date of Patent: December 1, 2009
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi Prakash Padhi, Sapna George
  • Patent number: 7563971
    Abstract: An energy-based pattern recognition algorithm receives the input frames of an audio signal and a test frame sequence and returns a best match in the audio signal to the given test frame sequence. The energy of each input frame is computed and input frames for which the energy is both within a predetermined degree of closeness to the local maximum energy within the test frame sequence and a local maximum within a respective neighborhood of adjacent frames are identified as probable matches. The difference between overall energy for frames neighboring the remaining probable matches and the test frame sequence is computed as a percentage. The best match is selected based on a weighted combination of difference between local maximum energies and minimum percent deviation in overall energy. Local signal characteristic matching may be employed, with weighting, to refine matching.
    Type: Grant
    Filed: September 30, 2004
    Date of Patent: July 21, 2009
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Kabi Prakash Padhi, Sapna George
  • Patent number: 7424502
    Abstract: A method of searching for a best-match decimation vector of decimation factors for non-uniform filter bank, the best match vector allowing perfect or near-perfect reconstruction of an input signal of the non-uniform filter bank, the method including the steps of: a) selecting a partial decimation vector having a number, l, of decimation factors, where l does not exceed a maximum number, K, of decimation factors of said best-match decimation vector; b) testing said l decimation factors to determine whether said partial decimation vector satisfies a feasibility criterion; c) testing a least common multiplier value of said l decimation factors to determine whether said least common multiplier value is greater than a predetermined value; d) testing a maximum decimation value, Dmax, of said partial decimation vector to determine whether Dmax is less than one; e) testing a minimum decimation value, Dmin, of said partial decimation vector to determine whether Dmin is greater than one; and f) if said feasibility crit
    Type: Grant
    Filed: September 28, 2001
    Date of Patent: September 9, 2008
    Assignee: STMicroelectronics Asia Pacific PTE Ltd.
    Inventors: Mohammed Javed Absar, Sapna George
  • Publication number: 20080199014
    Abstract: A method and audio device are presented that preserve mono energy during downmixing of a hybrid coding process of an audio signal. The method includes calculating a stereo scaling factor in a group level that is definable within a stereo band. The method may also include updating the stereo scaling factor using an update rate and synchronizing the update rate of a spatial parameter during a fast changing transient portion of the signal. A number of groups in a first stereo band may be greater than a number of groups in a second stereo band, and the first stereo band may be a lower frequency band than the second band or may be perceptually more important than the second band.
    Type: Application
    Filed: December 28, 2007
    Publication date: August 21, 2008
    Applicant: STMicroelectronics Asia Pacific PTE Ltd
    Inventors: Evelyn Kurniawati, Sapna George
  • Publication number: 20080189104
    Abstract: An apparatus for adaptively suppressing noise in an input signal frequency spectrum derived from overlapping input frames is provided. The system includes a psychoacoustic power computation module configured to compute a noisy signal power in psychoacoustic bands, a voice activity scoring module configured to compute a probabilistic score for a presence of a speech, and a noise estimation module configured to estimate a noise power in the psychoacoustic bands based on information of past frames, the probabilistic score, and the computed noisy signal power. The system also includes a gain computation module configured to compute a gain for each frequency, based on a probabilistic heuristic, the probabilistic score and the information on the past frames, and a gain post-processing module configured to perform a gain time smoothing, a gain frequency smoothing, and a gain regulation for the computed gain.
    Type: Application
    Filed: January 18, 2008
    Publication date: August 7, 2008
    Applicant: STMICROELECTRONICS ASIA PACIFIC PTE LTD
    Inventors: Wenbo Zong, Yuan Wu, Sapna George
  • Publication number: 20080137875
    Abstract: An device and method of generating environmental reverberation effects for digital audio signals is presented. The device includes a reverberation controller. The reverberation controller pre-processes one or more predetermined characteristics of a first audio signal to produce a pre-processed signal and generates a plurality of delayed outputs from the pre-processed signal, each output having a predetermined delay. The reverberation controller also produces a plurality of reflection outputs from the plurality of delayed outputs and combines the plurality of reflection outputs to produce a second audio signal having a desired reverberation response.
    Type: Application
    Filed: November 5, 2007
    Publication date: June 12, 2008
    Applicant: STMICROELECTRONICS ASIA PACIFIC PTE LTD
    Inventors: Wenbo Zong, Yuan Wu, Sapna George
  • Patent number: 7369989
    Abstract: A unified filter bank for use in encoding and decoding MPEG-1 audio data, wherein input audio data is encoded into coded audio data and the coded audio data is subsequently decoded into output audio data. The unified filter bank includes a plurality of filters, with each filter of the plurality of filters being a cosine modulation of a prototype filter. The unified filter bank is operational as an analysis filter bank during audio data encoding and as a synthesis filter bank during audio data decoding, wherein the unified filter bank is effective to substantially eliminate the effects of aliasing, phase distortion and amplitude distortion in the output audio data.
    Type: Grant
    Filed: June 8, 2001
    Date of Patent: May 6, 2008
    Assignee: STMicroelectronics Asia Pacific Pte, Ltd.
    Inventors: Mohammed Javed Absar, Sapna George
  • Patent number: 7363216
    Abstract: A method of parametrically encoding a transient audio signal, including the steps of: determining a set V of the N largest frequency components of the transient audio signal, where N is a predetermined number; determining an approximate envelope of the transient audio signal; and determining a predetermined number P of samples W of the approximate envelope for use in generating a spline approximation of the approximate envelope, whereby a parametric representation of the transient audio signal is given by parameters including V, N, P and W, such that a decoder receiving the parametric representation can reproduce a received approximation of the transient audio signal.
    Type: Grant
    Filed: July 23, 2003
    Date of Patent: April 22, 2008
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Mohammed Javed Absar, Sapna George
  • Patent number: 7359521
    Abstract: A method for effecting aliasing cancellation in an audio effects algorithm using a delay modulated signal, derived from interpolation of a delay modulator at an instantaneous sampling frequency, including: determining the instantaneous sampling frequency 1/Tisf and band limiting an input signal, to which the audio effects algorithm is to be applied to ½ Tisf prior to interpolation.
    Type: Grant
    Filed: November 24, 1999
    Date of Patent: April 15, 2008
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Mohammed Javed Absar, Sapna George, Antonio Mario Alvarez-Tinoco
  • Patent number: 7337025
    Abstract: A method and apparatus for assigning an exponent coding strategy in a digital audio transform coder. Different coding strategies having different differential coding limits may be assigned to different set of transform exponents according to the frequency domain characteristics of the audio signal. A neural network processing system is utilised to perform an efficient mapping of each exponent set to an appropriate coding strategy.
    Type: Grant
    Filed: February 12, 1998
    Date of Patent: February 26, 2008
    Assignee: STMicroelectronics Asia Pacific Pte. Ltd.
    Inventors: Mohammed Javed Absar, Sapna George, Antonio Mario Alvarez-Tinoco