Patents by Inventor Evelyn Kurniawati

Evelyn Kurniawati has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Computer-implement voice command authentication method and electronic device

Patent number: 11227601

Abstract: A computer-implement voice command authentication method is provided. The method includes obtaining a sound signal stream; calculating a Signal-to-Noise Ratio (SNR) value of the sound signal stream; converting the sound signal stream into a Mel-Frequency Cepstral Coefficients (MFCC) stream; calculating a Dynamic Time Warping (DTW) distance corresponding to the MFCC stream according to the MFCC stream and one of a plurality of sample streams generated by the Gaussian Mixture Model with Universal Background Model (GMM-UBM); calculating, according to the MFCC stream and the sample streams, a Log-likelihood ratio value corresponding to the MFCC stream as a GMM-UBM score; determining whether the sound signal stream passes a voice command authentication according to the GMM-UBM score, the DTW distance and the SNR value; in response to determining that the sound signal stream passes the voice command authentication, determining that the sound signal stream is a voice stream spoken from a legal user.

Type: Grant

Filed: September 21, 2019

Date of Patent: January 18, 2022

Assignee: Merry Electronics(Shenzhen) Co., Ltd.

Inventors: Evelyn Kurniawati, Sasiraj Somarajan
COMPUTER-IMPLEMENT VOICE COMMAND AUTHENTICATION METHOD AND ELECTRONIC DEVICE

Publication number: 20210090577

Abstract: A computer-implement voice command authentication method is provided. The method includes obtaining a sound signal stream; calculating a Signal-to-Noise Ratio (SNR) value of the sound signal stream; converting the sound signal stream into a Mel-Frequency Cepstral Coefficients (MFCC) stream; calculating a Dynamic Time Warping (DTW) distance corresponding to the MFCC stream according to the MFCC stream and one of a plurality of sample streams generated by the Gaussian Mixture Model with Universal Background Model (GMM-UBM); calculating, according to the MFCC stream and the sample streams, a Log-likelihood ratio value corresponding to the MFCC stream as a GMM-UBM score; determining whether the sound signal stream passes a voice command authentication according to the GMM-UBM score, the DTW distance and the SNR value; in response to determining that the sound signal stream passes the voice command authentication, determining that the sound signal stream is a voice stream spoken from a legal user.

Type: Application

Filed: September 21, 2019

Publication date: March 25, 2021

Applicant: Merry Electronics(Shenzhen) Co., Ltd.

Inventors: Evelyn Kurniawati, Sasiraj Somarajan
Methods, systems, and circuits for speaker dependent voice recognition with a single lexicon

Patent number: 9633652

Abstract: Embodiments reduce the complexity of speaker dependent speech recognition systems and methods by representing the code phrase (i.e., the word or words to be recognized) using a single Gaussian Mixture Model (GMM) which is adapted from a Universal Background Model (UBM). Only the parameters of the GMM need to be stored. Further reduction in computation is achieved by only checking the GMM component that is relevant to the keyword template. In this scheme, keyword template is represented by a sequence of the index of best performing component of the GMM of the keyword model. Only one template is saved by combining the registration template using Longest Common Sequence algorithm. The quality of the word model is continuously updated by performing expectation maximization iteration using the test word which is accepted as keyword model.

Type: Grant

Filed: March 31, 2013

Date of Patent: April 25, 2017

Assignee: STMicroelectronics Asia Pacific Pte Ltd.

Inventors: Evelyn Kurniawati, Sapna George
Methods, systems, and circuits for text independent speaker recognition with automatic learning features

Patent number: 9530417

Abstract: Methods and systems of text independent speaker recognition provide a complexity comparable to text dependent speaker recognition system. These methods and systems exploit the fact that speech is a quasi-stationary signal and simplify the recognition process based on this theory. The speaker modeling allows a speaker profile to be updated progressively with new speech samples that are acquired during usage over time by the speaker.

Type: Grant

Filed: April 1, 2013

Date of Patent: December 27, 2016

Assignee: STMicroelectronics Asia Pacific Pte Ltd.

Inventors: Evelyn Kurniawati, Sapna George
METHODS, SYSTEMS, AND CIRCUITS FOR SPEAKER DEPENDENT VOICE RECOGNITION WITH A SINGLE LEXICON

Publication number: 20140200890

Abstract: Embodiments reduce the complexity of speaker dependent speech recognition systems and methods by representing the code word (i.e., the word to be recognized) using a single Gaussian Mixture Model (GMM) which is adapted from a Universal Background Model (UBM). Only the parameters of the GMM need to be stored. Further reduction in computation is achieved by only checking the GMM component that is relevant to the keyword template. In this scheme, keyword template is represented by a sequence of the index of best performing component of the GMM of the keyword model. Only one template is saved by combining the registration template using Longest Common Sequence algorithm. The quality of the word model is continuously updated by performing expectation maximization iteration using the test word which is accepted as keyword model.

Type: Application

Filed: March 31, 2013

Publication date: July 17, 2014

Applicant: STMicroelectronics Asia Pacific Pte Ltd.

Inventors: Evelyn Kurniawati, Sapna George
Scalable hybrid auto coder for transient detection in advanced audio coding with spectral band replication

Patent number: 8489391

Abstract: A system method of reusing information in a low power scalable hybrid audio encoder are disclosed. The includes determining a state of an advanced audio coding (AAC) transient flag, performing spectral band replication (SBR) transient detection on at least two possible locations upon a determination that the AAC transient flag is equal to a first value, performing SBR transient detection on a high frequency upon a determination that the AAC transient flag is equal to a second value, and determining whether a transient exists. The system includes a spectral band replication (SBR) coding module configured to determine a state of an advanced audio coding (AAC) transient flag and perform SBR transient detection on at least one location based upon an energy in a signal upon a determination that the AAC transient flag is equal to a first value.

Type: Grant

Filed: August 5, 2010

Date of Patent: July 16, 2013

Assignee: STMicroelectronics Asia Pacific Pte., Ltd.

Inventors: Evelyn Kurniawati, Sapna George
Estimating rate controlling parameters in perceptual audio encoders

Patent number: 8374857

Abstract: Perceptual audio coder refers to audio compression schemes that exploit the properties of human auditory perception. The coder allocates the quantization noise below the masking threshold such that even with the bit rate limitation, the noise is imperceptible to the ear. These distortion and bit rate requirement makes the bit allocation-quantization process a considerable computational effort. One method includes incrementally adjusting a global gain according to a gradient. The gradient could be adjusted each time the number of bits used to represent a quantized value is counted. Another method includes limiting a rate controlling parameter to a predetermined number of loops. The method could also include deriving a global gain to ensure exit from the loop. Accordingly, embodiments of the present disclosure provide a fast and efficient method to derive the rate controlling parameter and can be applied to generic perceptual audio encoders where low computational complexity is required.

Type: Grant

Filed: August 3, 2007

Date of Patent: February 12, 2013

Assignee: STMicroelectronics Asia Pacific Pte, Ltd.

Inventors: Evelyn Kurniawati, Kim Hann Kuah, Sapna George
System and method for low power stereo perceptual audio coding using adaptive masking threshold

Patent number: 8332216

Abstract: A method for stereo audio perceptual encoding of an input signal includes masking threshold estimation and bit allocation. The masking threshold estimation and bit allocation are performed once every two encoding processes. Another method for stereo audio perceptual encoding of an input signal includes performing a time-to-frequency transformation, performing a quantization, performing a bitstream formatting to produce an output stream, and performing a psychoacoustics analysis. The psychoacoustics analysis includes masking threshold estimation on a first of every two successive frames of the input signal.

Type: Grant

Filed: August 22, 2006

Date of Patent: December 11, 2012

Assignee: STMicroelectronics Asia Pacific PTE., Ltd.

Inventors: Evelyn Kurniawati, Sapna George
NOISE DETECTION AND REDUCTION IN AUDIO DEVICES

Publication number: 20120163622

Abstract: Methods and apparatuses for detection and reduction of wind noise in audio devices are disclosed. In an embodiment, a method includes acquiring and transforming the audio signals. Correlations from the transformed audio signals are computed. A cross correlation index is compared to a predetermined value to determine if a wind noise spectral content is present. In another embodiment, an apparatus includes an audio processing unit to receive non-decomposed audio signals, and an audio decomposition unit to receive the non-decomposed audio signals and to generate decomposed audio signals. A wind noise spectrum estimation unit receives non-decomposed audio signals and decomposed audio signals and identifies wind noise spectral components in at least one of the non-decomposed and decomposed audio signals. A wind noise spectrum reduction unit receives the wind noise spectral components and removes the wind noise spectral components from at least one of the non-decomposed and the decomposed audio signals.

Type: Application

Filed: December 28, 2010

Publication date: June 28, 2012

Applicant: STMICROELECTRONICS ASIA PACIFIC PTE LTD

Inventors: Muralidhar KARTHIK, Samsudin, Evelyn KURNIAWATI, Sapna GEORGE
Low power downmix energy equalization in parametric stereo encoders

Patent number: 8200351

Abstract: A method and audio device are presented that preserve mono energy during downmixing of a hybrid coding process of an audio signal. The method includes calculating a stereo scaling factor in a group level that is definable within a stereo band. The method may also include updating the stereo scaling factor using an update rate and synchronizing the update rate of a spatial parameter during a fast changing transient portion of the signal. A number of groups in a first stereo band may be greater than a number of groups in a second stereo band, and the first stereo band may be a lower frequency band than the second band or may be perceptually more important than the second band.

Type: Grant

Filed: December 28, 2007

Date of Patent: June 12, 2012

Assignee: STMicroelectronics Asia PTE., Ltd.

Inventors: Evelyn Kurniawati, Sapna George
INFORMATION REUSE IN LOW POWER SCALABLE HYBRID AUDIO ENCODERS

Publication number: 20120035936

Abstract: A system and method of reusing information in low power scalable hybrid audio encoders. The system and method provides a transform coder and parameterization of high frequency spectrum (SBR).

Type: Application

Filed: August 5, 2010

Publication date: February 9, 2012

Applicant: STMICROELECTRONICS ASIA PACIFIC PTE LTD

Inventors: Evelyn Kurniawati, Sapna George
Adaptive rate control algorithm for low complexity AAC encoding

Patent number: 7873510

Abstract: A system and method for adaptive rate control in audio processing is provided. The process could include receiving uncompressed audio data from an input and generating MDCT spectrum for each frame of the uncompressed audio data using a filterbank. The process could also include estimating masking thresholds for current frame to be encoded based on the MDCT spectrum. The masking thresholds reflect a bit budget for the current frame. The process could also include performing quantization of the current frame based on the masking thresholds. After the quantization of the current frame, the bit budget for next frame is updated for estimating the masking thresholds of the next frame. The process could also include encoding the quantized audio data.

Type: Grant

Filed: April 26, 2007

Date of Patent: January 18, 2011

Assignee: STMicroelectronics Asia Pacific Pte. Ltd.

Inventors: Evelyn Kurniawati, Sapna George
Low power downmix energy equalization in parametric stereo encoders

Publication number: 20080199014

Abstract: A method and audio device are presented that preserve mono energy during downmixing of a hybrid coding process of an audio signal. The method includes calculating a stereo scaling factor in a group level that is definable within a stereo band. The method may also include updating the stereo scaling factor using an update rate and synchronizing the update rate of a spatial parameter during a fast changing transient portion of the signal. A number of groups in a first stereo band may be greater than a number of groups in a second stereo band, and the first stereo band may be a lower frequency band than the second band or may be perceptually more important than the second band.

Type: Application

Filed: December 28, 2007

Publication date: August 21, 2008

Applicant: STMicroelectronics Asia Pacific PTE Ltd

Inventors: Evelyn Kurniawati, Sapna George
Estimating rate controlling parameters in perceptual audio encoders

Publication number: 20080040120

Abstract: Perceptual audio coder refers to audio compression schemes that exploit the properties of human auditory perception. The coder allocates the quantization noise below the masking threshold such that even with the bit rate limitation, the noise is imperceptible to the ear. These distortion and bit rate requirement makes the bit allocation-quantization process a considerable computational effort. One method includes incrementally adjusting a global gain according to a gradient. The gradient could be adjusted each time the number of bits used to represent a quantized value is counted. Another method includes limiting a rate controlling parameter to a predetermined number of loops. The method could also include deriving a global gain to ensure exit from the loop. Accordingly, embodiments of the present disclosure provide a fast and efficient method to derive the rate controlling parameter and can be applied to generic perceptual audio encoders where low computational complexity is required.

Type: Application

Filed: August 3, 2007

Publication date: February 14, 2008

Applicant: STMicroelectronics Asia Pacific Pte., Ltd.

Inventors: Evelyn Kurniawati, Kim Kuah, Sapna George
Adaptive rate control algorithm for low complexity AAC encoding

Publication number: 20070255562

Abstract: A system and method for adaptive rate control in audio processing is provided. The process could include receiving uncompressed audio data from an input and generating MDCT spectrum for each frame of the uncompressed audio data using a filterbank. The process could also include estimating masking thresholds for current frame to be encoded based on the MDCT spectrum. The masking thresholds reflect a bit budget for the current frame. The process could also include performing quantization of the current frame based on the masking thresholds. After the quantization of the current frame, the bit budget for next frame is updated for estimating the masking thresholds of the next frame. The process could also include encoding the quantized audio data.

Type: Application

Filed: April 26, 2007

Publication date: November 1, 2007

Applicant: STMicroelectronics Asia Pacific Pte., Ltd.

Inventors: Evelyn Kurniawati, Sapna George
System and method for low power stereo perceptual audio coding using adaptive masking threshold

Publication number: 20070162277

Abstract: A method for stereo audio perceptual encoding of an input signal includes masking threshold estimation and bit allocation. The masking threshold estimation and bit allocation are performed once every two encoding processes. Another method for stereo audio perceptual encoding of an input signal includes performing a time-to-frequency transformation, performing a quantization, performing a bitstream formatting to produce an output stream, and performing a psychoacoustics analysis. The psychoacoustics analysis includes masking threshold estimation on a first of every two successive frames of the input signal.

Type: Application

Filed: August 22, 2006

Publication date: July 12, 2007

Applicant: STMicroelectronics Asia Pacific Pte., Ltd.

Inventors: Evelyn Kurniawati, Sapna George