Patents by Inventor Bengt J. Borgstrom

Bengt J. Borgstrom has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for improving model-based speech enhancement with neural networks

Patent number: 11227586

Abstract: Systems and methods improving the performance of statistical model-based single-channel speech enhancement systems using a deep neural network (DNN) are disclosed. Embodiments include a DNN-trained system to predict speech presence in the input signal, and this information can be used to create frameworks for tracking noise and conducting a priori signal to-noise ratio estimation. Example frameworks provide increased flexibility for various aspects of system design, such as gain estimation. Examples include training a DNN to detect speech in the presence of both noise and reverberation, enabling joint suppression of additive noise and reverberation. Example frameworks provide significant improvements in objective speech quality metrics relative to baseline systems.

Type: Grant

Filed: September 11, 2019

Date of Patent: January 18, 2022

Assignee: Massachusetts Institute of Technology

Inventors: Bengt J. Borgstrom, Michael S. Brandstein, Robert B. Dunn
SYSTEMS AND METHODS FOR IMPROVING MODEL-BASED SPEECH ENHANCEMENT WITH NEURAL NETWORKS

Publication number: 20210074282

Abstract: Systems and methods improving the performance of statistical model-based single-channel speech enhancement systems using a deep neural network (DNN) are disclosed. Embodiments include a DNN-trained system to predict speech presence in the input signal, and this information can be used to create frameworks for tracking noise and conducting a priori signal to-noise ratio estimation. Example frameworks provide increased flexibility for various aspects of system design, such as gain estimation. Examples include training a DNN to detect speech in the presence of both noise and reverberation, enabling joint suppression of additive noise and reverberation. Example frameworks provide significant improvements in objective speech quality metrics relative to baseline systems.

Type: Application

Filed: September 11, 2019

Publication date: March 11, 2021

Inventors: Bengt J. Borgstrom, Michael S. Brandstein, Robert B. Dunn
Isolated word training and detection using generated phoneme concatenation models of audio inputs

Patent number: 10719115

Abstract: Methods, systems, and apparatuses are described for isolated word training and detection. Isolated word training devices and systems are provided in which a user may provide a wake-up phrase from 1 to 3 times to train the device or system. A concatenated phoneme model of the user-provided wake-up phrase may be generated based on the provided wake-up phrase and a pre-trained phoneme model database. A word model of the wake-up phrase may be subsequently generated from the concatenated phoneme model and the provided wake-up phrase. Once trained, the user-provided wake-up phrase may be used to unlock the device or system and/or to wake up the device or system from a standby mode of operation. The word model of the user-provided wake-up phrase may be further adapted based on additional provisioning of the wake-up phrase.

Type: Grant

Filed: January 27, 2015

Date of Patent: July 21, 2020

Assignee: AVAGO TECHNOLOGIES INTERNATIONAL SALES PTE. LIMITED

Inventors: Robert W. Zopf, Bengt J. Borgstrom
Single channel suppression of interfering sources

Patent number: 9570087

Abstract: Techniques described herein are directed to performing back-end single-channel suppression of one or more types of interfering sources (e.g., additive noise) in an uplink path of a communication device. The back-end single-channel suppression techniques may suppress types(s) of additive noise using one or more suppression branches (e.g., a non-spatial (or stationary noise) branch, a spatial (or non-stationary noise) branch, a residual echo suppression branch, etc.). The non-spatial branch may be configured to suppress stationary noise from the single-channel audio signal, the spatial branch may be configured to suppress non-stationary noise from the single-channel audio signal and the residual echo suppression branch may be configured to suppress residual echo from the signal-channel audio signal. The spatial branch may be disabled based on an operational mode (e.g.

Type: Grant

Filed: November 13, 2014

Date of Patent: February 14, 2017

Assignee: Broadcom Corporation

Inventors: Jes Thyssen, Bengt J. Borgstrom
Adaptive modulation filtering for spectral feature enhancement

Patent number: 9520138

Abstract: Techniques described herein are directed to the enhancement of spectral features of an audio signal via adaptive modulation filtering. The adaptive modulation filtering process is based on observed modulation envelope autocorrelation coefficients obtained from the audio signal. The modulation envelope autocorrelation coefficients are used to determine parameters of an adaptive filter configured to filter the spectral features of the audio signal to provide filtered spectral features. The parameters are updated based on the observed modulation envelope autocorrelation coefficients to adapt to changing acoustic conditions, such as signal-to-noise ratio (SNR) or reverberation time. Accordingly, such acoustic conditions are not required to be estimated explicitly. Techniques described herein also allow for the estimation of useful side information, e.g.

Type: Grant

Filed: March 13, 2014

Date of Patent: December 13, 2016

Assignee: Broadcom Corporation

Inventor: Bengt J. Borgstrom
MULTI-MICROPHONE SOURCE TRACKING AND NOISE SUPPRESSION

Publication number: 20160241955

Abstract: Methods, systems, and apparatuses are described for improved multi-microphone source tracking and noise suppression. In multi-microphone devices and systems, frequency domain acoustic echo cancellation is performed on each microphone input, and microphone levels and sensitivity are normalized. Methods, systems, and apparatuses are also described for improved acoustic scene analysis and source tracking using steered null error transforms, on-line adaptive acoustic scene modeling, and speaker-dependent information. Switched super-directive beamforming reinforces desired audio sources and closed-form blocking matrices suppress desired audio sources based on spatial information derived from microphone pairings. Underlying statistics are tracked and used to updated filters and models. Automatic detection of single-user and multi-user scenarios, and single-channel suppression using spatial information, non-spatial information, and residual echo are also described.

Type: Application

Filed: April 22, 2016

Publication date: August 18, 2016

Inventors: Jes Thyssen, Ashutosh Pandey, Bengt J. Borgstrom, Daniele Giacobello, Juin-Hwey Chen
ISOLATED WORD TRAINING AND DETECTION

Publication number: 20160189706

Abstract: Methods, systems, and apparatuses are described for isolated word training and detection. Isolated word training devices and systems are provided in which a user may provide a wake-up phrase from 1 to 3 times to train the device or system. A concatenated phoneme model of the user-provided wake-up phrase may be generated based on the provided wake-up phrase and a pre-trained phoneme model database. A word model of the wake-up phrase may be subsequently generated from the concatenated phoneme model and the provided wake-up phrase. Once trained, the user-provided wake-up phrase may be used to unlock the device or system and/or to wake up the device or system from a standby mode of operation. The word model of the user-provided wake-up phrase may be further adapted based on additional provisioning of the wake-up phrase.

Type: Application

Filed: January 27, 2015

Publication date: June 30, 2016

Inventors: Robert W. Zopf, Bengt J. Borgstrom
Multi-microphone source tracking and noise suppression

Patent number: 9338551

Abstract: Methods, systems, and apparatuses are described for improved multi-microphone source tracking and noise suppression. In multi-microphone devices and systems, frequency domain acoustic echo cancellation is performed on each microphone input, and microphone levels and sensitivity are normalized. Methods, systems, and apparatuses are also described for improved acoustic scene analysis and source tracking using steered null error transforms, on-line adaptive acoustic scene modeling, and speaker-dependent information. Switched super-directive beamforming reinforces desired audio sources and closed-form blocking matrices suppress desired audio sources based on spatial information derived from microphone pairings. Underlying statistics are tracked and used to updated filters and models. Automatic detection of single-user and multi-user scenarios, and single-channel suppression using spatial information, non-spatial information, and residual echo are also described.

Type: Grant

Filed: March 17, 2014

Date of Patent: May 10, 2016

Assignee: Broadcom Corporation

Inventors: Jes Thyssen, Ashutosh Pandey, Bengt J. Borgstrom, Daniele Giacobello, Juin-Hwey Chen
Speaker-identification-assisted speech processing systems and methods

Patent number: 9293140

Abstract: Methods, systems, and apparatuses are described for performing speaker-identification-assisted speech processing. In accordance with certain embodiments, a communication device includes speaker identification (SID) logic that is configured to identify a user of the communication device and/or the identity of a far-end speaker participating in a voice call with a user of the communication device. Knowledge of the identity of the user and/or far-end speaker is then used to improve the performance of one or more speech processing algorithms implemented on the communication device.

Type: Grant

Filed: August 13, 2013

Date of Patent: March 22, 2016

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Robert W. Zopf, Bengt J. Borgstrom, Elias Nemer, Ashutosh Pandey, Jes Thyssen
Speaker-identification-assisted uplink speech processing systems and methods

Patent number: 9269368

Abstract: Methods, systems, and apparatuses are described for performing speaker-identification-assisted speech processing in an uplink path of a communication device. In accordance with certain embodiments, a communication device includes speaker identification (SID) logic that is configured to identify the identity of a near-end speaker. Knowledge of the identity of the near-end speaker is then used to improve the performance of one or more uplink speech processing algorithms implemented on the communication device.

Type: Grant

Filed: October 31, 2013

Date of Patent: February 23, 2016

Assignee: Broadcom Corporation

Inventors: Juin-Hwey Chen, Jes Thyssen, Elias Nemer, Bengt J. Borgstrom, Ashutosh Pandey, Robert W. Zopf
SINGLE-CHANNEL SUPPRESSION OF INTEFERING SOURCES

Publication number: 20150071461

Abstract: Techniques described herein are directed to performing back-end single-channel suppression of one or more types of interfering sources (e.g., additive noise) in an uplink path of a communication device. The back-end single-channel suppression techniques may suppress types(s) of additive noise using one or more suppression branches (e.g., a non-spatial (or stationary noise) branch, a spatial (or non-stationary noise) branch, a residual echo suppression branch, etc.). The non-spatial branch may be configured to suppress stationary noise from the single-channel audio signal, the spatial branch may be configured to suppress non-stationary noise from the single-channel audio signal and the residual echo suppression branch may be configured to suppress residual echo from the signal-channel audio signal. The spatial branch may be disabled based on an operational mode (e.g.

Type: Application

Filed: November 13, 2014

Publication date: March 12, 2015

Inventors: Jes Thyssen, Bengt J. Borgstrom
MULTI-MICROPHONE SOURCE TRACKING AND NOISE SUPPRESSION

Publication number: 20140286497

Abstract: Methods, systems, and apparatuses are described for improved multi-microphone source tracking and noise suppression. In multi-microphone devices and systems, frequency domain acoustic echo cancellation is performed on each microphone input, and microphone levels and sensitivity are normalized. Methods, systems, and apparatuses are also described for improved acoustic scene analysis and source tracking using steered null error transforms, on-line adaptive acoustic scene modeling, and speaker-dependent information. Switched super-directive beamforming reinforces desired audio sources and closed-form blocking matrices suppress desired audio sources based on spatial information derived from microphone pairings. Underlying statistics are tracked and used to updated filters and models. Automatic detection of single-user and multi-user scenarios, and single-channel suppression using spatial information, non-spatial information, and residual echo are also described.

Type: Application

Filed: March 17, 2014

Publication date: September 25, 2014

Applicant: Broadcom Corporation

Inventors: Jes Thyssen, Ashutosh Pandey, Bengt J. Borgstrom, Daniele Giacobello, Juin-Hwey Chen
SPEAKER-IDENTIFICATION-ASSISTED SPEECH PROCESSING SYSTEMS AND METHODS

Publication number: 20140278417

Abstract: Methods, systems, and apparatuses are described for performing speaker-identification-assisted speech processing. In accordance with certain embodiments, a communication device includes speaker identification (SID) logic that is configured to identify a user of the communication device and/or the identity of a far-end speaker participating in a voice call with a user of the communication device. Knowledge of the identity of the user and/or far-end speaker is then used to improve the performance of one or more speech processing algorithms implemented on the communication device.

Type: Application

Filed: August 13, 2013

Publication date: September 18, 2014

Applicant: Broadcom Corporation

Inventors: Juin-Hwey Chen, Robert W. Zopf, Bengt J. Borgstrom, Elias Nemer, Ashutosh Pandey, Jes Thyssen
SPEAKER-IDENTIFICATION-ASSISTED DOWNLINK SPEECH PROCESSING SYSTEMS AND METHODS

Publication number: 20140278418

Abstract: Methods, systems, and apparatuses are described for performing speaker-identification-assisted speech processing in a downlink path of a communication device. In accordance with certain embodiments, a communication device includes speaker identification (SID) logic that is configured to identify the identity of a far-end speaker participating in a voice call with a user of the communication device. Knowledge of the identity of the far-end speaker is then used to improve the performance of one or more downlink speech processing algorithms implemented on the communication device.

Type: Application

Filed: September 30, 2013

Publication date: September 18, 2014

Inventors: Juin-Hwey Chen, Robert W. Zopf, Bengt J. Borgstrom, Elias Nemer, Ashutosh Pandey, Jes Thyssen
ADAPTIVE MODULATION FILTERING FOR SPECTRAL FEATURE ENHANCEMENT

Publication number: 20140270226

Abstract: Techniques described herein are directed to the enhancement of spectral features of an audio signal via adaptive modulation filtering. The adaptive modulation filtering process is based on observed modulation envelope autocorrelation coefficients obtained from the audio signal. The modulation envelope autocorrelation coefficients are used to determine parameters of an adaptive filter configured to filter the spectral features of the audio signal to provide filtered spectral features. The parameters are updated based on the observed modulation envelope autocorrelation coefficients to adapt to changing acoustic conditions, such as signal-to-noise ratio (SNR) or reverberation time. Accordingly, such acoustic conditions are not required to be estimated explicitly. Techniques described herein also allow for the estimation of useful side information, e.g.

Type: Application

Filed: March 13, 2014

Publication date: September 18, 2014

Applicant: Broadcom Corporation

Inventor: Bengt J. Borgstrom
SPEAKER-IDENTIFICATION-ASSISTED UPLINK SPEECH PROCESSING SYSTEMS AND METHODS

Publication number: 20140278397

Abstract: Methods, systems, and apparatuses are described for performing speaker-identification-assisted speech processing in an uplink path of a communication device. In accordance with certain embodiments, a communication device includes speaker identification (SID) logic that is configured to identify the identity of a near-end speaker. Knowledge of the identity of the near-end speaker is then used to improve the performance of one or more uplink speech processing algorithms implemented on the communication device.

Type: Application

Filed: October 31, 2013

Publication date: September 18, 2014

Applicant: Broadcom Corporation

Inventors: Juin-Hwey Chen, Jes Thyssen, Elias Nemer, Bengt J. Borgstrom, Ashutosh Pandey, Robert W. Zopf