Patents by Inventor Felix Weninger

Felix Weninger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and Method for Dynamically Adjusting a Number of Emissions in Speech Processing Systems Operating with Large Stride Values

Publication number: 20250078817

Abstract: A method, computer program product, and computing system for dynamically adjusting the number of emitted tokens per frame in speech processing systems operating with large stride values.

Type: Application

Filed: September 6, 2023

Publication date: March 6, 2025

Inventors: Nicola Ferri, Felix Weninger, Puming Zhan
System and Method for Automatically Determining Stride Values in Online Streaming Speech Processing Systems

Publication number: 20240394589

Abstract: A method, computer program product, and computing system for determining a stride value for a first machine learning model. Transfer learning from the first machine learning model to a second machine learning model is performed, wherein the second machine learning model is an online streaming machine learning model. A spectral pooling layer is inserted into the second machine learning model using the stride value. The second machine learning model is trained with the spectral pooling layer.

Type: Application

Filed: May 25, 2023

Publication date: November 28, 2024

Inventors: Dario Albesano, Felix Weninger, Puming Zhan
SYSTEM AND METHOD FOR TRAINING SPEECH PROCESSING NEURAL NETWORKS FOR DYNAMIC LOADS

Publication number: 20240347042

Abstract: A method, computer program product, and computing system for dividing a speech signal into a plurality of chunks. A first context window is defined with a first period of past context for processing the plurality of chunks with a neural network of a speech processing system. The neural network is trained using the first context window. A second context window is defined with a first period of past context for processing the plurality of chunks with the neural network. The neural network is trained using the second context window.

Type: Application

Filed: April 11, 2023

Publication date: October 17, 2024

Inventors: Felix Weninger, Marco Gaudesi, Puming Zhan
SYSTEM AND METHOD FOR DYNAMICALLY ADJUSTING SPEECH PROCESSING NEURAL NETWORKS

Publication number: 20240347047

Abstract: A method, computer program product, and computing system for dividing a speech signal into a plurality of chunks. A context window is defined for processing a chunk of the plurality of chunks using a neural network of a speech processing system. A processing load associated with the speech processing system is determined. The context window is dynamically adjusted based upon, at least in part, the processing load associated with the speech processing system.

Type: Application

Filed: April 11, 2023

Publication date: October 17, 2024

Inventors: Felix Weninger, Marco Gaudesi, Puming Zhan
Multi-encoder end-to-end automatic speech recognition (ASR) for joint modeling of multiple input devices

Patent number: 11978433

Abstract: An end-to-end automatic speech recognition (ASR) system includes: a first encoder configured for close-talk input captured by a close-talk input mechanism; a second encoder configured for far-talk input captured by a far-talk input mechanism; and an encoder selection layer configured to select at least one of the first and second encoders for use in producing ASR output. The selection is made based on at least one of short-time Fourier transform (STFT), Mel-frequency Cepstral Coefficient (MFCC) and filter bank derived from at least one of the close-talk input and the far-talk input. If signals from both the close-talk input mechanism and the far-talk input mechanism are present for a speech segment, the encoder selection layer dynamically selects between the close-talk encoder and the far-talk encoder to select the encoder that better recognizes the speech segment. An encoder-decoder model is used to produce the ASR output.

Type: Grant

Filed: June 22, 2021

Date of Patent: May 7, 2024

Assignee: Microsoft Technology Licensing, LLC.

Inventors: Felix Weninger, Marco Gaudesi, Ralf Leibold, Puming Zhan
System and Method for Spectral Pooling in Streaming Speech Processing

Publication number: 20240127802

Abstract: A method, computer program product, and computing system for inserting a spectral pooling layer into a neural network of a speech processing system. An output of a hidden layer of the neural network is filtered using the spectral pooling layer with a non-integer stride. The filtered output is provided to a subsequent hidden layer of the neural network.

Type: Application

Filed: January 31, 2023

Publication date: April 18, 2024

Inventors: Felix Weninger, Dario Albesano, Puming Zhan
Neural networks for transforming signals

Patent number: 10592800

Abstract: A method for transforms input signals, by first defining a model for transforming the input signals, wherein the model is specified by constraints and a set of model parameters. An iterative inference procedure is derived from the model and the set of model parameters and unfolded into a set of layers, wherein there is one layer for each iteration of the procedure, and wherein a same set of network parameters is used by all layers. A neural network is formed by untying the set of network parameters such that there is one set of network parameters for each layer and each set of network parameters is separately maintainable and separately applicable to the corresponding layer. The neural network is trained to obtain a trained neural network, and then input signals are transformed using the trained neural network to obtain output signals.

Type: Grant

Filed: November 3, 2016

Date of Patent: March 17, 2020

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: John Hershey, Jonathan Le Roux, Felix Weninger
Source signal separation by discriminatively-trained non-negative matrix factorization

Patent number: 9679559

Abstract: A method estimates source signals from a mixture of source signals by first training an analysis model and a reconstruction model using training data. The analysis model is applied to the mixture of source signals to obtain an analysis representation of the mixture of source signals, and the reconstruction model is applied to the analysis representation to obtain an estimate of the source signals, wherein the analysis model utilizes an analysis linear basis representation, and the reconstruction model utilizes a reconstruction linear basis representation.

Type: Grant

Filed: May 29, 2014

Date of Patent: June 13, 2017

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Jonathan Le Roux, John R. Hershey, Felix Weninger, Shinji Watanabe
Neural networks for transforming signals

Patent number: 9582753

Abstract: A method for transforms input signals, by first defining a model for transforming the input signals, wherein the model is specified by constraints and a set of model parameters. An iterative inference procedure is derived from the model and the set of model parameters and unfolded into a set of layers, wherein there is one layer for each iteration of the procedure, and wherein a same set of network parameters is used by all layers. A neural network is formed by untying the set of network parameters such that there is one set of network parameters for each layer and each set of network parameters is separately maintainable and separately applicable to the corresponding layer. The neural network is trined to obtain a trained neural network, and then input signals are transformed using the trained neural network to obtain output signals.

Type: Grant

Filed: July 30, 2014

Date of Patent: February 28, 2017

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: John Hershey, Jonathan Le Roux, Felix Weninger
Neural Networks for Transforming Signals

Publication number: 20170053203

Abstract: A method for transforms input signals, by first defining a model for transforming the input signals, wherein the model is specified by constraints and a set of model parameters. An iterative inference procedure is derived from the model and the set of model parameters and unfolded into a set of layers, wherein there is one layer for each iteration of the procedure, and wherein a same set of network parameters is used by all layers. A neural network is formed by untying the set of network parameters such that there is one set of network parameters for each layer and each set of network parameters is separately maintainable and separately applicable to the corresponding layer. The neural network is trained to obtain a trained neural network, and then input signals are transformed using the trained neural network to obtain output signals.

Type: Application

Filed: November 3, 2016

Publication date: February 23, 2017

Applicant: Mitsubishi Electric Research Laboratories, Inc.

Inventors: John Hershey, Jonathan Le Roux, Felix Weninger
Method and device for reconstructing a target signal from a noisy input signal

Patent number: 9536538

Abstract: A method for reconstructing at least one target signal comprises determining a first set of feature vectors from the input signal, the first set of feature vectors forming a non-negative input matrix; determining a second set of feature vectors, the second set of feature vectors forming a non-negative noise matrix; decomposing the input matrix into a sum of a first matrix and a second matrix, the first matrix representing a product of a non-negative bases matrix and a non-negative weight matrix, and the second matrix representing a combination of the noise matrix and a noise weight vector; and reconstructing the at least one target signal based on the non-negative bases matrix and the non-negative weight matrix.

Type: Grant

Filed: May 19, 2015

Date of Patent: January 3, 2017

Assignee: Huawei Technologies Co., Ltd.

Inventors: Cyril Joder, Felix Weninger, Bjoern Schuller, David Virette
APPARATUS AND METHOD FOR IMPROVING A PERCEPTION OF A SOUND SIGNAL

Publication number: 20160247518

Abstract: The present invention relates to an apparatus for improving a perception of a sound signal, the apparatus comprising: a separation unit configured to separate the sound signal into at least one speech component and at least one noise component; and a spatial rendering unit configured to generate an auditory impression of the at least one speech component at a first virtual position with respect to a user, when output via a transducer unit, and of the at least one noise component at a second virtual position with respect to the user, when output via the transducer unit.

Type: Application

Filed: May 5, 2016

Publication date: August 25, 2016

Inventors: Bjoern Schuller, Felix Weninger, Christian Kirst, Peter Grosche
Neural Networks for Transforming Signals

Publication number: 20160034810

Abstract: A method for transforms input signals, by first defining a model for transforming the input signals, wherein the model is specified by constraints and a set of model parameters. An iterative inference procedure is derived from the model and the set of model parameters and unfolded into a set of layers, wherein there is one layer for each iteration of the procedure, and wherein a same set of network parameters is used by all layers. A neural network is formed by untying the set of network parameters such that there is one set of network parameters for each layer and each set of network parameters is separately maintainable and separately applicable to the corresponding layer. The neural network is trined to obtain a trained neural network, and then input signals are transformed using the trained neural network to obtain output signals.

Type: Application

Filed: July 30, 2014

Publication date: February 4, 2016

Inventors: John Hershey, Jonathan Le Roux, Felix Weninger
Source Signal Separation by Discriminatively-Trained Non-Negative Matrix Factorization

Publication number: 20150348537

Abstract: A method estimates source signals from a mixture of source signals by first training an analysis model and a reconstruction model using training data. The analysis model is applied to the mixture of source signals to obtain an analysis representation of the mixture of source signals, and the reconstruction model is applied to the analysis representation to obtain an estimate of the source signals, wherein the analysis model utilizes an analysis linear basis representation, and the reconstruction model utilizes a reconstruction linear basis representation.

Type: Application

Filed: May 29, 2014

Publication date: December 3, 2015

Applicant: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Jonathan Le Roux, John R. Hershey, Felix Weninger, Shinji Watanabe
Method and Device for Reconstructing a Target Signal from a Noisy Input Signal

Publication number: 20150262590

Abstract: A method for reconstructing at least one target signal comprises determining a first set of feature vectors from the input signal, the first set of feature vectors forming a non-negative input matrix; determining a second set of feature vectors, the second set of feature vectors forming a non-negative noise matrix; decomposing the input matrix into a sum of a first matrix and a second matrix, the first matrix representing a product of a non-negative bases matrix and a non-negative weight matrix, and the second matrix representing a combination of the noise matrix and a noise weight vector; and reconstructing the at least one target signal based on the non-negative bases matrix and the non-negative weight matrix.

Type: Application

Filed: May 19, 2015

Publication date: September 17, 2015

Inventors: Cyril Joder, Felix Weninger, Bjoern Schuller, David Virette