Patents by Inventor Tore Rudberg

Tore Rudberg has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AUTOMATIC GAIN CONTROL BASED ON MACHINE LEARNING LEVEL ESTIMATION OF THE DESIRED SIGNAL

Publication number: 20240420719

Abstract: A system includes a memory and a processing device communicably coupled to the memory. The processing device identifies audio data associated with a plurality of input device. The processing devices determines a speech energy level for each input device by providing the audio data as input to a trained model. For each input device, a statistical value associated with the speech energy level is determined. A strongest input device is identified based on the statistical value. In response to determining that the statistical value associated with the speech energy level of the strongest input device satisfies a threshold condition, the processing device updates the gain value of an input device to an estimated target gain value based on the statistical value of the speech energy level of the respective input device.

Type: Application

Filed: August 26, 2024

Publication date: December 19, 2024

Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
Automatic gain control based on machine learning level estimation of the desired signal

Patent number: 12073845

Abstract: Method includes receiving, at a server device, from a plurality of input devices, audio data. The audio data of each input device corresponds to a time-related portion of the audio data. The method determines a speech energy level for each input device by providing the time-related audio portion as input to a trained model. For each input device, a statistical value associated with the speech energy level is determined. A strongest input device is identified based on the statistical value. The statistical value associated with the speech energy level of each input device other than the strongest input device is compared to the statistical value of the strongest input device. Depending on the comparison, the method determines whether to update the gain value of an input device to an estimated target gain value based on the statistical value of the speech energy level of the respective input device.

Type: Grant

Filed: March 13, 2023

Date of Patent: August 27, 2024

Assignee: Google LLC

Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
Automatic Cloud Normalization of Audio Transmissions for Teleconferencing

Publication number: 20240031489

Abstract: Methods, systems, and apparatus for normalizing audio transmissions from multiple endpoints within a teleconference. A first audio transmission from a first participant of a teleconference can be received for presentation at the teleconference. The first audio transmission can be analyzed to classify one or more audio signatures of the first audio transmission as speech. A difference can be determined between the audio level of the one or more audio signatures and an audio level of second transmissions. Based on the difference, the first audio transmission can be normalized to adjust a gain of the first transmission. The transmission can be output to the teleconference.

Type: Application

Filed: July 22, 2022

Publication date: January 25, 2024

Inventors: Henrik Fahlberg Lundin, Alessio Bazzica, Esbjörn Dominique, Per Erik Daniel Johansson, Tomas Gunnarsson, Markus Lindroth, Karl Allan Tore Rudberg
Audio noise reduction

Patent number: 11848023

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reducing audio noise are disclosed. In one aspect, a method includes the actions of receiving first audio data of a user utterance. The actions further include determining an energy level of second audio data being outputted by the loudspeaker. The actions further include selecting a model from among (i) a first model that is trained using first audio data samples that each encode speech from one speaker and (ii) a second model that is trained using second audio data samples that each encode speech from either one speaker or two speakers. The actions further include providing the first audio data as an input to the selected model. The actions further include receiving processed first audio data. The actions further include outputting the processed first audio data.

Type: Grant

Filed: June 9, 2020

Date of Patent: December 19, 2023

Assignee: Google LLC

Inventors: Tore Rudberg, Marcus Wirebrand, Samuel Sonning, Christian Schuldt
AUTOMATIC GAIN CONTROL BASED ON MACHINE LEARNING LEVEL ESTIMATION OF THE DESIRED SIGNAL

Publication number: 20230215451

Abstract: Method includes receiving, at a server device, from a plurality of input devices, audio data. The audio data of each input device corresponds to a time-related portion of the audio data. The method determines a speech energy level for each input device by providing the time-related audio portion as input to a trained model. For each input device, a statistical value associated with the speech energy level is determined. A strongest input device is identified based on the statistical value. The statistical value associated with the speech energy level of each input device other than the strongest input device is compared to the statistical value of the strongest input device. Depending on the comparison, the method determines whether to update the gain value of an input device to an estimated target gain value based on the statistical value of the speech energy level of the respective input device.

Type: Application

Filed: March 13, 2023

Publication date: July 6, 2023

Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
Automatic gain control based on machine learning level estimation of the desired signal

Patent number: 11605392

Abstract: Method includes receiving, through a plurality of channels, audio data corresponding to a plurality of frequency ranges; determining, for each channel's frequency ranges, speech audio and/or noise energy level using a model trained by machine learning; determining a speech signal with removed noise for each channel; determining one or more statistical values associated with an energy level of a channel's speech signal with the removed noise; determining a strongest channel that has highest statistical values associated with an energy level of a speech signal; determining that the one or more statistical values associated with the energy level of the strongest channel's speech signal satisfy a threshold condition; comparing statistical values associated with an energy level of a speech signal of each channel with those of the strongest channel; and determining whether to update a gain value for a channel based on the channel's statistical values associated with the energy level.

Type: Grant

Filed: March 16, 2020

Date of Patent: March 14, 2023

Assignee: Google LLC

Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
AUTOMATIC GAIN CONTROL BASED ON MACHINE LEARNING LEVEL ESTIMATION OF THE DESIRED SIGNAL

Publication number: 20210287691

Abstract: Method includes receiving, through a plurality of channels, audio data corresponding to a plurality of frequency ranges; determining, for each channel's frequency ranges, speech audio and/or noise energy level using a model trained by machine learning; determining a speech signal with removed noise for each channel; determining one or more statistical values associated with an energy level of a channel's speech signal with the removed noise; determining a strongest channel that has highest statistical values associated with an energy level of a speech signal; determining that the one or more statistical values associated with the energy level of the strongest channel's speech signal satisfy a threshold condition; comparing statistical values associated with an energy level of a speech signal of each channel with those of the strongest channel; and determining whether to update a gain value for a channel based on the channel's statistical values associated with the energy level.

Type: Application

Filed: March 16, 2020

Publication date: September 16, 2021

Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
AUDIO NOISE REDUCTION

Publication number: 20200388297

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reducing audio noise are disclosed. In one aspect, a method includes the actions of receiving first audio data of a user utterance. The actions further include determining an energy level of second audio data being outputted by the loudspeaker. The actions further include selecting a model from among (i) a first model that is trained using first audio data samples that each encode speech from one speaker and (ii) a second model that is trained using second audio data samples that each encode speech from either one speaker or two speakers. The actions further include providing the first audio data as an input to the selected model. The actions further include receiving processed first audio data. The actions further include outputting the processed first audio data.

Type: Application

Filed: June 9, 2020

Publication date: December 10, 2020

Inventors: Tore Rudberg, Marcus Wirebrand, Samuel Sonning, Christian Schuldt
AUDIO CHANNEL MIXING

Publication number: 20200388292

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for audio channel mixing are disclosed. In one aspect, a method includes the actions of receiving first audio data for a first audio channel. The actions further include transmitting the first audio data. The actions further include, while receiving and transmitting the first audio data, receiving second audio data for a second audio channel; determining a first speech audio energy level of the first audio data and a first noise energy level of the first audio data; determining a second speech audio energy level of the second audio data and a second noise energy level of the second audio data; and determining whether to switch to transmitting the second audio data or continue transmitting the first audio data. The actions further include transmitting the first audio data or the second audio data.

Type: Application

Filed: June 9, 2020

Publication date: December 10, 2020

Inventors: Tore Rudberg, Christian Schuldt
Controlling focus of audio signals on speaker during videoconference

Patent number: 10805575

Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.

Type: Grant

Filed: June 4, 2019

Date of Patent: October 13, 2020

Assignee: Google LLC

Inventors: Tore Rudberg, Christian Schuldt
CONTROLLING FOCUS OF AUDIO SIGNALS ON SPEAKER DURING VIDEOCONFERENCE

Publication number: 20190289259

Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.

Type: Application

Filed: June 4, 2019

Publication date: September 19, 2019

Inventors: Tore Rudberg, Christian Schuldt
CONTROLLING FOCUS OF AUDIO SIGNALS ON SPEAKER DURING VIDEOCONFERENCE

Publication number: 20190222804

Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.

Type: Application

Filed: January 16, 2018

Publication date: July 18, 2019

Inventors: Tore Rudberg, Christian Schuldt
Controlling focus of audio signals on speaker during videoconference

Patent number: 10356362

Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.

Type: Grant

Filed: January 16, 2018

Date of Patent: July 16, 2019

Assignee: GOOGLE LLC

Inventors: Tore Rudberg, Christian Schuldt