Patents by Inventor Tore Rudberg

Tore Rudberg has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240031489
    Abstract: Methods, systems, and apparatus for normalizing audio transmissions from multiple endpoints within a teleconference. A first audio transmission from a first participant of a teleconference can be received for presentation at the teleconference. The first audio transmission can be analyzed to classify one or more audio signatures of the first audio transmission as speech. A difference can be determined between the audio level of the one or more audio signatures and an audio level of second transmissions. Based on the difference, the first audio transmission can be normalized to adjust a gain of the first transmission. The transmission can be output to the teleconference.
    Type: Application
    Filed: July 22, 2022
    Publication date: January 25, 2024
    Inventors: Henrik Fahlberg Lundin, Alessio Bazzica, Esbjörn Dominique, Per Erik Daniel Johansson, Tomas Gunnarsson, Markus Lindroth, Karl Allan Tore Rudberg
  • Patent number: 11848023
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reducing audio noise are disclosed. In one aspect, a method includes the actions of receiving first audio data of a user utterance. The actions further include determining an energy level of second audio data being outputted by the loudspeaker. The actions further include selecting a model from among (i) a first model that is trained using first audio data samples that each encode speech from one speaker and (ii) a second model that is trained using second audio data samples that each encode speech from either one speaker or two speakers. The actions further include providing the first audio data as an input to the selected model. The actions further include receiving processed first audio data. The actions further include outputting the processed first audio data.
    Type: Grant
    Filed: June 9, 2020
    Date of Patent: December 19, 2023
    Assignee: Google LLC
    Inventors: Tore Rudberg, Marcus Wirebrand, Samuel Sonning, Christian Schuldt
  • Publication number: 20230215451
    Abstract: Method includes receiving, at a server device, from a plurality of input devices, audio data. The audio data of each input device corresponds to a time-related portion of the audio data. The method determines a speech energy level for each input device by providing the time-related audio portion as input to a trained model. For each input device, a statistical value associated with the speech energy level is determined. A strongest input device is identified based on the statistical value. The statistical value associated with the speech energy level of each input device other than the strongest input device is compared to the statistical value of the strongest input device. Depending on the comparison, the method determines whether to update the gain value of an input device to an estimated target gain value based on the statistical value of the speech energy level of the respective input device.
    Type: Application
    Filed: March 13, 2023
    Publication date: July 6, 2023
    Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
  • Patent number: 11605392
    Abstract: Method includes receiving, through a plurality of channels, audio data corresponding to a plurality of frequency ranges; determining, for each channel's frequency ranges, speech audio and/or noise energy level using a model trained by machine learning; determining a speech signal with removed noise for each channel; determining one or more statistical values associated with an energy level of a channel's speech signal with the removed noise; determining a strongest channel that has highest statistical values associated with an energy level of a speech signal; determining that the one or more statistical values associated with the energy level of the strongest channel's speech signal satisfy a threshold condition; comparing statistical values associated with an energy level of a speech signal of each channel with those of the strongest channel; and determining whether to update a gain value for a channel based on the channel's statistical values associated with the energy level.
    Type: Grant
    Filed: March 16, 2020
    Date of Patent: March 14, 2023
    Assignee: Google LLC
    Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
  • Publication number: 20210287691
    Abstract: Method includes receiving, through a plurality of channels, audio data corresponding to a plurality of frequency ranges; determining, for each channel's frequency ranges, speech audio and/or noise energy level using a model trained by machine learning; determining a speech signal with removed noise for each channel; determining one or more statistical values associated with an energy level of a channel's speech signal with the removed noise; determining a strongest channel that has highest statistical values associated with an energy level of a speech signal; determining that the one or more statistical values associated with the energy level of the strongest channel's speech signal satisfy a threshold condition; comparing statistical values associated with an energy level of a speech signal of each channel with those of the strongest channel; and determining whether to update a gain value for a channel based on the channel's statistical values associated with the energy level.
    Type: Application
    Filed: March 16, 2020
    Publication date: September 16, 2021
    Inventors: Karl Allan Tore Rudberg, Alessio Bazzica
  • Publication number: 20200388292
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for audio channel mixing are disclosed. In one aspect, a method includes the actions of receiving first audio data for a first audio channel. The actions further include transmitting the first audio data. The actions further include, while receiving and transmitting the first audio data, receiving second audio data for a second audio channel; determining a first speech audio energy level of the first audio data and a first noise energy level of the first audio data; determining a second speech audio energy level of the second audio data and a second noise energy level of the second audio data; and determining whether to switch to transmitting the second audio data or continue transmitting the first audio data. The actions further include transmitting the first audio data or the second audio data.
    Type: Application
    Filed: June 9, 2020
    Publication date: December 10, 2020
    Inventors: Tore Rudberg, Christian Schuldt
  • Publication number: 20200388297
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for reducing audio noise are disclosed. In one aspect, a method includes the actions of receiving first audio data of a user utterance. The actions further include determining an energy level of second audio data being outputted by the loudspeaker. The actions further include selecting a model from among (i) a first model that is trained using first audio data samples that each encode speech from one speaker and (ii) a second model that is trained using second audio data samples that each encode speech from either one speaker or two speakers. The actions further include providing the first audio data as an input to the selected model. The actions further include receiving processed first audio data. The actions further include outputting the processed first audio data.
    Type: Application
    Filed: June 9, 2020
    Publication date: December 10, 2020
    Inventors: Tore Rudberg, Marcus Wirebrand, Samuel Sonning, Christian Schuldt
  • Patent number: 10805575
    Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.
    Type: Grant
    Filed: June 4, 2019
    Date of Patent: October 13, 2020
    Assignee: Google LLC
    Inventors: Tore Rudberg, Christian Schuldt
  • Publication number: 20190289259
    Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.
    Type: Application
    Filed: June 4, 2019
    Publication date: September 19, 2019
    Inventors: Tore Rudberg, Christian Schuldt
  • Publication number: 20190222804
    Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.
    Type: Application
    Filed: January 16, 2018
    Publication date: July 18, 2019
    Inventors: Tore Rudberg, Christian Schuldt
  • Patent number: 10356362
    Abstract: A non-transitory computer-readable storage medium may include instructions stored thereon. When executed by at least one processor, the instructions may be configured to cause a computing system to determine that a video system is aiming at a single speaker of a plurality of people, receive audio signals from a plurality of microphones, the received audio signals including audio signals generated by the single speaker, based on determining that the video system is aiming at the single speaker, transmit a monophonic signal, the monophonic signal being based on the received audio signals, determine that the video system is not aiming at the single speaker, and based on the determining that the video system is not aiming at the single speaker, transmit a stereophonic signal, the stereophonic signal being based on the received audio signals.
    Type: Grant
    Filed: January 16, 2018
    Date of Patent: July 16, 2019
    Assignee: GOOGLE LLC
    Inventors: Tore Rudberg, Christian Schuldt