Patents by Inventor Jan K. Skoglund

Jan K. Skoglund has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Publication number: 20230368804

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Application

Filed: May 8, 2023

Publication date: November 16, 2023

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
Speech coding using auto-regressive generative neural networks

Patent number: 11676613

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Grant

Filed: May 27, 2021

Date of Patent: June 13, 2023

Assignee: Google LLC

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Publication number: 20210366495

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Application

Filed: May 27, 2021

Publication date: November 25, 2021

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
Speech coding using auto-regressive generative neural networks

Patent number: 11024321

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Grant

Filed: November 30, 2018

Date of Patent: June 1, 2021

Assignee: Google LLC

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Publication number: 20200176004

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for coding speech using neural networks. One of the methods includes obtaining a bitstream of parametric coder parameters characterizing spoken speech; generating, from the parametric coder parameters, a conditioning sequence; generating a reconstruction of the spoken speech that includes a respective speech sample at each of a plurality of decoder time steps, comprising, at each decoder time step: processing a current reconstruction sequence using an auto-regressive generative neural network, wherein the auto-regressive generative neural network is configured to process the current reconstruction to compute a score distribution over possible speech sample values, and wherein the processing comprises conditioning the auto-regressive generative neural network on at least a portion of the conditioning sequence; and sampling a speech sample from the possible speech sample values.

Type: Application

Filed: November 30, 2018

Publication date: June 4, 2020

Inventors: Willem Bastiaan Kleijn, Jan K. Skoglund, Alejandro Luebs, Sze Chie Lim
Detection of speech activity using feature model adaptation

Patent number: 6993481

Abstract: According to the invention, a method for detecting speech activity for a signal is disclosed. In one step, a plurality of features is extracted from the signal. An active speech probability density function (PDF) of the plurality of features is modeled, and an inactive speech PDF of the plurality of features is modeled. The active and inactive speech PDFs are adapted to respond to changes in the signal over time. The signal is probability-based classifyied based, at least in part, on the plurality of features. Speech in the signal is distinguished based, at least in part, upon the probability-based classification.

Type: Grant

Filed: December 4, 2001

Date of Patent: January 31, 2006

Assignee: Global IP Sound AB

Inventors: Jan K. Skoglund, Jan T. Linden
Detection of sound activity

Publication number: 20020165713

Abstract: According to the invention, a method for detecting speech activity for a signal is disclosed. In one step, a plurality of features is extracted from the signal. An active speech probability density function (PDF) of the plurality of features is modeled, and an inactive speech PDF of the plurality of features is modeled. The active and inactive speech PDFs are adapted to respond to changes in the signal over time. The signal is probability-based classifyied based, at least in part, on the plurality of features. Speech in the signal is distinguished based, at least in part, upon the probability-based classification.

Type: Application

Filed: December 4, 2001

Publication date: November 7, 2002

Applicant: Global IP Sound AB

Inventors: Jan K. Skoglund, Jan T. Linden

SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Speech coding using auto-regressive generative neural networks

SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Speech coding using auto-regressive generative neural networks

SPEECH CODING USING AUTO-REGRESSIVE GENERATIVE NEURAL NETWORKS

Detection of speech activity using feature model adaptation

Detection of sound activity