Patents by Inventor Stephane Villette

Stephane Villette has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CONTENT-BASED SWITCHABLE AUDIO CODEC

Publication number: 20250201255

Abstract: A device includes a machine-learning audio encoder and a waveform-matching audio encoder. The device includes a controller configured to cause a segment of audio data to be input to the machine-learning audio encoder, to the waveform-matching audio encoder, or to both, based on a classification associated with the segment.

Type: Application

Filed: December 13, 2023

Publication date: June 19, 2025

Inventors: Pravin Kumar RAMADAS, Vivek RAJENDRAN, Duminda DEWASURENDRA, Stephane VILLETTE
AUDIO SAMPLE RECONSTRUCTION USING A NEURAL NETWORK AND MULTIPLE SUBBAND NETWORKS

Publication number: 20250182769

Abstract: A device includes a neural network, a first subband neural network, a second subband neural network, and a reconstructor. The neural network processes neural network inputs to generate a neural network output. The neural network inputs include at least one previous audio sample. The first subband neural network processes first subband network inputs to generate a first subband audio sample. The first subband network inputs include at least the neural network output. The second subband neural network processes second subband network inputs to generate a second subband audio sample. The second subband network inputs include at least the neural network output. The reconstructor generates a reconstructed audio sample based on the first subband audio sample and the second subband audio sample. The at least one previous audio sample includes a previous subband audio sample, a previous reconstructed audio sample, or both.

Type: Application

Filed: February 24, 2023

Publication date: June 5, 2025

Inventors: Zisis Iason SKORDILIS, Vivek RAJENDRAN, Stephane VILLETTE
Avatar facial expressions based on semantical context

Patent number: 12315057

Abstract: A device includes a memory and one or more processors configured to process sensor data to determine a semantical context associated with the sensor data. The one or more processors are also configured to generate adjusted face data based on the determined semantical context and face data. The adjusted face data includes an avatar facial expression that corresponds to the semantical context.

Type: Grant

Filed: September 7, 2022

Date of Patent: May 27, 2025

Assignee: QUALCOMM Incorporated

Inventors: Scott Beith, Suzana Arellano, Michel Adib Sarkis, Matthew Fischler, Ke-Li Cheng, Stephane Villette
Media segment representation using fixed weights

Patent number: 12300233

Abstract: A device includes a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment. The device also includes one or more processors configured to generate data representing the detected first input speech segment and to pass the data representing the detected first input speech segment into a collection of memory units. Each memory unit of the collection of memory units includes a set of weights from the collection of sets of weights. The one or more processors are also configured to generate a first estimate of an associated media segment that represents the detected first input speech segment. The associated media segment corresponds to a first memory unit in the collection of memory units.

Type: Grant

Filed: October 18, 2022

Date of Patent: May 13, 2025

Assignee: QUALCOMM Incorporated

Inventors: Stephane Villette, Sen Li, Daniel Jared Sinder
Media segment prediction for media generation

Patent number: 12170094

Abstract: A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.

Type: Grant

Filed: October 18, 2022

Date of Patent: December 17, 2024

Assignee: QUALCOMM Incorporated

Inventors: Stephane Villette, Sen Li, Pravin Kumar Ramadas, Daniel Jared Sinder
MEDIA SEGMENT REPRESENTATION USING FIXED WEIGHTS

Publication number: 20240127809

Abstract: A device includes a memory configured to store a collection of sets of weights, each of the sets of weights representing a respective media segment. The device also includes one or more processors configured to generate data representing the detected first input speech segment and to pass the data representing the detected first input speech segment into a collection of memory units. Each memory unit of the collection of memory units includes a set of weights from the collection of sets of weights. The one or more processors are also configured to generate a first estimate of an associated media segment that represents the detected first input speech segment. The associated media segment corresponds to a first memory unit in the collection of memory units.

Type: Application

Filed: October 18, 2022

Publication date: April 18, 2024

Inventors: Stephane VILLETTE, Sen LI, Daniel Jared SINDER
MEDIA SEGMENT PREDICTION FOR MEDIA GENERATION

Publication number: 20240127838

Abstract: A device includes one or more processors configured to input one or more segments of an input media stream into a feature extractor. The one or more processors are further configured to pass an output of the feature extractor into an utterance classifier to produce at least one representation of at least one utterance class of a plurality of utterance classes. The one or more processors are further configured to pass the output of the feature extractor and the at least one representation into a segment matcher to produce a media output segment identifier.

Type: Application

Filed: October 18, 2022

Publication date: April 18, 2024

Inventors: Stephane VILLETTE, Sen LI, Pravin Kumar RAMADAS, Daniel Jared SINDER
MATCHING AUDIO USING MACHINE LEARNING BASED AUDIO REPRESENTATIONS

Publication number: 20240127827

Abstract: Systems and techniques are described herein for encoding and/or decoding audio information. For example, a process can process an input audio segment to generate a representation of the input audio segment, and can compare the representation of the input audio segment to representations stored in a memory. The representations represent a plurality of audio segments. The process can determine, based on the comparison, target representation(s) of target audio segment(s) from the representations stored in the memory. The process can determine one or more indices associated with the target audio segment(s). The process can then packetize the one or more indices and transmit the one or more packetized indices (e.g., to a decoder configured to decode the packetized indices).

Type: Application

Filed: October 18, 2022

Publication date: April 18, 2024

Inventors: Stephane VILLETTE, Sen LI, Pravin Kumar RAMADAS, Daniel Jared SINDER
AVATAR FACIAL EXPRESSIONS BASED ON SEMANTICAL CONTEXT

Publication number: 20240078732

Abstract: A device includes a memory and one or more processors configured to process sensor data to determine a semantical context associated with the sensor data. The one or more processors are also configured to generate adjusted face data based on the determined semantical context and face data. The adjusted face data includes an avatar facial expression that corresponds to the semantical context.

Type: Application

Filed: September 7, 2022

Publication date: March 7, 2024

Inventors: Scott BEITH, Suzana ARELLANO, Michel Adib SARKIS, Matthew FISCHLER, Ke-Li CHENG, Stephane VILLETTE
AVATAR REPRESENTATION AND AUDIO GENERATION

Publication number: 20240078731

Abstract: A device includes a memory and one or more processors configured to process image data corresponding to a user's face to generate face data. The one or more processors are configured to process sensor data to generate feature data and to generate a representation of an avatar based on the face data and the feature data. The one or more processors are also configured to generate an audio output for the avatar based on the sensor data.

Type: Application

Filed: September 7, 2022

Publication date: March 7, 2024

Inventors: Scott BEITH, Suzana ARELLANO, Michel Adib SARKIS, Matthew FISCHLER, Ke-Li CHENG, Stephane VILLETTE
Data transmission

Patent number: 8472508

Abstract: A system for transmitting input data over a speech channel of a network comprising: a modulator arranged to produce a modulated waveform signal transforming the data for transmission over the network; a channel compensation filter arranged to filter the modulated waveform signal after it has been transmitted over the speech channel to compensate for the response of the speech channel; and a demodulator arranged to retrieve the data from the filtered waveform signal.

Type: Grant

Filed: May 6, 2005

Date of Patent: June 25, 2013

Assignee: Mulsys Ltd

Inventors: Ahmet Kondoz, Nilantha Nandima Katugampala, Kholdoon Taha Al-Naimi, Stephane Villette
Generating LSF vectors

Patent number: 7493255

Abstract: To alleviate problems of signal aliasing and to reduce complexity, Linear Predictive Coefficients (LPCS) are calculated from samples of audio signals and Line Spectral Frequency (LSF) vectors are extracted from the LPCs with a rate higher than a desired vector rate, the LSF vectors comprising values of different LSF parameters. Next, an LSF track is formed for at least one of the LSF parameters. At least one of the formed LSF tracks is then low pass filtered. Finally, decimated LSF vectors are reconstructed from the low pass filtered LSF tracks, the decimated number corresponding to the desired vector rate. The invention equally relates to a corresponding computer program, to corresponding devices and to a corresponding communication network.

Type: Grant

Filed: April 10, 2003

Date of Patent: February 17, 2009

Assignee: Nokia Corporation

Inventors: Khaldoon Taha Al-Naimi, Stephane Villette, Ahmet Kondoz
Data Transmission

Publication number: 20080165885

Abstract: A system for transmitting input data over a speech channel of a network comprising: a modulator arranged to produce a modulated waveform signal transforming the data for transmission over the network; a channel compensation filter arranged to filter the modulated waveform signal after it has been transmitted over the speech channel to compensate for the response of the speech channel; and a demodulator arranged to retrieve the data from the filtered waveform signal.

Type: Application

Filed: May 6, 2005

Publication date: July 10, 2008

Applicant: UNIVERSITY OF SURREY

Inventors: Ahmet Kondoz, Nilantha Nandima Katugampala, Kholdoon Taha Al-Naimi, Stephane Villette
Generating LSF vectors

Publication number: 20040006463

Abstract: To alleviate problems of signal aliasing and to reduce complexity, Linear Predictive Coefficients (LPCS) are calculated from samples of audio signals and Line Spectral Frequency (LSF) vectors are extracted from the LPCs with a rate higher than a desired vector rate, the LSF vectors comprising values of different LSF parameters. Next, an LSF track is formed for at least one of the LSF parameters. At least one of the formed LSF tracks is then low pass filtered. Finally, decimated LSF vectors are reconstructed from the low pass filtered LSF tracks, the decimated number corresponding to the desired vector rate. The invention equally relates to a corresponding computer program, to corresponding devices and to a corresponding communication network.

Type: Application

Filed: April 10, 2003

Publication date: January 8, 2004

Applicant: Nokia Corporation

Inventors: Khaldoon Taha Al-Naimi, Stephane Villette, Ahmet Kondoz