Patents by Inventor David Grangier
David Grangier has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240144957Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.Type: ApplicationFiled: December 19, 2023Publication date: May 2, 2024Applicant: Google LLCInventors: David Grangier, Neil Zeghidour, Oliver Teboul
-
Publication number: 20240078412Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for generating a prediction of an audio signal. One of the methods includes receiving a request to generate an audio signal; obtaining a semantic representation of the audio signal; generating, using one or more generative neural networks and conditioned on at least the semantic representation, an acoustic representation of the audio signal; and processing at least the acoustic representation using a decoder neural network to generate the prediction of the audio signal.Type: ApplicationFiled: September 7, 2023Publication date: March 7, 2024Inventors: Neil Zeghidour, David Grangier, Marco Tagliasacchi, Raphaël Marinier, Olivier Teboul, Zalán Borsos
-
Patent number: 11887623Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.Type: GrantFiled: June 22, 2021Date of Patent: January 30, 2024Assignee: Google LLCInventors: David Grangier, Neil Zeghidour, Oliver Teboul
-
Publication number: 20230259759Abstract: Provided are systems and methods for sequence-to-sequence modeling with neural quality metrics. More particularly, example aspects of the present disclosure relate to minimum bayes risk (MBR) decoding with neural metrics for machine translation. According to example aspects of the present disclosure, a set of candidate outputs can be sampled from a machine translation model given a source sequence. Given the set of candidate outputs, systems and methods according to example aspects of the present disclosure can select a hypothesis with high expected utility with respect to the distribution over a set of pseudo-references from the machine translation model.Type: ApplicationFiled: February 16, 2022Publication date: August 17, 2023Inventors: Qijun Tan, Markus Freitag, David Grangier
-
Publication number: 20230112265Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech separation. One of the methods includes obtaining a recording comprising speech from a plurality of speakers; processing the recording using a speaker neural network having speaker parameter values and configured to process the recording in accordance with the speaker parameter values to generate a plurality of per-recording speaker representations, each speaker representation representing features of a respective identified speaker in the recording; and processing the per-recording speaker representations and the recording using a separation neural network having separation parameter values and configured to process the recording and the speaker representations in accordance with the separation parameter values to generate, for each speaker representation, a respective predicted isolated audio signal that corresponds to speech of one of the speakers in the recording.Type: ApplicationFiled: October 17, 2022Publication date: April 13, 2023Inventors: Neil Zeghidour, David Grangier
-
Publication number: 20220375492Abstract: A method includes receiving an input audio signal corresponding to utterances spoken by multiple speakers. The method also includes encoding the input audio signal into a sequence of T temporal embeddings. During each of a plurality of iterations each corresponding to a respective speaker of the multiple speakers, the method includes selecting a respective speaker embedding for the respective speaker by determining a probability that the corresponding temporal embedding includes a presence of voice activity by a single new speaker for which a speaker embedding was not previously selected during a previous iteration and selecting the respective speaker embedding for the respective speaker as the temporal embedding. The method also includes, at each time step, predicting a respective voice activity indicator for each respective speaker of the multiple speakers based on the respective speaker embeddings selected during the plurality of iterations and the temporal embedding.Type: ApplicationFiled: June 22, 2021Publication date: November 24, 2022Applicant: Google LLCInventors: David Grangier, Neil Zeghidour, Oliver Teboul
-
Patent number: 11475909Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech separation. One of the methods includes obtaining a recording comprising speech from a plurality of speakers; processing the recording using a speaker neural network having speaker parameter values and configured to process the recording in accordance with the speaker parameter values to generate a plurality of per-recording speaker representations, each speaker representation representing features of a respective identified speaker in the recording; and processing the per-recording speaker representations and the recording using a separation neural network having separation parameter values and configured to process the recording and the speaker representations in accordance with the separation parameter values to generate, for each speaker representation, a respective predicted isolated audio signal that corresponds to speech of one of the speakers in the recording.Type: GrantFiled: February 8, 2021Date of Patent: October 18, 2022Assignee: Google LLCInventors: Neil Zeghidour, David Grangier
-
Publication number: 20220108174Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network having a plurality of model parameters to perform a main task. In one aspect, a method comprises: determining an auxiliary task update to the model parameters of the neural network that, if applied to the model parameters, is predicted to increase a performance of the neural network on an auxiliary task; determining a decomposition of the auxiliary task update into multiple constituent updates that, if applied to the model parameters, are each predicted to have a different impact on a performance of the neural network on the main task; determining a new auxiliary task update to the model parameters of the neural network as a function of the plurality of constituent updates; and applying the new auxiliary task update to the model parameters of the neural network.Type: ApplicationFiled: October 1, 2021Publication date: April 7, 2022Inventors: Yann Dauphin, David Grangier, Lucio Dery
-
Publication number: 20210249027Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for performing speech separation. One of the methods includes obtaining a recording comprising speech from a plurality of speakers; processing the recording using a speaker neural network having speaker parameter values and configured to process the recording in accordance with the speaker parameter values to generate a plurality of per-recording speaker representations, each speaker representation representing features of a respective identified speaker in the recording; and processing the per-recording speaker representations and the recording using a separation neural network having separation parameter values and configured to process the recording and the speaker representations in accordance with the separation parameter values to generate, for each speaker representation, a respective predicted isolated audio signal that corresponds to speech of one of the speakers in the recording.Type: ApplicationFiled: February 8, 2021Publication date: August 12, 2021Inventors: Neil Zeghidour, David Grangier
-
Patent number: 8706668Abstract: Methods and systems for classifying incomplete data are disclosed. In accordance with one method, pairs of features and values are generated based upon feature measurements on the incomplete data. In addition, a transformation function is applied on the pairs of features and values to generate a set of vectors by mapping each of the pairs to a corresponding vector in an embedding space. Further, a hardware processor applies a prediction function to the set of vectors to generate at least one confidence assessment for at least one class that indicates whether the incomplete data is of the at least one class. The method further includes outputting the at least one confidence assessment.Type: GrantFiled: June 2, 2011Date of Patent: April 22, 2014Assignee: NEC Laboratories America, Inc.Inventors: Iain Melvin, David Grangier
-
Patent number: 8359282Abstract: A system and method for determining a similarity between a document and a query includes providing a frequently used dictionary and an infrequently used dictionary in storage memory. For each word or gram in the infrequently used dictionary, n words or grams are correlated from the frequently used dictionary based on a first score. Features for a vector of the infrequently used words or grams are replaced with features from a vector of the correlated words or grams from the frequently used dictionary when the features from a vector of the correlated words or grams meet a threshold value. A similarity score is determined between weight vectors of a query and one or more documents in a corpus by employing the features from the vector of the correlated words or grams that met the threshold value.Type: GrantFiled: September 18, 2009Date of Patent: January 22, 2013Assignee: NEC Laboratories America, Inc.Inventors: Bing Bai, Jason Weston, Ronan Collorbert, David Grangier
-
Patent number: 8341095Abstract: A system and method for determining a similarity between a document and a query includes building a weight vector for each of a plurality of documents in a corpus of documents stored in memory and building a weight vector for a query input into a document retrieval system. A weight matrix is generated which distinguishes between relevant documents and lower ranked documents by comparing document/query tuples using a gradient step approach. A similarity score is determined between weight vectors of the query and documents in a corpus by determining a product of a document weight vector, a query weight vector and the weight matrix.Type: GrantFiled: September 18, 2009Date of Patent: December 25, 2012Assignee: NEC Laboratories America, Inc.Inventors: Bing Bai, Jason Weston, Ronan Collobert, David Grangier
-
Publication number: 20110302118Abstract: Methods and systems for classifying incomplete data are disclosed. In accordance with one method, pairs of features and values are generated based upon feature measurements on the incomplete data. In addition, a transformation function is applied on the pairs of features and values to generate a set of vectors by mapping each of the pairs to a corresponding vector in an embedding space. Further, a hardware processor applies a prediction function to the set of vectors to generate at least one confidence assessment for at least one class that indicates whether the incomplete data is of the at least one class. The method further includes outputting the at least one confidence assessment.Type: ApplicationFiled: June 2, 2011Publication date: December 8, 2011Applicant: NEC Laboratories America, Inc.Inventors: IAIN MELVIN, David Grangier
-
Publication number: 20100185659Abstract: A system and method for determining a similarity between a document and a query includes providing a frequently used dictionary and an infrequently used dictionary in storage memory. For each word or gram in the infrequently used dictionary, n words or grams are correlated from the frequently used dictionary based on a first score. Features for a vector of the infrequently used words or grams are replaced with features from a vector of the correlated words or grams from the frequently used dictionary when the features from a vector of the correlated words or grams meet a threshold value. A similarity score is determined between weight vectors of a query and one or more documents in a corpus by employing the features from the vector of the correlated words or grams that met the threshold value.Type: ApplicationFiled: September 18, 2009Publication date: July 22, 2010Applicant: NEC Laboratories America, Inc.Inventors: BING BAI, JASON WESTON, RONAN COLLORBERT, DAVID GRANGIER
-
Publication number: 20100179933Abstract: A system and method for determining a similarity between a document and a query includes building a weight vector for each of a plurality of documents in a corpus of documents stored in memory and building a weight vector for a query input into a document retrieval system. A weight matrix is generated which distinguishes between relevant documents and lower ranked documents by comparing document/query tuples using a gradient step approach. A similarity score is determined between weight vectors of the query and documents in a corpus by determining a product of a document weight vector, a query weight vector and the weight matrix.Type: ApplicationFiled: September 18, 2009Publication date: July 15, 2010Applicant: NEC Laboratories America, Inc.Inventors: BING BAI, Jason Weston, Ronan Collobert, David Grangier