Patents by Inventor Benoit Dumoulin

Benoit Dumoulin has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Combining machine-learning and social data to generate personalized recommendations

Patent number: 11514333

Abstract: A computing device receives a message including a request for a recommendation. A representation of a hypothetical ideal recommendation to provide in response to the message is determined based on the message content. Data regarding entities that are potential recommendations are retrieved from a data store, the data regarding each entity including a representation of the entity (e.g., a vector) derived from factual information about the entity and opinions of other users of the entity. Ranking scores are determined for at least a subset of the entities based on the difference between the entity representations and the representation of the hypothetical ideal recommendation. An entity to recommend is selected based on the ranking scores and a reply to the message is sent that identifies the selected entity.

Type: Grant

Filed: April 30, 2018

Date of Patent: November 29, 2022

Assignee: Meta Platforms, Inc.

Inventors: Kun Han, Fuchun Peng, Benoit Dumoulin, Bo Zeng
Intent arbitration for a virtual assistant

Patent number: 10878198

Abstract: A user interacts with a virtual digital assistant with the intent that it provides assistance with a task. The user sends messages to the virtual digital assistant that include content obtained via user input at a client device. An intent determination model is applied to the content to identify the user's intent. The virtual digital assistant identifies agents that are capable of servicing the intent are identified and retrieves contextual data relating to the message from a data store. An intent arbitration model is used to select one of the agents which is activated to provide assistance with the task. The contextual information may include global metrics of agent performance and/or information regarding the user's preferences.

Type: Grant

Filed: December 6, 2018

Date of Patent: December 29, 2020

Assignee: Facebook, Inc.

Inventors: Anuj Kumar, Benoit Dumoulin, Wenhai Yang, Rajen Subba
Applying templates to customize presentation of content based on surface type

Patent number: 10803077

Abstract: An online system receives a request to generate presentation content for presentation to a user. The online system receives a set of content items and identifies a surface for presenting the presentation information to the user. For example, the surface may be a voice only surface, a voice and graphical display, a graphical display only. Based on the identified surface, the online system ranks the set of content items. The online system then determines presentation information for a subset of the content items and transmits instructions to present the presentation information at the surface.

Type: Grant

Filed: April 30, 2018

Date of Patent: October 13, 2020

Assignee: Facebook, Inc.

Inventors: Fuchun Peng, Bo Zeng, Kun Han, Benoit Dumoulin
Flexible schema for language model customization

Patent number: 10497367

Abstract: The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device. A hint may then be sent to a recognition service provider for combining the multiple language modeling components from the list. The hint may be based on a number of different domains. A customized combination of the language modeling components based on the hint may then be received from the recognition service provider.

Type: Grant

Filed: December 22, 2016

Date of Patent: December 3, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Hernan Guelman, Shuangyu Chang, Sarangarajan Parthasarathy, Benoit Dumoulin
APPLYING TEMPLATES TO CUSTOMIZE PRESENTATION OF CONTENT BASED ON SURFACE TYPE

Publication number: 20190332709

Abstract: An online system receives a request to generate presentation content for presentation to a user. The online system receives a set of content items and identifies a surface for presenting the presentation information to the user. For example, the surface may be a voice only surface, a voice and graphical display, a graphical display only. Based on the identified surface, the online system ranks the set of content items. The online system then determines presentation information for a subset of the content items and transmits instructions to present the presentation information at the surface.

Type: Application

Filed: April 30, 2018

Publication date: October 31, 2019

Inventors: Fuchun Peng, Bo Zeng, Kun Han, Benoit Dumoulin
COMBINING MACHINE-LEARNING AND SOCIAL DATA TO GENERATE PERSONALIZED RECOMMENDATIONS

Publication number: 20190332946

Abstract: A computing device receives a message including a request for a recommendation. A representation of a hypothetical ideal recommendation to provide in response to the message is determined based on the message content. Data regarding entities that are potential recommendations are retrieved from a data store, the data regarding each entity including a representation of the entity (e.g., a vector) derived from factual information about the entity and opinions of other users of the entity. Ranking scores are determined for at least a subset of the entities based on the difference between the entity representations and the representation of the hypothetical ideal recommendation. An entity to recommend is selected based on the ranking scores and a reply to the message is sent that identifies the selected entity.

Type: Application

Filed: April 30, 2018

Publication date: October 31, 2019

Inventors: Kun Han, Fuchun Peng, Benoit Dumoulin, Bo Zeng
INTENT ARBITRATION FOR A VIRTUAL ASSISTANT

Publication number: 20190205386

Abstract: A user interacts with a virtual digital assistant with the intent that it provides assistance with a task. The user sends messages to the virtual digital assistant that include content obtained via user input at a client device. An intent determination model is applied to the content to identify the user's intent. The virtual digital assistant identifies agents that are capable of servicing the intent are identified and retrieves contextual data relating to the message from a data store. An intent arbitration model is used to select one of the agents which is activated to provide assistance with the task. The contextual information may include global metrics of agent performance and/or information regarding the user's preferences.

Type: Application

Filed: December 6, 2018

Publication date: July 4, 2019

Inventors: Anuj Kumar, Benoit Dumoulin, Wenhai Yang, Rajen Subba
Language modeling based on spoken and unspeakable corpuses

Patent number: 10192545

Abstract: A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an unspeakable corpus by filtering the typed corpus to remove each item of typed text represented by a feature vector that is within a similarity threshold of a feature vector derived from the spoken corpus. The computer system may derive feature vectors from the unspeakable corpus and train a classifier to perform discriminative data selection for language modeling based on the feature vectors derived from the spoken corpus and the feature vectors derived from the unspeakable corpus.

Type: Grant

Filed: June 5, 2017

Date of Patent: January 29, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Shuangyu Chang, Benoit Dumoulin
Interactive reformulation of speech queries

Patent number: 10176219

Abstract: Methods and systems are provided for providing alternative query suggestions. For example, a spoken natural language expression may be received and converted to a textual query by a speech recognition component. The spoken natural language expression may include one or more words, terms, and/or phrases. A phonetically confusable segment of the textual query may be identified by a classifier component. The classifier component may determine at least one alternative query based on identifying at least the phonetically confusable segment of the textual query. The classifier may further determine whether to suggest the at least one alternative query based on whether the at least one alternative query is sensical and/or useful. When it is determined to suggest the at least one alternative query, the at least one alternative query may be provided to and displayed on a user interface display.

Type: Grant

Filed: March 13, 2015

Date of Patent: January 8, 2019

Assignee: Microsoft Technology Licensing, LLC

Inventors: Benoit Dumoulin, Ali Ahmadi, Sarangarajan Parthasarathy, Nick Craswell, Umut Ozertem, Milad Shokouhi, Karthik Raghunathan, Rosie Jones
Knowledge source personalization to improve language models

Patent number: 9997157

Abstract: Systems and methods are provided for improving language models for speech recognition by personalizing knowledge sources utilized by the language models to specific users or user-population characteristics. A knowledge source, such as a knowledge graph, is personalized for a particular user by mapping entities or user actions from usage history for the user, such as query logs, to the knowledge source. The personalized knowledge source may be used to build a personal language model by training a language model with queries corresponding to entities or entity pairs that appear in usage history. In some embodiments, a personalized knowledge source for a specific user can be extended based on personalized knowledge sources of similar users.

Type: Grant

Filed: May 16, 2014

Date of Patent: June 12, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Murat Akbacak, Dilek Z. Hakkani-Tur, Gokhan Tur, Larry P. Heck, Benoit Dumoulin
Incremental utterance decoder combination for efficient and accurate decoding

Patent number: 9922654

Abstract: An incremental speech recognition system. The incremental speech recognition system incrementally decodes a spoken utterance using an additional utterance decoder only when the additional utterance decoder is likely to add significant benefit to the combined result. The available utterance decoders are ordered in a series based on accuracy, performance, diversity, and other factors. A recognition management engine coordinates decoding of the spoken utterance by the series of utterance decoders, combines the decoded utterances, and determines whether additional processing is likely to significantly improve the recognition result. If so, the recognition management engine engages the next utterance decoder and the cycle continues. If the accuracy cannot be significantly improved, the result is accepted and decoding stops.

Type: Grant

Filed: December 13, 2016

Date of Patent: March 20, 2018

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shuangyu Chang, Michael Levit, Abhik Lahiri, Barlas Oguz, Benoit Dumoulin
LANGUAGE MODELING BASED ON SPOKEN AND UNSPEAKABLE CORPUSES

Publication number: 20170270912

Abstract: A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an unspeakable corpus by filtering the typed corpus to remove each item of typed text represented by a feature vector that is within a similarity threshold of a feature vector derived from the spoken corpus. The computer system may derive feature vectors from the unspeakable corpus and train a classifier to perform discriminative data selection for language modeling based on the feature vectors derived from the spoken corpus and the feature vectors derived from the unspeakable corpus.

Type: Application

Filed: June 5, 2017

Publication date: September 21, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Shuangyu Chang, Benoit Dumoulin
Language modeling based on spoken and unspeakable corpuses

Patent number: 9761220

Abstract: A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an unspeakable corpus by filtering the typed corpus to remove each item of typed text represented by a feature vector that is within a similarity threshold of a feature vector derived from the spoken corpus. The computer system may derive feature vectors from the unspeakable corpus and train a classifier to perform discriminative data selection for language modeling based on the feature vectors derived from the spoken corpus and the feature vectors derived from the unspeakable corpus.

Type: Grant

Filed: May 13, 2015

Date of Patent: September 12, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Michael Levit, Shuangyu Chang, Benoit Dumoulin
Language modeling for conversational understanding domains using semantic web resources

Patent number: 9679558

Abstract: Systems and methods are provided for training language models using in-domain-like data collected automatically from one or more data sources. The data sources (such as text data or user-interactional data) are mined for specific types of data, including data related to style, content, and probability of relevance, which are then used for language model training. In one embodiment, a language model is trained from features extracted from a knowledge graph modified into a probabilistic graph, where entity popularities are represented and the popularity information is obtained from data sources related to the knowledge. Embodiments of language models trained from this data are particularly suitable for domain-specific conversational understanding tasks where natural language is used, such as user interaction with a game console or a personal assistant application on personal device.

Type: Grant

Filed: May 15, 2014

Date of Patent: June 13, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Murat Akbacak, Dilek Z. Hakkani-Tur, Gokhan Tur, Larry P. Heck, Benoit Dumoulin
FLEXIBLE SCHEMA FOR LANGUAGE MODEL CUSTOMIZATION

Publication number: 20170103753

Abstract: The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device. A hint may then be sent to a recognition service provider for combining the multiple language modeling components from the list. The hint may be based on a number of different domains. A customized combination of the language modeling components based on the hint may then be received from the recognition service provider.

Type: Application

Filed: December 22, 2016

Publication date: April 13, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Hernan Guelman, Shuangyu Chang, Sarangarajan Parthasarathy, Benoit Dumoulin
INCREMENTAL UTTERANCE DECODER COMBINATION FOR EFFICIENT AND ACCURATE DECODING

Publication number: 20170092275

Abstract: An incremental speech recognition system. The incremental speech recognition system incrementally decodes a spoken utterance using an additional utterance decoder only when the additional utterance decoder is likely to add significant benefit to the combined result. The available utterance decoders are ordered in a series based on accuracy, performance, diversity, and other factors. A recognition management engine coordinates decoding of the spoken utterance by the series of utterance decoders, combines the decoded utterances, and determines whether additional processing is likely to significantly improve the recognition result. If so, the recognition management engine engages the next utterance decoder and the cycle continues. If the accuracy cannot be significantly improved, the result is accepted and decoding stops.

Type: Application

Filed: December 13, 2016

Publication date: March 30, 2017

Applicant: Microsoft Technology Licensing, LLC

Inventors: Shuangyu Chang, Michael Levit, Abhik Lahiri, Barlas Oguz, Benoit Dumoulin
Incremental utterance decoder combination for efficient and accurate decoding

Patent number: 9552817

Abstract: An incremental speech recognition system. The incremental speech recognition system incrementally decodes a spoken utterance using an additional utterance decoder only when the additional utterance decoder is likely to add significant benefit to the combined result. The available utterance decoders are ordered in a series based on accuracy, performance, diversity, and other factors. A recognition management engine coordinates decoding of the spoken utterance by the series of utterance decoders, combines the decoded utterances, and determines whether additional processing is likely to significantly improve the recognition result. If so, the recognition management engine engages the next utterance decoder and the cycle continues. If the accuracy cannot be significantly improved, the result is accepted and decoding stops.

Type: Grant

Filed: March 19, 2014

Date of Patent: January 24, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shuangyu Chang, Michael Levit, Abhik Lahiri, Barlas Oguz, Benoit Dumoulin
Flexible schema for language model customization

Patent number: 9529794

Abstract: The customization of language modeling components for speech recognition is provided. A list of language modeling components may be made available by a computing device. A hint may then be sent to a recognition service provider for combining the multiple language modeling components from the list. The hint may be based on a number of different domains. A customized combination of the language modeling components based on the hint may then be received from the recognition service provider.

Type: Grant

Filed: March 27, 2014

Date of Patent: December 27, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Hernan Guelman, Shuangyu Chang, Sarangarajan Parthasarathy, Benoit Dumoulin
DISCRIMINATIVE DATA SELECTION FOR LANGUAGE MODELING

Publication number: 20160336006

Abstract: A computer system for language modeling may collect training data from one or more information sources, generate a spoken corpus containing text of transcribed speech, and generate a typed corpus containing typed text. The computer system may derive feature vectors from the spoken corpus, analyze the typed corpus to determine feature vectors representing items of typed text, and generate an unspeakable corpus by filtering the typed corpus to remove each item of typed text represented by a feature vector that is within a similarity threshold of a feature vector derived from the spoken corpus. The computer system may derive feature vectors from the unspeakable corpus and train a classifier to perform discriminative data selection for language modeling based on the feature vectors derived from the spoken corpus and the feature vectors derived from the unspeakable corpus.

Type: Application

Filed: May 13, 2015

Publication date: November 17, 2016

Applicant: Microsoft Technology Licensing, LLC

Inventors: Michael Levit, Shuangyu Chang, Benoit Dumoulin
INTERACTIVE REFORMULATION OF SPEECH QUERIES

Publication number: 20160267128

Abstract: Methods and systems are provided for providing alternative query suggestions. For example, a spoken natural language expression may be received and converted to a textual query by a speech recognition component. The spoken natural language expression may include one or more words, terms, and/or phrases. A phonetically confusable segment of the textual query may be identified by a classifier component. The classifier component may determine at least one alternative query based on identifying at least the phonetically confusable segment of the textual query. The classifier may further determine whether to suggest the at least one alternative query based on whether the at least one alternative query is sensical and/or useful. When it is determined to suggest the at least one alternative query, the at least one alternative query may be provided to and displayed on a user interface display.

Type: Application

Filed: March 13, 2015

Publication date: September 15, 2016

Applicant: Microsoft Technology Licensing , LLC

Inventors: Benoit Dumoulin, Ali Ahmadi, Sarangarajan Parthasarathy, Nick Craswell, Umut Ozertem, Milad Shokouhi, Karthik Raghunathan, Rosie Jones

1 2 3 next