Patents by Inventor Baiyang Liu

Baiyang Liu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

In-Call Experience Enhancement for Assistant Systems

Publication number: 20210120206

Abstract: In one embodiment, a method includes establishing a video call between multiple client systems while persistently maintaining access to an assistant system during the video call. A request to be performed by the assistant system during the video call may then be received from a first client system; this request may reference one or more second users in the video call. An intent of the request and one or more user identifiers of these one or more second users referenced by the request may be determined, and the assistant system may be instructed to execute the request based on the determined intent and user identifiers. Finally, a response to the request may be sent to one more of the multiple client systems while maintaining the video call between these client systems.

Type: Application

Filed: April 13, 2020

Publication date: April 22, 2021

Inventors: Xiaohu Liu, Baiyang Liu, Rajen Subba
Content summarization for assistant systems

Patent number: 10977258

Abstract: In one embodiment, a method includes, by one or more computing systems, receiving, from a client system associated with a user, a request for a summary of user communications from a content source, accessing a plurality of user communications from the content source, identifying a plurality of segments associated with the plurality of user communications, wherein the plurality of segments is associated with a plurality of topics, respectively, calculating, for each segment of the plurality of segments, a user interest score for the segment, selecting one or more of the segments for summarization based on their user interest scores, generating one or more personalized summaries of the one or more selected segments, wherein the personalization of the summary is based on the user profile of the user and sending, to the client system, instructions to present the personalized summaries to the user responsive to the request.

Type: Grant

Filed: January 14, 2019

Date of Patent: April 13, 2021

Assignee: Facebook, Inc.

Inventors: Xiaohu Liu, Baiyang Liu, Rajen Subba, Benoit F. Dumoulin
Assisting multiple users in a multi-user conversation thread

Patent number: 10958599

Abstract: In one embodiment, a method includes receiving an instruction to add an assistant xbot as a participant in a conversation thread from a first user of a plurality of users participating in the conversation thread, monitoring the conversation thread including user inputs by one or more users of the plurality of users via the assistant xbot, analyzing the user inputs to identify intents based on a natural-language understanding module, sending instructions for prompting one or more users of the plurality of users to provide information for completing tasks associated with the intents via the assistant xbot within the conversation thread, executing the tasks based on the information provided by one or more agents, and sending instructions for presenting information associated with one or more of the executed tasks via the assistant xbot within the conversation thread.

Type: Grant

Filed: October 2, 2018

Date of Patent: March 23, 2021

Assignee: Facebook, Inc.

Inventors: Francislav P. Penov, Baiyang Liu, Xiaohu Liu
Multiple wake words for systems with multiple smart assistants

Patent number: 10957329

Abstract: In one embodiment, a method includes by a client system associated with a user, receiving, at the client system associated with the user, a user input, parsing the user input to identify an n-gram associated with a wake word from a plurality of wake words corresponding to a plurality of assistant systems associated with the client system, wherein each assistant system provides a particular set of functions, determining that the wake word corresponds to a first assistant system of the plurality of assistant systems, wherein the first assistant system provides a first set of functions, sending, to the first assistant system, a request to set an assistant xbot of the first assistant system into a listening mode, and receiving, from the first assistant system, an indication that the assistant xbot is in listening mode responsive to a determination that the user has permission to access the first assistant system.

Type: Grant

Filed: November 7, 2018

Date of Patent: March 23, 2021

Assignee: Facebook, Inc.

Inventors: Xiaohu Liu, Baiyang Liu, Rajen Subba
Identifying users through conversations for assistant systems

Patent number: 10854206

Abstract: In one embodiment, a method includes receiving from a client system a user request from a first user, determining a necessity for resolving the first user to a known entity to execute one or more tasks associated with the user request based on privacy restrictions associated with the user request, determining a set of candidate entities for the first user based on one or more machine-learning models, each candidate entity being associated with a respective confidence score greater than a threshold score, sending instructions for prompting the first user to select a candidate entity from the set of candidate entities, resolving the first user to a selected candidate entity responsive to receiving a selection from the first user, and executing the one or more tasks associated with the user request based on a user profile associated with the selected candidate entity.

Type: Grant

Filed: December 21, 2018

Date of Patent: December 1, 2020

Assignee: Facebook, Inc.

Inventors: Xiaohu Liu, Baiyang Liu, Rajen Subba, Benoit F. Dumoulin
Intent Identification for Agent Matching by Assistant Systems

Publication number: 20200364069

Abstract: In one embodiment, a method includes receiving a user request from a client system associated with a first user, wherein the user request is associated with a semantic-intent, identifying one or more dialog-intents associated with the user request based on the semantic-intent and context information associated with the user request, wherein each dialog-intent is a sub-intent of the semantic-intent, determining one or more agents for executing one or more tasks associated with the one or more dialog-intents, and sending instructions for presenting information returned from the one or more agents responsive to executing the one or more tasks to the client system.

Type: Application

Filed: August 6, 2020

Publication date: November 19, 2020

Inventors: Baiyang Liu, Benoit F. Dumoulin, Carlos Garcia Jurado Suarez, Xiaohu Liu
AUTOMATIC SPEAKER IDENTIFICATION USING SPEECH RECOGNITION FEATURES

Publication number: 20200349957

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Application

Filed: May 21, 2020

Publication date: November 5, 2020

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber
Intent identification for agent matching by assistant systems

Patent number: 10761866

Abstract: In one embodiment, a method includes receiving a user request associated with one or more domains from a client system associated with a first user, parsing the user request to identify one or more semantic-intents are associated with the one or more domains and one or more slots, identifying, based on a ranker model, one or more dialog-intents associated with the user request based on the one or more semantic-intents and slots and context information associated with the user request, wherein each dialog-intent is a sub-intent of one or more of the semantic-intents, determining one or more agents for executing one or more tasks associated with the one or more dialog-intents respectively, and sending instructions for presenting a communication content information returned from the one or more agents responsive to executing the one or more tasks responsive to the user input to the client system.

Type: Grant

Filed: August 30, 2018

Date of Patent: September 1, 2020

Assignee: Facebook, Inc.

Inventors: Baiyang Liu, Benoit F. Dumoulin, Carlos Garcia Jurado Suarez, Xiaohu Liu
Methods and systems for responding to inquiries based on social graph information

Patent number: 10726843

Abstract: Exemplary embodiments relate to improvements in digital assistants incorporating personalization based on social network data. Various aspects of the agent, such as the agent's voice, language style, and avatar may be personalized. Personalization may be applied to components of an agent's architecture (e.g., the virtual agent's language model, natural language generator, voice generation component, etc.). Moreover, by interfacing with the social network's social graph, the agent may be provided with information useful to performing certain tasks (e.g., a calendar for scheduling, food preferences for ordering tasks, etc.). An agent may be provided (and personalized) for a single user, or a group of users (e.g., a family). The agent can be personalized to anyone, which may allow (e.g.) for the agent to represent a celebrity or a person who is not currently available in interactions with others. Different agents can talk to each other, e.g. for purposes of scheduling meetings.

Type: Grant

Filed: December 20, 2017

Date of Patent: July 28, 2020

Assignee: FACEBOOK, INC.

Inventors: Xiaohu Liu, Benoit F. Dumoulin, Baiyang Liu
Automatic speaker identification using speech recognition features

Patent number: 10665245

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Grant

Filed: June 21, 2019

Date of Patent: May 26, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber
AUTOMATIC SPEAKER IDENTIFICATION USING SPEECH RECOGNITION FEATURES

Publication number: 20190378517

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Application

Filed: June 21, 2019

Publication date: December 12, 2019

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber
Intent Identification for Agent Matching by Assistant Systems

Publication number: 20190325081

Abstract: In one embodiment, a method includes receiving a user request associated with one or more domains from a client system associated with a first user, parsing the user request to identify one or more semantic-intents are associated with the one or more domains and one or more slots, identifying, based on a ranker model, one or more dialog-intents associated with the user request based on the one or more semantic-intents and slots and context information associated with the user request, wherein each dialog-intent is a sub-intent of one or more of the semantic-intents, determining one or more agents for executing one or more tasks associated with the one or more dialog-intents respectively, and sending instructions for presenting a communication content information returned from the one or more agents responsive to executing the one or more tasks responsive to the user input to the client system.

Type: Application

Filed: August 30, 2018

Publication date: October 24, 2019

Inventors: Baiyang Liu, Benoit F. Dumoulin, Carlos Garcia Jurado Suarez, Xiaohu Liu
Aggregating Semantic Information for Improved Understanding of Users

Publication number: 20190327331

Abstract: In one embodiment, a method includes receiving a user input by the first user from a client system associated with a first user, parsing the user input to identify one or more n-grams associated with the user input, accessing a user profile associated with the first user, wherein the user profile is stored in a first data store, accessing ontology data based on the one or more identified n-grams from one or more information graphs, wherein the one or more information graphs are stored in one or more second data stores, respectively, determining contextual information associated with the user input, generating semantic information by aggregating the user profile, ontology data, and contextual information, generating a feature representation for the identified one or more n-grams based on the semantic information, and resolving one or more entities associated with the one or more n-grams based on the feature representation.

Type: Application

Filed: April 30, 2018

Publication date: October 24, 2019

Inventors: Vivek Natarajan, Baiyang Liu, Xiaohu Liu, Ahmed Aly
Automatic speaker identification using speech recognition features

Patent number: 10332525

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Grant

Filed: January 30, 2017

Date of Patent: June 25, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber
METHODS AND SYSTEMS FOR RESPONDING TO INQUIRIES BASED ON SOCIAL GRAPH INFORMATION

Publication number: 20190189126

Abstract: Exemplary embodiments relate to improvements in digital assistants incorporating personalization based on social network data. Various aspects of the agent, such as the agent's voice, language style, and avatar may be personalized. Personalization may be applied to components of an agent's architecture (e.g., the virtual agent's language model, natural language generator, voice generation component, etc.). Moreover, by interfacing with the social network's social graph, the agent may be provided with information useful to performing certain tasks (e.g., a calendar for scheduling, food preferences for ordering tasks, etc.). An agent may be provided (and personalized) for a single user, or a group of users (e.g., a family). The agent can be personalized to anyone, which may allow (e.g.) for the agent to represent a celebrity or a person who is not currently available in interactions with others. Different agents can talk to each other, e.g. for purposes of scheduling meetings.

Type: Application

Filed: December 20, 2017

Publication date: June 20, 2019

Inventors: Xiaohu Liu, Benoit F. Dumoulin, Baiyang Liu
Error tolerant neural network model compression

Patent number: 10229356

Abstract: Features are disclosed for error tolerant model compression. Such features could be used to reduce the size of a deep neural network model including several hidden node layers. The size reduction in an error tolerant fashion ensures predictive applications relying on the model do not experience performance degradation due to model compression. Such predictive applications include automatic recognition of speech, image recognition, and recommendation engines. Partially quantized models are re-trained such that any degradation of accuracy is “trained out” of the model providing improved error tolerance with compression.

Type: Grant

Filed: December 23, 2014

Date of Patent: March 12, 2019

Assignee: Amazon Technologies, Inc.

Inventors: Baiyang Liu, Michael Reese Bastian, Bjorn Hoffmeister, Sankaran Panchapagesan, Ariya Rastrow
Language model speech endpointing

Patent number: 10121471

Abstract: An automatic speech recognition (ASR) system detects an endpoint of an utterance using the active hypotheses under consideration by a decoder. The ASR system calculates the amount of non-speech detected by a plurality of hypotheses and weights the non-speech duration by the probability of each hypotheses. When the aggregate weighted non-speech exceeds a threshold, an endpoint may be declared.

Type: Grant

Filed: June 29, 2015

Date of Patent: November 6, 2018

Assignee: Amazon Technologies, Inc.

Inventors: Bjorn Hoffmeister, Ariya Rastrow, Baiyang Liu
Voice controlled assistant with non-verbal user input

Patent number: 9864576

Abstract: A voice controlled assistant having a housing to hold one or more microphones, one or more speakers, and various computing components. The voice controlled assistant facilitates transactions and other functions primarily through verbal interactions with a user. In some situations, a transaction may require entry of a code, which the user may wish to enter in a non-verbal way. The voice controlled assistant is configured to analyze an audio signal to detect user interactions with the surface of the voice controlled assistant and to interpret the detected interactions as entry of the code.

Type: Grant

Filed: September 9, 2013

Date of Patent: January 9, 2018

Assignee: Amazon Technologies, Inc.

Inventors: Baiyang Liu, Hugh Evan Secker-Walker
AUTOMATIC SPEAKER IDENTIFICATION USING SPEECH RECOGNITION FEATURES

Publication number: 20170140761

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Application

Filed: January 30, 2017

Publication date: May 18, 2017

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber
Automatic speaker identification using speech recognition features

Patent number: 9558749

Abstract: Features are disclosed for automatically identifying a speaker. Artifacts of automatic speech recognition (“ASR”) and/or other automatically determined information may be processed against individual user profiles or models. Scores may be determined reflecting the likelihood that individual users made an utterance. The scores can be based on, e.g., individual components of Gaussian mixture models (“GMMs”) that score best for frames of audio data of an utterance. A user associated with the highest likelihood score for a particular utterance can be identified as the speaker of the utterance. Information regarding the identified user can be provided to components of a spoken language processing system, separate applications, etc.

Type: Grant

Filed: August 1, 2013

Date of Patent: January 31, 2017

Assignee: Amazon Technologies, Inc.

Inventors: Hugh Evan Secker-Walker, Baiyang Liu, Frederick Victor Weber

prev 1 2 3 next