Patents Examined by Qi Han

Terminal device and method for controlling thereof

Patent number: 11031008

Abstract: A terminal device is provided. The terminal device includes a communication interface, and a processor configured to receive performance information of one or more other terminal devices from each of the one or more other terminal devices, identify an edge device to perform voice recognition based on the performance information received from each of the one or more other terminal devices, based on the terminal device being identified as the edge device, receive information associated with reception quality from one or more other terminal devices which receive a sound wave including a triggering word, determine a terminal device to acquire the sound wave for voice recognition from based on the received information associated with the reception quality, and transmit, to the determined terminal device, a command to transmit the sound wave acquired for voice recognition to an external voice recognition device.

Type: Grant

Filed: April 10, 2019

Date of Patent: June 8, 2021

Assignee: Samsung Electronics Co., Ltd.

Inventor: Minseok Kim
Automatic answer rephrasing based on talking style

Patent number: 11030990

Abstract: Techniques are described that facilitate automatically providing entities with rephrased versions of standard answers. In one embodiment, a computer-implemented is provided that comprises determining, by a device operatively coupled to a processor, a talking style of a plurality of talking styles that an entity is associated with based on reception of natural language input from the entity proposing a question related to a defined topic. The method further comprises selecting, by the device based on the talking style, an answer rephrasing model from a plurality of answer rephrasing models respectively configured to generate different rephrased versions of a standard answer to the question, and employing, by the device, the answer rephrasing model to generate a rephrased version of the standard that corresponds to the talking style.

Type: Grant

Filed: September 5, 2019

Date of Patent: June 8, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Jian Min Jiang, Yuan Ni, Guo Yu Tang, Guo Tong Xie, Shi Wan Zhao
Noise filling concept

Patent number: 11031022

Abstract: Noise filling of a spectrum of an audio signal is improved in quality with respect to the noise filled spectrum so that the reproduction of the noise filled audio signal is less annoying, by performing the noise filling in a manner dependent on a tonality of the audio signal.

Type: Grant

Filed: July 26, 2019

Date of Patent: June 8, 2021

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Marc Gayer, Christian Helmrich, Goran Markovic, Maria Luis Valero
Orchestrating execution of a series of actions requested to be performed via an automated assistant

Patent number: 11031007

Abstract: Implementations are set forth herein for creating an order of execution for actions that were requested by a user, via a spoken utterance to an automated assistant. The order of execution for the requested actions can be based on how each requested action can, or is predicted to, affect other requested actions. In some implementations, an order of execution for a series of actions can be determined based on an output of a machine learning model, such as a model that has been trained according to supervised learning. A particular order of execution can be selected to mitigate waste of processing, memory, and network resources—at least relative to other possible orders of execution. Using interaction data that characterizes past performances of automated assistants, certain orders of execution can be adapted over time, thereby allowing the automated assistant to learn from past interactions with one or more users.

Type: Grant

Filed: February 7, 2019

Date of Patent: June 8, 2021

Assignee: GOOGLE LLC

Inventors: Mugurel Ionut Andreica, Vladimir Vuskovic, Joseph Lange, Sharon Stovezky, Marcin Nowak-Przygodzki
Query translation

Patent number: 11023461

Abstract: Translating a natural language search query into a query language includes receiving a natural language query for a database, processing the natural language query to generate a modified text input, generating an entity tree based on the modified text input, including assigning one or more semantic markers to one or more words or one or more groups of words within the modified text input, wherein each semantic tag denotes a semantic class for each respective word or group or words, and converting the entity tree into the query language associated with the first database.

Type: Grant

Filed: January 16, 2019

Date of Patent: June 1, 2021

Assignee: ServiceNow, Inc.

Inventors: Mikhail Rumiantsau, Alyaksandr Zaytsav, Alexey Zenovich, Aliaksei Vertsel
Portable audio device with voice capabilities

Patent number: 11024309

Abstract: A portable audio device that includes a network card able to connect with a WLAN and a wireless modem to connect to a WWAN. The portable audio device communicates with a voices services platform and/or content provider via the network card and WLAN or the wireless modem and WWAN. If the portable audio device does not have access to the WLAN, the portable audio device may process and respond to voice queries by communicating with the voice services platform via the wireless modem and WWAN. The portable audio device also includes a battery that provides power for the various hardware and software components of the portable audio device to perform various functions, such as advanced voice functions. The portable audio device provides true portability and may be used in any environment, such as within a home or building environment or outside of the home or building environment.

Type: Grant

Filed: October 16, 2017

Date of Patent: June 1, 2021

Assignee: Harman International Industries, Incorporated

Inventor: David Owens
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Patent number: 11017766

Abstract: Determining a language for speech recognition of a spoken utterance received via an automated assistant interface for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Implementations determine a user profile that corresponds to audio data that captures a spoken utterance, and utilize language(s), and optionally corresponding probabilities, assigned to the user profile in determining a language for speech recognition of the spoken utterance. Some implementations select only a subset of languages, assigned to the user profile, to utilize in speech recognition of a given spoken utterance of the user.

Type: Grant

Filed: October 17, 2018

Date of Patent: May 25, 2021

Assignee: GOOGLE LLC

Inventors: Pu-sen Chao, Diego Melendo Casado, Ignacio Lopez Moreno, William Zhang
Systems and methods for dispensing consumable products with voice interface

Patent number: 11017768

Abstract: A method of dispensing a beverage from a beverage dispenser includes: detecting a user in proximity to the beverage dispenser; prompting the user to provide a first input, wherein the first input is audible; retrieving a user profile for the user based on the first input; receiving a second input from the user, wherein the second input comprises information about a beverage selection of the user, and wherein the second input is provided in a different manner than the first input; and dispensing the beverage.

Type: Grant

Filed: April 26, 2018

Date of Patent: May 25, 2021

Assignee: PepsiCo, Inc.

Inventor: Robert Crawford
Real-time vocal features extraction for automated emotional or mental state assessment

Patent number: 11004461

Abstract: Embodiments of the present systems and methods may provide techniques for extracting vocal features from voice signals to determine an emotional or mental state of one or more persons, such as to determine a risk of suicide and other mental health issues. For example, as a person's mental state may indirectly alters his or her speech, suicidal risk in, for example, hotline calls, may be determined through speech analysis. In embodiments, such techniques may include preprocessing of the original recording, vocal feature extraction, and prediction processing. For example, in an embodiment, a computer-implemented method of determining an emotional or mental state of a person, the method comprising acquiring an audio signal relating to a conversation including the person, extracting signal components relating to an emotional or mental state of at least the person, and outputting information characterizing the extracted emotional or mental state of the person.

Type: Grant

Filed: August 20, 2018

Date of Patent: May 11, 2021

Inventor: Newton Howard
Language identification system for live language interpretation via a computing device

Patent number: 11003853

Abstract: A configuration is implemented to generate, with a processor, an image and audio user interface which has a language identification indicium that is image-based. Further, the configuration sends, with the processor, the image and audio user interface to a computing device so that the image and audio user interface is displayed to a user. Moreover, the configuration receives, with the processor, audio data captured by the computing device from a user positioned at the computing device upon activation of the language identification indicium. Additionally, the configuration performs, with the processor, an audio analysis on the captured audio data to identify a language spoken by the user. Finally, the configuration establishes, with the processor, a language interpretation session between the computing device and a communication device associated with a language interpreter based on the identified language.

Type: Grant

Filed: May 14, 2019

Date of Patent: May 11, 2021

Assignee: Language Line Services, Inc.

Inventors: Jeffrey Cordell, James Boutcher, Lindsay D'Penha
Smart home automation systems and methods

Patent number: 10992491

Abstract: A smart home interaction system is presented. It is built on a multi-modal, multithreaded conversational dialog engine. The system provides a natural language user interface for the control of household devices, appliances or household functionality. The smart home automation agent can receive input from users through sensing devices such as a smart phone, a tablet computer or a laptop computer. Users interact with the system from within the household or from remote locations. The smart home system can receive input from sensors or any other machines with which it is interfaced. The system employs interaction guide rules for processing reaction to both user and sensor input and driving the conversational interactions that result from such input. The system adaptively learns based on both user and sensor input and can learn the preferences and practices of its users.

Type: Grant

Filed: July 29, 2019

Date of Patent: April 27, 2021

Assignee: NANT HOLDINGS IP, LLC

Inventors: Farzad Ehsani, Silke Maren Witt-Ehsani, Walter Rolandi
Data collection using voice and messaging side channel

Patent number: 10978071

Abstract: An approach is provided in which an information handling system sends a first request to a user over a voice channel through a first communication network. The request is in an audio format and requests a user data set from the user. The information handling system establishes a messaging channel with a user device utilized by the user through a second communication network. The messaging channel is an end-to-end digital data channel between the information handing system and the user device. The information handling system receives a set of user data corresponding to the first request from the user device over the messaging channel, and sends the set of user data to a conversation system.

Type: Grant

Filed: September 24, 2019

Date of Patent: April 13, 2021

Assignee: International Business Machines Corporation

Inventors: Scott W. Graham, Lior Luker, Nitzan Nissim, Brian L. Pulito
Speaker diarization

Patent number: 10978070

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for speaker diarization are disclosed. In one aspect, a method includes the actions of receiving audio data corresponding to an utterance. The actions further include determining that the audio data includes an utterance of a predefined hotword spoken by a first speaker. The actions further include identifying a first portion of the audio data that includes speech from the first speaker. The actions further include identifying a second portion of the audio data that includes speech from a second, different speaker. The actions further include transmitting the first portion of the audio data that includes speech from the first speaker and suppressing transmission of the second portion of the audio data that includes speech from the second, different speaker.

Type: Grant

Filed: August 27, 2019

Date of Patent: April 13, 2021

Inventors: Aleksandar Kracun, Richard Cameron Rose
Speech interaction device

Patent number: 10971146

Abstract: A speech interaction device includes an ascertaining section and a control section. The ascertaining section ascertains a direction of a speech utterer by audio emitted by the speech utterer. The control section controls directionality of audio output through a speaker when outputting audio toward the speech utterer, such that directionality of audio in the direction ascertained by the ascertaining section is higher than directionality of audio in other directions.

Type: Grant

Filed: December 28, 2018

Date of Patent: April 6, 2021

Assignee: Toyota Jidosha Kabushiki Kaisha

Inventors: Hideki Kobayashi, Akihiro Muguruma, Yukiya Sugiyama, Shota Higashihara, Riho Matsuo, Naoki Yamamuro
Information processing device and information processing method

Patent number: 10943587

Abstract: An information processing device including an electronic control unit is provided. The electronic control unit is configured: to acquire speech data which is uttered by a user; to acquire context in associated with a situation of the user; to convert the speech data into text data; to select a dictionary which is referred to for determining a meaning of a word included in the text data based on the context information when the speech data has been acquired; to give the meaning of the word determined with reference to the selected dictionary to the text data; and to provide a service based on the text data to which the meaning of the word is given.

Type: Grant

Filed: December 28, 2018

Date of Patent: March 9, 2021

Assignee: Toyota Jidosha Kabushiki Kaisha

Inventor: Koichi Suzuki
System, method, and recording medium for controlling dialogue interruptions by a speech output device

Patent number: 10943592

Abstract: A computer speech output control method, system, and non-transitory computer readable medium, include a computer speech output unit configured to output a computer speech, a human speech monitoring circuit configured to determine whether ambient human conversation including human-to-human speech is occurring, and an interruption determining circuit configured to determine whether to cause the computer speech output unit to output the computer speech based on a status of the human conversation.

Type: Grant

Filed: October 31, 2017

Date of Patent: March 9, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Christopher J. Hardee, Steven Robert Joroff, Pamela Ann Nesbitt, Scott Edward Schneider
Systems and methods for text input using neuromuscular information

Patent number: 10937414

Abstract: Systems and methods for text input based on neuromuscular information. The system includes a plurality of neuromuscular sensors, arranged on one or more wearable devices, wherein the plurality of neuromuscular sensors is configured to continuously record a plurality of neuromuscular signals from a user, at least one storage device configured to store one or more trained statistical models, and at least one computer processor programmed to obtain the plurality of neuromuscular signals from the plurality of neuromuscular sensors, provide as input to the one or more trained statistical models, the plurality of neuromuscular signals or signals derived from the plurality of neuromuscular signals, and determine based, at least in part, on an output of the one or more trained statistical models, one or more linguistic tokens.

Type: Grant

Filed: May 8, 2018

Date of Patent: March 2, 2021

Assignee: Facebook Technologies, LLC

Inventors: Adam Berenzweig, Alan Huan Du, Jeffrey Scott Seely
System and method for generating blocks of natural language

Patent number: 10902210

Abstract: The invention relates to a system and method for generating a block of natural language, the system comprising a digital data store capable of storing a data graph according to a data schema, input sub-system for entering natural language data units to the data graph, and a data processor for generating a block of natural language based on the data graph. Further, the data schema allows storage of recursively nested natural language data units and relation data units associated with the natural language data units into the data graph, the relation data units being configured to define relations between natural language data units in the data graph. The data processor is adapted to generate said block of natural language utilizing a plurality of natural language data units and relations between the natural language data units as defined by the relation data units associated therewith.

Type: Grant

Filed: December 28, 2018

Date of Patent: January 26, 2021

Inventors: Sakari Arvela, Juho Kallio
Systems and methods for multi-speaker neural text-to-speech

Patent number: 10896669

Abstract: Described herein are systems and methods for augmenting neural speech synthesis networks with low-dimensional trainable speaker embeddings in order to generate speech from different voices from a single model. As a starting point for multi-speaker experiments, improved single-speaker model embodiments, which may be referred to generally as Deep Voice 2 embodiments, were developed, as well as a post-processing neural vocoder for Tacotron (a neural character-to-spectrogram model). New techniques for multi-speaker speech synthesis were performed for both Deep Voice 2 and Tacotron embodiments on two multi-speaker TTS datasets—showing that neural text-to-speech systems can learn hundreds of unique voices from twenty-five minutes of audio per speaker.

Type: Grant

Filed: May 8, 2018

Date of Patent: January 19, 2021

Assignee: Baidu USA LLC

Inventors: Sercan O. Arik, Gregory Diamos, Andrew Gibiansky, John Miller, Kainan Peng, Wei Ping, Jonathan Raiman, Yanqi Zhou
Optimization of speech analytics system recognition thresholds for target word identification in a contact center

Patent number: 10891940

Abstract: An approach for optimizing a confidence score threshold that is used to recognize a target word(s) in an audio source. A variety of potential instances of the target word can be detected and classified using an initial confidence score threshold value. Each potential instance of the target word is audibly reviewed and validated by a user. After a determination of the correctness of each potential instance's classification, a different confidence score threshold value can be used to produce an updated set of classification results without requiring the user to revalidate the results. By using a variety of confidence score threshold values to produce various sets of classification results, an optimized confidence threshold setting can be determined for the identified target word based on minimizing errors in the various results. This value can then be applied for future analysis of the target word in an audio source.

Type: Grant

Filed: December 13, 2018

Date of Patent: January 12, 2021

Assignee: Noble Systems Corporation

Inventors: Steven K. Mammen, Patrick M. McDaniel, Karl H. Koster

prev … 3 4 5 6 7 8 9 10 11 … next