Patents Examined by Richard Z Zhu

Automatic speaker identification in calls using multiple speaker-identification parameters

Patent number: 11417343

Abstract: A speaker identification system (“system”) automatically assigns a speaker to voiced segments in a call. The system identifies one or more speakers in a call using one or more speaker-identification parameters. The system processes the call to determine one or more speaker-identification parameters, such as a transcript of the call, a facial image of the speaker, a scene image, which is an image of a scene in which the speaker is located during the call, or textual data associated with the call such as names of the speaker or an organization that are retrieved from the scene images or video data of the call. The system analyzes one or more of the speaker-identification parameters and determines the identity of the speaker. The system then identifies the voice segments associated with the identified speaker and marks the voice segments with the identity of the speaker.

Type: Grant

Filed: July 2, 2018

Date of Patent: August 16, 2022

Assignee: ZOOMINFO CONVERSE LLC

Inventors: Raphael Cohen, Erez Volk, Russell Levy, Micha Yochanan Breakstone, Orgad Keller, Ilana Tuil, Amit Ashkenazi
Voice user interface for nested content

Patent number: 11410638

Abstract: Methods and systems for causing a voice-activated electronic device to identify that a step of a series of steps can begin while a previous step is ongoing. In some embodiments, a first step will have a waiting period. The methods and systems, in some embodiments, identify this waiting period and determine that a second step can begin during the waiting period of step one. In some embodiments, nested sets of sequential steps are identified within the series of steps. The nested sets of sequential steps, in some embodiments, can be called upon.

Type: Grant

Filed: August 30, 2017

Date of Patent: August 9, 2022

Assignee: Amazon Technologies, Inc.

Inventor: Eshan Bhatnagar
Text-to-speech (TTS) processing with transfer of vocal characteristics

Patent number: 11410684

Abstract: Audio data from a first, source speaker is received and processed to determine linguistic units and vocal characteristics corresponding to those linguistic units. The linguistic units may either be determined from received text data or may be determined from the audio data using automatic speech recognition. A model is trained using training data from a second, target speaker. The trained model concatenates the linguistic units with the vocal characteristics to produce output speech that has the “voice” of the target speaker and the vocal characteristics of the source speaker.

Type: Grant

Filed: June 4, 2019

Date of Patent: August 9, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Viacheslav Klimkov, Thomas Renaud Drugman, Alexander Galkin, Srikanth Ronanki
Method and apparatus for controlling device

Patent number: 11398225

Abstract: A method and apparatus for controlling a device are disclosed. The method includes: performing voice recognition on a received sound signal to obtain a voice recognition result; determining keywords using the voice recognition result; determining a target intelligent device having attribute information matched with the keywords from intelligent devices, where relationships between the intelligent devices and attribute information of the intelligent devices are constructed in advance, and the attribute information characterizes a device operation provided by the intelligent device corresponding to the attribute information; and controlling the target intelligent device to perform an operation indicated by the voice recognition result.

Type: Grant

Filed: September 25, 2019

Date of Patent: July 26, 2022

Assignee: Beijing Xiaomi Intelligent Technology Co., Ltd.

Inventor: Fuxin Li
Method and apparatus for recognizing a voice

Patent number: 11393459

Abstract: Disclosed are a speech recognition device and a speech recognition method which perform speech recognition by executing an artificial intelligence (AI) algorithms and/or a machine learning algorithm installed thereon, to communicate with other electronic devices and an external server in a 5G communication environment. The speech recognition method according to an embodiment of the present disclosure may include converting a series of spoken utterance signals to a text item, extracting a discordant named-entity that is discordant with a parent domain inferred form the text, calculating probabilities of candidate words associated with the discordant named-entity based on calculated distances between a term representing the parent domain and each candidate word associated with the discordant named-entity, and based on the calculated probabilities, modifying the discordant named-entity in the text to one of the candidate words associated with the discordant named-entity.

Type: Grant

Filed: September 24, 2019

Date of Patent: July 19, 2022

Assignee: LG ELECTRONICS INC.

Inventors: Jong Hoon Chae, Esther Park, Su Il Choe
Discriminating ambiguous expressions to enhance user experience

Patent number: 11386268

Abstract: Methods and systems are provided for discriminating ambiguous expressions to enhance user experience. For example, a natural language expression may be received by a speech recognition component. The natural language expression may include at least one of words, terms, and phrases of text. A dialog hypothesis set from the natural language expression may be created by using contextual information. In some cases, the dialog hypothesis set has at least two dialog hypotheses. A plurality of dialog responses may be generated for the dialog hypothesis set. The dialog hypothesis set may be ranked based on an analysis of the plurality of the dialog responses. An action may be performed based on ranking the dialog hypothesis set.

Type: Grant

Filed: December 4, 2017

Date of Patent: July 12, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jean-Philippe Robichaud, Ruhi Sarikaya
Generation of alternate representions of utterances

Patent number: 11380304

Abstract: A system is provided for handling errors during automatic speech recognition by processing a potentially defective utterance to determine an alternative, potentially successful utterance. The system processes an ASR hypothesis, using a probabilistic graph, to determine a likelihood that it will result in an error. Using the probabilistic graph, the system determines an alternate utterance.

Type: Grant

Filed: March 25, 2019

Date of Patent: July 5, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Pragaash Ponnusamy, Alireza Roshan Ghias, Chenlei Guo
Intelligent presentation method

Patent number: 11380323

Abstract: Disclosed is an intelligent presentation method. The intelligent presentation method of the present disclosure may support a presentation to be smoothly performed by learning content of the presentation while a presenter is presenting and performing a function required for the presentation in response to a command voice. The intelligent presentation-assisting device of the present disclosure may be associated with an artificial intelligence module, a robot, an augmented reality (AR) device, a virtual reality (VR) device, a device related to a 5G service, and the like.

Type: Grant

Filed: November 22, 2019

Date of Patent: July 5, 2022

Assignee: LG ELECTRONICS INC.

Inventors: Wonho Shin, Jichan Maeng
Agent device, system, control method of agent device, and storage medium

Patent number: 11380325

Abstract: An agent device includes one or more agent controllers configured to provide a service including causing an output device to output a response of voice according to a voice of an occupant which is collected in a vehicle interior of a vehicle, a receiver configured to receive an input from the occupant, and a starting method setter configured to change or add a starting method of the agent controller on the basis of content received by the receiver.

Type: Grant

Filed: March 12, 2020

Date of Patent: July 5, 2022

Assignee: HONDA MOTOR CO., LTD.

Inventors: Masaki Kurihara, Shinichi Kikuchi, Shinya Yasuhara, Yusuke Oi, Hiroshi Honda
Determining context and intent in omnichannel communications using machine learning based artificial intelligence (AI) techniques

Patent number: 11373045

Abstract: A system for determining context and intent in a conversation using machine learning (ML) based artificial intelligence (AI) in omnichannel data communications is disclosed. The system may comprise a data store to store and manage data within a network, a server to facilitate operations using information from the one or more data stores, and a ML-based AI subsystem to communicate with the server and the data store in the network. The ML-based AI subsystem may comprise a data access interface to receive data associated with a conversation with a user via a communication channel. The ML-based AI subsystem may comprise a processor to provide a proactive, adaptive, and intelligent conversation by applying hierarchical multi-intent data labeling framework, training at least one model with training data, and generating and deploying a production-ready model based on the trained and retained at least one model.

Type: Grant

Filed: September 24, 2019

Date of Patent: June 28, 2022

Assignee: CONTACTENGINE LIMITED

Inventors: Dominic Bealby-Wright, Cosmin Dragos Davidescu
System, method, and recording medium for corpus pattern paraphrasing

Patent number: 11334721

Abstract: A corpus pattern paraphrasing method, system, and non-transitory computer readable medium, include aligning slots of patterns for verbal phrases based on syntactical and lexical features along with calculated synonyms to predict paraphrases that are not previously stored in a corpus of sentences in a database.

Type: Grant

Filed: July 31, 2019

Date of Patent: May 17, 2022

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Octavian Popescu, Vadim Sheinin
User adaptive conversation apparatus and method based on monitoring of emotional and ethical states

Patent number: 11328711

Abstract: A user adaptive conversation apparatus generating a talk for a conversation based on emotional and ethical states of a user. A voice recognition unit converts a talk of the user in a conversational situation into a natural language script form to generate talk information. An artificial visualization unit generates situation information by recognizing talking situation from a video and generates intention information indicating an intention of the talk. A natural language analysis unit converts the situation information and the intention information into the natural language script form. A natural language analysis unit analyzes the talk information, the intention information, and the situation information.

Type: Grant

Filed: July 5, 2019

Date of Patent: May 10, 2022

Assignee: KOREA ELECTRONICS TECHNOLOGY INSTITUTE

Inventors: Saim Shin, Hyedong Jung, Jinyea Jang
Method and apparatus for generating caption

Patent number: 11330342

Abstract: A method and apparatus for generating a caption are provided. The method of generating a caption according to one embodiment comprises: generating caption text which corresponds to a voice of a speaker included in broadcast data; generating reference voice information using a part of the voice of the speaker included in the broadcast data; and generating caption style information for the caption text based on the voice of the speaker and the reference voice information.

Type: Grant

Filed: June 3, 2019

Date of Patent: May 10, 2022

Assignee: NCSOFT Corporation

Inventors: Byungju Kim, Songhee So, Euijoon Son, Seungjoon Ahn, Sungyoung Yoon
Cross domain personalized vocabulary learning in intelligent assistants

Patent number: 11314940

Abstract: A method includes determining, by an electronic device, a skill from a first natural language (NL) input. Upon successful determination of the skill, the first NL input is transmitted to a custom skill parser for determination of a skill intent. The custom skill parser is trained based on data including at least a custom training data set. Upon unsuccessful determination of the skill, the first NL input is transmitted to a generic parser for determination of a general intent of the first NL input.

Type: Grant

Filed: May 22, 2018

Date of Patent: April 26, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Avik Ray, Yilin Shen, Hongxia Jin
Natural language processing apparatus and program

Patent number: 11308941

Abstract: A natural language processing apparatus includes: a first calculation unit configured to calculate a distributed vector of a word included in a plurality of sentences based on a database that manages the plurality of sentences associated with a classification word; a second calculation unit configured to calculate a distributed vector of the sentence based on the distributed vector of the word included in each sentence; and a third calculation unit configured to calculate a distributed vector of the classification word based on the distributed vector of each sentence associated with the same classification word.

Type: Grant

Filed: March 25, 2020

Date of Patent: April 19, 2022

Assignee: Nomura Research Institute, Ltd.

Inventors: Junichiro Maki, Satoshi Tobita, Shuichi Watanabe, Yosuke Hori, Jun Eijima
Cascaded adaptive interference cancellation algorithms

Patent number: 11277685

Abstract: Techniques for improving adaptive interference cancellation (AIC) using cascaded AIC algorithms are described. To improve an accuracy of detecting speech, a device may perform a first stage of AIC to generate isolated audio data and may generate speech mask data indicating time windows when speech is detected in the isolated audio data. Based on the speech mask data, the device may perform second AIC to generate output audio data, with adaptation of the adaptive filter enabled when the speech is not detected and disabled when the speech is detected. Thus, the first AIC improves the accuracy with which the device detects that speech is present and the second AIC reduces distortion in the output audio data by not updating filter coefficient values when the speech is present. The first AIC may use playback audio data, microphone audio data or beamformed audio data as reference signals.

Type: Grant

Filed: November 5, 2018

Date of Patent: March 15, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Robert Ayrapetian, Philip Ryan Hilmes, Mohamed Mansour, Carlo Murgia
Cross domain personalized vocabulary learning in intelligent assistants

Patent number: 11275896

Abstract: A method includes determining, by an electronic device, a skill from a first natural language (NL) input. Upon successful determination of the skill, the first NL input is transmitted to a custom skill parser for determination of a skill intent. The custom skill parser is trained based on data including at least a custom training data set. Upon unsuccessful determination of the skill, the first NL input is transmitted to a generic parser for determination of a general intent of the first NL input.

Type: Grant

Filed: May 22, 2018

Date of Patent: March 15, 2022

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Avik Ray, Yilin Shen, Hongxia Jin
Method for waking up robot and robot thereof

Patent number: 11276402

Abstract: A method for waking up a robot includes: acquiring sight range information when a voice command issuer issues a voice command; if the sight range information of the voice command issuer when issuing the voice command is acquired, determining, based on the sight range information, whether the voice command issuer gazes the robot when the voice command is issued; and determining that the robot is called if the voice command issuer gazes the robot.

Type: Grant

Filed: November 8, 2019

Date of Patent: March 15, 2022

Assignee: CLOUDMINDS ROBOTICS CO., LTD.

Inventor: Lei Luo
Information processing device and information processing method

Patent number: 11269936

Abstract: An information processing device includes a processor. The processor is configured to: receive an input of a question; hold a response, when data required to output response content in response to the question is insufficient; and output, when insufficient data is collected while the response is being held, an announcement that the response is made and the response content.

Type: Grant

Filed: February 15, 2019

Date of Patent: March 8, 2022

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Chikage Kubo, Takuji Yamada
Indicator for voice-based communications

Patent number: 11264030

Abstract: Systems, methods, and devices for outputting indications regarding voice-based interactions are described. A first speech-controlled device detects spoken audio corresponding to recipient information. The first device captures the audio and sends audio data corresponding to the captured audio to a server. The server determines a second speech-controlled device of the recipient and sends a signal to the recipient's second speech-controlled device representing a message is forthcoming. The recipient's second speech-controlled device outputs and indication representing a message is forthcoming.

Type: Grant

Filed: January 2, 2020

Date of Patent: March 1, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Christo Frank Devaraj, Manish Kumar Dalmia, Tony Roy Hardie, Ran Mokady, Nick Ciubotariu, Sandra Lemon

prev 1 2 3 4 5 6 7 8 9 … next