Patents Examined by Vu B. Hang

Systems and methods for detecting fake voice commands to smart devices

Patent number: 11972760

Abstract: The present disclosure relates to detecting the use of fake voice command to activate microphones of smart devices. In one embodiment, sound characteristics associated with an audio signal from a microphone of smart device may be compared with other microphones of the smart device in order to detect fake voice commands. In another embodiment, sound characteristics associated with the audio signal from the microphone may be compared with a threshold range of stored sound characteristics in order to detect fake voice commands. In some embodiments, a controller may triangulate a position associated with a source of a sound in order to detect a fake voice command. In a further embodiment, a controller may verify that a user or associated electronic device are near a smart device to authorize a voice command.

Type: Grant

Filed: July 28, 2020

Date of Patent: April 30, 2024

Assignee: United Services Automobile Association (USAA)

Inventors: Carlos J P Chavez, Sacha Melquiades De'Angeli, Oscar Guerra, David M. Jones, Jr., Gregory Brian Meyer, Christopher Russell, Arthur Quentin Smith
Adapting client application of feature phone based on experiment parameters

Patent number: 11966764

Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).

Type: Grant

Filed: December 16, 2021

Date of Patent: April 23, 2024

Assignee: GOOGLE LLC

Inventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
Contextual speech recognition methods and systems

Patent number: 11967306

Abstract: Methods and systems are provided for assisting operation of a vehicle using speech recognition. One method involves automatically identifying an input element based at least in part on an audio communication with respect to the vehicle, identifying one or more constraints associated with the input element, obtaining a limited command vocabulary for the input element using the one or more constraints, and automatically constructing a contextual speech recognition graph for the input element prior to user selection of the input element using the limited command vocabulary. Thereafter, subsequently received audio input is recognized using the contextual speech recognition graph that was automatically and prospectively generated.

Type: Grant

Filed: June 22, 2021

Date of Patent: April 23, 2024

Assignee: HONEYWELL INTERNATIONAL INC.

Inventors: Hariharan Saptharishi, Gobinathan Baladhandapani, Sivakumar Kanagarajan, Amal Leo
Hotword detection on multiple devices

Patent number: 11955121

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.

Type: Grant

Filed: April 28, 2021

Date of Patent: April 9, 2024

Assignee: GOOGLE LLC

Inventors: Jakob Nicolaus Foerster, Alexander H. Gruenstein
Layered coding and data structure for compressed higher-order Ambisonics sound or sound field representations

Patent number: 11955130

Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.

Type: Grant

Filed: May 19, 2022

Date of Patent: April 9, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventors: Sven Kordon, Alexander Krueger
Natural language generation by an edge computing device

Patent number: 11954453

Abstract: Systems and methods for natural language generation by an edge computing device are disclosed. In one embodiments, a method comprises: receiving, by an edge computing device, event data from an edge event; determining, by the edge computing device, that a network connection to a cloud server is not available; extracting, by the edge computing device, features of the event data; predicting, by a local neural network of the edge computing device, an action for the edge computing device to take based on the features of the event data, wherein the action is associated with a confidence level; and determining, by the edge computing device, whether the confidence level meets a predetermined threshold value.

Type: Grant

Filed: March 12, 2019

Date of Patent: April 9, 2024

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Chih-Hsiung Liu, I-Chien Lin, Cheng-Fang Lin, Joey H. Y. Tseng
Layered coding for compressed sound or sound field represententations

Patent number: 11948584

Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.

Type: Grant

Filed: May 23, 2022

Date of Patent: April 2, 2024

Assignee: DOLBY INTERNATIONAL AB

Inventors: Sven Kordon, Alexander Krueger
Coordinating electronic personal assistants

Patent number: 11929065

Abstract: A method includes receiving a representation of a spoken utterance, processing the representation of the spoken utterance to identify, from a number of candidate domains, a request and a serving domain, and routing the request to a personal assistant based on the request and the serving domain. Identification of the serving domain is based on one or more of a contextual state, a behavior profile of a speaker of the utterance, and a semantic content of the utterance.

Type: Grant

Filed: September 30, 2021

Date of Patent: March 12, 2024

Assignee: Cerence Operating Company

Inventors: Giuseppe Iacobelli, Binh Nguyen, Josef Anastasiadis
System and method for rule based modifications to variable slots based on context

Patent number: 11915693

Abstract: Methods, programming, and system for modifying a slot value are described herein. In a non-limiting embodiment, an intent may be determined based on a first utterance. A first slot-value pair may be obtained for the first utterance based on the intent, the first slot-value pair including a first slot and a first value associated with the first slot. A second value associated with the first slot may be identified, the second value being identified from a second utterance that was previously received. Based on the intent and the first slot, a type of update to be performed with respect to the second value may be determined. The second value may then be updated based on the first value and the type of update.

Type: Grant

Filed: September 21, 2020

Date of Patent: February 27, 2024

Assignee: YAHOO ASSETS LLC

Inventors: Prakhar Biyani, Cem Akkaya, Kostas Tsioutsiouliklis
Audio modifying conferencing system

Patent number: 11915716

Abstract: A computer-implemented method for modifying audio-based communications produced during a conference call is disclosed. The computer-implemented method can include monitoring a plurality of utterances transmitted via an audio feed of a device connected to the conference call. The computer-implemented method can identify a first unwanted audio component transmitted via the audio feed. The computer-implemented method can actively modify the audio feed by removing the first unwanted audio component from the audio feed.

Type: Grant

Filed: July 16, 2020

Date of Patent: February 27, 2024

Assignee: International Business Machines Corporation

Inventors: Craig M. Trim, Adam Lee Griffin, Shikhar Kwatra, Hyman David Chantz
Device for processing user voice input

Patent number: 11915700

Abstract: An electronic device according to an embodiment comprises a microphone, a communication circuitry, a memory storing utterance pattern information of a first user registered in the electronic device and instructions, and a processor connected to the microphone, the communication circuitry, and the memory. The instructions, when executed by the processor, cause the electronic device to: obtain a utterance through the microphone; determine whether the utterance is uttered by the first user based on the utterance pattern information; based on being determined the utterance is uttered by the first user, transmit the utterance to an external server through the communication circuitry; receive a response message corresponding to the utterance from the external server through the communication circuitry; and execute at least one function corresponding to the response message. The response message is generated with reference to utterance history of a second user different from the first user.

Type: Grant

Filed: August 19, 2022

Date of Patent: February 27, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventors: Yangkyun Oh, Jaeyung Yeo, Changryong Heo
Task modification after task initiation

Patent number: 11908473

Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process includes, at an electronic device having one or more processors and memory: performing a first task specified in a first user speech input; receiving a second user speech input; and in accordance with a determination that the second user speech input includes a modification to the first task, performing a second task, wherein performance of the second task modifies at least a portion of the performance of the first task.

Type: Grant

Filed: September 21, 2022

Date of Patent: February 20, 2024

Assignee: Apple Inc.

Inventors: Yi Ma, Arash Dawoodi, Antoine R. Raux, Humza M. Siddiqui
Detection of potential exfiltration of audio data from digital assistant applications

Patent number: 11908459

Abstract: The present disclosure is generally related to a data processing system to detect potential exfiltration of audio data by agent applications can include a data processing system. The data processing system can identify, from an I/O record, an input received from the digital assistant application via a microphone of a client device, an output received from the agent application after the input, and a microphone status for the microphone. The data processing system can determine that the output is terminal based on the input and the output. The data processing system can identify the microphone status as in the enabled state subsequent to the input. The data processing system can determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on determining that the output is terminal and identifying the microphone status as enabled.

Type: Grant

Filed: May 14, 2021

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Yan Huang, Nikhil Rao
Adapting client application of feature phone based on experiment parameters

Patent number: 11893402

Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).

Type: Grant

Filed: December 16, 2021

Date of Patent: February 6, 2024

Assignee: GOOGLE LLC

Inventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
Server and system including the same

Patent number: 11893987

Abstract: The present disclosure relates to a server and a system including the same.

Type: Grant

Filed: May 28, 2021

Date of Patent: February 6, 2024

Assignee: LG ELECTRONICS, INC.

Inventors: Yookyoung Choi, Kiwon Park, Jaekyung Lee
Search system and method having civility score

Patent number: 11893981

Abstract: A scoring system and method identifies personal attacks in a piece of audio content and generates a civility score for the piece of audio content that can differentiate between personal attacks and vernacular/casual banter. The piece of audio content may be a podcast.

Type: Grant

Filed: September 7, 2023

Date of Patent: February 6, 2024

Assignee: SEEKR TECHNOLOGIES INC.

Inventors: Robin J. Clark, Ali Taleb Zadeh Kasgari, Stefanos Poulis
System and method for conversation agent selection based on processing contextual data from speech

Patent number: 11881216

Abstract: A system for identifying computer agents to perform a particular task requested by a user, receives an audio signal to perform the particular task. The system extracts a set of features from the audio signal. The set of features represents at least a first keyword indicating the particular task. The system determines which one or more computer agents from a plurality of computer agents is predetermined to perform the particular task by comparing the first keyword with a plurality of keywords associated with the plurality of keywords. The system determines a first computer agent associated with a second keyword that corresponds to the first keyword. The system executes the first computer agent to perform the particular task.

Type: Grant

Filed: June 8, 2021

Date of Patent: January 23, 2024

Assignee: Bank of America Corporation

Inventor: Rajan Jigish Jhaveri
Voice trigger for a digital assistant

Patent number: 11862186

Abstract: A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.

Type: Grant

Filed: October 7, 2022

Date of Patent: January 2, 2024

Assignee: Apple Inc.

Inventors: Justin Binder, Samuel D. Post, Onur Tackin, Thomas R. Gruber
Systems and methods for processing speech dialogues

Patent number: 11862143

Abstract: The present disclosure is related to systems and methods for processing speech dialogue. The method includes obtaining target speech dialogue data. The method includes obtaining a text vector representation sequence, a phonetic symbol vector representation sequence, and a role vector representation sequence by performing a vector transformation on the target speech dialogue data based on a text embedding model, a phonetic symbol embedding model, and a role embedding model, respectively. The method includes determining a representation vector corresponding to the target speech dialogue data by inputting the text vector representation sequence, the phonetic symbol vector representation sequence, and the role vector representation sequence into a trained speech dialogue coding model. The method includes determining a summary of the target speech dialogue data by inputting the representation vector into a classification model.

Type: Grant

Filed: August 19, 2020

Date of Patent: January 2, 2024

Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.

Inventors: Haiyang Xu, Kun Han
Skill dispatching method and apparatus for speech dialogue platform

Patent number: 11862150

Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.

Type: Grant

Filed: November 18, 2020

Date of Patent: January 2, 2024

Assignee: AI SPEECH CO., LTD.

Inventors: Chengya Zhu, Shuai Fan, Weisi Shi

1 2 3 4 5 … next