Patents Examined by Vu B. Hang
  • Patent number: 11966764
    Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).
    Type: Grant
    Filed: December 16, 2021
    Date of Patent: April 23, 2024
    Assignee: GOOGLE LLC
    Inventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
  • Patent number: 11967306
    Abstract: Methods and systems are provided for assisting operation of a vehicle using speech recognition. One method involves automatically identifying an input element based at least in part on an audio communication with respect to the vehicle, identifying one or more constraints associated with the input element, obtaining a limited command vocabulary for the input element using the one or more constraints, and automatically constructing a contextual speech recognition graph for the input element prior to user selection of the input element using the limited command vocabulary. Thereafter, subsequently received audio input is recognized using the contextual speech recognition graph that was automatically and prospectively generated.
    Type: Grant
    Filed: June 22, 2021
    Date of Patent: April 23, 2024
    Assignee: HONEYWELL INTERNATIONAL INC.
    Inventors: Hariharan Saptharishi, Gobinathan Baladhandapani, Sivakumar Kanagarajan, Amal Leo
  • Patent number: 11955121
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.
    Type: Grant
    Filed: April 28, 2021
    Date of Patent: April 9, 2024
    Assignee: GOOGLE LLC
    Inventors: Jakob Nicolaus Foerster, Alexander H. Gruenstein
  • Patent number: 11955130
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Grant
    Filed: May 19, 2022
    Date of Patent: April 9, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11954453
    Abstract: Systems and methods for natural language generation by an edge computing device are disclosed. In one embodiments, a method comprises: receiving, by an edge computing device, event data from an edge event; determining, by the edge computing device, that a network connection to a cloud server is not available; extracting, by the edge computing device, features of the event data; predicting, by a local neural network of the edge computing device, an action for the edge computing device to take based on the features of the event data, wherein the action is associated with a confidence level; and determining, by the edge computing device, whether the confidence level meets a predetermined threshold value.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: April 9, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Chih-Hsiung Liu, I-Chien Lin, Cheng-Fang Lin, Joey H. Y. Tseng
  • Patent number: 11948584
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: May 23, 2022
    Date of Patent: April 2, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11929065
    Abstract: A method includes receiving a representation of a spoken utterance, processing the representation of the spoken utterance to identify, from a number of candidate domains, a request and a serving domain, and routing the request to a personal assistant based on the request and the serving domain. Identification of the serving domain is based on one or more of a contextual state, a behavior profile of a speaker of the utterance, and a semantic content of the utterance.
    Type: Grant
    Filed: September 30, 2021
    Date of Patent: March 12, 2024
    Assignee: Cerence Operating Company
    Inventors: Giuseppe Iacobelli, Binh Nguyen, Josef Anastasiadis
  • Patent number: 11915693
    Abstract: Methods, programming, and system for modifying a slot value are described herein. In a non-limiting embodiment, an intent may be determined based on a first utterance. A first slot-value pair may be obtained for the first utterance based on the intent, the first slot-value pair including a first slot and a first value associated with the first slot. A second value associated with the first slot may be identified, the second value being identified from a second utterance that was previously received. Based on the intent and the first slot, a type of update to be performed with respect to the second value may be determined. The second value may then be updated based on the first value and the type of update.
    Type: Grant
    Filed: September 21, 2020
    Date of Patent: February 27, 2024
    Assignee: YAHOO ASSETS LLC
    Inventors: Prakhar Biyani, Cem Akkaya, Kostas Tsioutsiouliklis
  • Patent number: 11915716
    Abstract: A computer-implemented method for modifying audio-based communications produced during a conference call is disclosed. The computer-implemented method can include monitoring a plurality of utterances transmitted via an audio feed of a device connected to the conference call. The computer-implemented method can identify a first unwanted audio component transmitted via the audio feed. The computer-implemented method can actively modify the audio feed by removing the first unwanted audio component from the audio feed.
    Type: Grant
    Filed: July 16, 2020
    Date of Patent: February 27, 2024
    Assignee: International Business Machines Corporation
    Inventors: Craig M. Trim, Adam Lee Griffin, Shikhar Kwatra, Hyman David Chantz
  • Patent number: 11915700
    Abstract: An electronic device according to an embodiment comprises a microphone, a communication circuitry, a memory storing utterance pattern information of a first user registered in the electronic device and instructions, and a processor connected to the microphone, the communication circuitry, and the memory. The instructions, when executed by the processor, cause the electronic device to: obtain a utterance through the microphone; determine whether the utterance is uttered by the first user based on the utterance pattern information; based on being determined the utterance is uttered by the first user, transmit the utterance to an external server through the communication circuitry; receive a response message corresponding to the utterance from the external server through the communication circuitry; and execute at least one function corresponding to the response message. The response message is generated with reference to utterance history of a second user different from the first user.
    Type: Grant
    Filed: August 19, 2022
    Date of Patent: February 27, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Yangkyun Oh, Jaeyung Yeo, Changryong Heo
  • Patent number: 11908473
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process includes, at an electronic device having one or more processors and memory: performing a first task specified in a first user speech input; receiving a second user speech input; and in accordance with a determination that the second user speech input includes a modification to the first task, performing a second task, wherein performance of the second task modifies at least a portion of the performance of the first task.
    Type: Grant
    Filed: September 21, 2022
    Date of Patent: February 20, 2024
    Assignee: Apple Inc.
    Inventors: Yi Ma, Arash Dawoodi, Antoine R. Raux, Humza M. Siddiqui
  • Patent number: 11908459
    Abstract: The present disclosure is generally related to a data processing system to detect potential exfiltration of audio data by agent applications can include a data processing system. The data processing system can identify, from an I/O record, an input received from the digital assistant application via a microphone of a client device, an output received from the agent application after the input, and a microphone status for the microphone. The data processing system can determine that the output is terminal based on the input and the output. The data processing system can identify the microphone status as in the enabled state subsequent to the input. The data processing system can determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on determining that the output is terminal and identifying the microphone status as enabled.
    Type: Grant
    Filed: May 14, 2021
    Date of Patent: February 20, 2024
    Assignee: GOOGLE LLC
    Inventors: Yan Huang, Nikhil Rao
  • Patent number: 11893402
    Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).
    Type: Grant
    Filed: December 16, 2021
    Date of Patent: February 6, 2024
    Assignee: GOOGLE LLC
    Inventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
  • Patent number: 11893987
    Abstract: The present disclosure relates to a server and a system including the same.
    Type: Grant
    Filed: May 28, 2021
    Date of Patent: February 6, 2024
    Assignee: LG ELECTRONICS, INC.
    Inventors: Yookyoung Choi, Kiwon Park, Jaekyung Lee
  • Patent number: 11893981
    Abstract: A scoring system and method identifies personal attacks in a piece of audio content and generates a civility score for the piece of audio content that can differentiate between personal attacks and vernacular/casual banter. The piece of audio content may be a podcast.
    Type: Grant
    Filed: September 7, 2023
    Date of Patent: February 6, 2024
    Assignee: SEEKR TECHNOLOGIES INC.
    Inventors: Robin J. Clark, Ali Taleb Zadeh Kasgari, Stefanos Poulis
  • Patent number: 11881216
    Abstract: A system for identifying computer agents to perform a particular task requested by a user, receives an audio signal to perform the particular task. The system extracts a set of features from the audio signal. The set of features represents at least a first keyword indicating the particular task. The system determines which one or more computer agents from a plurality of computer agents is predetermined to perform the particular task by comparing the first keyword with a plurality of keywords associated with the plurality of keywords. The system determines a first computer agent associated with a second keyword that corresponds to the first keyword. The system executes the first computer agent to perform the particular task.
    Type: Grant
    Filed: June 8, 2021
    Date of Patent: January 23, 2024
    Assignee: Bank of America Corporation
    Inventor: Rajan Jigish Jhaveri
  • Patent number: 11862186
    Abstract: A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.
    Type: Grant
    Filed: October 7, 2022
    Date of Patent: January 2, 2024
    Assignee: Apple Inc.
    Inventors: Justin Binder, Samuel D. Post, Onur Tackin, Thomas R. Gruber
  • Patent number: 11862143
    Abstract: The present disclosure is related to systems and methods for processing speech dialogue. The method includes obtaining target speech dialogue data. The method includes obtaining a text vector representation sequence, a phonetic symbol vector representation sequence, and a role vector representation sequence by performing a vector transformation on the target speech dialogue data based on a text embedding model, a phonetic symbol embedding model, and a role embedding model, respectively. The method includes determining a representation vector corresponding to the target speech dialogue data by inputting the text vector representation sequence, the phonetic symbol vector representation sequence, and the role vector representation sequence into a trained speech dialogue coding model. The method includes determining a summary of the target speech dialogue data by inputting the representation vector into a classification model.
    Type: Grant
    Filed: August 19, 2020
    Date of Patent: January 2, 2024
    Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
    Inventors: Haiyang Xu, Kun Han
  • Patent number: 11862150
    Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.
    Type: Grant
    Filed: November 18, 2020
    Date of Patent: January 2, 2024
    Assignee: AI SPEECH CO., LTD.
    Inventors: Chengya Zhu, Shuai Fan, Weisi Shi
  • Patent number: 11842144
    Abstract: Embodiments are directed to summarizing conversational speech. Conversation segments may be provided based on a conversation stream and segmentation models. Summarization models may be determined based on characteristics of the conversation segments. Summarization information may be generated for each of the conversation segments based on the summarization models such that the summarization information includes a text-based summarization of the conversation segment. Summarization profiles may be generated for the conversation segments based on the summarization information such that each summarization profile is associated with quality scores. Summarization models may be modified based on the summarization profiles and the associated quality scores such that the summarization profiles are updated based on the modified summarization models. Modified summarization models and the updated summarization profiles may be employed to provide reports to a user.
    Type: Grant
    Filed: March 6, 2023
    Date of Patent: December 12, 2023
    Assignee: Rammer Technologies, Inc.
    Inventors: Toshish Arun Jawale, Sekhar Vallath, Pratik Abhaykumar Budruk