Patents Examined by Vu B. Hang
  • Patent number: 12046235
    Abstract: One embodiment provides a method, including: receiving, at an input device associated with an information handling device, audio input; determining, using a processor, that an audible anomaly exists in the audio input, wherein the audible anomaly corresponds to a deviation from an established speech input pattern of a user; and performing, responsive to determining that the audible anomaly exists in the audio input, a remedial action to address the audible anomaly. Other aspects are described and claimed.
    Type: Grant
    Filed: July 29, 2021
    Date of Patent: July 23, 2024
    Assignee: LENOVO (SINGAPORE) PTE. LTD.
    Inventor: Matthew Tucker
  • Patent number: 12039968
    Abstract: System and method for operating an always-on ASR (automatic speech recognition) system by selecting target keywords and continuously detecting the selected target keywords in voice commands in a mobile device are provided. In the mobile device, a processor is configured to collect keyword candidates, collect usage frequency data for keywords in the keyword candidates, collect situational usage frequency data for the keywords in the keyword candidates, select target keywords from the keyword candidates based on the usage frequency data and the situational usage frequency data, and detect one or more of the target keywords in a voice command using continuous detection of the target keywords.
    Type: Grant
    Filed: September 30, 2020
    Date of Patent: July 16, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Wonil Chang, Jinseok Lee, Mingu Lee, Jinkyu Lee, Byeonggeun Kim, Dooyong Sung, Jae-Won Choi, Kyu Woong Hwang
  • Patent number: 12039970
    Abstract: A system and method for authenticating sound verbalized or otherwise generated by a live source within a monitored setting for voice-controlled or sound-controlled automation of a responsive process. One or more classifiers each generate a decision value according to values of predetermined signal features extracted from a received digital stream, and a sound type classification is computed according to an aggregate score of a predetermined number of decision values. The actuation of the responsive process is authenticated when the system discriminately indicates the captured sound signals to be verbalized or generated by a live source. The responsive process is thereby suppressed when the sound is instead determined to be reproduced or otherwise previously transduced, for example by a transmission or recording.
    Type: Grant
    Filed: July 29, 2022
    Date of Patent: July 16, 2024
    Assignee: Renesas Electronics America
    Inventor: Jeffrey Sieracki
  • Patent number: 12020714
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: May 23, 2022
    Date of Patent: June 25, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 12020696
    Abstract: [Object] Technology is provided to enable a mobile terminal to function as a digital assistant even when the mobile terminal is in a state where it cannot communicate with a server apparatus. [Solution] When a user terminal 200 receives a query A from a user, user terminal 200 sends query A to a server 100. Server 100 interprets the meaning of query A using a grammar A. Server 100 obtains a response to query A based on the meaning of query A and sends the response to user terminal 200. Server 100 further sends grammar A to user terminal 200. That is, server 100 sends to user terminal 200 a grammar used to interpret the query received from user terminal 200.
    Type: Grant
    Filed: October 21, 2019
    Date of Patent: June 25, 2024
    Assignee: SoundHound AI IP, LLC
    Inventor: Karl Stahl
  • Patent number: 12014728
    Abstract: A computer implemented method classifies an input corresponding to multiple different kinds of input. The method includes obtaining a set of features from the input, providing the set of features to multiple different models to generate state predictions, generating a set of state-dependent predicted weights, and combining the state predictions from the multiple models, based on the state-dependent predicted weights for classification of the set of features.
    Type: Grant
    Filed: March 25, 2019
    Date of Patent: June 18, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Kshitiz Kumar, Yifan Gong
  • Patent number: 12009007
    Abstract: A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.
    Type: Grant
    Filed: April 17, 2023
    Date of Patent: June 11, 2024
    Assignee: Apple Inc.
    Inventors: Justin Binder, Samuel D. Post, Onur Tackin, Thomas R. Gruber
  • Patent number: 12002454
    Abstract: Embodiments of the innovation relate to, in a contact center apparatus, a method for recognizing user intent associated with user interaction with the contact center apparatus.
    Type: Grant
    Filed: December 18, 2020
    Date of Patent: June 4, 2024
    Assignee: Swampfox Technologies, Inc.
    Inventors: Sergey A. Razin, Robert S. Cooper, Rick Ulmer, Tom Hanson
  • Patent number: 12002470
    Abstract: Systems and methods for providing multi-source based knowledge data for Artificial Intelligence (AI) characters are provided. An example method includes providing a plurality of data sources; receiving, from a user, at least one word during a conversation between the user and an AI character; ascertaining a speech style of the AI character; analyzing the at least one word to determine a type of information needed to generate a reply to the user; selecting, based on the type of information, at least one data source from the plurality of data sources; generating, based on the at least one word, one or more queries; sending the one or more queries to the at least one data source; receiving one or more responses from the at least one data source; forming, based on the one or more responses and the speech style of the AI character, the reply for providing to the user.
    Type: Grant
    Filed: December 31, 2023
    Date of Patent: June 4, 2024
    Assignee: Theai, Inc.
    Inventors: Ilya Gelfenbeyn, Mikhail Ermolenko, Kylan Gibbs, Kirill Ryzhov, Nathan Yu
  • Patent number: 11996095
    Abstract: The exemplary embodiments disclose a method, a computer program product, and a computer system for managing user commands. The exemplary embodiments may include a user giving one or more commands to one or more devices, collecting data of the one or more commands, extracting one or more features from the collected data, and determining which one or more of the commands should be executed on which one or more of the devices based on the extracted one or more features and one or more models.
    Type: Grant
    Filed: August 12, 2020
    Date of Patent: May 28, 2024
    Assignee: KYNDRYL, INC.
    Inventors: Cesar Augusto Rodriguez Bravo, David Alonso Campos Batista, Sarbajit K. Rakshit
  • Patent number: 11991194
    Abstract: Embodiments presented herein describe techniques for generating a linguistic model of input data obtained from a data source (e.g., a video camera). According to one embodiment of the present disclosure, a sequence of symbols is generated based on an ordered stream of normalized vectors generated from the input data. A dictionary of words is generated from combinations of the ordered sequence of symbols based on a frequency at which combinations of symbols appear in the ordered sequence of symbols. A plurality of phrases is generated based an ordered sequence of words from the dictionary observed in the ordered sequence of symbols based on a frequency by which combinations of words in ordered sequence of words appear relative to one another.
    Type: Grant
    Filed: July 6, 2021
    Date of Patent: May 21, 2024
    Assignee: Intellective Ai, Inc.
    Inventors: Ming-Jung Seow, Wesley Kenneth Cobb, Gang Xu, Tao Yang, Aaron Poffenberger, Lon W. Risinger, Kishor Adinath Saitwal, Michael S. Yantosca, David M. Solum, Alex David Hemsath, Dennis G. Urech, Duy Trong Nguyen, Charles Richard Morgan
  • Patent number: 11978442
    Abstract: A system and methods are provided to analyze audio signals from an incoming voice call. The system includes a processor and a computer readable medium operably coupled thereto, to perform voice analysis operations which include receiving a first audio signal comprising a first audio waveform of a first speech between at least two users during the incoming voice call, accessing speech segment parameters for analyzing the audio signals, determining one or more talk-over segments in the first audio waveform using the speech segment parameters, extracting audio features from each of the one or more talk-over segments, determining, using a machine learning (ML) model trained for interruption analysis of the audio signals, whether each of the one or more talk-over segments are a negative interruption or a non-negative interruption based on the audio features, and determining whether to output a first notification for the negative interruption or the non-negative interruption.
    Type: Grant
    Filed: January 6, 2022
    Date of Patent: May 7, 2024
    Assignee: NICE LTD.
    Inventors: Gennadi Lembersky, Neta Rosenfeld
  • Patent number: 11978451
    Abstract: Systems and methods to translate a spoken command to a selection sequence are disclosed. Exemplary implementations may: obtain audio information representing sounds captured by a client computing platform; analyze the sounds to determine spoken terms; determine whether the spoken terms include one or more of the terms that are correlated with the commands; responsive to determining that the spoken terms are terms that are correlated with a particular command stored in the electronic storage, perform a set of operations that correspond to the particular command; responsive to determine that the spoken terms are not the terms correlated with the commands stored in the electronic storage, determining a selection sequence that causes a result subsequent to the analysis of the sounds; correlate the spoken terms with the selection sequence; store the correlation of the spoken terms with the selection sequence; and perform the selection sequence to cause the result.
    Type: Grant
    Filed: November 17, 2022
    Date of Patent: May 7, 2024
    Assignee: Suki AI, Inc.
    Inventors: Maneesh Dewan, Jatin Chhugani, Ganesh Satish Mallya, Alan Diec, Vamsi Reddy Chagari, Sudheer Tumu, Nithyanand Kota
  • Patent number: 11978437
    Abstract: Devices and techniques are generally described for learning personalized concepts for natural language processing. In various examples, a first natural language input may be received. In some examples, a determination may be made that the first natural language input comprises non-actionable slot data. A dialog session may be initiated with the user. In some examples, first slot data that is indicated by the user during the dialog session may be determined. In various examples, data representing the first slot data may be stored in a database in association with the first natural language input.
    Type: Grant
    Filed: December 11, 2020
    Date of Patent: May 7, 2024
    Assignee: Amazon Technologies, Inc.
    Inventors: Govindarajan Sundaram Thattai, Qing Ping, Feiyang Niu, Joel Joseph Chengottusseriyil, Prashanth Rajagopal, Qiaozi Gao, Aishwarya Naresh Reganti, Gokhan Tur, Dilek Hakkani-Tur, Rohit Prasad, Premkumar Natarajan
  • Patent number: 11972760
    Abstract: The present disclosure relates to detecting the use of fake voice command to activate microphones of smart devices. In one embodiment, sound characteristics associated with an audio signal from a microphone of smart device may be compared with other microphones of the smart device in order to detect fake voice commands. In another embodiment, sound characteristics associated with the audio signal from the microphone may be compared with a threshold range of stored sound characteristics in order to detect fake voice commands. In some embodiments, a controller may triangulate a position associated with a source of a sound in order to detect a fake voice command. In a further embodiment, a controller may verify that a user or associated electronic device are near a smart device to authorize a voice command.
    Type: Grant
    Filed: July 28, 2020
    Date of Patent: April 30, 2024
    Assignee: United Services Automobile Association (USAA)
    Inventors: Carlos J P Chavez, Sacha Melquiades De'Angeli, Oscar Guerra, David M. Jones, Jr., Gregory Brian Meyer, Christopher Russell, Arthur Quentin Smith
  • Patent number: 11967306
    Abstract: Methods and systems are provided for assisting operation of a vehicle using speech recognition. One method involves automatically identifying an input element based at least in part on an audio communication with respect to the vehicle, identifying one or more constraints associated with the input element, obtaining a limited command vocabulary for the input element using the one or more constraints, and automatically constructing a contextual speech recognition graph for the input element prior to user selection of the input element using the limited command vocabulary. Thereafter, subsequently received audio input is recognized using the contextual speech recognition graph that was automatically and prospectively generated.
    Type: Grant
    Filed: June 22, 2021
    Date of Patent: April 23, 2024
    Assignee: HONEYWELL INTERNATIONAL INC.
    Inventors: Hariharan Saptharishi, Gobinathan Baladhandapani, Sivakumar Kanagarajan, Amal Leo
  • Patent number: 11966764
    Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).
    Type: Grant
    Filed: December 16, 2021
    Date of Patent: April 23, 2024
    Assignee: GOOGLE LLC
    Inventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
  • Patent number: 11955121
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.
    Type: Grant
    Filed: April 28, 2021
    Date of Patent: April 9, 2024
    Assignee: GOOGLE LLC
    Inventors: Jakob Nicolaus Foerster, Alexander H. Gruenstein
  • Patent number: 11955130
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Grant
    Filed: May 19, 2022
    Date of Patent: April 9, 2024
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11954453
    Abstract: Systems and methods for natural language generation by an edge computing device are disclosed. In one embodiments, a method comprises: receiving, by an edge computing device, event data from an edge event; determining, by the edge computing device, that a network connection to a cloud server is not available; extracting, by the edge computing device, features of the event data; predicting, by a local neural network of the edge computing device, an action for the edge computing device to take based on the features of the event data, wherein the action is associated with a confidence level; and determining, by the edge computing device, whether the confidence level meets a predetermined threshold value.
    Type: Grant
    Filed: March 12, 2019
    Date of Patent: April 9, 2024
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Chih-Hsiung Liu, I-Chien Lin, Cheng-Fang Lin, Joey H. Y. Tseng