Patents Examined by Vu B. Hang
-
Patent number: 11972760Abstract: The present disclosure relates to detecting the use of fake voice command to activate microphones of smart devices. In one embodiment, sound characteristics associated with an audio signal from a microphone of smart device may be compared with other microphones of the smart device in order to detect fake voice commands. In another embodiment, sound characteristics associated with the audio signal from the microphone may be compared with a threshold range of stored sound characteristics in order to detect fake voice commands. In some embodiments, a controller may triangulate a position associated with a source of a sound in order to detect a fake voice command. In a further embodiment, a controller may verify that a user or associated electronic device are near a smart device to authorize a voice command.Type: GrantFiled: July 28, 2020Date of Patent: April 30, 2024Assignee: United Services Automobile Association (USAA)Inventors: Carlos J P Chavez, Sacha Melquiades De'Angeli, Oscar Guerra, David M. Jones, Jr., Gregory Brian Meyer, Christopher Russell, Arthur Quentin Smith
-
Patent number: 11966764Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).Type: GrantFiled: December 16, 2021Date of Patent: April 23, 2024Assignee: GOOGLE LLCInventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
-
Patent number: 11967306Abstract: Methods and systems are provided for assisting operation of a vehicle using speech recognition. One method involves automatically identifying an input element based at least in part on an audio communication with respect to the vehicle, identifying one or more constraints associated with the input element, obtaining a limited command vocabulary for the input element using the one or more constraints, and automatically constructing a contextual speech recognition graph for the input element prior to user selection of the input element using the limited command vocabulary. Thereafter, subsequently received audio input is recognized using the contextual speech recognition graph that was automatically and prospectively generated.Type: GrantFiled: June 22, 2021Date of Patent: April 23, 2024Assignee: HONEYWELL INTERNATIONAL INC.Inventors: Hariharan Saptharishi, Gobinathan Baladhandapani, Sivakumar Kanagarajan, Amal Leo
-
Patent number: 11955121Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio data that corresponds to an utterance. The actions further include determining a likelihood that the utterance includes a hotword. The actions further include determining a loudness score for the audio data. The actions further include based on the loudness score, determining an amount of delay time. The actions further include, after the amount of delay time has elapsed, transmitting a signal that indicates that the computing device will initiate speech recognition processing on the audio data.Type: GrantFiled: April 28, 2021Date of Patent: April 9, 2024Assignee: GOOGLE LLCInventors: Jakob Nicolaus Foerster, Alexander H. Gruenstein
-
Patent number: 11955130Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.Type: GrantFiled: May 19, 2022Date of Patent: April 9, 2024Assignee: DOLBY INTERNATIONAL ABInventors: Sven Kordon, Alexander Krueger
-
Patent number: 11954453Abstract: Systems and methods for natural language generation by an edge computing device are disclosed. In one embodiments, a method comprises: receiving, by an edge computing device, event data from an edge event; determining, by the edge computing device, that a network connection to a cloud server is not available; extracting, by the edge computing device, features of the event data; predicting, by a local neural network of the edge computing device, an action for the edge computing device to take based on the features of the event data, wherein the action is associated with a confidence level; and determining, by the edge computing device, whether the confidence level meets a predetermined threshold value.Type: GrantFiled: March 12, 2019Date of Patent: April 9, 2024Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Chih-Hsiung Liu, I-Chien Lin, Cheng-Fang Lin, Joey H. Y. Tseng
-
Patent number: 11948584Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.Type: GrantFiled: May 23, 2022Date of Patent: April 2, 2024Assignee: DOLBY INTERNATIONAL ABInventors: Sven Kordon, Alexander Krueger
-
Patent number: 11929065Abstract: A method includes receiving a representation of a spoken utterance, processing the representation of the spoken utterance to identify, from a number of candidate domains, a request and a serving domain, and routing the request to a personal assistant based on the request and the serving domain. Identification of the serving domain is based on one or more of a contextual state, a behavior profile of a speaker of the utterance, and a semantic content of the utterance.Type: GrantFiled: September 30, 2021Date of Patent: March 12, 2024Assignee: Cerence Operating CompanyInventors: Giuseppe Iacobelli, Binh Nguyen, Josef Anastasiadis
-
Patent number: 11915693Abstract: Methods, programming, and system for modifying a slot value are described herein. In a non-limiting embodiment, an intent may be determined based on a first utterance. A first slot-value pair may be obtained for the first utterance based on the intent, the first slot-value pair including a first slot and a first value associated with the first slot. A second value associated with the first slot may be identified, the second value being identified from a second utterance that was previously received. Based on the intent and the first slot, a type of update to be performed with respect to the second value may be determined. The second value may then be updated based on the first value and the type of update.Type: GrantFiled: September 21, 2020Date of Patent: February 27, 2024Assignee: YAHOO ASSETS LLCInventors: Prakhar Biyani, Cem Akkaya, Kostas Tsioutsiouliklis
-
Patent number: 11915716Abstract: A computer-implemented method for modifying audio-based communications produced during a conference call is disclosed. The computer-implemented method can include monitoring a plurality of utterances transmitted via an audio feed of a device connected to the conference call. The computer-implemented method can identify a first unwanted audio component transmitted via the audio feed. The computer-implemented method can actively modify the audio feed by removing the first unwanted audio component from the audio feed.Type: GrantFiled: July 16, 2020Date of Patent: February 27, 2024Assignee: International Business Machines CorporationInventors: Craig M. Trim, Adam Lee Griffin, Shikhar Kwatra, Hyman David Chantz
-
Patent number: 11915700Abstract: An electronic device according to an embodiment comprises a microphone, a communication circuitry, a memory storing utterance pattern information of a first user registered in the electronic device and instructions, and a processor connected to the microphone, the communication circuitry, and the memory. The instructions, when executed by the processor, cause the electronic device to: obtain a utterance through the microphone; determine whether the utterance is uttered by the first user based on the utterance pattern information; based on being determined the utterance is uttered by the first user, transmit the utterance to an external server through the communication circuitry; receive a response message corresponding to the utterance from the external server through the communication circuitry; and execute at least one function corresponding to the response message. The response message is generated with reference to utterance history of a second user different from the first user.Type: GrantFiled: August 19, 2022Date of Patent: February 27, 2024Assignee: Samsung Electronics Co., Ltd.Inventors: Yangkyun Oh, Jaeyung Yeo, Changryong Heo
-
Patent number: 11908473Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process includes, at an electronic device having one or more processors and memory: performing a first task specified in a first user speech input; receiving a second user speech input; and in accordance with a determination that the second user speech input includes a modification to the first task, performing a second task, wherein performance of the second task modifies at least a portion of the performance of the first task.Type: GrantFiled: September 21, 2022Date of Patent: February 20, 2024Assignee: Apple Inc.Inventors: Yi Ma, Arash Dawoodi, Antoine R. Raux, Humza M. Siddiqui
-
Patent number: 11908459Abstract: The present disclosure is generally related to a data processing system to detect potential exfiltration of audio data by agent applications can include a data processing system. The data processing system can identify, from an I/O record, an input received from the digital assistant application via a microphone of a client device, an output received from the agent application after the input, and a microphone status for the microphone. The data processing system can determine that the output is terminal based on the input and the output. The data processing system can identify the microphone status as in the enabled state subsequent to the input. The data processing system can determine that the agent application is unauthorized to access audio data acquired via the microphone of the client device based on determining that the output is terminal and identifying the microphone status as enabled.Type: GrantFiled: May 14, 2021Date of Patent: February 20, 2024Assignee: GOOGLE LLCInventors: Yan Huang, Nikhil Rao
-
Patent number: 11893402Abstract: Some implementations are directed to adapting a client application on a feature phone based on experiment parameters. Some of those implementations are directed to adapting an assistant client application, where the assistant client application interacts with remote assistant component(s) to provide automated assistant functionalities via the assistant client application of the feature phone. Some implementations are additionally or alternatively directed to determining whether an invocation, of an assistant client application on a feature phone, is a request for transcription of voice data received in conjunction with the invocation, or is instead a request for an assistant response that is responsive to the transcription of the voice data (e.g., includes assistant content that is based on and in addition to the transcription, and that optionally lacks the transcription itself).Type: GrantFiled: December 16, 2021Date of Patent: February 6, 2024Assignee: GOOGLE LLCInventors: Diego Accame, Abraham Lee, Yujie Wan, Shriya Raghunathan, Raymond Carino, Feng Ji, Shashwat Lal Das, Nickolas Westman
-
Patent number: 11893987Abstract: The present disclosure relates to a server and a system including the same.Type: GrantFiled: May 28, 2021Date of Patent: February 6, 2024Assignee: LG ELECTRONICS, INC.Inventors: Yookyoung Choi, Kiwon Park, Jaekyung Lee
-
Patent number: 11893981Abstract: A scoring system and method identifies personal attacks in a piece of audio content and generates a civility score for the piece of audio content that can differentiate between personal attacks and vernacular/casual banter. The piece of audio content may be a podcast.Type: GrantFiled: September 7, 2023Date of Patent: February 6, 2024Assignee: SEEKR TECHNOLOGIES INC.Inventors: Robin J. Clark, Ali Taleb Zadeh Kasgari, Stefanos Poulis
-
Patent number: 11881216Abstract: A system for identifying computer agents to perform a particular task requested by a user, receives an audio signal to perform the particular task. The system extracts a set of features from the audio signal. The set of features represents at least a first keyword indicating the particular task. The system determines which one or more computer agents from a plurality of computer agents is predetermined to perform the particular task by comparing the first keyword with a plurality of keywords associated with the plurality of keywords. The system determines a first computer agent associated with a second keyword that corresponds to the first keyword. The system executes the first computer agent to perform the particular task.Type: GrantFiled: June 8, 2021Date of Patent: January 23, 2024Assignee: Bank of America CorporationInventor: Rajan Jigish Jhaveri
-
Patent number: 11862186Abstract: A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.Type: GrantFiled: October 7, 2022Date of Patent: January 2, 2024Assignee: Apple Inc.Inventors: Justin Binder, Samuel D. Post, Onur Tackin, Thomas R. Gruber
-
Patent number: 11862143Abstract: The present disclosure is related to systems and methods for processing speech dialogue. The method includes obtaining target speech dialogue data. The method includes obtaining a text vector representation sequence, a phonetic symbol vector representation sequence, and a role vector representation sequence by performing a vector transformation on the target speech dialogue data based on a text embedding model, a phonetic symbol embedding model, and a role embedding model, respectively. The method includes determining a representation vector corresponding to the target speech dialogue data by inputting the text vector representation sequence, the phonetic symbol vector representation sequence, and the role vector representation sequence into a trained speech dialogue coding model. The method includes determining a summary of the target speech dialogue data by inputting the representation vector into a classification model.Type: GrantFiled: August 19, 2020Date of Patent: January 2, 2024Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.Inventors: Haiyang Xu, Kun Han
-
Patent number: 11862150Abstract: A skill dispatching method for a speech dialogue platform including: receiving, by a central control dispatching service, a semantic result of recognizing a user's voice sent by a data distribution service; dispatching, by the central control dispatching service, a plurality of skill services related to the semantic result in parallel, and obtaining skill parsing results from the plurality of skill services; sorting the skill parsing results based on priorities of the skill services, and exporting a result with the highest priority to a skill realization discrimination service; when failure in realization, selecting a result with the highest priority among the rest of skill parsing results and exporting the same to the skill realization discrimination service, and when success in realization, sending the result with the highest priority to the data distribution service for feedback to the user. The method improves skill dispatching efficiency, reduces delay, and improves user experience.Type: GrantFiled: November 18, 2020Date of Patent: January 2, 2024Assignee: AI SPEECH CO., LTD.Inventors: Chengya Zhu, Shuai Fan, Weisi Shi