Patents Examined by Michael Colucci

End-to-end streaming keyword spotting

Patent number: 11967310

Abstract: A method for training hotword detection includes receiving a training input audio sequence including a sequence of input frames that define a hotword that initiates a wake-up process on a device. The method also includes feeding the training input audio sequence into an encoder and a decoder of a memorized neural network. Each of the encoder and the decoder of the memorized neural network include sequentially-stacked single value decomposition filter (SVDF) layers. The method further includes generating a logit at each of the encoder and the decoder based on the training input audio sequence. For each of the encoder and the decoder, the method includes smoothing each respective logit generated from the training input audio sequence, determining a max pooling loss from a probability distribution based on each respective logit, and optimizing the encoder and the decoder based on all max pooling losses associated with the training input audio sequence.

Type: Grant

Filed: May 23, 2023

Date of Patent: April 23, 2024

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Hyun Jin Park, Patrick Violette
Voice communication analysis system

Patent number: 11967307

Abstract: Techniques are disclosed for applying a trained machine learning model to incoming voice communications to determine whether the voice communications are genuine or not genuine. The trained machine learning model may identify vocal attributes within the target call and use the identified attributes, and the training, determine whether the target call is genuine or not genuine. An applied trained machine learning model may include multiple different types of trained machine learning models, where each of different types of machine learning models is trained and/or configured for a different function within the analysis.

Type: Grant

Filed: February 12, 2021

Date of Patent: April 23, 2024

Assignee: Oracle International Corporation

Inventor: Suraj Shinde
Systems and methods for monitoring technology infrastructure

Patent number: 11954444

Abstract: Systems and methods of monitoring technology infrastructure using alerts indicative service events and tickets indicative of incidents reported to the support system, including transmitting, to a client via a network, structured support data including issue data and correlation data. The issue data represents issues, which are fewer than the number of tickets, generated by processing textual data of the tickets through a clustering engine implementing a generative probabilistic model and generating the correlation data by associating alerts and tickets by correlating alert-specific identifiers and ticket-specific identifiers. The identifiers are of least one of identifier times, locations, names, or descriptions. A prioritization engine is also disclosed.

Type: Grant

Filed: August 30, 2021

Date of Patent: April 9, 2024

Assignee: ROYAL BANK OF CANADA

Inventors: Seyedramin Alikiaamiri, Mehdi Rostamiforooshani, Morteza Mashayekhi, Frank Liu, Martin Mendoza, Keerthi Ningegowda, Chuhang Liu
Wakeword selection

Patent number: 11948571

Abstract: A system and method are disclosed capable of parsing a spoken utterance into a natural language request and a speech audio segment, where the natural language request directs the system to use the speech audio segment as a new wakeword. In response to this wakeword assignment directive, the system and method are further capable of immediately building a new wakeword spotter to activate the device upon matching the new wakeword in the input audio. Different approaches to promptly building a new wakeword spotter are described. Variations of wakeword assignment directives can make the new wakeword public or private. They can also add the new wakeword to earlier wakewords, or replace earlier wakewords.

Type: Grant

Filed: March 30, 2022

Date of Patent: April 2, 2024

Assignee: SoundHound AI IP, LLC

Inventor: Bernard Mont-Reynaud
Adapting to differences in device state reporting of third party servers

Patent number: 11943075

Abstract: Implementations herein relate to information describing one or more internal states of a technical system. Implementations herein are provided for characterizing reliability of various different third party servers, at least when reporting third party device statuses, as well as adapting protocols for device ecosystems affected by such reliability. Latency can affect accuracy of device states represented by assistant devices. Certain servers can be characterized as especially delayed when reporting an updated device state in response to a user request, and, as a result, the third party server can be correlated to a metric that characterizes the relative latency of the third party server. When the metric fails to satisfy a particular threshold, a server and/or client associated with the “ecosystem” of third party devices can affirmatively operate to retrieve device state updates, rather than passively await updates from a corresponding third party server.

Type: Grant

Filed: December 6, 2021

Date of Patent: March 26, 2024

Assignee: GOOGLE LLC

Inventor: Yuzhao Ni
Enhanced spoken language understanding using joint model training

Patent number: 11942080

Abstract: Systems and methods for improved Spoken Language Understanding (“SLU”) are provided. The methods may comprise receiving an utterance from a user, contextualizing a plurality of words in the utterance, providing the contextualized words to the slot detector to determine the probability of a word forming the beginning or end of a slot to determine slots and nested slots, an intent classifier to determine the probability of a word conveying a user intent, and a slot classifier that applies specific labels to each slot and nest slot. The SLU method may employ a model and jointly trains the model for each task (determining beginning and end of slots, intents, and slot classifications) using a combined loss function.

Type: Grant

Filed: January 29, 2021

Date of Patent: March 26, 2024

Assignee: Walmart Apollo, LLC

Inventor: Seyed Iman Mirrezaei
End-to-end streaming keyword spotting

Patent number: 11929064

Abstract: A method for detecting a hotword includes receiving a sequence of input frames that characterize streaming audio captured by a user device and generating a probability score indicating a presence of a hotword in the streaming audio using a memorized neural network. The network includes sequentially-stacked single value decomposition filter (SVDF) layers and each SVDF layer includes at least one neuron. Each neuron includes a respective memory component, a first stage configured to perform filtering on audio features of each input frame individually and output to the memory component, and a second stage configured to perform filtering on all the filtered audio features residing in the respective memory component. The method also includes determining whether the probability score satisfies a hotword detection threshold and initiating a wake-up process on the user device for processing additional terms.

Type: Grant

Filed: January 9, 2023

Date of Patent: March 12, 2024

Assignee: Google LLC

Inventors: Raziel Alvarez Guevara, Hyun Jin Park
Electronic message transmission

Patent number: 11922937

Abstract: One or more computing devices, systems, and/or methods for detecting trigger phrases and transmitting electronic messages to devices are provided. For example, audio received via a microphone of a first device may be monitored. Responsive to detecting a first trigger phrase in a first audio segment identified during the monitoring, a first electronic message comprising instructions to activate a microphone function of a second device may be generated and the first electronic message may be transmitted to the second device. Responsive to detecting a second trigger phrase in a second audio segment identified during the monitoring, a second electronic message comprising instructions to activate a microphone function of a third device may be generated and the second electronic message may be transmitted to the third device.

Type: Grant

Filed: October 18, 2021

Date of Patent: March 5, 2024

Assignee: Yahoo Assets LLC

Inventor: Varun Bhagwan
Hotword detection on multiple devices

Patent number: 11915706

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for hotword detection on multiple devices are disclosed. In one aspect, a method includes the actions of receiving, by a first computing device, audio data that corresponds to an utterance. The actions further include determining a first value corresponding to a likelihood that the utterance includes a hotword. The actions further include receiving a second value corresponding to a likelihood that the utterance includes the hotword, the second value being determined by a second computing device. The actions further include comparing the first value and the second value. The actions further include based on comparing the first value to the second value, initiating speech recognition processing on the audio data.

Type: Grant

Filed: January 5, 2023

Date of Patent: February 27, 2024

Assignee: Google LLC

Inventor: Matthew Sharifi
Speaker adaptation for attention-based encoder-decoder

Patent number: 11915686

Abstract: Embodiments are associated with a speaker-independent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-independent attention-based encoder-decoder model associated with a first output distribution, and a speaker-dependent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-dependent attention-based encoder-decoder model associated with a second output distribution. The second attention-based encoder-decoder model is trained to classify output tokens based on input speech frames of a target speaker and simultaneously trained to maintain a similarity between the first output distribution and the second output distribution.

Type: Grant

Filed: January 5, 2022

Date of Patent: February 27, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Zhong Meng, Yashesh Gaur, Jinyu Li, Yifan Gong
System and method for automated patient interaction

Patent number: 11914953

Abstract: Provided is a system and method for automated patient interaction. The method includes parsing a patient complaint comprising a plurality of words, determining a subset of patient queries from a plurality of patient queries based on the patient complaint and patient data, communicating the subset of patient queries to a first computing device; receiving, from the first computing device, responses to at least a portion of the subset of patient queries; generating output data based on the subset of patient queries and the responses; communicating the output data to a second computing device; receiving, from the second computing device, a user input corresponding to at least one patient query of the subset of patient queries; and training, based on the user input, at least one machine-learning algorithm configured to output at least one patient query based on at least one of the patient complaint and a subsequent patient complaint.

Type: Grant

Filed: November 15, 2019

Date of Patent: February 27, 2024

Assignee: 98point6 Inc.

Inventors: Damon Lanphear, Keith Trnka, Robbie Schwietzer
Method and electronic device for translating speech signal

Patent number: 11915684

Abstract: A method and an electronic device for translating a speech signal between a first language and a second language with minimized translation delay by translating fewer than all words of the speech signal according to a level of understanding of the second language by a user that receives the translation.

Type: Grant

Filed: January 26, 2022

Date of Patent: February 27, 2024

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Ji-sang Yu, Sang-ha Kim, Jong-youb Ryu, Yoon-jung Choi, Eun-kyoung Kim, Jae-won Lee
Reduced training intent recognition techniques

Patent number: 11914962

Abstract: The present disclosure relates generally to determining intent based upon speech input using a dialog system. More particularly, techniques are described using matching-based machine learning techniques to identify an intent corresponding to speech input in a dialog system. These procedures do not require training when intents are added or removed from the set of possible intents.

Type: Grant

Filed: July 29, 2020

Date of Patent: February 27, 2024

Assignee: Oracle International Corporation

Inventor: Mark Edward Johnson
Speech separation model training method and apparatus, storage medium and computer device

Patent number: 11908455

Abstract: A speech separation model training method and apparatus, a computer-readable storage medium, and a computer device are provided, the method including: obtaining first audio and second audio, the first audio including target audio and having corresponding labeled audio, and the second audio including noise audio.

Type: Grant

Filed: February 15, 2022

Date of Patent: February 20, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jun Wang, Wingyip Lam, Dan Su, Dong Yu
Automatic extraction of conversation highlights

Patent number: 11908477

Abstract: This disclosure describes techniques for generating a conversation summary. The techniques may include processing at least one statement indication of the conversation to determine at least one statement that is a candidate highlight of the conversation. The techniques may further include applying linguistic filtering rules to the candidate highlight to determine the candidate highlight is an actual highlight. The techniques may further include generating the conversation summary including providing the actual highlight as at least a portion of the conversation summary.

Type: Grant

Filed: August 28, 2020

Date of Patent: February 20, 2024

Inventors: Varsha Ravikumar Embar, Karthik Raghunathan
Adaptive interface in a voice-activated network

Patent number: 11908462

Abstract: The systems and methods of the present disclosure generally relate to a data processing system that can identify and surface alternative requests when presented with ambiguous, unclear, or other requests to which a data processing system may not be able to respond. The data processing system can improve the efficiency of network transmissions to reduce network bandwidth usage and processor utilization by selecting alternative requests that are responsive to the intent of the original request.

Type: Grant

Filed: March 21, 2022

Date of Patent: February 20, 2024

Assignee: GOOGLE LLC

Inventors: Gleb Skobeltsyn, Mihaly Kozsevnyikov, Vladimir Vuskovic
Updating trained voice bot(s) utilizing example-based voice bot development techniques

Patent number: 11902222

Abstract: Implementations are directed to updating a trained voice bot that is deployed for conducting conversations on behalf of a third-party. A third-party developer can interact with a voice bot development system that enables the third-party developer to train, update, validate, and monitor performance of the trained voice bot. In various implementations, the trained voice bot can be updated by updating a corpus of training instances that was initially utilized to train the voice bot, and updating the trained voice bot based on the updated corpus. In some implementations, the corpus of training instances may be updated in response to identifying occurrence(s) of behavioral error(s) of the trained voice bot while the conversations are being conducted on behalf of the third-party. In additional or alternative implementations, the corpus of training instances may be updated in response to determining the trained voice bot does not include a desired behavior.

Type: Grant

Filed: February 8, 2021

Date of Patent: February 13, 2024

Assignee: GOOGLE LLC

Inventors: Asaf Aharoni, Eyal Segalis, Ofer Ron, Sasha Goldshtein, Tomer Amiaz, Razvan Mathias, Yaniv Leviathan
Interface for patient-provider conversation and auto-generation of note or summary

Patent number: 11894140

Abstract: A computer-implemented method includes receiving, by a computing device, a particular textual description of a scene. The method also includes applying a neural network for text-to-image generation to generate an output image rendition of the scene, the neural network having been trained to cause two image renditions associated with a same textual description to attract each other and two image renditions associated with different textual descriptions to repel each other based on mutual information between a plurality of corresponding pairs, wherein the plurality of corresponding pairs comprise an image-to-image pair and a text-to-image pair. The method further includes predicting the output image rendition of the scene.

Type: Grant

Filed: December 21, 2021

Date of Patent: February 6, 2024

Assignee: Google LLC

Inventors: Melissa Strader, William Ito, Christopher Co, Katherine Chou, Alvin Rajkomar, Rebecca Rolfe
Global re-ranker

Patent number: 11887585

Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example method includes, at an electronic device having one or more processors and memory: receiving a natural language speech input; determining, based on the natural language speech input, a plurality of candidate intents; obtaining contextual data associated with the user device; ranking, based on the contextual data, the plurality of candidate intents using a machine learning model, wherein the machine learning model is pre-trained at least partially on the user device; determining a user intent based on the ranked candidate intents; and performing a task corresponding to the determined user intent.

Type: Grant

Filed: May 5, 2020

Date of Patent: January 30, 2024

Assignee: Apple Inc.

Inventors: Srinivas Chappidi, Arash Dawoodi
Voice-based interactions with a graphical user interface

Patent number: 11887589

Abstract: Techniques for voice-based interactions are described. In an example, a device presents a user interface on a display. The device starts an operational mode of the device. The operational mode restricts voice-based interactions with the user interface to a set of commands. The set of commands is defined in a language model that is stored on the device. Further, the device receives, at a microphone of the device, audio data corresponding to a natural language utterance and generates, from the audio data, text data that corresponds to the natural language utterance. The device determines, based at least in part on the language model, that semantics of the text data correspond to a command from the set of commands and presents, on the display, an outcome of performing the command.

Type: Grant

Filed: June 17, 2020

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Senthil Kumar Dayalan, Manikandan Thangarathnam, Sai Vinayak, Suraj Gopalakrishnan

1 2 3 4 5 … next