Patents Examined by Michael Colucci

Voice-based interactions with a graphical user interface

Patent number: 11887589

Abstract: Techniques for voice-based interactions are described. In an example, a device presents a user interface on a display. The device starts an operational mode of the device. The operational mode restricts voice-based interactions with the user interface to a set of commands. The set of commands is defined in a language model that is stored on the device. Further, the device receives, at a microphone of the device, audio data corresponding to a natural language utterance and generates, from the audio data, text data that corresponds to the natural language utterance. The device determines, based at least in part on the language model, that semantics of the text data correspond to a command from the set of commands and presents, on the display, an outcome of performing the command.

Type: Grant

Filed: June 17, 2020

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Senthil Kumar Dayalan, Manikandan Thangarathnam, Sai Vinayak, Suraj Gopalakrishnan
Updating models with trained model update objects

Patent number: 11887583

Abstract: Some devices may perform processing using machine learning models trained at a centralized system and distributed to the device. The centralized system may update the machine learning model and distribute the update to the device (or devices). To reduce the size of an update, the centralized system may train a model update object, which may be smaller in size than the model itself and thus more suitable for sending to the device(s). A device may receive the model update object and use it to update the on-device machine learning model; for example, by changing some parameters of the model. Parameters left unchanged during the update may retain their previous value. Thus, using the model update object to update the on-device model may result in a more accurate updated model when compared to sending an updated model compressed to a size similar to that of the model update object.

Type: Grant

Filed: June 9, 2021

Date of Patent: January 30, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Grant Strimel, Jonathan Jenner Macoskey, Ariya Rastrow
Far-field pickup device and method for collecting voice signal in far-field pickup device

Patent number: 11871176

Abstract: A far-field pickup device including a device body and a microphone pickup unit is provided. The microphone pickup unit is configured to collect user speech and an echo of a first sound signal output by the device body, and transmit, to the device body, a signal obtained through digital conversion of the collected user speech and the echo. The device body includes a signal playback source, a synchronizing signal generator, a horn, a delay determining unit, and an echo cancellation unit configured to perform echo cancellation on the signal transmitted by the microphone pickup unit to obtain a collected human voice signal.

Type: Grant

Filed: September 25, 2020

Date of Patent: January 9, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LTD

Inventors: Ji Meng Zheng, Meng Yu, Dan Su
Language agnostic automated voice activity detection

Patent number: 11869537

Abstract: Systems, methods, and computer-readable media are disclosed for systems and methods for language agnostic automated voice activity detection. Example methods may include determining an audio file associated with video content, generating audio segments using the audio file, the audio segments including a first segment and a second segment, and determining that the first segment includes first voice activity. Methods may include determining that the second segment comprises second voice activity, determining that voice activity is present between a first timestamp associated with the first segment and a second timestamp associated with the second segment, and generating text data representing the voice activity that is present between the first timestamp and the second timestamp.

Type: Grant

Filed: November 10, 2021

Date of Patent: January 9, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Mayank Sharma, Sandeep Joshi, Muhammad Raffay Hamid
Talk back from actions in applications

Patent number: 11862156

Abstract: Embodiments of the present invention provide systems, methods, and computer storage media directed to providing talk back automation for applications installed on a mobile device. To do so actions (e.g., talk back features) can be created, via the digital assistant, by recording a series of events that are typically provided by a user of the mobile device when manually invoking the desired action. At a desired state, the user may select an object that represents the output of the application. The recording embodies the action and can be associated with a series of verbal commands that the user would typically announce to the digital assistant when an invocation of the action is desired. In response, the object is verbally communicated to the user via the digital assistant, a different digital assistant, or even another device. Alternatively, the object may be communicated to the same application or another application as input.

Type: Grant

Filed: July 2, 2021

Date of Patent: January 2, 2024

Assignee: Peloton Interactive, Inc.

Inventors: Mark Robinson, Matan Levi, Kiran Bindhu Hemaraj, Rajat Mukherjee
Many or one detection classification systems and methods

Patent number: 11853884

Abstract: A classification training system comprises a neural network configured to perform classification of input data, a training dataset including pre-segmented, labeled training samples, and a classification training module configured to train the neural network using the training dataset. The classification training module includes a forward pass processing module, and a backward pass processing module. The backward pass processing module is configured to determine whether a current frame is in a region of target (ROT), determine ROT information such as beginning and length of the ROT and update weights and biases using a cross-entropy cost function and a tunable many-or-one detection (MOOD) cost function, that comprises a tunable hyperparameter for tuning the classifier for a particular task. The backward pass module further computes a soft target value using ROT information and computes a signal output error using the soft target value and network output value.

Type: Grant

Filed: April 28, 2021

Date of Patent: December 26, 2023

Assignee: Synaptics Incorporated

Inventor: Saeed Mosayyebpour Kaskari
Adaptive interface in a voice-activated network

Patent number: 11848009

Abstract: The systems and methods of the present disclosure generally relate to a data processing system that can identify and surface alternative requests when presented with ambiguous, unclear, or other requests to which a data processing system may not be able to respond. The data processing system can improve the efficiency of network transmissions to reduce network bandwidth usage and processor utilization by selecting alternative requests that are responsive to the intent of the original request.

Type: Grant

Filed: August 9, 2021

Date of Patent: December 19, 2023

Assignee: GOOGLE LLC

Inventors: Gleb Skobeltsyn, Mihaly Kozsevnyikov, Vladimir Vuskovic
Seamless authentication and enrollment

Patent number: 11842740

Abstract: Some aspects of the invention may include a computer-implemented method for enrolling voice prints generated from audio streams, in a database. The method may include receiving an audio stream of a communication session and creating a preliminary association between the audio stream and an identity of a customer that has engaged in the communication session based on identification information. The method may further include determining a confidence level of the preliminary association based on authentication information related to the customer and if the confidence level is higher than a threshold, sending a request to compare the audio stream to a database of voice prints of known fraudsters. If the audio stream does not match any known fraudsters, sending a request to generate from the audio stream a current voice print associated with the customer and enrolling the voice print in a customer voice print database.

Type: Grant

Filed: October 15, 2020

Date of Patent: December 12, 2023

Assignee: NICE LTD.

Inventors: Shahar Faians, Avraham Lousky, Elad Hoffman, Alon Moshe Sabban, Jade Tarni Kahn, Roie Mandler
Managing software defined networks using human language

Patent number: 11837223

Abstract: A human language software defined network (SDN) control system, including: a voice to text machine learning model configured to convert user speech to text; a machine learning language processing engine configured to control the operation of a SDN controller based upon the text; and a machine learning minimal language processing engine configured to control the operation of a SDN element based upon commands from the SDN controller produced by the machine learning language processing engine.

Type: Grant

Filed: December 18, 2020

Date of Patent: December 5, 2023

Assignee: NOKIA SOLUTIONS AND NETWORKS OY

Inventor: Sowrirajan Padmanabhan
Personal conversationalist system

Patent number: 11822889

Abstract: A personal conversationalist system includes a processor, and a computer-readable medium storing instructions which, when executed by the processor, cause the processor to operations that include receiving a first data input feed, accessing a first user profile that is as associated with a first user, detecting a conversation event when first data in the first data input feed satisfies a first conversation event condition, generating a first conversationalist persona based on the conversation event, the first user profile, and data provided in the first data input feed; and initiating a first conversation session via the first conversationalist persona by outputting a first conversationalist persona response that is based on the conversation event, the first user profile, and the data provided in the first data input feed.

Type: Grant

Filed: March 20, 2020

Date of Patent: November 21, 2023

Assignee: AT&T INTELLECTUAL PROPERTY I, L.P.

Inventors: Joseph Soryal, Naila Jaoude, Samuel N. Zellner
Activation trigger processing

Patent number: 11823670

Abstract: Utterance-based user interfaces can include activation trigger processing techniques for detecting activation triggers and causing execution of certain commands associated with particular command pattern activation triggers without waiting for output from a separate speech processing engine. The activation trigger processing techniques can also detect speech analysis patterns and selectively activate a speech processing engine.

Type: Grant

Filed: April 17, 2020

Date of Patent: November 21, 2023

Assignee: Spotify AB

Inventor: Richard Mitic
Systems and methods for continual updating of response generation by an artificial intelligence chatbot

Patent number: 11823061

Abstract: Methods and systems are provided for a natural language processing system comprising a chatbot adapted for dialog generation. In one example, the system may include a combination of a variational autoencoder (VAE) and a generative adversarial network (GAN) for generating natural responses to input queries. The VAE may convert queries into vector embeddings that may then be used by the GAN to continuously update and improve responses provided by the chatbot.

Type: Grant

Filed: October 31, 2022

Date of Patent: November 21, 2023

Assignee: CAMBIA HEALTH SOLUTIONS, INC.

Inventors: Weicheng Ma, Kai Cao, Bei Pan, Lin Chen, Xiang Li
Speech recognition through disambiguation feedback

Patent number: 11823659

Abstract: A request including audio data is received from a voice-enabled device. A string of phonemes present in the utterance is determined through speech recognition. At a later time, a subsequent user input corresponding to the request may be received, in which the user input is associated with one or more text keywords. The subsequent user input may be obtained in response to an active request. Alternatively, feedback may not be actively elicited, but rather collected passively. However it is obtained, the one or more keywords associated with the subsequent user input may be associated with the string of phonemes to indicate that the user is saying or mean those words when they product that string of phonemes. A user-specific speech recognition key for the user account is then updated to associate the string of phonemes with these words. A general speech recognition model can also be trained using the association.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 21, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Julia Reinspach, Oleg Rokhlenko, Ramakanthachary Gottumukkala, Giovanni Clemente, Ankit Agrawal, Swayam Bhardwaj, Guy Michaeli, Vaidyanathan Puthucode Krishnamoorthy, Costantino Vlachos, Nalledath P. Vinodkrishnan, Shaun M. Vickers, Sethuraman Ramachandran, Charles C. Moore
Adaptive interface in a voice-based networked system

Patent number: 11817084

Abstract: The present disclosure relates generally to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. The system can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.

Type: Grant

Filed: May 21, 2020

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Pu-sen Chao, Diego Melendo Casado, Ignacio Lopez Moreno
Automatically determining language for speech recognition of spoken utterance received via an automated assistant interface

Patent number: 11817085

Abstract: Implementations relate to determining a language for speech recognition of a spoken utterance, received via an automated assistant interface, for interacting with an automated assistant. Implementations can enable multilingual interaction with the automated assistant, without necessitating a user explicitly designate a language to be utilized for each interaction. Selection of a speech recognition model for a particular language can based on one or more interaction characteristics exhibited during a dialog session between a user and an automated assistant. Such interaction characteristics can include anticipated user input types, anticipated user input durations, a duration for monitoring for a user response, and/or an actual duration of a provided user response.

Type: Grant

Filed: December 14, 2020

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Pu-Sen Chao, Diego Melendo Casado, Ignacio Lopez Moreno
Speech recognition method and apparatus, and method and apparatus for training speech recognition model

Patent number: 11798531

Abstract: A speech recognition method, a speech recognition apparatus, and a method and an apparatus for training a speech recognition model are provided. The speech recognition method includes: recognizing a target word speech from a hybrid speech, and obtaining, as an anchor extraction feature of a target speech, an anchor extraction feature of the target word speech based on the target word speech; obtaining a mask of the target speech according to the anchor extraction feature of the target speech; and recognizing the target speech according to the mask of the target speech.

Type: Grant

Filed: October 22, 2020

Date of Patent: October 24, 2023

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jun Wang, Dan Su, Dong Yu
System and method for generating subjective wellbeing analytics score

Patent number: 11797779

Abstract: A system includes at least one processor to perform natural language processing on text from at least one document and assign the at least one document to at least one subjective wellbeing dimension by comparing the text from the at least one document with a subjective wellbeing dimension filter for each subjective wellbeing dimension, insert the at least one document into at least one bin, each bin associated with a particular subjective wellbeing dimension, and analyze each document in each bin associated with the particular subjective wellbeing dimension to determine a score for each subjective wellbeing dimension and an overall score that is based on each score for each subjective wellbeing dimension.

Type: Grant

Filed: January 31, 2023

Date of Patent: October 24, 2023

Assignee: TSG Technologies, LLC

Inventors: Anthony L Hinrichs, Andrea E DiGiovanni, Willem S Maritz, Anthony M Sardella
Resolving unique personal identifiers during corresponding conversations between a voice bot and a human

Patent number: 11790906

Abstract: Implementations are directed to causing a voice bot to utilize a plurality of ML layers in resolving unique personal identifier(s) for a human while the voice bot is engaged in a corresponding conversation with the human. The unique personal identifier(s) can include a unique sequence of alphanumeric characters that is personal to the human. In some implementations, ASR speech hypothes(es) corresponding to spoken utterance(s) that include the unique personal identifier(s) can be processed to generate candidate unique personal identifier(s), given alphanumeric character(s) of the candidate unique personal identifier(s) can be selected, and the voice bot can prompt the human with clarification request(s) to clarify the given alphanumeric character(s) until it is predicted to correspond to the an actual unique personal identifier(s) for the human(s). The unique personal identifier(s) can then be utilized in performance of further action(s) by the voice bot and/or other systems.

Type: Grant

Filed: January 25, 2021

Date of Patent: October 17, 2023

Assignee: GOOGLE LLC

Inventors: Rafael Goldfarb, Or Guz, Lior Alon, Assaf Hurwitz Michaely, Golan Pundak, Shmuel Leibtag, Tomer Amiaz, Dan Rasin, Asaf Aharoni
Phoneme recognizer customizable keyword spotting system with keyword adaptation

Patent number: 11790912

Abstract: A wake-up word for a digital assistant may be specified by a user to trigger the digital assistant to respond to the wake-up word, with the user providing one or more initial pronunciations of the wake-up word. The wake-up word may be unique, or at least not determined beforehand by a device manufacturer or developer of the digital assistant. The initial pronunciation(s) of the keyword may then be augmented with other potential pronunciations of the wake-up word that might be provided in the future, and those other potential pronunciations may then be pruned down to a threshold number of other potential pronunciations. One or more recordings of the initial pronunciation(s) of the wake-up may then be used to train a phoneme recognizer model to better recognize future instances of the wake-up word being spoken by the user or another person using the initial pronunciation or other potential pronunciations.

Type: Grant

Filed: January 3, 2022

Date of Patent: October 17, 2023

Assignee: Sony Interactive Entertainment Inc.

Inventors: Lakshmish Kaushik, Zhenhao Ge, Xiaoyu Liu
System and method for automated patient interaction

Patent number: 11783124

Abstract: Provided is a system and method for automated patient interaction. The method includes parsing a patient complaint comprising a plurality of words, determining a subset of patient queries from a plurality of patient queries based on the patient complaint and patient data, communicating the subset of patient queries to a first computing device; receiving, from the first computing device, responses to at least a portion of the subset of patient queries; generating output data based on the subset of patient queries and the responses; communicating the output data to a second computing device; receiving, from the second computing device, a user input corresponding to at least one patient query of the subset of patient queries; and training, based on the user input, at least one machine-learning algorithm configured to output at least one patient query based on at least one of the patient complaint and a subsequent patient complaint.

Type: Grant

Filed: November 15, 2019

Date of Patent: October 10, 2023

Assignee: 98point6 Inc.

Inventors: Damon Lanphear, Keith Trnka, Robbie Schwietzer

prev 1 2 3 4 5 6 … next