Adaptation (epo) Patents (Class 704/E15.009)

E Subclasses

In the frequency domain (epo) (Class 704/E15.01)

To speaker (epo) (Class 704/E15.011)

Selectively storing, with multiple user accounts and/or to a shared assistant device: speech recognition biasing, NLU biasing, and/or other data

Patent number: 12190892

Abstract: Some implementations relate to performing speech biasing, NLU biasing, and/or other biasing based on historical assistant interaction(s). It can be determined, for one or more given historical interactions of a given user, whether to affect future biasing for (1) the given user account, (2) additional user account(s), and/or (3) the shared assistant device as a whole. Some implementations disclosed herein additionally and/or alternatively relate to: determining, based on utterance(s) of a given user to a shared assistant device, an association of first data and second data; storing the association as accessible to a given user account of the given user; and determining whether to store the association as also accessible by additional user account(s) and/or the shared assistant device.

Type: Grant

Filed: October 18, 2023

Date of Patent: January 7, 2025

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Method and apparatus for normalizing features extracted from audio data for signal recognition or modification

Patent number: 12175965

Abstract: A feature vector may be extracted from each frame of input digitized microphone audio data. The feature vector may include a power value for each frequency band of a plurality of frequency bands. A feature history data structure, including a plurality of feature vectors, may be formed. A normalized feature set that includes a normalized feature data structure may be produced by determining normalized power values for a plurality of frequency bands of each feature vector of the feature history data structure. A signal recognition or modification process may be based, at least in part, on the normalized feature data structure.

Type: Grant

Filed: July 25, 2020

Date of Patent: December 24, 2024

Assignee: DOLBY LABORATORIES LICENSING CORPORATION

Inventor: Richard J. Cartwright
Systems and methods for rapidly building, managing, and sharing machine learning models

Patent number: 12106078

Abstract: In some aspects, systems and methods for rapidly building, managing, and sharing machine learning models are provided. Managing the lifecycle of machine learning models can include: receiving a set of unannotated data; requesting annotations of samples of the unannotated data to produce an annotated set of data; building a machine learning model based on the annotated set of data; deploying the machine learning model to a client system, wherein production annotations are generated; collecting the generated production annotations and generating a new machine learning model incorporating the production annotations; and selecting one of the machine learning model built based on the annotated set of data or the new machine learning model.

Type: Grant

Filed: May 14, 2018

Date of Patent: October 1, 2024

Assignee: Digital Reasoning Systems, Inc.

Inventors: Cory Hughes, Timothy Estes, John Liu, Brandon Carl, Uday Kamath
Speech recognition device, speech recognition method, and program

Patent number: 12057105

Abstract: Provided is a speech recognition device capable of implementing end-to-end speech recognition considering a context.

Type: Grant

Filed: January 27, 2020

Date of Patent: August 6, 2024

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Ryo Masumura, Tomohiro Tanaka, Takanobu Oba
Selectively storing, with multiple user accounts and/or to a shared assistant device: speech recognition biasing, NLU biasing, and/or other data

Patent number: 11817106

Abstract: Some implementations relate to performing speech biasing, NLU biasing, and/or other biasing based on historical assistant interaction(s). It can be determined, for one or more given historical interactions of a given user, whether to affect future biasing for (1) the given user account, (2) additional user account(s), and/or (3) the shared assistant device as a whole. Some implementations disclosed herein additionally and/or alternatively relate to: determining, based on utterance(s) of a given user to a shared assistant device, an association of first data and second data; storing the association as accessible to a given user account of the given user; and determining whether to store the association as also accessible by additional user account(s) and/or the shared assistant device.

Type: Grant

Filed: November 8, 2022

Date of Patent: November 14, 2023

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
Method and apparatus for building a conversation understanding system based on artificial intelligence, device and computer-readable storage medium

Patent number: 11727302

Abstract: A method and apparatus for building a conversation understanding system based on artificial intelligence, a device and a computer-readable storage medium. In embodiments of the present disclosure, it is feasible to obtain the training feedback information provided by conversation service conducted by the user and the basic conversation understanding system, then according to the training feedback information, perform adjustment processing for a service state of the basic conversation understanding system, to obtain an adjustment state of the basic conversation understanding system. It is possible to perform data merging processing according to the training feedback information and the adjustment state of the basic conversation understanding system, to obtain model training data for building the model conversation understanding system.

Type: Grant

Filed: June 12, 2018

Date of Patent: August 15, 2023

Assignee: BEIJING BAIDU NETCOM SCIENCE AND TECHNOLOGY CO., LTD.

Inventors: Ke Sun, Shiqi Zhao, Dianhai Yu, Haifeng Wang
Updating weight values in a machine learning system

Patent number: 11715036

Abstract: A machine learning system includes a learning section and an operating section including a memory. The operating section holds a required accuracy, and an internal state and a weight value of a learner in the memory and executes calculation processing by using data input to the machine learning system and the weight value held in the memory to update the internal state. An accuracy of the internal state is calculated from a result of the calculation processing and an evaluation value is calculated using the data input to the machine learning system, the weight value, and the updated internal state held in the memory when the calculated accuracy is higher than the required accuracy. The evaluation value is transmitted to the learning section, which updates the weight value by using the evaluation value and notifies the number of times of updating the weight value to the operating section.

Type: Grant

Filed: June 26, 2020

Date of Patent: August 1, 2023

Assignee: HITACHI, LTD.

Inventor: Hiroshi Uchigaito
Selectively storing, with multiple user accounts and/or to a shared assistant device: speech recognition biasing, NLU biasing, and/or other data

Patent number: 11532313

Abstract: Some implementations relate to performing speech biasing, NLU biasing, and/or other biasing based on historical assistant interaction(s). It can be determined, for one or more given historical interactions of a given user, whether to affect future biasing for (1) the given user account, (2) additional user account(s), and/or (3) the shared assistant device as a whole. Some implementations disclosed herein additionally and/or alternatively relate to: determining, based on utterance(s) of a given user to a shared assistant device, an association of first data and second data; storing the association as accessible to a given user account of the given user; and determining whether to store the association as also accessible by additional user account(s) and/or the shared assistant device.

Type: Grant

Filed: August 27, 2020

Date of Patent: December 20, 2022

Assignee: GOOGLE LLC

Inventors: Matthew Sharifi, Victor Carbune
METHOD AND SYSTEM FOR AUTOMATIC DOMAIN ADAPTATION IN SPEECH RECOGNITION APPLICATIONS

Publication number: 20130262106

Abstract: A system and method for adapting a language model to a specific environment by receiving interactions captured the specific environment, generating a collection of documents from documents retrieved from external resources, detecting in the collection of documents terms related to the environment that are not included in an initial language model and adapting the initial language model to include the terms detected.

Type: Application

Filed: March 29, 2012

Publication date: October 3, 2013

Inventors: Eyal HURVITZ, Ezra Daya, Oren Pereg, Moshe Wasserblat
MODEL ADAPTATION DEVICE, METHOD THEREOF, AND PROGRAM THEREOF

Publication number: 20110224985

Abstract: A model adaptation device includes a text database that stores a plurality of sentences containing predetermined phonemes; a sentence list that includes a plurality of sentences that describe the contents of the input voice; an input unit to which the input voice is input; a model adaptation unit that performs the model adaptation using the input voice and the sentence list and outputs adapting characteristic information, which is for making the model approximate to the input voice; a statistic database that stores the adapting characteristic information; a distance calculation unit that outputs a value of an acoustic distance between the adapting characteristic information and the model for each phoneme; a phoneme detection unit that outputs a distance value, among the distance values, which is greater than a threshold value as a detection result; and a label generation unit that extracts from the text database a sentence containing a phoneme associated with the detection result and outputs the sentence.

Type: Application

Filed: October 23, 2009

Publication date: September 15, 2011

Inventors: Ken Hanazawa, Yoshifumi Onishi
METHOD FOR PERFORMING SPEECH RECOGNITION AND PROCESSING SYSTEM

Publication number: 20100256978

Abstract: A method for performing speech recognition relating to an object for the purpose of affecting automatic processing of the object by a processing system. The object carries information with at least a character string of processing information. The character string spoken by an operator is processed by way of a speech recognition procedure to generate a first result. Based on the need for more information of an element of the first result additional processing data is requested. An operator's response generates a second result. The first result is then modified to achieve consistency with the operator's response.

Type: Application

Filed: April 6, 2010

Publication date: October 7, 2010

Applicant: SIEMENS AKTIENGESELLSCHAFT

Inventor: Walter Rosenbaum
Natural Language System and Method Based on Unisolated Performance Metric

Publication number: 20090030692

Abstract: A natural language business system and method is developed to understand the underlying meaning of a person's speech, such as during a transaction with the business system. The system includes a speech recognition engine, and action classification engine, and a control module.

Type: Application

Filed: May 15, 2008

Publication date: January 29, 2009

Applicant: International Business Machines, Inc.

Inventors: Sabine Deligne, Yuqing Gao, Vaibhava Goel, Hong-Kwang Kuo, Cheng Wu
Method For Speech Recognition From a Partitioned Vocabulary

Publication number: 20080126090

Abstract: A is recognized using a predefinable vocabulary that is partitioned in sections of phonetically similar words. In a recognition process, first oral input is associated with one of the sections, then the oral input is determined from the vocabulary of the associated section.

Type: Application

Filed: October 4, 2005

Publication date: May 29, 2008

Inventor: Niels Kunstmann