To Speaker (epo) Patents (Class 704/E15.011)

E Subclasses

Supervised, i.e., under machine guidance (epo) (Class 704/E15.012)

Unsupervised (epo) (Class 704/E15.013)

Construction of a machine learning model

Patent number: 11966851

Abstract: Implement one or both of processing of computer queries using machine learning models and/or generation of machine learning models in a computer system. A computer processor generates a plurality of stored machine learning models. A computer processor extracts a plurality of updated parameters sets from the plurality of stored machine learning models. A computer processor creates a new machine learning model based on the respective distribution of each parameter included in the plurality of updated parameters sets. A computer processor processes at least one new query using the new machine learning model.

Type: Grant

Filed: April 2, 2019

Date of Patent: April 23, 2024

Assignee: International Business Machines Corporation

Inventor: Aoun Lutfi
Detecting unrelated utterances in a chatbot system

Patent number: 11928430

Abstract: Techniques are described to determine whether an input utterance is unrelated to a set of skill bots associated with a master bot. In some embodiments, a system described herein includes a training system and a master bot. The training system trains a classifier of the master bot. The training includes accessing training utterances associated with the skill bots and generating training feature vectors from the training utterances. The training further includes generating multiple set representations of the training feature vectors, where each set representation corresponds to a subset of the training feature vectors, and configuring the classifier with the set representations. The master bot accesses an input utterance and generates an input feature vector. The master bot uses the classifier to compare the input feature vector to the multiple set representations so as to determine whether the input feature falls outside and, thus, cannot be handled by the skill bots.

Type: Grant

Filed: September 10, 2020

Date of Patent: March 12, 2024

Assignee: Oracle International Corporation

Inventors: Crystal C. Pan, Gautam Singaraju, Vishal Vishnoi, Srinivasa Phani Kumar Gadde
Method and system for processing user spoken utterance

Patent number: 11817093

Abstract: There is disclosed a method and system for processing a user spoken utterance, the method comprising: receiving, from a user, an indication of the user spoken utterance; generating, a text representation hypothesis based on the user spoken utterance; processing, using a first trained scenario model and a second trained scenario model, the text representation hypothesis to generate a first scenario hypothesis and a second scenario hypothesis, respectively; the first trained scenario model and the second trained scenario model having been trained using at least partially different corpus of texts; analyzing, using a Machine Learning Algorithm (MLA), the first scenario hypothesis and the second scenario hypothesis to determine a winning scenario having a higher confidence score; based on the winning scenario, determining by an associated one of the first trained scenario model and the second trained scenario model, an action to be executed by the electronic device; executing the action.

Type: Grant

Filed: December 7, 2020

Date of Patent: November 14, 2023

Assignee: YANDEX EUROPE AG

Inventors: Vyacheslav Vyacheslavovich Alipov, Oleg Aleksandrovich Sadovnikov, Nikita Vladimirovich Zubkov
System and method of video capture and search optimization for creating an acoustic voiceprint

Patent number: 11776547

Abstract: Systems and method of diarization of audio files use an acoustic voiceprint model. A plurality of audio files are analyzed to arrive at an acoustic voiceprint model associated to an identified speaker. Metadata associate with an audio file is used to select an acoustic voiceprint model. The selected acoustic voiceprint model is applied in a diarization to identify audio data of the identified speaker.

Type: Grant

Filed: January 17, 2022

Date of Patent: October 3, 2023

Assignee: Verint Systems Inc.

Inventors: Omer Ziv, Ran Achituv, Ido Shapira, Jeremie Dreyfuss
Guest Speaker Robust Adapted Speech Recognition

Publication number: 20120245940

Abstract: A method for speech recognition is implemented in the specific form of computer processes that function in a computer processor. That is, one or more computer processes: process a speech input to produce a sequence of representative speech vectors and perform multiple recognition passes to determine a recognition output corresponding to the speech input. At least one generic recognition pass is based on a generic speech recognition arrangement using generic modeling of a broad general class of input speech. And at least one adapted recognition pass is based on a speech adapted arrangement using pre-adapted modeling of a specific sub-class of the general class of input speech.

Type: Application

Filed: December 8, 2009

Publication date: September 27, 2012

Applicant: NUANCE COMMUNICATIONS, INC.

Inventors: Daniel Willett, Lambert Mathias, Chuang He, Jianxiong Wu
SYSTEM AND METHOD OF MULTI MODEL ADAPTATION AND VOICE RECOGNITION

Publication number: 20110301953

Abstract: Provided is a system of voice recognition that adapts and stores a voice of a speaker for each feature to each of a basic voice model and new independent multi models and provides stable real-time voice recognition through voice recognition using a multi adaptive model.

Type: Application

Filed: April 11, 2011

Publication date: December 8, 2011

Applicant: Seoby Electronic Co., Ltd

Inventor: Sung-Sub Lee
ADAPTIVE VOICE PRINT FOR CONVERSATIONAL BIOMETRIC ENGINE

Publication number: 20110196676

Abstract: A computer-implemented method, system and/or program product update voice prints over time. A receiving computer receives an initial voice print. A determining period of time is calculated for that initial voice print. This determining period of time is a length of time during which an expected degree of change in subsequent voice prints, in comparison to the initial voice print, is predicted to occur. A new voice print is received after the determining period of time has passed, and the new voice print is compared with the initial voice print. In response to a change to the new voice print falling within the expected degree of change in comparison to the initial voice print, a voice print store is updated with the new voice print.

Type: Application

Filed: February 9, 2010

Publication date: August 11, 2011

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: SHERI GAYLE DAYE, PEEYUSH JAISWAL, FANG WANG
RESOURCE CONSERVATIVE TRANSFORMATION BASED UNSUPERVISED SPEAKER ADAPTATION

Publication number: 20090198494

Abstract: The present invention discloses a solution for conserving computing resources when implementing transformation based adaptation techniques. The disclosed solution limits the amount of speech data used by real-time adaptation algorithms to compute a transformation, which results in substantial computational savings. Appreciably, application of a transform is a relatively low memory and computationally cheap process compared to memory and resource requirements for computing the transform to be applied.

Type: Application

Filed: February 6, 2008

Publication date: August 6, 2009

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: JOHN W. ECKHART, MICHAEL FLORIO, RADEK HAMPL, PAVEL KRBEC, JONATHAN PALGON
SPEECH RECOGNITION BASED ON SYMBOLIC REPRESENTATION OF A TARGET SENTENCE

Publication number: 20090119107

Abstract: Systems and methods for processing a user speech input to determine whether the user has correctly read a target sentence string are provided. One disclosed method may include receiving a sentence array including component words of the target sentence string and processing the sentence array to generate a symbolic representation of the target sentence string. The symbolic representation may include a subset of words selected from the component words of the target sentence string, having fewer words than the sentence array. The method may include processing user speech input to recognize in the user speech input each of the words in the subset of words in the symbolic representation of the target sentence string. The method may further include, upon recognizing the subset of words, making a determination that the user has correctly read the target sentence string.

Type: Application

Filed: November 1, 2007

Publication date: May 7, 2009

Applicant: MICROSOFT CORPORATION

Inventor: Duncan
APPARATUS AND METHODS FOR VOCAL TRACT ANALYSIS OF SPEECH SIGNALS

Publication number: 20080162134

Abstract: The present invention provides for speech processing apparatus arranged for the input or output of a speech data signal and including a function generating means arranged for producing a representation of a vocal-tract potential function representative of a speech source and as an example, a speaker identification process can comprise means to capture an incoming voice signal, for example from a microphone or telephone line; means to process the signal electronically to generate a time varying series of binary vocal-tract potentials and associated non-vowel binary parameters; means to refine the signal to revoke the speaker-independent speech components; and means to compare the residual signal with a database of such residual features of known individuals.

Type: Application

Filed: January 7, 2008

Publication date: July 3, 2008

Applicant: King's College London

Inventors: Barbara Janey Forbes, Edward Roy Pike