Patents by Inventor Shane Settle

Shane Settle has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Recognition of out-of-vocabulary in direct acoustics-to-word speech recognition using acoustic word embedding

Patent number: 10839792

Abstract: A method (and structure and computer product) for learning Out-of-Vocabulary (OOV) words in an Automatic Speech Recognition (ASR) system includes using an Acoustic Word Embedding Recurrent Neural Network (AWE RNN) to receive a character sequence for a new OOV word for the ASR system, the RNN providing an Acoustic Word Embedding (AWE) vector as an output thereof. The AWE vector output from the AWE RNN is provided as an input into an Acoustic Word Embedding-to-Acoustic-to-Word Neural Network (AWE?A2W NN) trained to provide an OOV word weight value from the AWE vector. The OOV word weight is inserted into a listing of Acoustic-to-Word (A2W) word embeddings used by the ASR system to output recognized words from an input of speech acoustic features, wherein the OOV word weight is inserted into the A2W word embeddings list relative to existing weights in the A2W word embeddings list.

Type: Grant

Filed: February 5, 2019

Date of Patent: November 17, 2020

Assignees: INTERNATIONAL BUSINESS MACHINES CORPORATION, TOYOTA TECHNOLOGICAL INSTITUTE AT CHICAGO

Inventors: Kartik Audhkhasi, Karen Livescu, Michael Picheny, Shane Settle
Methods and systems for recognizing simultaneous speech by multiple speakers

Patent number: 10811000

Abstract: Systems and methods for a speech recognition system for recognizing speech including overlapping speech by multiple speakers. The system including a hardware processor. A computer storage memory to store data along with having computer-executable instructions stored thereon that, when executed by the processor is to implement a stored speech recognition network. An input interface to receive an acoustic signal, the received acoustic signal including a mixture of speech signals by multiple speakers, wherein the multiple speakers include target speakers. An encoder network and a decoder network of the stored speech recognition network are trained to transform the received acoustic signal into a text for each target speaker. Such that the encoder network outputs a set of recognition encodings, and the decoder network uses the set of recognition encodings to output the text for each target speaker. An output interface to transmit the text for each target speaker.

Type: Grant

Filed: April 13, 2018

Date of Patent: October 20, 2020

Assignee: Mitsubishi Electric Research Laboratories, Inc.

Inventors: Jonathan Le Roux, Takaaki Hori, Shane Settle, Hiroshi Seki, Shinji Watanabe, John Hershey
RECOGNITION OF OUT-OF-VOCABULARY IN DIRECT ACOUSTICS- TO-WORD SPEECH RECOGNITION USING ACOUSTIC WORD EMBEDDING

Publication number: 20200251096

Abstract: A method (and structure and computer product) for learning Out-of-Vocabulary (OOV) words in an Automatic Speech Recognition (ASR) system includes using an Acoustic Word Embedding Recurrent Neural Network (AWE RNN) to receive a character sequence for a new OOV word for the ASR system, the RNN providing an Acoustic Word Embedding (AWE) vector as an output thereof. The AWE vector output from the AWE RNN is provided as an input into an Acoustic Word Embedding-to-Acoustic-to-Word Neural Network (AWE?A2W NN) trained to provide an OOV word weight value from the AWE vector. The OOV word weight is inserted into a listing of Acoustic-to-Word (A2W) word embeddings used by the ASR system to output recognized words from an input of speech acoustic features, wherein the OOV word weight is inserted into the A2W word embeddings list relative to existing weights in the A2W word embeddings list.

Type: Application

Filed: February 5, 2019

Publication date: August 6, 2020

Inventors: Kartik AUDHKHASI, Karen Livescu, Michael Picheny, Shane Settle
Methods and Systems for Recognizing Simultaneous Speech by Multiple Speakers

Publication number: 20190318725

Abstract: Systems and methods for a speech recognition system for recognizing speech including overlapping speech by multiple speakers. The system including a hardware processor. A computer storage memory to store data along with having computer-executable instructions stored thereon that, when executed by the processor is to implement a stored speech recognition network. An input interface to receive an acoustic signal, the received acoustic signal including a mixture of speech signals by multiple speakers, wherein the multiple speakers include target speakers. An encoder network and a decoder network of the stored speech recognition network are trained to transform the received acoustic signal into a text for each target speaker. Such that the encoder network outputs a set of recognition encodings, and the decoder network uses the set of recognition encodings to output the text for each target speaker. An output interface to transmit the text for each target speaker.

Type: Application

Filed: April 13, 2018

Publication date: October 17, 2019

Inventors: Jonathan Le Roux, Takaaki Hori, Shane Settle, Hiroshi Seki, Shinji Watanabe, John Hershey

Recognition of out-of-vocabulary in direct acoustics-to-word speech recognition using acoustic word embedding

Methods and systems for recognizing simultaneous speech by multiple speakers

RECOGNITION OF OUT-OF-VOCABULARY IN DIRECT ACOUSTICS- TO-WORD SPEECH RECOGNITION USING ACOUSTIC WORD EMBEDDING

Methods and Systems for Recognizing Simultaneous Speech by Multiple Speakers