Patents by Inventor Julian J. Odell

Julian J. Odell has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Combination and federation of local and remote speech recognition

Patent number: 8892439

Abstract: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

Type: Grant

Filed: July 15, 2009

Date of Patent: November 18, 2014

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Robert L. Chambers
Recognizing multiple semantic items from single utterance

Patent number: 8725492

Abstract: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

Type: Grant

Filed: March 5, 2008

Date of Patent: May 13, 2014

Assignee: Microsoft Corporation

Inventors: Julian J Odell, Robert L. Chambers, Oliver Scholz
Confidence threshold tuning

Patent number: 8396715

Abstract: An expected dialog-turn (ED) value is estimated for evaluating a speech application. Parameters such as a confidence threshold setting can be adjusted based on the expected dialog-turn value. In a particular example, recognition results and corresponding confidence scores are used to estimate the expected dialog-turn value. The recognition results can be associated with a possible outcome for the speech application and a cost for the possible outcome can be used to estimate the expected dialog-turn value.

Type: Grant

Filed: June 28, 2005

Date of Patent: March 12, 2013

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Li Jiang, Wei Zhang, Xiaodong He
Speech recognition system with display information

Patent number: 8364487

Abstract: A language processing system may determine a display form of a spoken word by analyzing the spoken form using a language model that includes dictionary entries for display forms of homonyms. The homonyms may include trade names as well as given names and other phrases. The language processing system may receive spoken language and produce a display form of the language while displaying the proper form of the homonym. Such a system may be used in search systems where audio input is converted to a graphical display of a portion of the spoken input.

Type: Grant

Filed: October 21, 2008

Date of Patent: January 29, 2013

Assignee: Microsoft Corporation

Inventors: Yun-Cheng Ju, Julian J. Odell
Voice user interface authoring tool

Patent number: 8315874

Abstract: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files.

Type: Grant

Filed: April 11, 2006

Date of Patent: November 20, 2012

Assignee: Microsoft Corporation

Inventors: William F. Barton, Michelle S. Spina, David G. Ollason, Julian J. Odell
Classification filter for processing data for creating a language model

Patent number: 8165870

Abstract: The method and apparatus utilize a filter to remove a variety of non-dictated words from data based on probability and improve the effectiveness of creating a language model.

Type: Grant

Filed: February 10, 2005

Date of Patent: April 24, 2012

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, Dong Yu, Julian J. Odell, Milind V. Mahajan, Peter K. L. Mau
Lightweight windowing method for screening harvested data for novelty

Patent number: 8069032

Abstract: Biasing of language model customization due to repetitious data is substantially reduced by introducing novelty screening to data harvesting process. Novelty detection based filtering is added to ensure that an adaptation system gives more weight to representative adaptation data that is not repetitious. The value of the adaptation data is preserved and the process prevented from being polluted when the same data is seen multiple times, such as the original posting in an email thread, various versions of the same document, and the like. The screening technique may be built on top of existing data harvesting mechanisms as already seen data is used to determine the novelty of a particular portion of the data. A window into the new data, fixed or variable size, is compared against the already collected data to determine the likelihood that the data is novel.

Type: Grant

Filed: July 27, 2006

Date of Patent: November 29, 2011

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Kunal Mukerjee
Automatic identification of dialog timing problems for an interactive speech dialog application using speech log data indicative of cases of barge-in and timing problems

Patent number: 7930183

Abstract: A method of analyzing dialog between a user and an interactive application having dialog turns is provided. The method includes accessing information indicative of a plurality of dialog turns between the application and at least one user and identifying instances where the application determined a response was received before an associated prompt had completed. The accessed information includes information related to operation of the application with a first grammar to recognize the response. The method includes identifying whether the response was received in a particular limited time period from when the associated prompt began. If the response was received in the limited time period, the method determines whether the response included one or more terms from the associated prompt by performing recognition on the response using a second grammar having more information related to grammar of a language than the first grammar.

Type: Grant

Filed: March 29, 2006

Date of Patent: April 19, 2011

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Stephen F. Potter
COMBINATION AND FEDERATION OF LOCAL AND REMOTE SPEECH RECOGNITION

Publication number: 20110015928

Abstract: Techniques to provide automatic speech recognition at a local device are described. An apparatus may include an audio input to receive audio data indicating a task. The apparatus may further include a local recognizer component to receive the audio data, to pass the audio data to a remote recognizer while receiving the audio data, and to recognize speech from the audio data. The apparatus may further include a federation component operative to receive one or more recognition results from the local recognizer and/or the remote recognizer, and to federate a plurality of recognition results to produce a most likely result. The apparatus may further include an application to perform the task indicated by the most likely result. Other embodiments are described and claimed.

Type: Application

Filed: July 15, 2009

Publication date: January 20, 2011

Applicant: MICROSOFT CORPORATION

Inventors: Julian J. Odell, Robert L. Chambers
Speech Recognition System with Display Information

Publication number: 20100100384

Abstract: A language processing system may determine a display form of a spoken word by analyzing the spoken form using a language model that includes dictionary entries for display forms of homonyms. The homonyms may include trade names as well as given names and other phrases. The language processing system may receive spoken language and produce a display form of the language while displaying the proper form of the homonym. Such a system may be used in search systems where audio input is converted to a graphical display of a portion of the spoken input.

Type: Application

Filed: October 21, 2008

Publication date: April 22, 2010

Applicant: Microsoft Corporation

Inventors: Yun-Cheng Ju, Julian J. Odell
Recognizing multiple semantic items from single utterance

Publication number: 20090228270

Abstract: Semantically distinct items are extracted from a single utterance by repeatedly recognizing the same utterance using constraints provided by semantic items already recognized. User feedback for selection or correction of partially recognized utterance may be used in a hierarchical, multi-modal, or single step manner. An accuracy of recognition is preserved while the less structured and more natural single utterance recognition form is allowed to be used.

Type: Application

Filed: March 5, 2008

Publication date: September 10, 2009

Applicant: Microsoft Corporation

Inventors: Julian J. Odell, Robert L. Chambers, Oliver Scholz
Block synchronous decoding

Patent number: 7529671

Abstract: A pattern recognition system and method are provided. Aspects of the invention are particularly useful in combination with multi-state Hidden Markov Models. Pattern recognition is effected by processing Hidden Markov Model Blocks. This block-processing allows the processor to perform more operations upon data while such data is in cache memory. By so increasing cache locality, aspects of the invention provide significantly improved pattern recognition speed.

Type: Grant

Filed: March 4, 2003

Date of Patent: May 5, 2009

Assignee: Microsoft Corporation

Inventors: William H. Rockenbeck, Julian J. Odell
Lightweight windowing method for screening harvested data for novelty

Publication number: 20080027706

Abstract: Biasing of language model customization due to repetitious data is substantially reduced by introducing novelty screening to data harvesting process. Novelty detection based filtering is added to ensure that an adaptation system gives more weight to representative adaptation data that is not repetitious. The value of the adaptation data is preserved and the process prevented from being polluted when the same data is seen multiple times, such as the original posting in an email thread, various versions of the same document, and the like. The screening technique may be built on top of existing data harvesting mechanisms as already seen data is used to determine the novelty of a particular portion of the data. A window into the new data, fixed or variable size, is compared against the already collected data to determine the likelihood that the data is novel.

Type: Application

Filed: July 27, 2006

Publication date: January 31, 2008

Applicant: Microsoft Corporation

Inventors: Julian J. Odell, Kunal Mukerjee
Word-specific acoustic models in a speech recognition system

Patent number: 7275034

Abstract: A speech recognizer has an acoustic model that includes word-specific models, that are specific to candidate words. The candidate words would otherwise be mapped to a series of general phones. A decoder transcribes input speech into words formed by shared phones, word-specific phones and a combination of shared word-specific phones.

Type: Grant

Filed: April 13, 2006

Date of Patent: September 25, 2007

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Shahid Durrani
Method and apparatus for quantizing model parameters

Patent number: 7272557

Abstract: A method of quantizing a model parameter includes applying the model parameter to a non-linear scaling function to produce a scaled model parameter and quantizing the scaled model parameter to form a quantized model parameter. In further embodiments, likelihoods for multiple frames of input feature vectors are determined for each retrieval of quantized model parameters from memory.

Type: Grant

Filed: May 1, 2003

Date of Patent: September 18, 2007

Assignee: Microsoft Corporation

Inventor: Julian J. Odell
Voice user interface authoring tool

Publication number: 20070156406

Abstract: A voice user interface authoring tool is configured to use categorized example caller responses, from which callflow paths, automatic speech recognition, and natural language processing control files can be generated automatically within a single, integrated authoring user interface. A voice user interface (VUI) design component allows an author to create an application incorporating various types of action nodes, including Prompt/Response Processing (PRP) nodes. At runtime, the system uses the information from each PRP node to prompt a user to say something, and to process the user's response in order to extract its meaning. An Automatic Speech Recognition/Natural Language Processing (ASR/NLP) Control Design component allows the author to associate sample inputs with each possible meaning, and automatically generates the necessary ASR and NLP runtime control files.

Type: Application

Filed: April 11, 2006

Publication date: July 5, 2007

Applicant: Microsoft Corporation

Inventors: William F. Barton, Michelle S. Spina, David G. Ollason, Julian J. Odell
Word-specific acoustic models in a speech recognition system

Patent number: 7062436

Abstract: An acoustic model includes word-specific models, that are specific to candidate words. The candidate words would otherwise be mapped to a series of general phones. A sub-series of the general phones representing the candidate word is modeled by a new phone and the new phone is dedicated to the candidate word, or a small group of similar words, but the new phone is not shared among all words that otherwise map to the sub-series of general phones.

Type: Grant

Filed: February 11, 2003

Date of Patent: June 13, 2006

Assignee: Microsoft Corporation

Inventors: Julian J. Odell, Shahid Durrani
Method and apparatus for quantizing model parameters

Publication number: 20040220804

Abstract: A method of quantizing a model parameter includes applying the model parameter to a non-linear scaling function to produce a scaled model parameter and quantizing the scaled model parameter to form a quantized model parameter. In further embodiments, likelihoods for multiple frames of input feature vectors are determined for each retrieval of quantized model parameters from memory.

Type: Application

Filed: May 1, 2003

Publication date: November 4, 2004

Applicant: Microsoft Corporation

Inventor: Julian J. Odell
Block synchronous decoding

Publication number: 20040176956

Abstract: A pattern recognition system and method are provided. Aspects of the invention are particularly useful in combination with multi-state Hidden Markov Models. Pattern recognition is effected by processing Hidden Markov Model Blocks. This block-processing allows the processor to perform more operations upon data while such data is in cache memory. By so increasing cache locality, aspects of the invention provide significantly improved pattern recognition speed.

Type: Application

Filed: March 4, 2003

Publication date: September 9, 2004

Applicant: Microsoft Corporation

Inventors: William H. Rockenbeck, Julian J. Odell
Network and language models for use in a speech recognition system

Patent number: 6668243

Abstract: A language model structure for use in a speech recognition system employs a tree-structured network model. The language model is structured such that identifiers associated with each word and contained therein are arranged such that each node of the network model with which the language model is associated spans a continuous range of identifiers. A method of transferring tokens through a tree-structured network in a speech recognition process is also provided.

Type: Grant

Filed: August 2, 2001

Date of Patent: December 23, 2003

Assignee: Microsoft Corporation

Inventor: Julian J. Odell