Patents by Inventor Alex Acero

Alex Acero has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Automatic reading tutoring using dynamically built language model

Patent number: 8306822

Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.

Type: Grant

Filed: September 11, 2007

Date of Patent: November 6, 2012

Assignee: Microsoft Corporation

Inventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
Conveying locations in spoken dialog systems

Patent number: 8065078

Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.

Type: Grant

Filed: August 10, 2007

Date of Patent: November 22, 2011

Assignee: Microsoft Corporation

Inventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
Training system for a speech recognition application

Patent number: 7813926

Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.

Type: Grant

Filed: March 16, 2006

Date of Patent: October 12, 2010

Assignee: Microsoft Corporation

Inventors: Ye-Yi Wang, John Sie Yuen Lee, Alex Acero
Incrementally regulated discriminative margins in MCE training for speech recognition

Patent number: 7617103

Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.

Type: Grant

Filed: August 25, 2006

Date of Patent: November 10, 2009

Assignee: Microsoft Corporation

Inventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
INTRA-LANGUAGE STATISTICAL MACHINE TRANSLATION

Publication number: 20090248422

Abstract: Training data may be provided, the training data including pairs of source phrases and target phrases. The pairs may be used to train an intra-language statistical machine translation model, where the intra-language statistical machine translation model, when given an input phrase of text in the human language, can compute probabilities of semantic equivalence of the input phrase to possible translations of the input phrase in the human language. The statistical machine translation model may be used to translate between queries and listings. The queries may be text strings in the human language submitted to a search engine. The listing strings may be text strings of formal names of real world entities that are to be searched by the search engine to find matches for the query strings.

Type: Application

Filed: March 28, 2008

Publication date: October 1, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Xiao Li, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
Automatic reading tutoring

Publication number: 20090070112

Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.

Type: Application

Filed: September 11, 2007

Publication date: March 12, 2009

Applicant: Microsoft Corporation

Inventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
Conveying Locations In Spoken Dialog Systems

Publication number: 20090043497

Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.

Type: Application

Filed: August 10, 2007

Publication date: February 12, 2009

Applicant: Microsoft Corporation

Inventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
A/V CONTENT EDITING

Publication number: 20080177536

Abstract: A/V content creation, editing and publishing is disclosed. Speech recognition can be performed on the A/V content to identify words therein and form a transcript of the words. The transcript can be aligned with the associated A/V content and displayed to allow selective editing of the transcript and associated A/V content. Keywords and a summary for the transcript can also be identified for use in publishing the A/V content.

Type: Application

Filed: January 24, 2007

Publication date: July 24, 2008

Applicant: Microsoft Corporation

Inventors: Adil Sherwani, Christopher Weare, Patrick Nguyen, Milind Mahajan, Alex Acero, Manuel Clement, Patrick Nelson
Incrementally regulated discriminative margins in MCE training for speech recognition

Publication number: 20080052075

Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.

Type: Application

Filed: August 25, 2006

Publication date: February 28, 2008

Applicant: Microsoft Corporation

Inventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
PERSONALIZED SPEECH SERVICES ACROSS A NETWORK

Publication number: 20080004880

Abstract: A speech application accessible across a network is personalized for a particular user based on preferences for the user. The speech application can be modified based on the preferences.

Type: Application

Filed: June 15, 2006

Publication date: January 3, 2008

Applicant: Microsoft Corporation

Inventors: Alex Acero, Timothy S. Paek, Christopher A. Meek, David M. Chickering
Training system for a speech recognition application

Publication number: 20070219798

Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.

Type: Application

Filed: March 16, 2006

Publication date: September 20, 2007

Applicant: Microsoft Corporation

Inventors: Ye-Yi Wang, John Lee, Alex Acero