Patents by Inventor Alex Acero
Alex Acero has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8306822Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.Type: GrantFiled: September 11, 2007Date of Patent: November 6, 2012Assignee: Microsoft CorporationInventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
-
Patent number: 8065078Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.Type: GrantFiled: August 10, 2007Date of Patent: November 22, 2011Assignee: Microsoft CorporationInventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
-
Patent number: 7813926Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.Type: GrantFiled: March 16, 2006Date of Patent: October 12, 2010Assignee: Microsoft CorporationInventors: Ye-Yi Wang, John Sie Yuen Lee, Alex Acero
-
Patent number: 7617103Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.Type: GrantFiled: August 25, 2006Date of Patent: November 10, 2009Assignee: Microsoft CorporationInventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
-
Publication number: 20090248422Abstract: Training data may be provided, the training data including pairs of source phrases and target phrases. The pairs may be used to train an intra-language statistical machine translation model, where the intra-language statistical machine translation model, when given an input phrase of text in the human language, can compute probabilities of semantic equivalence of the input phrase to possible translations of the input phrase in the human language. The statistical machine translation model may be used to translate between queries and listings. The queries may be text strings in the human language submitted to a search engine. The listing strings may be text strings of formal names of real world entities that are to be searched by the search engine to find matches for the query strings.Type: ApplicationFiled: March 28, 2008Publication date: October 1, 2009Applicant: MICROSOFT CORPORATIONInventors: Xiao Li, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
-
Publication number: 20090070112Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.Type: ApplicationFiled: September 11, 2007Publication date: March 12, 2009Applicant: Microsoft CorporationInventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
-
Publication number: 20090043497Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.Type: ApplicationFiled: August 10, 2007Publication date: February 12, 2009Applicant: Microsoft CorporationInventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
-
Publication number: 20080177536Abstract: A/V content creation, editing and publishing is disclosed. Speech recognition can be performed on the A/V content to identify words therein and form a transcript of the words. The transcript can be aligned with the associated A/V content and displayed to allow selective editing of the transcript and associated A/V content. Keywords and a summary for the transcript can also be identified for use in publishing the A/V content.Type: ApplicationFiled: January 24, 2007Publication date: July 24, 2008Applicant: Microsoft CorporationInventors: Adil Sherwani, Christopher Weare, Patrick Nguyen, Milind Mahajan, Alex Acero, Manuel Clement, Patrick Nelson
-
Publication number: 20080052075Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.Type: ApplicationFiled: August 25, 2006Publication date: February 28, 2008Applicant: Microsoft CorporationInventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
-
Publication number: 20080004880Abstract: A speech application accessible across a network is personalized for a particular user based on preferences for the user. The speech application can be modified based on the preferences.Type: ApplicationFiled: June 15, 2006Publication date: January 3, 2008Applicant: Microsoft CorporationInventors: Alex Acero, Timothy S. Paek, Christopher A. Meek, David M. Chickering
-
Publication number: 20070219798Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.Type: ApplicationFiled: March 16, 2006Publication date: September 20, 2007Applicant: Microsoft CorporationInventors: Ye-Yi Wang, John Lee, Alex Acero