Patents by Inventor Alex Acero

Alex Acero has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8306822
    Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.
    Type: Grant
    Filed: September 11, 2007
    Date of Patent: November 6, 2012
    Assignee: Microsoft Corporation
    Inventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
  • Patent number: 8065078
    Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.
    Type: Grant
    Filed: August 10, 2007
    Date of Patent: November 22, 2011
    Assignee: Microsoft Corporation
    Inventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
  • Patent number: 7813926
    Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.
    Type: Grant
    Filed: March 16, 2006
    Date of Patent: October 12, 2010
    Assignee: Microsoft Corporation
    Inventors: Ye-Yi Wang, John Sie Yuen Lee, Alex Acero
  • Patent number: 7617103
    Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.
    Type: Grant
    Filed: August 25, 2006
    Date of Patent: November 10, 2009
    Assignee: Microsoft Corporation
    Inventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
  • Publication number: 20090248422
    Abstract: Training data may be provided, the training data including pairs of source phrases and target phrases. The pairs may be used to train an intra-language statistical machine translation model, where the intra-language statistical machine translation model, when given an input phrase of text in the human language, can compute probabilities of semantic equivalence of the input phrase to possible translations of the input phrase in the human language. The statistical machine translation model may be used to translate between queries and listings. The queries may be text strings in the human language submitted to a search engine. The listing strings may be text strings of formal names of real world entities that are to be searched by the search engine to find matches for the query strings.
    Type: Application
    Filed: March 28, 2008
    Publication date: October 1, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Xiao Li, Yun-Cheng Ju, Geoffrey Zweig, Alex Acero
  • Publication number: 20090070112
    Abstract: A method of providing automatic reading tutoring is disclosed. The method includes retrieving a textual indication of a story from a data store and creating a language model including constructing a target context free grammar indicative of a first portion of the story. A first acoustic input is received and a speech recognition engine is employed to recognize the first acoustic input. An output of the speech recognition engine is compared to the language model and a signal indicative of whether the output of the speech recognition matches at least a portion of the target context free grammar is provided.
    Type: Application
    Filed: September 11, 2007
    Publication date: March 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Xiaolong Li, Li Deng, Yun-Cheng Ju, Alex Acero
  • Publication number: 20090043497
    Abstract: The presentation of location information to a user that is distracted by traveling can result in the user quickly forgetting, or never even comprehending, key parts of the location information, such as the street number. Identification can be made of intersections and points of interest near the user's destination, which can then be provided instead of, or in addition to, the address, thereby increasing user comprehension and retention, especially when distracted. Map data can be parsed into addresses, intersections and points of interest databases. These databases can be accessed to identify proximate intersections and points of interest, which can then be filtered and subsequently ranked to identify one intersection, one point of interest, or both, that can be presented to the user to aid the user in comprehending and retaining the location information even when distracted.
    Type: Application
    Filed: August 10, 2007
    Publication date: February 12, 2009
    Applicant: Microsoft Corporation
    Inventors: Ivan Tashev, Michael Lewis Seltzer, Yun-Cheng Ju, Alex Acero
  • Publication number: 20080177536
    Abstract: A/V content creation, editing and publishing is disclosed. Speech recognition can be performed on the A/V content to identify words therein and form a transcript of the words. The transcript can be aligned with the associated A/V content and displayed to allow selective editing of the transcript and associated A/V content. Keywords and a summary for the transcript can also be identified for use in publishing the A/V content.
    Type: Application
    Filed: January 24, 2007
    Publication date: July 24, 2008
    Applicant: Microsoft Corporation
    Inventors: Adil Sherwani, Christopher Weare, Patrick Nguyen, Milind Mahajan, Alex Acero, Manuel Clement, Patrick Nelson
  • Publication number: 20080052075
    Abstract: A method and apparatus for training an acoustic model are disclosed. A training corpus is accessed and converted into an initial acoustic model. Scores are calculated for a correct class and competitive classes, respectively, for each token given the acoustic model. From this score a misclassification measure is calculated and then a loss function is calculated from the misclassification measure. The loss function also includes a margin value that varies over each iteration in the training. Based on the calculated loss function the acoustic model is updated, where the loss function with the margin value is minimized. This process repeats until such time as an empirical convergence is met.
    Type: Application
    Filed: August 25, 2006
    Publication date: February 28, 2008
    Applicant: Microsoft Corporation
    Inventors: Xiaodong He, Alex Acero, Dong Yu, Li Deng
  • Publication number: 20080004880
    Abstract: A speech application accessible across a network is personalized for a particular user based on preferences for the user. The speech application can be modified based on the preferences.
    Type: Application
    Filed: June 15, 2006
    Publication date: January 3, 2008
    Applicant: Microsoft Corporation
    Inventors: Alex Acero, Timothy S. Paek, Christopher A. Meek, David M. Chickering
  • Publication number: 20070219798
    Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.
    Type: Application
    Filed: March 16, 2006
    Publication date: September 20, 2007
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, John Lee, Alex Acero