Patents by Inventor Ye-Yi Wang

Ye-Yi Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20090327260
    Abstract: To construct a classifier, a data structure correlating queries to items identified by the queries is received, where the data structure contains initial labeled queries that have been labeled with respect to predetermined classes, and unlabeled queries that have not been labeled with respect to the predetermined classes. The data structure is used to label at least some of the unlabeled queries with respect to the predetermined classes. Queries in the data structure that have been labeled with respect to the predetermined classes are used as training data to train the classifier.
    Type: Application
    Filed: June 25, 2008
    Publication date: December 31, 2009
    Applicant: Microsoft Corporation
    Inventors: Xiao Li, Ye-Yi Wang
  • Publication number: 20090276380
    Abstract: The present invention uses a natural language understanding system that is currently being trained to assist in annotating training data for training that natural language understanding system. Unannotated training data is provided to the system and the system proposes annotations to the training data. The user is offered an opportunity to confirm or correct the proposed annotations, and the system is trained with the corrected or verified annotations.
    Type: Application
    Filed: May 6, 2009
    Publication date: November 5, 2009
    Applicant: Microsoft Corporation
    Inventors: Alejandro Acero, Ye-Yi Wang, Leon Wong
  • Patent number: 7599837
    Abstract: A method and system to generate a grammar adapted for use by a speech recognizer includes receiving a representation of an alphanumeric expression. For instance, the representation can take the form of a regular expression or a mask. The grammar is generated based on the representation.
    Type: Grant
    Filed: September 15, 2004
    Date of Patent: October 6, 2009
    Assignee: Microsoft Corporation
    Inventors: Ye-Yi Wang, Yun-Cheng Ju, Leonard Alan Collins, Mark Cecys, Alejandro Acero
  • Patent number: 7580942
    Abstract: A computer-implemented method is disclosed for providing a directory assistance service. The method includes generating an indexing file that is a representation of information associated with a collection of listings stored in an index. The indexing file is utilized as a basis for ranking listings in an index based on the strength of association with a query. Based at least in part on the ranking, an output is provided and is indicative of listings in the index that are likely correspond to the query. At least one particular listing in the index is excluded from the output without there ever being a comparison of features in the query with features in the one particular listing.
    Type: Grant
    Filed: January 12, 2007
    Date of Patent: August 25, 2009
    Assignee: Microsoft Corporation
    Inventors: Dong Yu, Alejandro Acero, Yun-Cheng Ju, Ye-Yi Wang
  • Patent number: 7548847
    Abstract: The present invention uses a natural language understanding system that is currently being trained to assist in annotating training data for training that natural language understanding system. Unannotated training data is provided to the system and the system proposes annotations to the training data. The user is offered an opportunity to confirm or correct the proposed annotations, and the system is trained with the corrected or verified annotations.
    Type: Grant
    Filed: May 10, 2002
    Date of Patent: June 16, 2009
    Assignee: Microsoft Corporation
    Inventors: Alejandro Acero, Ye-Yi Wang, Leon Wong
  • Publication number: 20090150308
    Abstract: Described is a technology by which a maximum entropy model used for classification is trained with a significantly lesser amount of training data than is normally used in training other maximum entropy models, yet provides similar accuracy to the others. The maximum entropy model is initially parameterized with parameter values determined from weights obtained by training a vector space model or an n-gram model. The weights may be scaled into the initial parameter values by determining a scaling factor. Gaussian mean values may also be determined, and used for regularization in training the maximum entropy model. Scaling may also be applied to the Gaussian mean values. After initial parameterization, training comprises using training data to iteratively adjust the initial parameters into adjusted parameters until convergence is determined.
    Type: Application
    Filed: December 7, 2007
    Publication date: June 11, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Ye-Yi Wang, Alejandro Acero
  • Patent number: 7529657
    Abstract: A method for authoring a grammar for use in a language processing application is provided. The method includes receiving at least one grammar configuration parameter relating to how to configure a grammar and creating the grammar based on the at least one grammar configuration parameter.
    Type: Grant
    Filed: September 24, 2004
    Date of Patent: May 5, 2009
    Assignee: Microsoft Corporation
    Inventors: Ye-Yi Wang, Alejandro Acero
  • Publication number: 20090037175
    Abstract: A voice search system has a speech recognizer, a search component, and a dialog manager. A confidence measure generator receives speech recognition features from the speech recognizer, search features from the search component, and dialog features from the dialog manager, and calculates an overall confidence measure for voice search results based upon the features received. The invention can be extended to include the generation of additional features, based on those received from the individual components of the voice search system.
    Type: Application
    Filed: August 3, 2007
    Publication date: February 5, 2009
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, Yun-Cheng Ju, Dong Yu
  • Publication number: 20090019027
    Abstract: A directory assistance system includes a directory database and a search engine. The search engine is configured to search the directory database for a first set of residential listings based on at least one first search term. A second search term is received that is related to a cohabitant of the listing to be found. At least one search result is selected that satisfies the second search term.
    Type: Application
    Filed: July 9, 2007
    Publication date: January 15, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Yun-Cheng Ju, Ye-Yi Wang
  • Publication number: 20080281806
    Abstract: A database having listings rather than long documents is searched using a term frequency-inverse document frequency (Tf/Idf) algorithm.
    Type: Application
    Filed: May 10, 2007
    Publication date: November 13, 2008
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alejandro Acero, Geoffrey G. Zweig
  • Publication number: 20080281827
    Abstract: A structured database is used for webpage information extraction, and in particular, to obtain training data from the webpage for training a statistical model. The structured database has a plurality of entries, wherein each entry comprises a plurality of fields. One of the fields comprises a URL (uniform resource locater), while another field comprises information at least similar to other information to be located in a webpage associated with the URL. For at least some of the entries in the structured database, a web page associated with the URL is retrieved. The webpage is analyzed and if information is found in the webpage similar to the information in the structured database, the webpage is identified as being suitable to be considered as a training sample.
    Type: Application
    Filed: May 10, 2007
    Publication date: November 13, 2008
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, Alejandro Acero, Mandar A. Rahurkar
  • Publication number: 20080177547
    Abstract: A novel system integrates speech recognition and semantic classification, so that acoustic scores in a speech recognizer that accepts spoken utterances may be taken into account when training both language models and semantic classification models. For example, a joint association score may be defined that is indicative of a correspondence of a semantic class and a word sequence for an acoustic signal. The joint association score may incorporate parameters such as weighting parameters for signal-to-class modeling of the acoustic signal, language model parameters and scores, and acoustic model parameters and scores. The parameters may be revised to raise the joint association score of a target word sequence with a target semantic class relative to the joint association score of a competitor word sequence with the target semantic class. The parameters may be designed so that the semantic classification errors in the training data are minimized.
    Type: Application
    Filed: January 19, 2007
    Publication date: July 24, 2008
    Applicant: Microsoft Corporation
    Inventors: Sibel Yaman, Li Deng, Dong Yu, Ye-Yi Wang, Alejandro Acero
  • Publication number: 20080172376
    Abstract: A computer-implemented method is disclosed for providing a directory assistance service. The method includes generating an indexing file that is a representation of information associated with a collection of listings stored in an index. The indexing file is utilized as a basis for ranking listings in an index based on the strength of association with a query. Based at least in part on the ranking, an output is provided and is indicative of listings in the index that are likely correspond to the query. At least one particular listing in the index is excluded from the output without there ever being a comparison of features in the query with features in the one particular listing.
    Type: Application
    Filed: January 12, 2007
    Publication date: July 17, 2008
    Applicant: Microsoft Corporation
    Inventors: Dong Yu, Alejandro Acero, Yun-Cheng Ju, Ye-Yi Wang
  • Publication number: 20080015846
    Abstract: An answering machine detection module is used to determine whether a call recipient is an actual person or an answering machine. The answering machine detection module includes a speech recognizer and a call analysis module. The speech recognizer receives an audible response of the call recipient to a call. The speech recognizer processes the audible response and provides an output indicative of recognized speech. The call analysis module processes the output of the speech recognizer to generate an output indicative of whether the call recipient is a person or an answering machine.
    Type: Application
    Filed: July 12, 2006
    Publication date: January 17, 2008
    Applicant: Microsoft Corporation
    Inventors: Alejandro Acero, Craig M. Fisher, Dong Yu, Ye-Yi Wang, Yun-Cheng Ju
  • Patent number: 7286978
    Abstract: A method for creating a language model from a task-independent corpus is provided. In one embodiment, a task dependent unified language model is created. The unified language model includes a plurality of context-free grammars having non-terminals and a hybrid N-gram model having at least some of the same non-terminals embedded therein.
    Type: Grant
    Filed: April 11, 2006
    Date of Patent: October 23, 2007
    Assignee: Microsoft Corporation
    Inventors: Xuedong D. Huang, Milind V. Mahajan, Ye-Yi Wang, Xiaolong Mou
  • Publication number: 20070219798
    Abstract: A training system for a speech recognition application is disclosed. In embodiments described, the training system is used to train a classification model or language model. The classification model is trained using an adaptive language model generated by an iterative training process. In embodiments described, the training data is recognized by the speech recognition component and the recognized text is used to create the adaptive language model which is used for speech recognition in a following training iteration.
    Type: Application
    Filed: March 16, 2006
    Publication date: September 20, 2007
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, John Lee, Alex Acero
  • Publication number: 20070219793
    Abstract: A method of forming a shareable filler model (shareable model for garbage words) from a word n-gram model is provided. The word n-gram model is converted into a probabilistic context free grammar (PCFG). The PCFG is modified into a substantially application-independent PCFG, which constitutes the shareable filler model.
    Type: Application
    Filed: March 14, 2006
    Publication date: September 20, 2007
    Applicant: Microsoft Corporation
    Inventors: Alejandro Acero, Dong Yu, Ye-Yi Wang, Yun-Cheng Ju
  • Publication number: 20070129936
    Abstract: A conditional model is used in spoken language understanding. One such model is a conditional random field model.
    Type: Application
    Filed: March 17, 2006
    Publication date: June 7, 2007
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, Alejandro Acero, John Lee, Milind Mahajan
  • Publication number: 20070055492
    Abstract: To provide application developers with the ability to easily form customized grammars, grammar extensions are provided that allow application developers to selectively include portions of grammar templates and to easily combine grammar elements to form new grammar structures.
    Type: Application
    Filed: October 26, 2005
    Publication date: March 8, 2007
    Applicant: Microsoft Corporation
    Inventors: Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alejandro Acero
  • Publication number: 20060184354
    Abstract: A method for creating a language model from a task-independent corpus is provided. In one embodiment, a task dependent unified language model is created. The unified language model includes a plurality of context-free grammars having non-terminals and a hybrid N-gram model having at least some of the same non-terminals embedded therein.
    Type: Application
    Filed: April 11, 2006
    Publication date: August 17, 2006
    Applicant: Microsoft Corporation
    Inventors: Xuedong Huang, Milind Mahajan, Ye-Yi Wang, Xiaolong Mou