Patents by Inventor Andreas Stolcke

Andreas Stolcke has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20140214421
    Abstract: Prosodic features are used for discriminating computer-directed speech from human-directed speech. Statistics and models describing energy/intensity patterns over time, speech/pause distributions, pitch patterns, vocal effort features, and speech segment duration patterns may be used for prosodic modeling. The prosodic features for at least a portion of an utterance are monitored over a period of time to determine a shape associated with the utterance. A score may be determined to assist in classifying the current utterance as human directed or computer directed without relying on knowledge of preceding utterances or utterances following the current utterance. Outside data may be used for training lexical addressee detection systems for the H-H-C scenario. H-C training data can be obtained from a single-user H-C collection and that H-H speech can be modeled using general conversational speech. H-C and H-H language models may also be adapted using interpolation with small amounts of matched H-H-C data.
    Type: Application
    Filed: January 31, 2013
    Publication date: July 31, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Elizabeth Shriberg, Andreas Stolcke, Dilek Hakkani-Tur, Larry Heck, Heeyoung Lee
  • Publication number: 20130166295
    Abstract: The present invention relates to a method and apparatus for speaker-calibrated speaker detection. One embodiment of a method for generating a speaker model for use in detecting a speaker of interest includes identifying one or more speech features that best distinguish the speaker of interest from a plurality of impostor speakers and then incorporating the speech features in the speaker model.
    Type: Application
    Filed: December 21, 2011
    Publication date: June 27, 2013
    Inventors: ELIZABETH SHRIBERG, Luciana Ferrer, Andreas Stolcke, Martin Graciarena, Nicolas Scheffer
  • Publication number: 20120166365
    Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.
    Type: Application
    Filed: September 1, 2010
    Publication date: June 28, 2012
    Inventors: Gokhan Tur, Horacio E. Franco, Elizabeth Shriberg, Gregory K. Myers, William S. Mark, Norman D. Winarsky, Andreas Stolcke, Bart Peintner, Michael J. Wolverton, Luciana Ferrer, Martin Graciarena, Neil Yorke-Smith
  • Publication number: 20120029904
    Abstract: The present invention relates to a method and apparatus for adding new vocabulary to interactive translation and dialogue systems. In one embodiment, a method for adding a new word to a vocabulary of an interactive dialogue includes receiving an input signal that includes at least one word not currently in the vocabulary, inserting the word into a dynamic component of a search graph associated with the vocabulary, and compiling the dynamic component independently of a permanent component of the search graph to produce a new sub-grammar, where the permanent component comprises a plurality of words that are permanently part of the search graph.
    Type: Application
    Filed: July 30, 2010
    Publication date: February 2, 2012
    Inventors: KRISTIN PRECODA, HORACIO FRANCO, JING ZHENG, MICHAEL FRANDSEN, VICTOR ABRASH, MURAT AKBACAK, ANDREAS STOLCKE
  • Publication number: 20080010065
    Abstract: A method and apparatus for speaker recognition is provided. One embodiment of a method for determining whether a given speech signal is produced by an alleged speaker, where a plurality of statistical models (including at least one support vector machine) have been produced for the alleged speaker based on a previous speech signal received from the alleged speaker, includes receiving the given speech signal, the speech signal representing an utterance made by a speaker claiming to be the alleged speaker, scoring the given speech signal using at least two modeling systems, where at least one of the modeling systems is a support vector machine, combining scores produced by the modeling systems, with equal weights, to produce a final score, and determining, in accordance with the final score, whether the speaker is likely the alleged speaker.
    Type: Application
    Filed: June 5, 2007
    Publication date: January 10, 2008
    Inventors: Harry BRATT, Luciana Ferrer, Martin Graciarena, Sachin Kajarekar, Elizabeth Shriberg, Mustafa Sonmez, Andreas Stolcke, Gokhan Tur, Anand Venkataraman
  • Publication number: 20050125224
    Abstract: A method and apparatus are provided for fusion of recognition results from multiple types of data sources. In one embodiment, the inventive method implementing a first processing technique to recognize at least a portion of terms contained in a first media source, implementing a second processing technique to recognize at least a portion of terms contained in a second media source that contains a different type of data than that contained in the first media source, and adapting the first processing technique based at least in part on results generated by the second processing technique.
    Type: Application
    Filed: November 8, 2004
    Publication date: June 9, 2005
    Inventors: Gregory Myers, Harry Bratt, Anand Venkataraman, Andreas Stolcke, Horacio Franco, Venkata Rao Gadde