Patents by Inventor Geoffrey G. Zweig

Geoffrey G. Zweig has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10134389
    Abstract: A system is provided that trains a spoken language understanding (SLU) classifier. A corpus of user utterances is received. For each of the user utterances in the corpus, the user utterance is semantically parsed, and the result of this semantic parsing is represented as a rooted semantic parse graph. The parse graphs representing all of the user utterances in the corpus are then combined into a single corpus graph that represents the semantic parses of the entire corpus. The user utterances in the corpus are then clustered into intent-wise homogeneous groups of user utterances, where this clustering includes finding subgraphs in the corpus graph that represent different groups of user utterances, and each of these different groups has a similar user intent. The intent-wise homogeneous groups of user utterances are then used to train the SLU classifier, and the trained SLU classifier is output.
    Type: Grant
    Filed: September 4, 2015
    Date of Patent: November 20, 2018
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey G. Zweig, Gokhan Tur
  • Patent number: 10108608
    Abstract: A dialog state tracking system. One aspect of the system is the use of multiple utterance decoders and/or multiple spoken language understanding (SLU) engines generating competing results that improve the likelihood that the correct dialog state is available to the system and provide additional features for scoring dialog state hypotheses. An additional aspect is training a SLU engine and a dialog state scorer/ranker DSR engine using different subsets from a single annotated training data set. A further aspect is training multiple SLU/DSR engine pairs from inverted subsets of the annotated training data set. Another aspect is web-style dialog state ranking based on dialog state features using discriminative models with automatically generated feature conjunctions. Yet another aspect is using multiple parameter sets with each ranking engine and averaging the rankings. Each aspect independently improves dialog state tracking accuracy and may be combined in various combinations for greater improvement.
    Type: Grant
    Filed: June 12, 2014
    Date of Patent: October 23, 2018
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Jason D. Williams, Geoffrey G. Zweig
  • Publication number: 20180276704
    Abstract: Systems and methods for using Internet service co-branded financial instruments are disclosed. In one embodiment, a method for using Internet co-branded financial instruments may include: (1) a financial institution and a provider of an Internet service establishing a relationship, which may be a co-branded financial instrument, such as a credit card; (2) a user receiving the co-branded financial instrument; (3) the provider of the Internet service monitoring the user's Internet service use; (4) the provider of the Internet service may provide the user's Internet service use to the financial institution; (5) the financial institution issuing rewards to the user's account based on the Internet service use; (6) the financial institution may provide the provider of the Internet service with financial information for the user; and (7) the provider of the Internet service may charge an advertiser for ad placement based on the user financial information.
    Type: Application
    Filed: March 26, 2018
    Publication date: September 27, 2018
    Inventor: Geoffrey G. Zweig
  • Patent number: 9836671
    Abstract: Disclosed herein are technologies directed to discovering semantic similarities between images and text, which can include performing image search using a textual query, performing text search using an image as a query, and/or generating captions for images using a caption generator. A semantic similarity framework can include a caption generator and can be based on a deep multimodal similar model. The deep multimodal similarity model can receive sentences and determine the relevancy of the sentences based on similarity of text vectors generated for one or more sentences to an image vector generated for an image. The text vectors and the image vector can be mapped in a semantic space, and their relevance can be determined based at least in part on the mapping. The sentence associated with the text vector determined to be the most relevant can be output as a caption for the image.
    Type: Grant
    Filed: August 28, 2015
    Date of Patent: December 5, 2017
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Jianfeng Gao, Xiaodong He, Saurabh Gupta, Geoffrey G. Zweig, Forrest Iandola, Li Deng, Hao Fang, Margaret A. Mitchell, John C. Platt, Rupesh Kumar Srivastava
  • Publication number: 20170069310
    Abstract: A system is provided that trains a spoken language understanding (SLU) classifier. A corpus of user utterances is received. For each of the user utterances in the corpus, the user utterance is semantically parsed, and the result of this semantic parsing is represented as a rooted semantic parse graph. The parse graphs representing all of the user utterances in the corpus are then combined into a single corpus graph that represents the semantic parses of the entire corpus. The user utterances in the corpus are then clustered into intent-wise homogeneous groups of user utterances, where this clustering includes finding subgraphs in the corpus graph that represent different groups of user utterances, and each of these different groups has a similar user intent. The intent-wise homogeneous groups of user utterances are then used to train the SLU classifier, and the trained SLU classifier is output.
    Type: Application
    Filed: September 4, 2015
    Publication date: March 9, 2017
    Inventors: Dilek Hakkani-Tur, Yun-Cheng Ju, Geoffrey G. Zweig, Gokhan Tur
  • Publication number: 20170061250
    Abstract: Disclosed herein are technologies directed to discovering semantic similarities between images and text, which can include performing image search using a textual query, performing text search using an image as a query, and/or generating captions for images using a caption generator. A semantic similarity framework can include a caption generator and can be based on a deep multimodal similar model. The deep multimodal similarity model can receive sentences and determine the relevancy of the sentences based on similarity of text vectors generated for one or more sentences to an image vector generated for an image. The text vectors and the image vector can be mapped in a semantic space, and their relevance can be determined based at least in part on the mapping. The sentence associated with the text vector determined to be the most relevant can be output as a caption for the image.
    Type: Application
    Filed: August 28, 2015
    Publication date: March 2, 2017
    Inventors: Jianfeng Gao, Xiaodong He, Saurabh Gupta, Geoffrey G. Zweig, Forrest Iandola, Li Deng, Hao Fang, Margaret A. Mitchell, John C. Platt, Rupesh Kumar Srivastava
  • Patent number: 9519858
    Abstract: A system is described herein which uses a neural network having an input layer that accepts an input vector and a feature vector. The input vector represents at least part of input information, such as, but not limited to, a word or phrase in a sequence of input words. The feature vector provides supplemental information pertaining to the input information. The neural network produces an output vector based on the input vector and the feature vector. In one implementation, the neural network is a recurrent neural network. Also described herein are various applications of the system, including a machine translation application.
    Type: Grant
    Filed: February 10, 2013
    Date of Patent: December 13, 2016
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Geoffrey G. Zweig, Tomas Mikolov
  • Publication number: 20160196820
    Abstract: Domain-specific language understanding models that may be built, tested and improved quickly and efficiently are provided. Methods, systems and devices are provided that enable a developer to build user intent detection models, language entity extraction models, and language entity resolution models quickly and without specialized machine learning knowledge. These models may be built and implemented via single model systems that enable the models to be built in isolation or in an end-to-end pipeline system that enables the models to be built and improved in a simultaneous manner.
    Type: Application
    Filed: April 3, 2015
    Publication date: July 7, 2016
    Applicant: Microsoft Technology Licensing, LLC.
    Inventors: Jason Douglas Williams, Nobal Bikram Niraula, Pradeep Dasigi, Aparna Lakshmiratan, Geoffrey G. Zweig, Andrey Kolobov, Carlos Garcia Jurado Suarez, David Maxwell Chickering
  • Patent number: 9218412
    Abstract: A database having listings rather than long documents is searched using a term frequency-inverse document frequency (Tf/Idf) algorithm.
    Type: Grant
    Filed: May 10, 2007
    Date of Patent: December 22, 2015
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Ye-Yi Wang, Dong Yu, Yun-Cheng Ju, Alejandro Acero, Geoffrey G. Zweig
  • Publication number: 20150363393
    Abstract: A dialog state tracking system. One aspect of the system is the use of multiple utterance decoders and/or multiple spoken language understanding (SLU) engines generating competing results that improve the likelihood that the correct dialog state is available to the system and provide additional features for scoring dialog state hypotheses. An additional aspect is training a SLU engine and a dialog state scorer/ranker DSR engine using different subsets from a single annotated training data set. A further aspect is training multiple SLU/DSR engine pairs from inverted subsets of the annotated training data set. Another aspect is web-style dialog state ranking based on dialog state features using discriminative models with automatically generated feature conjunctions. Yet another aspect is using multiple parameter sets with each ranking engine and averaging the rankings. Each aspect independently improves dialog state tracking accuracy and may be combined in various combinations for greater improvement.
    Type: Application
    Filed: June 12, 2014
    Publication date: December 17, 2015
    Applicant: Microsoft Corporation
    Inventors: Jason D. Williams, Geoffrey G. Zweig
  • Patent number: 9025860
    Abstract: A document that includes a representation of a two-dimensional (2-D) image may be obtained. A selection indicator indicating a selection of at least a portion of the 2-D image may be obtained. A match correspondence may be determined between the selected portion of the 2-D image and a three-dimensional (3-D) image object stored in an object database, the match correspondence based on a web crawler analysis result. A 3-D rendering of the 3-D image object that corresponds to the selected portion of the 2-D image may be initiated.
    Type: Grant
    Filed: August 6, 2012
    Date of Patent: May 5, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Geoffrey G. Zweig, Eric J. Stollnitz, Richard Szeliski, Sudipta Sinha, Johannes Kopf
  • Patent number: 9020806
    Abstract: The subject disclosure is directed towards automated processes for generating sentence completion questions based at least in part on a language model. Using the language model, a sentence is located, and alternates for a focus word (or words) in the sentence are automatically provided. Also described is automated filtering candidate sentences to locate the sentence, filtering the alternates based upon elimination criteria, scoring sentences with the correct word and as modified the alternates, and ranking the alternates. Manual selection may be used along with the automated processes.
    Type: Grant
    Filed: November 30, 2012
    Date of Patent: April 28, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Geoffrey G. Zweig, Christopher J. C. Burges
  • Patent number: 8965765
    Abstract: Described is a technology by which a structured model of repetition is used to determine the words spoken by a user, and/or a corresponding database entry, based in part on a prior utterance. For a repeated utterance, a joint probability analysis is performed on (at least some of) the corresponding word sequences as recognized by one or more recognizers) and associated acoustic data. For example, a generative probabilistic model, or a maximum entropy model may be used in the analysis. The second utterance may be a repetition of the first utterance using the exact words, or another structural transformation thereof relative to the first utterance, such as an extension that adds one or more words, a truncation that removes one or more words, or a whole or partial spelling of one or more words.
    Type: Grant
    Filed: September 19, 2008
    Date of Patent: February 24, 2015
    Assignee: Microsoft Corporation
    Inventors: Geoffrey G. Zweig, Xiao Li, Dan Bohus, Alejandro Acero, Eric J. Horvitz
  • Publication number: 20140229158
    Abstract: A system is described herein which uses a neural network having an input layer that accepts an input vector and a feature vector. The input vector represents at least part of input information, such as, but not limited to, a word or phrase in a sequence of input words. The feature vector provides supplemental information pertaining to the input information. The neural network produces an output vector based on the input vector and the feature vector. In one implementation, the neural network is a recurrent neural network. Also described herein are various applications of the system, including a machine translation application.
    Type: Application
    Filed: February 10, 2013
    Publication date: August 14, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Geoffrey G. Zweig, Tomas Mikolov, Alejandro Acero
  • Publication number: 20140156260
    Abstract: The subject disclosure is directed towards automated processes for generating sentence completion questions based at least in part on a language model. Using the language model, a sentence is located, and alternates for a focus word (or words) in the sentence are automatically provided. Also described is automated filtering candidate sentences to locate the sentence, filtering the alternates based upon elimination criteria, scoring sentences with the correct word and as modified the alternates, and ranking the alternates. Manual selection may be used along with the automated processes.
    Type: Application
    Filed: November 30, 2012
    Publication date: June 5, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Geoffrey G. Zweig, Christopher J.C. Burges
  • Publication number: 20140067368
    Abstract: A document-term matrix may be generated based on a corpus. A term representation matrix may be generated based on modifying a plurality of elements of the document-term matrix based on antonym information included in the corpus. Similarities may be determined based on a plurality of elements of the term representation matrix.
    Type: Application
    Filed: August 29, 2012
    Publication date: March 6, 2014
    Applicant: Microsoft Corporation
    Inventors: Wen-tau Yih, Geoffrey G. Zweig, John C. Platt
  • Publication number: 20140037218
    Abstract: A document that includes a representation of a two-dimensional (2-D) image may be obtained. A selection indicator indicating a selection of at least a portion of the 2-D image may be obtained. A match correspondence may be determined between the selected portion of the 2-D image and a three-dimensional (3-D) image object stored in an object database, the match correspondence based on a web crawler analysis result. A 3-D rendering of the 3-D image object that corresponds to the selected portion of the 2-D image may be initiated.
    Type: Application
    Filed: August 6, 2012
    Publication date: February 6, 2014
    Applicant: MICROSOFT CORPORATION
    Inventors: Geoffrey G. Zweig, Eric J. Stollnitz, Richard Szeliski, Sudipta Sinha, Johannes Kopf
  • Publication number: 20110224982
    Abstract: Described is a technology in which information retrieval (IR) techniques are used in a speech recognition (ASR) system. Acoustic units (e.g., phones, syllables, multi-phone units, words and/or phrases) are decoded, and features found from those acoustic units. The features are then used with IR techniques (e.g., TF-IDF based retrieval) to obtain a target output (a word or words).
    Type: Application
    Filed: March 12, 2010
    Publication date: September 15, 2011
    Applicant: c/o Microsoft Corporation
    Inventors: Alejandro Acero, James Garnet Droppo, III, Xiaoqiang Xiao, Geoffrey G. Zweig
  • Publication number: 20100076752
    Abstract: The described implementations relate to automated data cleanup. One system includes a language model generated from language model seed text and a dictionary of possible data substitutions. This system also includes a transducer configured to cleanse a corpus utilizing the language model and the dictionary.
    Type: Application
    Filed: September 17, 2009
    Publication date: March 25, 2010
    Inventors: Geoffrey G. Zweig, Yun-Cheng Ju
  • Publication number: 20100076765
    Abstract: Described is a technology by which a structured model of repetition is used to determine the words spoken by a user, and/or a corresponding database entry, based in part on a prior utterance. For a repeated utterance, a joint probability analysis is performed on (at least some of) the corresponding word sequences as recognized by one or more recognizers) and associated acoustic data. For example, a generative probabilistic model, or a maximum entropy model may be used in the analysis. The second utterance may be a repetition of the first utterance using the exact words, or another structural transformation thereof relative to the first utterance, such as an extension that adds one or more words, a truncation that removes one or more words, or a whole or partial spelling of one or more words.
    Type: Application
    Filed: September 19, 2008
    Publication date: March 25, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Geoffrey G. Zweig, Xiao Li, Dan Bohus, Alejandro Acero, Eric J. Horvitz