Patents by Inventor Sonja S. Knoll

Sonja S. Knoll has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7634398
    Abstract: A method is provided for forming a reconstructed parse structure for an initial parse structure. Under the method, at least one rule used to form the initial parse structure and at least one additional rule are executed to form the reconstructed parse structure. In many embodiments, the reconstructed parse structure differs from the initial parse structure in that a node is located in a different position in the reconstructed parse structure than in the initial parse structure.
    Type: Grant
    Filed: May 16, 2002
    Date of Patent: December 15, 2009
    Assignee: Microsoft Corporation
    Inventors: Sonja S. Knoll, David N. Weise, David Parkinson
  • Patent number: 7480667
    Abstract: A computer implemented information retrieval system is provided. The system includes a user input configured to receive a user query relative to the corpus. A machine learning classifier is trained with a first set of training data comprising anchor text relative to at least some of the documents in the corpus. A processing unit is adapted to interact with the classifier to obtain search results relative to the query using the machine learning classifier. In some aspects, the classifier is also trained with a second set of training data. A method of integrating a new document into a corpus of documents is also provided. A method of training a machine learning classifier for retrieving documents from a corpus using two distinct types of training data is also provided.
    Type: Grant
    Filed: December 24, 2004
    Date of Patent: January 20, 2009
    Assignee: Microsoft Corporation
    Inventors: Chen Harr, Adwait Ratnaparkhi, Sonja S. Knoll, Hsiao-Wuen Hon
  • Patent number: 7464026
    Abstract: The present invention is a system and method for performing semantic analysis that interprets a linguistic structure output by a natural language linguistic analysis system. The semantic analysis system converts the linguistic output by the natural language linguistic analysis system into a data structure model referred to as a semantic discourse representation structure (SemDRS).
    Type: Grant
    Filed: December 19, 2006
    Date of Patent: December 9, 2008
    Assignee: Microsoft Corporation
    Inventors: Michael V. Calcagno, Par Jonas Barklund, Lei Zhao, Saliha Azzam, Sonja S. Knoll, Su Chang
  • Patent number: 7315810
    Abstract: The present invention is a named entity (NE) interface to a linguistic analysis layer. The interface exposes each input sentence to the NE recognizers of all applications and returns all recognized NEs. Thus, the present invention can accommodate NEs which dynamically change in the applications, because each input string will be handed to the applications for NE recognition. The present invention also includes a data structure which is a normalized form of recognized NEs.
    Type: Grant
    Filed: January 7, 2002
    Date of Patent: January 1, 2008
    Assignee: Microsoft Corporation
    Inventors: Saliha Azzam, Michael V. Calcagno, David N. Weise, Kevin R. Powell, Sonja S. Knoll, Pär Jonas Barklund, Evelyne Viegas
  • Patent number: 7269547
    Abstract: The present invention is a segmenter used in a natural language processing system. The segmenter segments a textual input string into tokens for further natural language processing. In accordance with one feature of the invention, the segmenter includes a tokeinzer engine that proposes segmentations and submits them to a linguistic knowledge component for validation. In accordance with another feature of the invention, the segmentation system includes language specific data that contains a precedence hierarchy for punctuation. If proposed tokens in the input string contain punctuation, they can illustratively be broken into subtokens based on the precedence hierarchy.
    Type: Grant
    Filed: July 15, 2005
    Date of Patent: September 11, 2007
    Assignee: Microsoft Corporation
    Inventors: Joseph E. Pentheroudakis, David G. Bradlee, Sonja S. Knoll
  • Patent number: 7177799
    Abstract: The present invention is a system and method for performing semantic analysis that interprets a linguistic structure output by a natural language linguistic analysis system. The semantic analysis system converts the linguistic output by the natural language linguistic analysis system into a data structure model referred to as a semantic discourse representation structure (SemDRS).
    Type: Grant
    Filed: January 14, 2002
    Date of Patent: February 13, 2007
    Assignee: Microsoft Corporation
    Inventors: Michael V. Calcagno, Pär Jonas Barklund, Lei Zhao, Saliha Azzam, Sonja S. Knoll, Su Chang
  • Patent number: 7092871
    Abstract: The present invention is a segmenter used in a natural language processing system. The segmenter segments a textual input string into tokens for further natural language processing. In accordance with one feature of the invention, the segmenter includes a tokenizer engine that proposes segmentations and submits them to a linguistic knowledge component for validation. In accordance with another feature of the invention, the segmentation system includes language-specific data that contains a precedence hierarchy for punctuation. If proposed tokens in the input string contain punctuation, they can illustratively be broken into subtokens based on the precedence hierarchy.
    Type: Grant
    Filed: March 30, 2001
    Date of Patent: August 15, 2006
    Assignee: Microsoft Corporation
    Inventors: Joseph E. Pentheroudakis, David G. Bradlee, Sonja S. Knoll
  • Publication number: 20030216904
    Abstract: A method is provided for forming a reconstructed parse structure for an initial parse structure. Under the method, at least one rule used to form the initial parse structure and at least one additional rule are executed to form the reconstructed parse structure. In many embodiments, the reconstructed parse structure differs from the initial parse structure in that a node is located in a different position in the reconstructed parse structure than in the initial parse structure.
    Type: Application
    Filed: May 16, 2002
    Publication date: November 20, 2003
    Inventors: Sonja S. Knoll, David N. Weise, David Parkinson
  • Publication number: 20030176999
    Abstract: The present invention is a system and method for performing semantic analysis that interprets a linguistic structure output by a natural language linguistic analysis system. The semantic analysis system converts the linguistic output by the natural language linguistic analysis system into a data structure model referred to as a semantic discourse representation structure (SemDRS).
    Type: Application
    Filed: January 14, 2002
    Publication date: September 18, 2003
    Inventors: Michael V. Calcagno, Par Jonas Barklund, Lei Zhao, Saliha Azzam, Sonja S. Knoll, Su Chang
  • Publication number: 20030130835
    Abstract: The present invention is a named entity (NE) interface to a linguistic analysis layer. The interface exposes each input sentence to the NE recognizers of all applications and returns all recognized NEs. Thus, the present invention can accommodate NEs which dynamically change in the applications, because each input string will be handed to the applications for NE recognition. The present invention also includes a data structure which is a normalized form of recognized NEs.
    Type: Application
    Filed: January 7, 2002
    Publication date: July 10, 2003
    Inventors: Saliha Azzam, Michael V. Calcagno, David N. Weise, Kevin R. Powell, Sonja S. Knoll, Par Jonas Barklund, Evelyne Viegao
  • Publication number: 20030023425
    Abstract: The present invention is a segmenter used in a natural language processing system. The segmenter segments a textual input string into tokens for further natural language processing. In accordance with one feature of the invention, the segmenter includes a tokenizer engine that proposes segmentations and submits them to a linguistic knowledge component for validation. In accordance with another feature of the invention, the segmentation system includes language-specific data that contains a precedence hierarchy for punctuation. If proposed tokens in the input string contain punctuation, they can illustratively be broken into subtokens based on the precedence hierarchy.
    Type: Application
    Filed: March 30, 2001
    Publication date: January 30, 2003
    Inventors: Joseph E. Pentheroudakis, David G. Bradlee, Sonja S. Knoll