Patents by Inventor Simon H. Corston

Simon H. Corston has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7788087
    Abstract: The present invention provides a system for identifying, extracting, clustering and analyzing sentiment-bearing text. In one embodiment, the invention implements a pipeline capable of accessing raw text and presenting it in a highly usable and intuitive way.
    Type: Grant
    Filed: April 14, 2005
    Date of Patent: August 31, 2010
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Anthony Aue, Eric K. Ringger, Michael Gamon
  • Patent number: 7788086
    Abstract: The present invention provides a system for identifying, extracting, clustering and analyzing sentiment-bearing text. In one embodiment, the invention implements a pipeline capable of accessing raw text and presenting it in a highly usable and intuitive way.
    Type: Grant
    Filed: April 14, 2005
    Date of Patent: August 31, 2010
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Anthony Aue, Eric K. Ringger, Michael Gamon
  • Patent number: 7536397
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: November 2, 2004
    Date of Patent: May 19, 2009
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7496500
    Abstract: The present invention relates to systems and methods that determine intent for received data (e.g., email, voice, graphics . . . ) and respond to the data based on the intent. The systems and methods employ various combinations of features based on shallow and deep linguistic analysis (e.g., semantic and syntactic) to yield very high accuracy. The systems and methods analyze and categorize received data to locate data that can include intent. This data can be further refined by extracting features related to the intent. The features can be utilized by a classifier to determine the intent. If the intent warrants a response, the data are further scrutinized and reformulated to generate a description that is indicative of the intent. The reformulation can include representing the features in a logical form, transforming the form and generating a description of the intent that can be presented to a user visually and/or audibly.
    Type: Grant
    Filed: June 15, 2004
    Date of Patent: February 24, 2009
    Assignee: Microsoft Corporation
    Inventors: David R. Reed, Eric K. Ringger, Michael Gamon, Richard G. Campbell, Robert G. Atkinson, Simon H. Corston, Malcolm E. Pearson
  • Patent number: 7398203
    Abstract: A text processor processes text in a message. The text processor generates a plurality of compressed forms of components of the message. The processor performs a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text. The processor then generates the plurality of compressed forms that can be used to compress the body of text. The plurality of compressed forms are generated based on the linguistic output. The invention can be implemented as a method of generating the compressed forms and as an apparatus.
    Type: Grant
    Filed: April 4, 2006
    Date of Patent: July 8, 2008
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Sharad Mathur
  • Patent number: 7392278
    Abstract: A system that facilitates performance of a focused search over a collection of sites comprises a subweb that corresponds to a topic and/or user characteristic(s) that are of interest to the user. The subweb includes a plurality of domains and/or paths (e.g. sites) that are related to the topic and/or the user characteristic(s). Each of the sites within the subweb is assigned a weight that indicates relevance of the site to the desirable topic and/or user characteristic(s). A search engine employs the subweb to facilitate focusing a search over a collection of sites. The search engine receives a query, and utilizes the subweb to focus a search over the selection of sites corresponding to the topic and/or user characteristic(s) represented by the subweb. The results from the search are returned to the user based at least in part upon the relevance weights assigned to the sites within the subweb.
    Type: Grant
    Filed: February 13, 2004
    Date of Patent: June 24, 2008
    Assignee: Microsoft Corporation
    Inventors: Harr Chen, Raman Chandrasekar, Simon H. Corston, Eric D. Brill
  • Patent number: 7299238
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: November 2, 2004
    Date of Patent: November 20, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7290004
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: November 2, 2004
    Date of Patent: October 30, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7290005
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: November 2, 2004
    Date of Patent: October 30, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7287012
    Abstract: The present invention relates to a system and methodology that applies automated learning procedures for determining document relevance and assisting information retrieval activities. A system is provided that facilitates a machine-learned approach to determine document relevance. The system includes a storage component that receives a set of human selected items to be employed as positive test cases of highly relevant documents. A training component trains at least one classifier with the human selected items as positive test cases and one or more other items as negative test cases in order to provide a query-independent model, wherein the other items can be selected by a statistical search, for example. Also, the trained classifier can be employed to aid an individual in identifying and selecting new positive cases or utilized to filter or re-rank results from a statistical-based search.
    Type: Grant
    Filed: January 9, 2004
    Date of Patent: October 23, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston, Raman Chandrasekar, Harr Chen
  • Patent number: 7269594
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: October 4, 2004
    Date of Patent: September 11, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7206787
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: October 4, 2004
    Date of Patent: April 17, 2007
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Patent number: 7069207
    Abstract: A text processor processes text in a message. The text processor generates a plurality of compressed forms of components of the message. The processor performs a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text. The processor then generates the plurality of compressed forms that can be used to compress the body of text. The plurality of compressed forms are generated based on the linguistic output. The invention can be implemented as a method of generating the compressed forms and as an apparatus.
    Type: Grant
    Filed: January 26, 2001
    Date of Patent: June 27, 2006
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Sharad Mathur
  • Patent number: 6901399
    Abstract: A system filters documents in a document set retrieved from a document store in response to a query. The system obtains a first set of logical forms based on a selected one of the query and the documents in the document set. The system obtains a second set of logical forms based on another of the query and the documents in the document set. The system then uses natural language processing techniques to modify the first logical forms to obtain a modified set of logical forms. The system filters documents in the document set based on a predetermined relationship between the modified set of logical forms and the second set of logical forms.
    Type: Grant
    Filed: June 16, 1998
    Date of Patent: May 31, 2005
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston, William B. Dolan, Lucy H. Vanderwende, Lisa Braden-Harder
  • Patent number: 6901402
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies constituents in the first textual input, having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the constituents identified. The constituents can be eliminated from the first textual input, weighted in the first textual input, or simply annotated in one of a variety of ways.
    Type: Grant
    Filed: June 18, 1999
    Date of Patent: May 31, 2005
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan, Hisami Suzuki
  • Publication number: 20020138248
    Abstract: A text processor processes text in a message. The text processor generates a plurality of compressed forms of components of the message. The processor performs a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text. The processor then generates the plurality of compressed forms that can be used to compress the body of text. The plurality of compressed forms are generated based on the linguistic output. The invention can be implemented as a method of generating the compressed forms and as an apparatus.
    Type: Application
    Filed: January 26, 2001
    Publication date: September 26, 2002
    Inventors: Simon H. Corston-Oliver, Sharad Mathur
  • Patent number: 6430552
    Abstract: A method is implemented in a computerized system that provides access to a search tool capable of searching at least one stored record. The method includes determining whether a search query is a logical query by comparing each search term to a set of logical operators. If a search term is in the set of logical operators it is removed from the search query to produce a modified search query. The modified search query is then passed to a search engine.
    Type: Grant
    Filed: December 24, 1998
    Date of Patent: August 6, 2002
    Assignee: Microsoft Corporation
    Inventor: Simon H. Corston-Oliver
  • Patent number: 6363374
    Abstract: A method of computerized searching receives parameters of a search query from a user and adds a restriction to the parameters to require that at least two of the search terms of the search query appear in a same sentence in a document. A representation of a set of documents is then searched based on the parameters of the search query and the added restriction. Documents that meet the search parameters and the added restriction are thus identified.
    Type: Grant
    Filed: December 31, 1998
    Date of Patent: March 26, 2002
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Lucretia H. Vanderwende, William B. Dolan
  • Patent number: 6295529
    Abstract: A system is utilized for determining a relationship between first and second textual inputs. The system identifies clauses in the first textual input having predetermined characteristics indicative of usefulness in determining the relationship. The relationship is then determined based on the clauses identified. The clauses can be eliminated from the first textual input, weighted in the first textual input, or simply annotated.
    Type: Grant
    Filed: December 24, 1998
    Date of Patent: September 25, 2001
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, William B. Dolan
  • Patent number: 5933822
    Abstract: Apparatus and accompanying methods for an information retrieval system that utilizes natural language processing to process results retrieved by, for example, an information retrieval engine such as a conventional statistical-based search engine, in order to improve overall precision. Specifically, such a search ultimately yields a set of retrieved documents. Each such document is then subjected to natural language processing to produce a set of logical forms. Each such logical form encodes, in a word-relation-word manner, semantic relationships, particularly argument and adjunct structure, between words in a phrase. A user-supplied query is analyzed in the same manner to yield a set of corresponding logical forms therefor. Documents are ranked as a predefined function of the logical forms from the documents and the query.
    Type: Grant
    Filed: July 22, 1997
    Date of Patent: August 3, 1999
    Assignee: Microsoft Corporation
    Inventors: Lisa Braden-Harder, Simon H. Corston, William B. Dolan, Lucy H. Vanderwende