Patents Represented by Attorney, Agent or Law Firm T. Franek
  • Patent number: 6694055
    Abstract: A word segmentation method to identify proper names in input text includes locating a sequence of single-characters in the input text not forming part of a multiple-character word. The method further includes comparing the sequence of single-characters to a lexical knowledge base to identify if a first portion of the sequence corresponds to stored identifiable portions of a proper name, and comparing the sequence of single-characters to the lexical knowledge base to identify if a second portion of the sequence proximate the first portion includes characters known to comprise a second portion of a proper name. Instructions can be provided on a computer readable medium to implement the method.
    Type: Grant
    Filed: July 15, 1998
    Date of Patent: February 17, 2004
    Assignee: Microsoft Corporation
    Inventor: Andi Wu