Patents Examined by Jr, Envall
  • Patent number: 5251131
    Abstract: Classification of natural language data wherein the natural language data has an open-ended range of possible values or the data values do not have a relative order. A training database stores training records, wherein each training record includes predictor data fields. Each predictor data field containes a feature, wherein each feature is a natural language term, and a target data field containing a target value representing a classification of the record. Features may also include conjunctions of natural language terms and each feature may also be a member of a category subset of features. The training database stores, for each feature, a probability weight value representing the probability that a record will have the target value contained in the target data field if a feature contained in a corresponding predictor data field occurs in the record.
    Type: Grant
    Filed: July 31, 1991
    Date of Patent: October 5, 1993
    Assignee: Thinking Machines Corporation
    Inventors: Brij M. Masand, Stephen J. Smith