Patents Assigned to Word Data Corp.
-
Publication number: 20080183759Abstract: Disclosed are a method, machine-readable code, and a database for use in identifying, among a group of patent practitioners, one or more practitioners having expertise related to a given invention or technology. In the method, a search query related to the given invention or technology is used to identify one or more texts of patent abstracts or claims or patent class definitions having high term matches with the user-input query. The identified text(s) are linked to patent-class tags associated with the texts, and the identified tags are linked to one or more members of a group of patent practitioners who wrote and/or prosecuted patents having the patent-class assignments.Type: ApplicationFiled: January 28, 2008Publication date: July 31, 2008Applicant: WORD DATA CORPInventor: Peter J. Dehlinger
-
Patent number: 7386442Abstract: A computer method, system and code, for representing a natural-language document in a vector form suitable for text manipulation operations are disclosed. The method involves determining (a) for each of a plurality of terms selected from one of (i) non-generic words in the document, (ii) proximately arranged word groups in the document, and (iii) a combination of (i) and (ii), a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term.Type: GrantFiled: July 1, 2003Date of Patent: June 10, 2008Assignee: Word Data Corp.Inventors: Peter J. Dehlinger, Shao Chin
-
Patent number: 7181451Abstract: Disclosed is an automated system, machine-readable storage medium embodying computer-executable code, and method for generating descriptive words and optionally, multi-word groups derived from a digitally encoded, natural-language input text that describes a concept, invention, or event in a selected field. The system includes (a) an electronic digital computer, (b) a database of words and optionally, word-groups derived from a plurality of texts, and (c) machine-readable storage medium embodying computer-executable code for accessing the database. The database provides, or can be used to calculate, a selectivity value for each of the words and optionally, word groups contained in or derived from the input text. Words and optionally, word groups having an above-threshold selectivity value are selected as descriptive terms from the input text.Type: GrantFiled: September 30, 2002Date of Patent: February 20, 2007Assignee: Word Data Corp.Inventors: Peter J. Dehlinger, Shao Chin
-
Patent number: 7024408Abstract: Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. For each of a plurality of non-generic words and/or words groups characterizing the target document, there is determined a selectivity value calculated as the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively, and the document is represented as a vector of terms, where the coefficient assigned to each term is a function of the selectivity value determined for that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text.Type: GrantFiled: July 1, 2003Date of Patent: April 4, 2006Assignee: Word Data Corp.Inventors: Peter J. Dehlinger, Shao Chin
-
Patent number: 7016895Abstract: Disclosed are a computer-readable code, system and method for classifying a target document in the form of a digitally encoded natural-language text as belonging to one or more of two or more different classes. Each of a plurality of non-generic words and optionally, words groups characterizing the target document is selected as a descriptive term if the term has an above-threshold selectivity value in at least one library of texts in a field, where the selectivity value of a term is a measure of the field-specificity of that term. There is then determined, for each of the plurality of sample texts having associated classification identifiers, a match score related to the number of descriptive terms present in or derived from that text that match those in the target text. From the selected matched texts, and the associated classification identifiers, a classification determination of the target document is made.Type: GrantFiled: February 25, 2003Date of Patent: March 21, 2006Assignee: Word Data Corp.Inventors: Peter J. Dehlinger, Shao Chin
-
Patent number: 7003516Abstract: A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.Type: GrantFiled: May 15, 2003Date of Patent: February 21, 2006Assignee: Word Data Corp.Inventors: Peter J. Dehlinger, Shao Chin
-
Publication number: 20040064304Abstract: A computer method for representing a natural-language document in a vector form suitable for text manipulation operations is disclosed. The method involves determining (a) for each of a plurality of terms composed of non-generic words and, optionally, proximately arranged word groups in the document, a selectivity value of the term related to the frequency of occurrence of that term in a library of texts in one field, relative to the frequency of occurrence of the same term in one or more other libraries of texts in one or more other fields, respectively. The document is represented as a vector of terms, where the coefficient assigned to each term includes a function of the selectivity value determined for that term, and optionally related to the inverse document frequency of that word in one or more libraries of texts. Also disclosed are a computer-readable code for carrying out the method, a computer system that employs the code, and a vector produced by the method.Type: ApplicationFiled: May 15, 2003Publication date: April 1, 2004Applicant: WORD DATA CORPInventors: Peter J. Dehlinger, Shao Chin