Patents Examined by Joseph Thomas
  • Patent number: 6076051
    Abstract: The present invention is directed to performing information retrieval utilizing semantic representation of text. In a preferred embodiment, a tokenizer generates from an input string information retrieval tokens that characterize the semantic relationship expressed in the input string. The tokenizer first creates from the input string a primary logical form characterizing a semantic relationship between selected words in the input string. The tokenizer then identifies hypernyms that each have an "is a" relationship with one of the selected words in the input string. The tokenizer then constructs from the primary logical form one or more alternative logical forms. The tokenizer constructs each alternative logical form by, for each of one or more of the selected words in the input string, replacing the selected word in the primary logical form with an identified hypernym of the selected word. Finally, the tokenizer generates tokens representing both the primary logical form and the alternative logical forms.
    Type: Grant
    Filed: March 7, 1997
    Date of Patent: June 13, 2000
    Assignee: Microsoft Corporation
    Inventors: John J. Messerly, George E. Heidorn, Stephen D. Richardson, William B. Dolan, Karen Jensen
  • Patent number: 6073146
    Abstract: Phonetic Chinese (Pinyin and BPMF) is entered into a computer system and accurately converted into the Hanzi form. The system has a novel keyboard with diacritic keys (and corresponding ASCII coding) that permit the user to annotate each entered phonetic text syllable with a diacritic that indicates the tone of the syllable. A process executing on the system determines that a syllable has been entered when a diacritic (or delimiter) key is struck. An entered phonetic syllable is then compared to a list of acceptable phonetic syllables and abbreviations. If the entered syllable is on the list, the correctly spelled and accented syllable is stored in memory and displayed on a phonetic portion of a graphical display. The process continues for succeeding syllables until a delimiter is entered.
    Type: Grant
    Filed: August 29, 1997
    Date of Patent: June 6, 2000
    Assignee: International Business Machines Corporation
    Inventor: Chengjun Julian Chen
  • Patent number: 6064952
    Abstract: An information abstracting method and apparatus for extracting and displaying keywords as an information abstract. Given a large number of character string data sets divided into prescribed units, the extracted keywords are significant and effective in describing a topic common to the plurality of units. The information abstracting apparatus comprises an input section for accepting an input of character string data divided into prescribed units, with each individual character represented by a character code, and an output section for displaying the result of information abstracting. Keywords contained in each of the prescribed units are extracted by a keyword extracting section from the character string input data from the input section. A score is calculated for each keyword by a score calculating section, so that a higher score is given to a keyword extracted from a larger number of units.
    Type: Grant
    Filed: November 17, 1995
    Date of Patent: May 16, 2000
    Assignee: Matsushita Electric Industrial Co., Ltd.
    Inventors: Takeshi Imanaka, Mitsuteru Kataoka, Satoshi Matsuura
  • Patent number: 6064951
    Abstract: A query transformation system and method capable of not only solving an ambiguousness of words involved in the transformation of queries from one language to another language, but also executing its processing independently of the processing of an information retrieval system used, so that it can be applied to a variety of information retrieval systems, thereby enabling the information retrieval system used to function as a multilingual information retrieval system.
    Type: Grant
    Filed: January 12, 1998
    Date of Patent: May 16, 2000
    Assignee: Electronic and Telecommunications Research Institute
    Inventors: Dong-In Park, Tae-Wan Kim, Chul-Min Sim, Won Chang, Sung-Kwon Choi, Sang-Hwa Yuh, Young-Soog Chae, Young-Kil Kim, Han-Min Jung
  • Patent number: 6055494
    Abstract: In computerized processing of natural-language medical/clinical data including phrase parsing and regularizing, parameters are referred to whose value can be specified by the user. Thus, a computerized system can be provided with versatility, for the processing of data originating in diverse domains, for example. Further to a parser and a regularizer, the system includes a preprocessor, output filters, and an encoding mechanism.
    Type: Grant
    Filed: October 28, 1996
    Date of Patent: April 25, 2000
    Assignee: The Trustees of Columbia University in the City of New York
    Inventor: Carol Friedman
  • Patent number: 6041293
    Abstract: A document processing apparatus and method extracts, from document data, both a first word, whose translation is to be determined, and a second word that is either the preceding word or the subsequent word of the first word. Then a keyword is extracted based on the frequency of occurrence of the first word and the keyword is translated into a predetermined language by referring to a dictionary in a process that considers a meaning of the first and second words existing together in the document data.
    Type: Grant
    Filed: May 29, 1996
    Date of Patent: March 21, 2000
    Assignee: Canon Kabushiki Kaisha
    Inventors: Shogo Shibata, Minoru Fujita, Yuji Ikeda, Takaya Ueda, Fumiaki Itoh, Makoto Hirota
  • Patent number: 6038539
    Abstract: A computer-implemented scheduling system and method designates start times of a plurality of procedures processed by a plurality of resource devices.
    Type: Grant
    Filed: March 4, 1993
    Date of Patent: March 14, 2000
    Assignee: Fujitsu Limited
    Inventors: Fumihiro Maruyama, Yoriko Minoda, Shuho Sawada, Yuka Takizawa
  • Patent number: 6035269
    Abstract: Critiques are applied to phrase units included in morpho-syntactical information derived from the Japanese text. The critiques include a "trigger" and an "action," and are written in a special-purpose syntax that allows for easy specification of the error class and the rewrite generation. If a critique's trigger condition is satisfied, the associated action is carried out in order to generate a replacement text string. The process of generating replacement text strings employs a morphological graph that reflects possible word formations. In a first pass through the graph, a breadth first search is used to identify intermediate nodes along a path whose morpheme transitions satisfy at least part of the attributes of the text. In a second pass, a depth first search is used to select only those morpheme transitions that completely satisfy the rewrite criteria specified in the critique, while traversing the nodes identified in the breadth first search.
    Type: Grant
    Filed: June 23, 1998
    Date of Patent: March 7, 2000
    Assignee: Microsoft Corporation
    Inventor: Hyun-suk Kim
  • Patent number: 6035267
    Abstract: A user goal extracting unit extracts a user goal from an input statement 10 entered by a user. A system goal determining unit 15 determines a system goal in accordance with the user goal. A goal frame generating unit 17 generates a goal frame based on an action sequence knowledge corresponding to the system goal. An action feasibility judging unit 19 sets arguments for, and judges the feasibility of, an action in the goal frame. If the action is judged to be feasible, the action feasibility judging unit 19 outputs an action command to an external application 20. If the action is not judged to be feasible, the action feasibility judging unit 19 outputs a new system goal to the goal frame generating unit 17. The external application 20 outputs a result of execution of the action.
    Type: Grant
    Filed: March 24, 1998
    Date of Patent: March 7, 2000
    Assignee: Mitsubishi Denki Kabushiki Kaisha
    Inventors: Keisuke Watanabe, Akito Nagai
  • Patent number: 6026368
    Abstract: Prioritized queues of advertising and content data are generated by a queue builder and sent to an on-line queue manager. A computer mediated communications network provides content and subscriber data to the queue builder and receives content segment play lists from the on-line queue manager. An exposure accounting module calculates and stores information about the number of exposures of targeted material received by subscribers and generates billing information. An information warehouse manager is employed to receive data from advertisers' data bases and third party sources as well as from the computer mediated communications network.
    Type: Grant
    Filed: July 17, 1995
    Date of Patent: February 15, 2000
    Assignee: 24/7 Media, Inc.
    Inventors: Yale Robert Brown, Matthew Brown Walker
  • Patent number: 6023670
    Abstract: The language in which a computer document is written is identified. A plurality of words from the document are compared to words in a word list associated with a candidate language. The words in the word list are a selection of the most frequently used words in the candidate language. A count of matches between words in the document and words in the word list for each word in the word list to produce a sample count. The sample count is correlated to a reference count for the candidate language to produce a correlation score for the candidate language. The language of the document is identified based on the correlation score. Generally, there are a plurality of candidate languages. Thus, comparing, accumulating, correlating and identifying processes are practiced for each language. The language of the document is identified as the candidate language having a reference count which generates a highest correlation score.
    Type: Grant
    Filed: December 20, 1996
    Date of Patent: February 8, 2000
    Assignee: International Business Machines Corporation
    Inventors: Michael John Martino, Robert Charles Paulsen, Jr.
  • Patent number: 6014615
    Abstract: Phonetic Chinese (Pinyin and BPMF) is entered into a computer system and accurately converted into the Hanzi form. The system has a novel keyboard with diacritic keys (and corresponding ASCII coding) that permit the user to annotate each entered phonetic text syllable with a diacritic that indicates the tone of the syllable. A process executing on the system determines that a syllable has been entered when a diacritic (or delimiter) key is struck. An entered phonetic syllable is then compared to a list of acceptable phonetic syllables and abbreviations. If the entered syllable is on the list, the correctly spelled and accented syllable is stored in memory and displayed on a phonetic portion of a graphical display. The process continues for succeeding syllables until a delimiter is entered.
    Type: Grant
    Filed: August 29, 1997
    Date of Patent: January 11, 2000
    Assignee: International Business Machines Corporaiton
    Inventor: Chengjun Julian Chen
  • Patent number: 6014616
    Abstract: A method for monitoring the language used by an operating system to communicate with a user via a display device. Two compatible types of operating systems are those that use a WINDOWS 3.1 operating system format and a WINDOWS 95 operating system format. The cursor in the character input area of the display device has a different color depending on the language being used by the operating system. This greatly enhances efficiency when alternately typing information in multiple languages. The method provides a small window that indicates which language is currently being used by the operating system. Contained in this window is a language conversion button that has a language indicating symbol positioned on it. The color of the symbol matches the color of the cursor. When the button is selected using a mouse or a shortcut key, the operating system switches the linguistic characters generated by signals from the keyboard to that of a different language.
    Type: Grant
    Filed: November 13, 1997
    Date of Patent: January 11, 2000
    Assignee: SamSung Electronics Co., Ltd.
    Inventor: Hyun-Don Kim
  • Patent number: 6012035
    Abstract: Effectuation of a health care provision agency cooperative function is established through a communication network linking all the various entities of the cooperative. The entities include the third party payor members, the health providing individuals, clinics, or the like, along with secondary providers including pharmacies and laboratories, health care facilities such as hospitals, and the several entities associated with management of the cooperative and appropriate funds transfer functions. A coordinating interface system maintains data storage of the necessary information, and manages the entity intercommunications in accordance with the basic structure of the active and eligible elements of the agency cooperative.
    Type: Grant
    Filed: October 13, 1997
    Date of Patent: January 4, 2000
    Assignee: Integral Business Services, Inc.
    Inventors: Berkley Irving Freeman, Jr., Edgar William Smith
  • Patent number: 6009382
    Abstract: A language in which a document is written is identified through the use of sets of most frequently used words in each of a plurality of candidate languages. Each set of most frequently used words in a respective set of word tables for a respective candidate language according to letter pairs in each set of most frequently used words. In the preferred embodiment, each word table is an N.times.N bit table, where each bit represents a given letter pair at a particular place in one of the most frequently used words in one of the candidate languages. Words from the document are compared to the most frequently used words stored in the word tables. A count of the number of matches between the words from the document and the words stored in each respective set of word tables is kept for each respective language. The language of the document as the respective candidate language having the greatest number of matches.
    Type: Grant
    Filed: September 30, 1996
    Date of Patent: December 28, 1999
    Assignee: International Business Machines Corporation
    Inventors: Michael John Martino, Robert Charles Paulsen, Jr.
  • Patent number: 6002997
    Abstract: A method for translating sentences in a source language to sentences in a target language. A knowledge base includes a plurality of information patterns in the source language which represents substantially all possible information patterns of the source language. A corresponding plurality of information patterns in the target language are contained in the same knowledge base, and are associated with source language information patterns in accordance with a predetermined determination. To translate a source language sentence into a target language sentence, the source language sentence is analyzed and its constituent parts of speech are determined so that a specific information pattern is identified. The knowledge base is consulted, and the corresponding target language information pattern is identified. The source language words are then inserted into the target language information pattern to form a Linguistic Canonical Form which is translated.
    Type: Grant
    Filed: June 21, 1996
    Date of Patent: December 14, 1999
    Inventor: Julius T. Tou
  • Patent number: 5995918
    Abstract: The present invention is a computer software system that allows the developer of a speech-enabled system to create a grammar and corpus for use in the system. A table interface is used, and phrases in the grammar are entered into cells in the table. The table also includes token data which corresponds to each valid utterance. When the grammar is defined, the computer software system automatically traverses the table to enumerate all possible valid utterances in the grammar. This traversal generates a listing (corpus) of valid utterances and their respective tokens. This listing can then be used to interpret spoken utterances for a speech-enabled system. The computer software system also transcribes the grammar rules found in the table to a format compatible with a variety of supported commercially-available speech recognizers.
    Type: Grant
    Filed: September 17, 1997
    Date of Patent: November 30, 1999
    Assignee: Unisys Corporation
    Inventors: Daythal Lee Kendall, Dennis Lee Wadsworth, Ahmed Tewfik Bouzid, Deborah Anna Dahl, Hua Hua
  • Patent number: 5991711
    Abstract: A language information processing apparatus for translating a model phrase in a first language into a corresponding second language and outputting the translated phrase. The apparatus includes a display that displays the phrase, an input device for inputting a character or a word to be added in the first language to the phrase displayed by the display, a first voice information storage device that stores voice information in the second language corresponding to the phrase, and a second voice information storage device that prepares and stores voice information concerning the character or word in the first language with an intonation in the first language, which character or word is input by the input device.
    Type: Grant
    Filed: February 24, 1997
    Date of Patent: November 23, 1999
    Assignee: Fuji Xerox Co., Ltd.
    Inventors: Kunihiro Seno, Hiromi Furusawa, Nobuki Hagiwara, Kentaro Tsuchiya
  • Patent number: 5991713
    Abstract: A method for compressing text includes steps of parsing words from text in an input file and comparing the parsed words to a predetermined dictionary. The dictionary has a plurality of vocabulary words in it and numbers or tokens corresponding to each vocabulary word. A further step is determining which of the parsed words are not present in the predetermined dictionary and creating at least one supplemental dictionary including the parsed words that are not present in the predetermined dictionary. The predetermined dictionary and the supplemental dictionary are stored together in a compressed file. Also, the parsed words are replaced with numbers or tokens corresponding to the numbers assigned in the predetermined and supplemental dictionary and the numbers or tokens are stored in the compressed file.
    Type: Grant
    Filed: November 26, 1997
    Date of Patent: November 23, 1999
    Assignee: International Business Machines Corp.
    Inventors: Jay Unger, Glen Fuller
  • Patent number: 5991709
    Abstract: A computer system for automatically classifying or declassifying military, intelligence, government, or industrial documents. Inputs to the system are classification or declassification guidelines, which describe the sensitive information, and the document(s) that need to be processed, all of which are in electronic format (e.g., output from word processor or other digital format). A database is created by a software program from the classification guidelines or rules, which is then stored in the computer system. The document(s) to be processed are searched and the database is used to identify classified portions of the documents, using a second software program (driven by the rules for determining classification levels), and the sensitive material is identified and the document(s) is modified to show the proper classification markings.
    Type: Grant
    Filed: June 10, 1997
    Date of Patent: November 23, 1999
    Inventor: Neil Charles Schoen