Patents by Inventor Woojin Paik

Woojin Paik has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Information extraction system and method using concept-relation-concept (CRC) triples

Patent number: 6263335

Abstract: An information extraction system that allows users to ask questions about documents in a database, and responds to queries by returning possibly relevant information which is extracted from the documents. The system is domain-independent, and automatically builds its own subject knowledge base. It can be applied to any new corpus of text with quick results, and no requirement for lengthy manual input. For this reason, it is also a dynamic system which can acquire new knowledge and add it to the knowledge base immediately by automatically identifying new names, events, or concepts.

Type: Grant

Filed: March 29, 1999

Date of Patent: July 17, 2001

Assignee: Textwise LLC

Inventors: Woojin Paik, Elizabeth D. Liddy, Jennifer Heverin Liddy, Ian Harcourt Niles, Eileen E. Allen
Information extraction system and method using concept relation concept (CRC) triples

Patent number: 6076088

Abstract: An information extraction system that allows users to ask questions about documents in a database, and responds to queries by returning possibly relevant information which is extracted from the documents. The system is domain-independent, and automatically builds its own subject knowledge base. It can be applied to any new corpus of text with quick results, and no requirement for lengthy manual input. For this reason, it is also a dynamic system which can acquire new knowledge and add it to the knowledge base immediately by automatically identifying new names, events, or concepts.

Type: Grant

Filed: February 6, 1997

Date of Patent: June 13, 2000

Inventors: Woojin Paik, Elizabeth D. Liddy, Jennifer Heverin Liddy, Ian Harcourt Niles, Eileen E. Allen
User interface and other enhancements for natural language information retrieval system and method

Patent number: 6026388

Abstract: Techniques for generating sophisticated representations of the contents of both queries and documents in a retrieval system by using natural language processing (NLP) techniques to represent, index, and retrieve texts at the multiple levels (e.g., the morphological, lexical, syntactic, semantic, discourse, and pragmatic levels) at which humans construe meaning in writing. The user enters a query and the system processes the query to generate an alternative representation, which includes conceptual-level abstraction and representations based on complex nominals (CNs), proper nouns (PNs), single terms, text structure, and logical make-up of the query, including mandatory terms. After processing the query, the system displays query information to the user, indicating the system's interpretation and representation of the content of the query. The user is then given an opportunity to provide input, in response to which the system modifies the alternative representation of the query.

Type: Grant

Filed: August 14, 1996

Date of Patent: February 15, 2000

Assignee: Textwise, LLC

Inventors: Elizabeth D. Liddy, Woojin Paik, Mary E. McKenna, Michael L. Weiner, Edmund S. Yu, Theodore G. Diamond, Bhaskaran Balakrishnan, David L. Snyder
Multilingual document retrieval system and method using semantic vector matching

Patent number: 6006221

Abstract: A document retrieval system where a user can enter a query, including a natural language query, in a desired one of a plurality of supported languages, and retrieve documents from a database that includes documents in at least one other language of the plurality of supported languages. The user need not have any knowledge of the other languages. Each document in the database is subjected to a set of processing steps to generate a language-independent conceptual representation of the subject content of the document. This is normally done before the query is entered. The query is also subjected to a (possibly different) set of processing steps to generate a language-independent conceptual representation of the subject content of the query. The documents and queries can also be subjected to additional analysis to provide additional term-based representations, such as the extraction of information-rich terms and phrases (such as proper nouns).

Type: Grant

Filed: August 14, 1996

Date of Patent: December 21, 1999

Assignee: Syracuse University

Inventors: Elizabeth D. Liddy, Woojin Paik, Edmund S. Yu, Ming Li
Natural language information retrieval system and method

Patent number: 5963940

Abstract: Techniques for generating sophisticated representations of the contents of both queries and documents in a retrieval system by using natural language processing (NLP) techniques to represent, index, and retrieve texts at the multiple levels (e.g., the morphological, lexical, syntactic, semantic, discourse, and pragmatic levels) at which humans construe meaning in writing. The user enters a query and the system processes the query to generate an alternative representation, which includes conceptual-level abstraction and representations based on complex nominals (CNs), proper nouns (PNs), single terms, text structure, and logical make-up of the query, including mandatory terms. After processing the query, the system displays query information to the user, indicating the system's interpretation and representation of the content of the query. The user is then given an opportunity to provide input, in response to which the system modifies the alternative representation of the query.

Type: Grant

Filed: August 14, 1996

Date of Patent: October 5, 1999

Assignee: Syracuse University

Inventors: Elizabeth D. Liddy, Woojin Paik, Mary E. McKenna, Ming Li
Natural language processing system for semantic vector representation which accounts for lexical ambiguity

Patent number: 5873056

Abstract: A natural language processing system uses unformatted naturally occurring text and generates a subject vector representation of the text, which may be an entire document or a part thereof such as its title, a paragraph, clause, or a sentence therein. The subject codes which are used are obtained from a lexical database and the subject code(s) for each word in the text is looked up and assigned from the database. The database may be a dictionary or other word resource which has a semantic classification scheme as designators of subject domains. Various meanings or senses of a word may have assigned thereto multiple, different subject codes and psycholinguistically justified sense meaning disambiguation is used to select the most appropriate subject field code. Preferably, an ordered set of sentence level heuristics is used which is based on the statistical probability or likelihood of one of the plurality of codes being the most appropriate one of the plurality.

Type: Grant

Filed: October 12, 1993

Date of Patent: February 16, 1999

Assignee: The Syracuse University

Inventors: Elizabeth D. Liddy, Woojin Paik, Edmund Szu-li Yu

Information extraction system and method using concept-relation-concept (CRC) triples

Information extraction system and method using concept relation concept (CRC) triples

User interface and other enhancements for natural language information retrieval system and method

Multilingual document retrieval system and method using semantic vector matching

Natural language information retrieval system and method

Natural language processing system for semantic vector representation which accounts for lexical ambiguity