Patents by Inventor Ronald M. Kaplan

Ronald M. Kaplan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

System and method for distributing multilingual documents

Patent number: 7925495

Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.

Type: Grant

Filed: February 11, 2009

Date of Patent: April 12, 2011

Assignee: Xerox Corporation

Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
Systems and methods for using and constructing user-interest sensitive indicators of search results

Patent number: 7890500

Abstract: Techniques are provided to construct and use user-interest sensitive indicators of search results. A set of documents is determined based on one or more search terms. Passages within each selected document are identified based on the search terms. Condensation transformations applied to the passages to preferentially retain elements of the passage based on the search terms and user interest information. The resultant indicator is provides a user-interest sensitive signal of the meaning of the passage.

Type: Grant

Filed: June 11, 2007

Date of Patent: February 15, 2011

Assignee: Palo Alto Research Center Incorporated

Inventors: Daniel G. Bobrow, Ronald M. Kaplan
Systems and methods for user-interest sensitive condensation

Patent number: 7801723

Abstract: Techniques are presented to determine user-interest sensitive condensations of a passage. One or more passages are selected and user interest information, condensation transformations and optional meaning distortion constraints are identified. The foci of user interest within the selected passages are determined based the similarity of the elements in the selected passages to elements in the user interest information. The condensation transformations are applied to the selected passages to preferentially retain user foci while eliding less salient information. The resultant condensate provides signals the user-interest sensitive meaning of the passage. Meaning distortions constraints are optionally applied in conjunction with the condensation transformations or in creating the condensation transformations to reduce the likelihood of distorting the meaning of the passage.

Type: Grant

Filed: November 30, 2004

Date of Patent: September 21, 2010

Assignee: Palo Alto Research Center Incorporated

Inventors: Ronald M. Kaplan, Richard Crouch, Daniel G. Bobrow
SYSTEM AND METHOD FOR DISTRIBUTING MULTILINGUAL DOCUMENTS

Publication number: 20090171653

Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.

Type: Application

Filed: February 11, 2009

Publication date: July 2, 2009

Applicant: XEROX CORPORATION

Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
System and method for distributing multilingual documents

Patent number: 7493250

Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.

Type: Grant

Filed: December 18, 2000

Date of Patent: February 17, 2009

Assignee: Xerox Corporation

Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
Systems and methods for using and constructing user-interest sensitive indicators of search results

Patent number: 7401077

Abstract: Techniques are provided to construct and use user-interest sensitive indicators of search results. A set of documents is determined based on one or more search terms. Passages within each selected document are identified based on the search terms. Condensation transformations applied to the passages to preferentially retain elements of the passage based on the search terms and user interest information. The resultant indicator is provides a user-interest sensitive signal of the meaning of the passage.

Type: Grant

Filed: December 21, 2004

Date of Patent: July 15, 2008

Assignee: Palo Alto Research Center Incorporated

Inventors: Daniel G. Bobrow, Ronald M. Kaplan
System and method for distributing multilingual documents

Publication number: 20020077805

Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.

Type: Application

Filed: December 18, 2000

Publication date: June 20, 2002

Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
Word/number and number/word mapping

Patent number: 6233580

Abstract: A compactly stored word list that includes a directed graph data structure is used for word to number (W/N) and number to word (N/W) mapping. Each word accepted by the data structure is mapped to a unique corresponding number within a dense set of numbers ranging from zero to one less than the total number of acceptable words. Some common suffixes are collapsed into shared branches, which is possible because the numbers are not stored within the word list. In addition, some branches of the data structure can be skipped during mapping because of information associated with branch points. That information permits the mapping scan to continue with a next branch or with an alternative branch. That information also indicates the number of suffix endings in the next branch; this number is used to keep a count of the word endings during word to number mapping; it is also used both to determine whether to continue with the next branch and also to reduce the number being mapped during number to word mapping.

Type: Grant

Filed: May 18, 1998

Date of Patent: May 15, 2001

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Martin Kay
Method for creating a disjunctive edge graph from subtrees during unification

Patent number: 6064953

Abstract: A method of unifying edge data structures using a processor. The method begins with creation of a first edge data structure and unifying it with a second edge data structure. If during unification any of the contexted lazy copy links associated with the second graph data structure are activated, then second graph data structure is expanded. Expansion involves first selecting a contexted lazy copy link from among the contexted lazy copy links associated with the second graph data, selecting a selected attribute from the subtree feature structure pointed to by the selected contexted lazy copy link, making a copy of the selected attribute and storing the copy of the selected attribute in the second graph data structure as a second attribute, and adding a contexted lazy copy link from the second attribute to the selected attribute value.

Type: Grant

Filed: June 18, 1998

Date of Patent: May 16, 2000

Assignee: Xerox Corporation

Inventors: John T. Maxwell, III, Ronald M. Kaplan
Method of conjoining clauses during unification using opaque clauses

Patent number: 5903860

Abstract: A method of using a processor to conjoin a first clause and a second clause as part of a unification of a first graph. If the first clause is not associated with the first graph, then a third clause is created that is opaque and has a pointer to the first clause. Afterward, the third clause is conjoined with the second clause.

Type: Grant

Filed: June 21, 1996

Date of Patent: May 11, 1999

Assignee: Xerox Corporation

Inventors: John T. Maxwell, III, Ronald M. Kaplan
Method of lazy contexted copying during unification

Patent number: 5819210

Abstract: A method of unifying edge data structures using a processor. The method begins with creation of a first edge data structure and unifying it with a second edge data structure. If during unification any of the contexted lazy copy links associated with the second graph data structure are activated, then second graph data structure is expanded. Expansion involves first selecting a contexted lazy copy link from among the contexted lazy copy links associated with the second graph data, selecting a selected attribute from the subtree feature structure pointed to by the selected contexted lazy copy link, making a copy of the selected attribute and storing the copy of the selected attribute in the second graph data structure as a second attribute, and adding a contexted lazy copy link from the second attribute to the selected attribute value.

Type: Grant

Filed: June 21, 1996

Date of Patent: October 6, 1998

Assignee: Xerox Corporation

Inventors: John T. Maxwell, III, Ronald M. Kaplan
Compact encoding of multi-lingual translation dictionaries

Patent number: 5787386

Abstract: A computerized multilingual translation dictionary includes a set of word and phrases for each of the languages it contains, plus a mapping that indicates for each word or phrase in one language what the corresponding translations in the other languages are. The set of words and phrases for each language are divided up among corresponding concept groups based on an abstract pivot language. The words and phrases are encoded as token numbers assigned by a word-number mapper laid out in sequence that can be searched fairly rapidly with a simple linear scan. The complex associations of words and phrases to particular pivot language senses are represented by including a list of pivot-language sense numbers with each word or phrase. The preferred coding of these sense numbers is by means of a bit vector for each word, where each bit corresponds to a particular pivot element in the abstract language, and the bit is ON if the given word is a translation of that pivot element.

Type: Grant

Filed: June 3, 1996

Date of Patent: July 28, 1998

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Atty T. Mullins
Word/number and number/word mapping

Patent number: 5754847

Abstract: A compactly stored word list that includes a directed graph data structure is used for word to number (W/N) and number to word (N/W) mapping. Each word accepted by the data structure is mapped to a unique corresponding number within a dense set of numbers ranging from zero to one less than the total number of acceptable words. Some common suffixes are collapsed into shared branches, which is possible because the numbers are not stored within the word list. In addition, some branches of the data structure can be skipped during mapping because of information associated with branch points. That information permits the mapping scan to continue with a next branch or with an alternative branch. That information also indicates the number of suffix endings in the next branch; this number is used to keep a count of the word endings during word to number mapping; it is also used both to determine whether to continue with the next branch and also to reduce the number being mapped during number to word mapping.

Type: Grant

Filed: October 16, 1991

Date of Patent: May 19, 1998

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Martin Kay
Method and apparatus for supplementing significant portions of a document selected without document image decoding with retrieved information

Patent number: 5748805

Abstract: A method and apparatus for applying morphological image criteria that identify image units in an undecoded document image having significant information content, and for retrieving related data that supplements the document either from elsewhere within the document or a source external to the document. The retrieved data can result from character code recognition or template matching of the identified significant image units, or the retrieved data can result directly from an analysis of the morphological image characteristics of the identified significant image units. A reading machine can allow a user to browse and select documents or segments thereof, and to obtain interactive retrieval of documents and supplemental data.

Type: Grant

Filed: July 11, 1994

Date of Patent: May 5, 1998

Assignee: Xerox Corporation

Inventors: M. Margaret Withgott, William Newman, Steven C. Bagley, Daniel P. Huttenlocher, Ronald M. Kaplan, Todd A. Cass, Per-Kristian Halvorsen, John Seely Brown, Martin Kay
Finite-state encoding system for hyphenation rules

Patent number: 5737621

Abstract: Valid positions for hyphens in input strings are determined by reading in and processing the symbols of the input string through a finite state transducer which has a state-transition data structure determined by a compilation of a set of hyphenation rules. The output of the encoding system can include a hyphenated string, or can accept a hyphenated string and output an indication of whether the input hyphenation is proper according to the set of hyphenation rules.

Type: Grant

Filed: June 6, 1995

Date of Patent: April 7, 1998

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Lauri J. Karttunen
Method and apparatus for tokenizing text

Patent number: 5721939

Abstract: An efficient method and apparatus for tokenizing natural language text minimizes required data storage and produces guaranteed incremental output. Id (text) is composed with a tokenizer to create a finite state machine representing tokenization paths. The tokenizer itself is in the form of a finite state transducer. The process is carried out in a breadth-first manner so that all possibilities are explored at each character position before progressing. Output is produced incrementally and occurs only when all paths collapse into one. Output may be delayed until a token boundary is reached. In this manner, the output is guaranteed and will not be retracted unless the text is globally ill-formed. Each time output is produced, storage space is freed for subsequent text processing.

Type: Grant

Filed: August 3, 1995

Date of Patent: February 24, 1998

Assignee: Xerox Corporation

Inventor: Ronald M. Kaplan
Finite-state transduction of related word forms for text indexing and retrieval

Patent number: 5625554

Abstract: The present invention solves a number of problems in using stems (canonical indicators of word meanings) in full-text retrieval of natural language documents, and thus permits recall to be improved without sacrificing precision. It uses various arrangements of finite-state transducers to accurately encode a number of desirable ways of mapping back and forth between words and stems, taking into account both systematic aspects of a language's morphological rule system and also the word-by-word irregularities that also occur. The techniques described apply generally across the languages of the world and are not just limited to simple suffixing languages like English. Although the resulting transducers can have many states and transitions or arcs, they can be compacted by finite-state compression algorithms so that they can be used effectively in resource-limited applications.

Type: Grant

Filed: July 20, 1992

Date of Patent: April 29, 1997

Assignee: Xerox Corporation

Inventors: Douglass R. Cutting, Per-Kristian G. Halvorsen, Ronald M. Kaplan, Lauri Karttunen, Martin Kay, Jan O. Pedersen
Stored string data with element data units and pointer data units in distinct subranges of values

Patent number: 5613145

Abstract: An FSM data structure is encoded by generating a transition unit of data corresponding to each transition which leads ultimately to a final state of the FSM. Information about the states is included in the transition units, so that the encoded data structure can be written without state units of data. The incoming transition units to a final state each contain an indication of finality. The incoming transition units to a state which has no outgoing transition units each contain a branch ending indication. The outgoing transition units of each state are ordered into a comparison sequence for comparison with a received element, and all but the last outgoing transition unit contain an alternative indication of a subsequent alternative outgoing transition.

Type: Grant

Filed: May 25, 1995

Date of Patent: March 18, 1997

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Martin Kay
Finite-state transduction of related word forms for text indexing and retrieval

Patent number: 5594641

Abstract: The present invention solves a number of problems in using stems (canonical indicators of word meanings) in full-text retrieval of natural language documents, and thus permits recall to be improved without sacrificing precision. It uses various arrangements of finite-state transducers to accurately encode a number of desirable ways of mapping back and forth between words and stems, taking into account both systematic aspects of a language's morphological rule system and also the word-by-word irregularities that also occur. The techniques described apply generally across the languages of the world and are not just limited to simple suffixing languages like English. Although the resulting transducers can have many states and transitions or arcs, they can be compacted by finite-state compression algorithms so that they can be used effectively in resource-limited applications.

Type: Grant

Filed: June 8, 1994

Date of Patent: January 14, 1997

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Lauri Karttunen
Stored string data with next and alternative information a processor can use to reach next and alternative locations in memory

Patent number: 5581780

Abstract: An FSM data structure is encoded by generating a transition unit of data corresponding to each transition which leads ultimately to a final state of the FSM. Information about the states is included in the transition units, so that the encoded data structure can be written without state units of data. The incoming transition units to a final state each contain an indication of finality. The incoming transition units to a state which has no outgoing transition units each contain a branch ending indication. The outgoing transition units of each state are ordered into a comparison sequence for comparison with a received element, and all but the last outgoing transition unit contain an alternative indication of a subsequent alternative outgoing transition.

Type: Grant

Filed: May 25, 1995

Date of Patent: December 3, 1996

Assignee: Xerox Corporation

Inventors: Ronald M. Kaplan, Martin Kay, John Maxwell

1 2 next