Patents by Inventor Ronald M. Kaplan
Ronald M. Kaplan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 7925495Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.Type: GrantFiled: February 11, 2009Date of Patent: April 12, 2011Assignee: Xerox CorporationInventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
-
Patent number: 7890500Abstract: Techniques are provided to construct and use user-interest sensitive indicators of search results. A set of documents is determined based on one or more search terms. Passages within each selected document are identified based on the search terms. Condensation transformations applied to the passages to preferentially retain elements of the passage based on the search terms and user interest information. The resultant indicator is provides a user-interest sensitive signal of the meaning of the passage.Type: GrantFiled: June 11, 2007Date of Patent: February 15, 2011Assignee: Palo Alto Research Center IncorporatedInventors: Daniel G. Bobrow, Ronald M. Kaplan
-
Patent number: 7801723Abstract: Techniques are presented to determine user-interest sensitive condensations of a passage. One or more passages are selected and user interest information, condensation transformations and optional meaning distortion constraints are identified. The foci of user interest within the selected passages are determined based the similarity of the elements in the selected passages to elements in the user interest information. The condensation transformations are applied to the selected passages to preferentially retain user foci while eliding less salient information. The resultant condensate provides signals the user-interest sensitive meaning of the passage. Meaning distortions constraints are optionally applied in conjunction with the condensation transformations or in creating the condensation transformations to reduce the likelihood of distorting the meaning of the passage.Type: GrantFiled: November 30, 2004Date of Patent: September 21, 2010Assignee: Palo Alto Research Center IncorporatedInventors: Ronald M. Kaplan, Richard Crouch, Daniel G. Bobrow
-
Publication number: 20090171653Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.Type: ApplicationFiled: February 11, 2009Publication date: July 2, 2009Applicant: XEROX CORPORATIONInventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
-
Patent number: 7493250Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.Type: GrantFiled: December 18, 2000Date of Patent: February 17, 2009Assignee: Xerox CorporationInventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
-
Patent number: 7401077Abstract: Techniques are provided to construct and use user-interest sensitive indicators of search results. A set of documents is determined based on one or more search terms. Passages within each selected document are identified based on the search terms. Condensation transformations applied to the passages to preferentially retain elements of the passage based on the search terms and user interest information. The resultant indicator is provides a user-interest sensitive signal of the meaning of the passage.Type: GrantFiled: December 21, 2004Date of Patent: July 15, 2008Assignee: Palo Alto Research Center IncorporatedInventors: Daniel G. Bobrow, Ronald M. Kaplan
-
Publication number: 20020077805Abstract: A method and apparatus is disclosed for generating and distributing multilingual documents. The multilingual documents are comprised of primary information consisting of human-readable text and secondary information consisting of machine-readable data such that a translation of the text is accomplished by converting the human-readable text into a second language through the use of the decoded machine-readable data. The machine-readable data is comprised of a code that describes a set of editing operations that can be applied to the human-readable text to convert it into at least a second language. In a preferred embodiment, the machine-readable data is embedded in the image using an unobtrusive code on the document such as Xerox DATAGLYPH codes.Type: ApplicationFiled: December 18, 2000Publication date: June 20, 2002Inventors: David L. Hecht, Glen W. Petrie, Ronald M. Kaplan, Colin Luckman
-
Patent number: 6233580Abstract: A compactly stored word list that includes a directed graph data structure is used for word to number (W/N) and number to word (N/W) mapping. Each word accepted by the data structure is mapped to a unique corresponding number within a dense set of numbers ranging from zero to one less than the total number of acceptable words. Some common suffixes are collapsed into shared branches, which is possible because the numbers are not stored within the word list. In addition, some branches of the data structure can be skipped during mapping because of information associated with branch points. That information permits the mapping scan to continue with a next branch or with an alternative branch. That information also indicates the number of suffix endings in the next branch; this number is used to keep a count of the word endings during word to number mapping; it is also used both to determine whether to continue with the next branch and also to reduce the number being mapped during number to word mapping.Type: GrantFiled: May 18, 1998Date of Patent: May 15, 2001Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Martin Kay
-
Patent number: 6064953Abstract: A method of unifying edge data structures using a processor. The method begins with creation of a first edge data structure and unifying it with a second edge data structure. If during unification any of the contexted lazy copy links associated with the second graph data structure are activated, then second graph data structure is expanded. Expansion involves first selecting a contexted lazy copy link from among the contexted lazy copy links associated with the second graph data, selecting a selected attribute from the subtree feature structure pointed to by the selected contexted lazy copy link, making a copy of the selected attribute and storing the copy of the selected attribute in the second graph data structure as a second attribute, and adding a contexted lazy copy link from the second attribute to the selected attribute value.Type: GrantFiled: June 18, 1998Date of Patent: May 16, 2000Assignee: Xerox CorporationInventors: John T. Maxwell, III, Ronald M. Kaplan
-
Patent number: 5903860Abstract: A method of using a processor to conjoin a first clause and a second clause as part of a unification of a first graph. If the first clause is not associated with the first graph, then a third clause is created that is opaque and has a pointer to the first clause. Afterward, the third clause is conjoined with the second clause.Type: GrantFiled: June 21, 1996Date of Patent: May 11, 1999Assignee: Xerox CorporationInventors: John T. Maxwell, III, Ronald M. Kaplan
-
Patent number: 5819210Abstract: A method of unifying edge data structures using a processor. The method begins with creation of a first edge data structure and unifying it with a second edge data structure. If during unification any of the contexted lazy copy links associated with the second graph data structure are activated, then second graph data structure is expanded. Expansion involves first selecting a contexted lazy copy link from among the contexted lazy copy links associated with the second graph data, selecting a selected attribute from the subtree feature structure pointed to by the selected contexted lazy copy link, making a copy of the selected attribute and storing the copy of the selected attribute in the second graph data structure as a second attribute, and adding a contexted lazy copy link from the second attribute to the selected attribute value.Type: GrantFiled: June 21, 1996Date of Patent: October 6, 1998Assignee: Xerox CorporationInventors: John T. Maxwell, III, Ronald M. Kaplan
-
Patent number: 5787386Abstract: A computerized multilingual translation dictionary includes a set of word and phrases for each of the languages it contains, plus a mapping that indicates for each word or phrase in one language what the corresponding translations in the other languages are. The set of words and phrases for each language are divided up among corresponding concept groups based on an abstract pivot language. The words and phrases are encoded as token numbers assigned by a word-number mapper laid out in sequence that can be searched fairly rapidly with a simple linear scan. The complex associations of words and phrases to particular pivot language senses are represented by including a list of pivot-language sense numbers with each word or phrase. The preferred coding of these sense numbers is by means of a bit vector for each word, where each bit corresponds to a particular pivot element in the abstract language, and the bit is ON if the given word is a translation of that pivot element.Type: GrantFiled: June 3, 1996Date of Patent: July 28, 1998Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Atty T. Mullins
-
Patent number: 5754847Abstract: A compactly stored word list that includes a directed graph data structure is used for word to number (W/N) and number to word (N/W) mapping. Each word accepted by the data structure is mapped to a unique corresponding number within a dense set of numbers ranging from zero to one less than the total number of acceptable words. Some common suffixes are collapsed into shared branches, which is possible because the numbers are not stored within the word list. In addition, some branches of the data structure can be skipped during mapping because of information associated with branch points. That information permits the mapping scan to continue with a next branch or with an alternative branch. That information also indicates the number of suffix endings in the next branch; this number is used to keep a count of the word endings during word to number mapping; it is also used both to determine whether to continue with the next branch and also to reduce the number being mapped during number to word mapping.Type: GrantFiled: October 16, 1991Date of Patent: May 19, 1998Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Martin Kay
-
Patent number: 5748805Abstract: A method and apparatus for applying morphological image criteria that identify image units in an undecoded document image having significant information content, and for retrieving related data that supplements the document either from elsewhere within the document or a source external to the document. The retrieved data can result from character code recognition or template matching of the identified significant image units, or the retrieved data can result directly from an analysis of the morphological image characteristics of the identified significant image units. A reading machine can allow a user to browse and select documents or segments thereof, and to obtain interactive retrieval of documents and supplemental data.Type: GrantFiled: July 11, 1994Date of Patent: May 5, 1998Assignee: Xerox CorporationInventors: M. Margaret Withgott, William Newman, Steven C. Bagley, Daniel P. Huttenlocher, Ronald M. Kaplan, Todd A. Cass, Per-Kristian Halvorsen, John Seely Brown, Martin Kay
-
Patent number: 5737621Abstract: Valid positions for hyphens in input strings are determined by reading in and processing the symbols of the input string through a finite state transducer which has a state-transition data structure determined by a compilation of a set of hyphenation rules. The output of the encoding system can include a hyphenated string, or can accept a hyphenated string and output an indication of whether the input hyphenation is proper according to the set of hyphenation rules.Type: GrantFiled: June 6, 1995Date of Patent: April 7, 1998Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Lauri J. Karttunen
-
Patent number: 5721939Abstract: An efficient method and apparatus for tokenizing natural language text minimizes required data storage and produces guaranteed incremental output. Id (text) is composed with a tokenizer to create a finite state machine representing tokenization paths. The tokenizer itself is in the form of a finite state transducer. The process is carried out in a breadth-first manner so that all possibilities are explored at each character position before progressing. Output is produced incrementally and occurs only when all paths collapse into one. Output may be delayed until a token boundary is reached. In this manner, the output is guaranteed and will not be retracted unless the text is globally ill-formed. Each time output is produced, storage space is freed for subsequent text processing.Type: GrantFiled: August 3, 1995Date of Patent: February 24, 1998Assignee: Xerox CorporationInventor: Ronald M. Kaplan
-
Patent number: 5625554Abstract: The present invention solves a number of problems in using stems (canonical indicators of word meanings) in full-text retrieval of natural language documents, and thus permits recall to be improved without sacrificing precision. It uses various arrangements of finite-state transducers to accurately encode a number of desirable ways of mapping back and forth between words and stems, taking into account both systematic aspects of a language's morphological rule system and also the word-by-word irregularities that also occur. The techniques described apply generally across the languages of the world and are not just limited to simple suffixing languages like English. Although the resulting transducers can have many states and transitions or arcs, they can be compacted by finite-state compression algorithms so that they can be used effectively in resource-limited applications.Type: GrantFiled: July 20, 1992Date of Patent: April 29, 1997Assignee: Xerox CorporationInventors: Douglass R. Cutting, Per-Kristian G. Halvorsen, Ronald M. Kaplan, Lauri Karttunen, Martin Kay, Jan O. Pedersen
-
Patent number: 5613145Abstract: An FSM data structure is encoded by generating a transition unit of data corresponding to each transition which leads ultimately to a final state of the FSM. Information about the states is included in the transition units, so that the encoded data structure can be written without state units of data. The incoming transition units to a final state each contain an indication of finality. The incoming transition units to a state which has no outgoing transition units each contain a branch ending indication. The outgoing transition units of each state are ordered into a comparison sequence for comparison with a received element, and all but the last outgoing transition unit contain an alternative indication of a subsequent alternative outgoing transition.Type: GrantFiled: May 25, 1995Date of Patent: March 18, 1997Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Martin Kay
-
Patent number: 5594641Abstract: The present invention solves a number of problems in using stems (canonical indicators of word meanings) in full-text retrieval of natural language documents, and thus permits recall to be improved without sacrificing precision. It uses various arrangements of finite-state transducers to accurately encode a number of desirable ways of mapping back and forth between words and stems, taking into account both systematic aspects of a language's morphological rule system and also the word-by-word irregularities that also occur. The techniques described apply generally across the languages of the world and are not just limited to simple suffixing languages like English. Although the resulting transducers can have many states and transitions or arcs, they can be compacted by finite-state compression algorithms so that they can be used effectively in resource-limited applications.Type: GrantFiled: June 8, 1994Date of Patent: January 14, 1997Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Lauri Karttunen
-
Patent number: 5581780Abstract: An FSM data structure is encoded by generating a transition unit of data corresponding to each transition which leads ultimately to a final state of the FSM. Information about the states is included in the transition units, so that the encoded data structure can be written without state units of data. The incoming transition units to a final state each contain an indication of finality. The incoming transition units to a state which has no outgoing transition units each contain a branch ending indication. The outgoing transition units of each state are ordered into a comparison sequence for comparison with a received element, and all but the last outgoing transition unit contain an alternative indication of a subsequent alternative outgoing transition.Type: GrantFiled: May 25, 1995Date of Patent: December 3, 1996Assignee: Xerox CorporationInventors: Ronald M. Kaplan, Martin Kay, John Maxwell