Text Summarization Or Condensation Patents (Class 715/254)
  • Patent number: 7559021
    Abstract: An example of a solution provided here comprises receiving a text definition signal, defining a first portion of text for folding, receiving a signal for hiding, and in response to the signal for hiding, displaying to at least one user a text view without the first portion, and a clue as to what is hidden.
    Type: Grant
    Filed: January 20, 2005
    Date of Patent: July 7, 2009
    Assignee: International Business Machines Corporation
    Inventors: Yen Fu Chen, John H. Handy-Bosma, Mei Y. Selvage, Keith R. Walker
  • Patent number: 7549114
    Abstract: To reduce required display space, a text segment is reduced in size by successively eliminating portions of the text segment and by reducing a size of text of the text segment and/or a spacing between characters of the text segment. The reduction is thus visually represented in a step-wise manner, and recognizability of the text segment is maintained, even if the final representation of the text does not carry a full meaning and/or is not independently comprehensible, because of the impression left in the mind of the user by the step-wise reduction of the text segment. The reduction may be animated.
    Type: Grant
    Filed: February 21, 2003
    Date of Patent: June 16, 2009
    Assignee: Xerox Corporation
    Inventors: Benjamin B. Bederson, Lance E. Good, Mark J. Stefik
  • Publication number: 20090112720
    Abstract: In a computerized method of identifying and displaying messages containing an identifier, where the messages are privately stored for restricted access by a user, the identifier of a document displayed in a navigation area result window is identified. In addition, the message store is scanned to identify messages containing the identifier and the messages identified as containing the identifier are displayed in a messaging area window.
    Type: Application
    Filed: October 21, 2008
    Publication date: April 30, 2009
    Inventors: Tyler Close, John Recker, Craig Sayers, Ian R. Robinson
  • Patent number: 7509572
    Abstract: A summarization system generates summaries from documents. Text structure tags, in conformance with the Text Encoding Initiative (TEI), are inserted into the documents to generate encoded documents. The text structure tags, when associated with portions of the document, identify text types. A text type, such as an argumentative text type, provides meta-information about the associated portion of text. The documents are also encoded, via document type declaration (“DTD”) in the eXtensible mark-up language (“XML”), to generate a tree structure that depicts the text types and hierarchical relationships among the text types in the tree structure. The summarization system generates a summary of the documents by extracting portions of the document, associated with the text type tags, using the tree structure in accordance with user input. The summarization system may be used to generate summaries from multiple documents.
    Type: Grant
    Filed: July 16, 1999
    Date of Patent: March 24, 2009
    Assignee: Oracle International Corporation
    Inventors: Nicole M. Melander, Ophir Frieder
  • Publication number: 20090044104
    Abstract: To provide a user with an easily understandable help document, the MFP capable of executing a plurality of processes includes a process designation accepting portion to accept designation of at least one of the plurality of processes, a workflow generating portion to generate workflow definition data defining the one or more processes accepted, and a help document generating portion to generate a help document corresponding to the generated workflow. The help document generating portion includes a summary page generating portion to generate a summary page having listed thereon process names for identification of the one or more processes defined by the corresponding workflow definition data.
    Type: Application
    Filed: July 18, 2008
    Publication date: February 12, 2009
    Applicant: Konica Minolta Business Technologies Inc.
    Inventor: Masaya Hashimoto
  • Publication number: 20080313534
    Abstract: A system and method for translating received input from a sender to recipient in an instant messaging dialog is disclosed. The method comprises receiving instant messaging input from a sender for recipient, wherein the instant messaging input comprises at least one subculture specific term. A category is identified the defines a difference between the sender and the recipient and the received instant messaging input is modified from the sender by generating an output associated with the least one subculture specific term and based on the identified category. Multiple recipients in a chat session may also each receive a translated or annotated message according to characteristics of each individual recipient.
    Type: Application
    Filed: August 19, 2008
    Publication date: December 18, 2008
    Applicant: AT&T Corp.
    Inventors: Eric Cheung, Kermit Hal Purdy
  • Patent number: 7454698
    Abstract: A digital document browsing system includes: a layout engine for determining the layout of a digital document based on previously obtained historical data for a display form of the digital document, a summarization engine for preparing a summary for the sentences of the digital document based on the historical data for the digital document. Further included is a view generator for arranging the summary obtained by the summarization engine in accordance with the layout, and for generating data relating to the display form of the digital document. A user interface for displaying the digital document on a display device based on the data related to the display form is still further included.
    Type: Grant
    Filed: February 15, 2002
    Date of Patent: November 18, 2008
    Assignee: International Business Machines Corporation
    Inventors: Takenori Kohda, Moriyoshi Ohara, Katashi Nagao
  • Patent number: 7451395
    Abstract: Techniques for determining interactive topic-based summarization are provided. A text to be summarized is segmented. Discrete keyword, key-phrase, n-gram, sentence and other sentence constituent based summaries are generated based on statistical measures for each text segment. Interactive topic-based summaries are displayed with human sensible omitted text indicators such as alternate colors, fonts, sounds, tactile elements or other human sensible display characteristics useful in indicating omitted text. Individual and/or combinations of discrete keyword, key-phrase, n-gram, sentence, noun phrase and sentence constituent based summaries are dynamically displayed to provide an overview of topic and subtopic development within a text. A hierarchical and interactive display of texts based on the use of discrete sentence constituent based summaries which associates expansible and contractible displayed text provides contextualized access to an interactive topic-based text summary and to an original text.
    Type: Grant
    Filed: December 16, 2002
    Date of Patent: November 11, 2008
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Thorsten H. Brants, Francine R. Chen, Annie E. Zaenen
  • Patent number: 7447627
    Abstract: A method of determining the component words of a compound word is disclosed. The method identifies the component words, by comparing the word with a list of words found in a lexicon. If the word is not found in the lexicon the method proceeds to analyze the word on a character-by-character basis. After each character the method identifies any potential matches to the selected characters in the lexicon. If a match is found, it is added to a hypothesis trace in a lattice. Next, the method checks to see whether the remaining characters form a valid entry in the lexicon, and whether the entry is allowed to be a final segment.
    Type: Grant
    Filed: March 19, 2004
    Date of Patent: November 4, 2008
    Assignee: Microsoft Corporation
    Inventors: Andrea Maria Jessee, Miriam R. Eckert, Kevin R. Powell
  • Patent number: 7426689
    Abstract: A system and method are provided for identifying text information and making such information available for applications. The system includes an electronic device communicating text information and an application for performing a function. The system also includes memory storing known text formats and a processor for processing the text information. The processor compares the text information to the known text formats and determines a text format of the text information. The processor also tags the text information according to the determined text format and further makes the tagged text information available to one or more applications.
    Type: Grant
    Filed: December 15, 2004
    Date of Patent: September 16, 2008
    Assignee: Ford Motor Company
    Inventors: Craig Simonds, Garold Myers, Perry Macneille
  • Patent number: 7421652
    Abstract: A document summary which includes an assemblage of a plurality of summary entries is generated for an electronic document. In the generation of the document summary, a content structure or properties within the electronic document are analyzed. The plurality of summary entries are selected from the contents of the electronic document based on the analysis of the content structure or properties. The content structure within the electronic document may include a table of contents, a plurality of spreadsheet worksheets, a plurality of document pages, etc. The content properties within the electronic document may include text formatting, paragraph formatting, paragraph sizing, etc. Preferably, the best available content structure or properties within the electronic document is identified and utilized in the selection of the plurality of summary entries. The document summary is provided to a mobile communication device in response to a request for the electronic document.
    Type: Grant
    Filed: October 24, 2003
    Date of Patent: September 2, 2008
    Assignee: Arizan Corporation
    Inventors: Jianwei Yuan, Olav A. Sylthe
  • Patent number: 7406458
    Abstract: Techniques are provided for generating descriptions of matching resources in a manner that takes into account the kind, quality, and relevance of the available sources of information about the matching resources. For example, after the search engine identifies matching resources based on the query terms, the search engine determines the kinds of available sources of information about each matching resource. For each matching resource, based on the kinds of available sources of information about the matching resource, one of a plurality of processes is selected to generate a description for the matching resource. Using the content-sensitive description generation techniques described herein, a single result set may include abstracts that were generated using several different processes, where the difference in process corresponds to a difference in the kind, quality, and relevance of the available sources of information about each matching resource.
    Type: Grant
    Filed: February 11, 2003
    Date of Patent: July 29, 2008
    Assignee: Yahoo! Inc.
    Inventors: Chad Carson, Mohan V. Nibhanupudi, Robert Meyers, Dmitri Pavlovski, Douglas Cook
  • Publication number: 20080172606
    Abstract: A method of providing information related to content presented within a first window, the method comprising extracting primary information from the content in response to activation of an interactive mechanism, the primary information including entities mentioned in the content, obtaining related information from content sources based on the primary information, wherein related information includes connection paths between a user and the entities, and generating a summary page including items of primary information in association with the related information. At least some of the items from the primary information and items from the related information can be provided as user selectable links to corresponding detailed information.
    Type: Application
    Filed: March 1, 2007
    Publication date: July 17, 2008
    Applicant: GENERATE, INC.
    Inventor: Robert A. White
  • Patent number: 7398203
    Abstract: A text processor processes text in a message. The text processor generates a plurality of compressed forms of components of the message. The processor performs a linguistic analysis on the body of text to obtain a linguistic output indicative of linguistic components of the body of text. The processor then generates the plurality of compressed forms that can be used to compress the body of text. The plurality of compressed forms are generated based on the linguistic output. The invention can be implemented as a method of generating the compressed forms and as an apparatus.
    Type: Grant
    Filed: April 4, 2006
    Date of Patent: July 8, 2008
    Assignee: Microsoft Corporation
    Inventors: Simon H. Corston-Oliver, Sharad Mathur
  • Patent number: 7395501
    Abstract: An automatic reading assistance application for documents available in electronic form. An automatic annotator is provided which finds concepts of interest and keywords. The operation of the annotator is personalizable for a particular user. The annotator is also capable of improving its performance overtime by both automatic and manual feedback. The annotator is usable with any electronic document. Another available feature is a thumbnail image of all or part of a multi-page document wherein a currently displayed section of the document is highlighted in the thumbnail image. Movement of the highlighted area in the thumbnail image is then synchronized with scrolling through the document.
    Type: Grant
    Filed: August 6, 2002
    Date of Patent: July 1, 2008
    Assignee: Ricoh Company, Ltd.
    Inventors: Jamey Graham, David G. Stork
  • Patent number: 7392474
    Abstract: A method and system for classifying display pages based on automatically generated summaries of display pages. A web page classification system uses a web page summarization system to generate summaries of web pages. The summary of a web page may include the sentences of the web page that are most closely related to the primary topic of the web page. The summarization system may combine the benefits of multiple summarization techniques to identify the sentences of a web page that represent the primary topic of the web page. Once the summary is generated, the classification system may apply conventional classification techniques to the summary to classify the web page. The classification system may use conventional classification techniques such as a Naïve Bayesian classifier or a support vector machine to identify the classifications of a web page based on the summary generated by the summarization system.
    Type: Grant
    Filed: April 30, 2004
    Date of Patent: June 24, 2008
    Assignee: Microsoft Corporation
    Inventors: Zheng Chen, Dou Shen, Benyu Zhang, Hua-Jun Zeng, Wei-Ying Ma
  • Patent number: 7376893
    Abstract: Techniques for determining sentence based interactive topic-based summarization are provided. A text to be summarized is segmented. Discrete keyword, key-phrase, n-gram, sentence and other sentence constituent based summaries are generated based on statistical measures for each text segment. Interactive topic-based summaries are displayed with human sensible omitted text indicators such as alternate colors, fonts, sounds, tactile elements or other human sensible display characteristics useful in indicating omitted text. Individual and/or combinations of discrete keyword, key-phrase, n-gram, sentence, noun phrase and sentence constituent based summaries are dynamically displayed to provide an overview of topic and subtopic development within a text.
    Type: Grant
    Filed: December 16, 2002
    Date of Patent: May 20, 2008
    Assignee: Palo Alto Research Center Incorporated
    Inventors: Francine R. Chen, Thorsten H. Brants, Annie E. Zaenen
  • Publication number: 20080109716
    Abstract: A method and apparatus for editing and displaying a web document, and a browser are provided. The apparatus for editing and displaying a web document includes a browser which classifies the content of the web document received through the Internet into two or more classified parts; and a display which displays only a summary of the web document partially including the classified parts, wherein the browser additionally displays a portion of a non-displayed item selected from the summary of the web document displayed on the display. Accordingly, in consumer electronics employing a method of editing and displaying a web document, even if the consumer electronics do not include an input unit such as a mouse or a touch pen, it is possible to easily and rapidly browse a large amount of web documents by using only a direction key.
    Type: Application
    Filed: May 11, 2007
    Publication date: May 8, 2008
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Kyung-eun LEE
  • Publication number: 20080104506
    Abstract: A method for producing a document summary from a document. The method includes: associating with the document a specific category from a set of predetermined categories; performing a thematic segmentation of the document to produce a segmented document, the segmented document including a plurality of text segments; associating with each text segment from the plurality of text segments a theme selected from a set of predetermined themes; and summarizing the segmented document to produce the document summary by processing each text segment from the plurality of text segments to either select at least one summary textual unit from the text segment, the at least one summary textual unit including at least one word and being a textual unit considered important in summarizing the document; or extract no textual unit from the text segment. The summary textual units are used to form the document summary.
    Type: Application
    Filed: October 30, 2006
    Publication date: May 1, 2008
    Inventor: Atefeh Farzindar