Document Retrieval Systems (epo) Patents (Class 707/E17.008)
-
Publication number: 20100077005Abstract: Users may be presented with different viewing interfaces for a document based on a combination of factors relating to display rights possessed for the document and user specific information. In one implementation, the user's location is used to determine portions of the document that can be displayed to the user. More particularly, access privileges to a document for a user are determined based on geographical location information of the user and based on access rights possessed for the document. Portions of the document may then be formatted for display to the user based on the determined access privileges.Type: ApplicationFiled: November 30, 2009Publication date: March 25, 2010Applicant: GOOGLE INC.Inventors: Joseph O'Sullivan, Siraj Khaliq, Adam M. Smith, Alexander MacGillivray, Joe Sriver
-
Publication number: 20100076954Abstract: Duplicate documents are detected in a web crawler system. Upon receiving a newly crawled document, a set of documents, if any, sharing the same content as the newly crawled document is identified. Information identifying the newly crawled document and the selected set of documents is merged into information identifying a new set of documents. Duplicate documents are included and excluded from the new set of documents based on a query independent metric for each such document. A single representative document for the new set of documents is identified in accordance with a set of predefined conditions.Type: ApplicationFiled: December 1, 2009Publication date: March 25, 2010Inventors: Daniel Dulitz, Alexandre A. Verstak, Sanjay Ghemawat, Jeffrey A. Dean
-
Publication number: 20100076984Abstract: A system and method for query expansion allows the refinement and expansion of a keyword query search by combining a key concept with semantically related concepts proposed by the system and associated with that key concept. The semantically related concepts may be grouped together in a cluster, which is then presented to the user in the form of a tooltip. Once a semantically related concept is selected from the cluster, a new search is launched. This new search may use as query terms the combination of at least part of the key concept and the selected semantically related concept to thereby expand the original search.Type: ApplicationFiled: March 26, 2009Publication date: March 25, 2010Inventors: Alkis Papadopoullos, Claude Vogel, Matthias Howell
-
Publication number: 20100076999Abstract: In registering a new document file in an index, the accumulated percentage of the number of registered keys A from registered keys associated with one posting data, including registered data, is computed. The posting data of a registered key associated with the number of posting data items, which is at most a threshold N, is stored in a leaf page of a balanced-plus tree constituted of the registered keys, and the posting data of a registered key associated with the number of posting data items, which is greater than the threshold N, is stored in a page of a posting-storing unit. When the accumulated number i of registered documents is a predetermined document number, the threshold N of the number of posting data items is changed to the maximum number of the posting data items that are associated with a registered key where the accumulated percentage is less than 60 percent.Type: ApplicationFiled: September 26, 2007Publication date: March 25, 2010Applicant: Justsystems CorproationInventors: Yasuhisa Okazaki, Takanori Hino, Kyoko Fujita, Mikio Moriya
-
Publication number: 20100070482Abstract: Content search on a device involves receiving a search request at a search engine of a device. The search request is associated with a search category. In response to the search request, a plug-in module of the search engine is selected based at least in part on a search category of the request. An extensible markup-language-formatted definition document is received from the plug-in module. The definition document includes an arrangement of data particular to the search category. A results document is formed based on the definition document and sent from the search engine to a search result renderer operating concurrently on the device with the search engine.Type: ApplicationFiled: September 12, 2008Publication date: March 18, 2010Inventors: Murali-Krishna Punaganti Venkata, Kristian Luoma, Jussi-Pekka Partanen, Mikko Tapio Kankainen
-
Publication number: 20100070464Abstract: A workflow application allows users to store, manage, and perform tasks related to workflows comprised of ordered sets of documents. The application provides an interface for creating and managing a workflow. Each document added to the workflow is assigned to a particular position in the workflow. Via the interface, the user saves data defining the workflow. In this manner, the workflow may be shared or preserved for subsequent re-use. The application allows a user to perform various tasks with respect to the workflow. Via an interface control presented by the application, the user instigates such tasks. In response, the application causes the task to be performed for each document in the workflow in an order corresponding to the arrangement of the documents within the workflow. For example, the application may render and print each document in the workflow. Or, the application may generate a combined workflow report.Type: ApplicationFiled: September 9, 2009Publication date: March 18, 2010Inventors: Andrew Aymeloglu, Nicholas Miyake, Brandon Burr, Derek Cicerone, Kevin Simler, Garry Tan
-
Patent number: 7680852Abstract: A technique to maintain the fast search capability for the large-scale documents without causing the update delay is provided. This search processing method includes: causing an index search unit for carrying out an index search using a search index before document update to carry out the index search relating to a search request, and obtaining a first list of document IDs of pertinent documents; causing a string pattern matching unit having document contents after the document update to carry out a string pattern matching relating to the search request for the document contents after the document update, and obtaining a second list of document IDs of pertinent documents; and generating a search result for the search request by using the first and second lists and a third list of document IDs of documents relating to the document update.Type: GrantFiled: April 13, 2007Date of Patent: March 16, 2010Assignee: Fujitsu LimitedInventor: Isao Nanba
-
Publication number: 20100057752Abstract: A document management apparatus capable of processing a request from a user logged in from an information processing apparatus via a network, includes a user management unit configured to manage attribute information for specifying document information to be used by a user logging in from the information processing apparatus, a document management unit configured to associate document information having different document attributes obtained from the information processing apparatus with one another to manage the document information as integrated document information, a determination unit configured to determine whether the document information requested by the log-in user is an integrated document, a specifying unit configured to specify, when the determination unit determines that the document information requested by the log-in user is an integrated document, document information to be referred to by the user from the integrated document information based on the attribute information by the log-in user, aType: ApplicationFiled: August 26, 2009Publication date: March 4, 2010Applicant: CANON KABUSHIKI KAISHAInventor: Koji Kikuchi
-
Publication number: 20100057798Abstract: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.Type: ApplicationFiled: November 11, 2009Publication date: March 4, 2010Applicant: Microsoft CorporationInventors: Benyu Zhang, Gui-Rong Xue, Hua-Jun Zeng, Wei-Ying Ma, Xue-Mei Jiang, Zheng Chen
-
Publication number: 20100057691Abstract: The present invention relates to a method for storing annotations of non-XML documents (10) in an XML database (1), the XML database (1) being adapted for storing a corresponding shadow XML document (20) for each of the non-XML documents (10), the method comprising the steps of: a. receiving an annotation document (15) comprising the annotations and attaching the annotations to the corresponding shadow XML document (20) in the XML database (1); and b. receiving an updated non-XML document (10?) and attaching any existing annotations from the original shadow XML document (20) to an updated shadow XML document (20?) created by the XML database (1).Type: ApplicationFiled: November 12, 2008Publication date: March 4, 2010Applicant: SOFTWARE AGInventors: Julius Geppert, Michael Gesmann
-
Publication number: 20100057483Abstract: A computer readable medium configured to store instructions for executing the following steps (A) accepting a first number of user parameters to be organized as a patent idea, (B) accepting a second number of user parameters to add to the patent idea, and (C) automatically converting the patent idea into a patent disclosure. The patent disclosure may comply with a number of criteria specific to a particular organization.Type: ApplicationFiled: August 29, 2008Publication date: March 4, 2010Inventor: Michael L. Peterson
-
Publication number: 20100049708Abstract: A system and method for scoring concepts in a document set is provided. Concepts including two or more terms extracted from the document set are identified. Each document having one or more of the concepts is designated as a candidate seed document. A score is calculated for each of the concepts identified within each candidate seed document based on a frequency of occurrence, concept weight, structural weight, and corpus weight. A vector is formed for each candidate seed document. The vector is compared with a center of one or more clusters each comprising thematically-related documents. At least one of the candidate seed documents that is sufficiently distinct from the other candidate seed documents is selected as a seed document for a new cluster. Each of the unselected candidate seed documents is placed into one of the clusters having a most similar cluster center.Type: ApplicationFiled: October 26, 2009Publication date: February 25, 2010Inventors: Kenji Kawai, Lynne Marie Evans
-
Publication number: 20100049705Abstract: The present invention relates to a document search apparatus for searching a predetermined corpus for a document file whose content is related to text for searching. The apparatus stores index information that indicates the position in a document and the position in a morpheme for a respective gram. Upon the receipt of the input of text for searching, from a user, the document search apparatus extracts a morpheme and a gram. Then, upon the indexing of the rarity of the morpheme in the corpus and the detection of a document file that contains the morpheme, the number of times such a morpheme appears in the document file is counted as an appearance frequency. From the estimate number and the appearance frequency regarding the morpheme, the relevance of the contents between the text for searching and the document file is indexed as a relevance score.Type: ApplicationFiled: September 28, 2007Publication date: February 25, 2010Applicant: Justsystems CorporationInventors: Shingo Ochi, Takanori Hino
-
Publication number: 20100049773Abstract: A unique document handling facility on the scale of a Lotus Notes document. Preferably, the documents are stored in a relational database and served-up using Java servlets, with provisions for handling document content and group level security. The preferred implementation of the invention provides several specific features: (1) Presentation and control of heterogeneous document content through the service of the Enterprise Application Development Platform, (2) An efficient scheme for group level and user level security, (3) Presentation of heterogeneous document types, (4) Presentation of heterogeneous data types in the document, (5) A method to externalize definition of keyword selections, and (6) The ability to present document fields in any order, regardless of whether they originate in the head or body of the underlying document.Type: ApplicationFiled: October 28, 2009Publication date: February 25, 2010Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventor: James R. Wason
-
Publication number: 20100042617Abstract: In one embodiment, the invention provides a method for a system to provide information based on a query, the method comprising: performing a first search of at least one first source for information responsive to the query; providing a result of said search to a user; based on user input, performing a second search of at least one second source for information responsive to the query; and providing a result of said second search to the user.Type: ApplicationFiled: August 12, 2009Publication date: February 18, 2010Inventors: Anna Matveenko, Alexander Rylov
-
Publication number: 20100040220Abstract: A system for responding to queries has an interface for receiving request communications from requesters. An agent platform is configured to receive the request communications and to provide replies to the requesters. A tracking module tracks the request communications and the replies and a news module tracks news events. A correlation module correlates increases in request communications relative to a first news event over a set time frame. An analysis module generates a search assistance routine based on the correlated increases in request communications relative to the first news events over a set time frame, where the search assistance routine is activated when a second news event is detected, similar to the first news event.Type: ApplicationFiled: July 24, 2009Publication date: February 18, 2010Inventors: Faith McGary, Alexis Tabora Adorable
-
Publication number: 20100042599Abstract: Techniques are disclosed for allowing efficient updating of metadata and high performance searching through the use of a text index and a separate updateable metadata index. Generally, an updateable metadata index is used to store document metadata. A text index is used to store document text. Documents in the text index are stored in the same order as the corresponding metadata entries. Upon receiving a search query, a search engine decomposes the query into a metadata condition and a text condition. Search engine performs a parallel scan upon the metadata index and the text index. To increase performance, metadata entries are skipped over if the corresponding text entries do not match the text condition. During the scan, when a document in the metadata index matches a document in the text index, the document is stored in the search results. After the scan, search results are displayed.Type: ApplicationFiled: August 12, 2008Publication date: February 18, 2010Inventors: Tom William Jacopi, Andreas Neumann, Liem Gioi Tran
-
Publication number: 20100042657Abstract: Saving database storage space includes extracting a standard property unit from a database of commodity information and including the SPU in a SPU library, generating a sequence document of the standard property unit and sending the sequence document to a front-end device, determining whether a newly released commodity matches the standard property unit of the sequence document of the standard property unit and in the event that the newly released commodity matches the standard property unit of the sequence document, binding the new released commodity and the matched standard property unit.Type: ApplicationFiled: July 29, 2009Publication date: February 18, 2010Inventors: Xu Qiang Yue, Chen Zhu, Ke Jin, Hu Wei, Jing Feng Luo, Ling Cao
-
Publication number: 20100042593Abstract: In one embodiment, the invention provides a method, comprising: receiving a query from a user computer device; determining what custom messages are applicable based on the query; and delivering any applicable custom messages to the user computer device. The messages may be selected and customized based on a customization control parameter. In one embodiment, the customization control parameter may include the interface language, the regional settings, and the version of the product. Advantageously, trial versions of dictionaries may have different messages from paid or non-trial versions. For example, for the trial versions, a warning message will be sent informing the user that the trial period is about to expire. Additionally, the server device has the capability to detect whether the version used by a user is bootleg or not, and send customized messages to users of bootleg versions.Type: ApplicationFiled: August 12, 2009Publication date: February 18, 2010Inventors: Anna Matveenko, Alexander Rylov
-
Publication number: 20100036817Abstract: Disclosed herein is a control system. The control server includes a management server, a plurality of client terminals configured to includes first and second client terminals and communicate with the management server, and a file server configured to store documents shared by the first and second client terminals. Each of the first and second client terminals includes an external device recognition module that reads codes of external devices that are communicably connected to or separated from the first or second client terminal. The management server includes an external device DB that stores the codes of the external devices, and an external device verification module that searches the external device DB for information about a code, and performs control so that a document stored in the file server is stored in an external device and is then transferred from the file server if the information about the code is found to exist.Type: ApplicationFiled: December 11, 2007Publication date: February 11, 2010Inventors: Hwan Kuk Bae, Yang Jin Seo, Sang Hak Nah
-
Publication number: 20100036838Abstract: A search engine for retrieving documents from a database including a semantic document editor that allows a user to edit an existing document by creating searchable compound words that contains information contextually relevant to the contents of the document. The editor associates the created compound words with the document to produce an enhanced document having the compounds words associated therewith. A database is provided for storing enhanced documents and a semantic query editor is provided that enables a searcher to address the database of enhanced documents with a query. The query editor receives the query and converts it into one or more compound search words that contain contextually relevant information. A search module is provided that receives the searchable compound words and locates the relevant enhanced documents that have compound words associated with the document matching the searchable compound words. An output module presents any located documents to the searcher.Type: ApplicationFiled: August 14, 2009Publication date: February 11, 2010Inventor: Gerard Ellis
-
Publication number: 20100030798Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.Type: ApplicationFiled: July 29, 2008Publication date: February 4, 2010Applicant: Clearwell Systems, Inc.Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
-
Publication number: 20100030805Abstract: A method, system, and computer usable program product for propagating information in a trust chain processing are provided in the illustrative embodiments. Upon a trust client invoking the trust chain processing, a mapped security information is received, the mapped security information being stored in a memory or a data storage associated with a data processing system. A set of security information attributes are located from the mapped security information according to a configuration. The set of security information attributes are packaged to form a packaged security information. The packaged security information is issued to a target system, the target system being distinct from the trust client that invoked the trust chain processing. The locating, the packaging, and the issuing collectively form monitoring the trust chain processing. A next component in the trust chain processing may be invoked. The invoking may occur before, after, or during the monitoring.Type: ApplicationFiled: July 30, 2008Publication date: February 4, 2010Applicant: International Business Machines CorporationInventors: Heather Maria Hinton, Sridhar R. Muppidi, David Eugene Cox
-
Publication number: 20100030773Abstract: An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. The document index is partitioned into multiple indexes, including a primary index and a secondary index. The primary index stores phrase posting lists with relevance rank ordered documents. The secondary index stores excess documents from the posting lists in document order.Type: ApplicationFiled: July 20, 2009Publication date: February 4, 2010Applicant: Google Inc.Inventor: Anna L. Patterson
-
Publication number: 20100030765Abstract: Systems and method for providing source attribution for a document are provided. A source attribution generator includes a source determiner and an attribution information generator. The source determiner is configured to determine a source for a section of content received in an electronic document by accessing a network-based search index. The attribution information generator is configured to generate attribution information that indicates the determined source in the electronic document, and to provide the generated attribution information to be included in the electronic document.Type: ApplicationFiled: July 30, 2008Publication date: February 4, 2010Applicant: YAHOO! INC.Inventors: Liang-Yu Chi, Ashley Hall
-
Publication number: 20100030749Abstract: The present inventors devised, among other things, an online legal research system with improved user controls. One exemplary system allows users to enter a query in a query input region that automatically expands to accommodate the length of the query field. The exemplary system also responds to the query by automatically directing it to an appropriate database, saving the user from having to choose among the myriad databases within the system. The exemplary system also provides user-specific folders for not only selected documents or excerpts from documents, but also annotating these documents with notes. The system enables the user to determine whether to make the notes private or publicly available.Type: ApplicationFiled: December 31, 2008Publication date: February 4, 2010Inventor: Michael Dahn
-
Publication number: 20100023311Abstract: System and method for analysis of an opinion expressed in documents on a particular topic computes opinion strength on a continuous numeric scale, or qualitatively. A variety of opinion scoring techniques are plugged in to score opinion expressing words and sentences in documents. These scores are aggregated to measure the opinion intensity of documents. Multilingual opinion analysis is supported by capability to concurrently identify and visualize the opinion intensity expressed in documents in multiple languages. A multi-dimensional representation of the measured opinion intensity is generated which is agreeable with multi-lingual domain.Type: ApplicationFiled: June 8, 2007Publication date: January 28, 2010Inventors: Venkatramanan Siva Subrahmanian, Antonio Picariello, Bonnie J. Dorr, Diego Recupero Reforgiato, Carmine Cesarano, Amelia Sagoff
-
Publication number: 20100023550Abstract: A system for handling meta data for describing one or more resources, wherein the one or more resources are deliverable to a common group of users at one or more user terminals, the system including: a resource server for storing the one or more resources for delivery to at least one of the common group of users at one or more user terminals, an administration server arranged to serve the common group of users at the one or more user terminals, for storing a set of meta data for describing the learning resources, the meta data having a format including a non-semantic tag which is customisable in accordance with the common group of users' requirements.Type: ApplicationFiled: October 6, 2009Publication date: January 28, 2010Applicant: SAP AGInventors: Martin Erhard, Andreas KREBS, Marcus Philipp
-
Publication number: 20100023561Abstract: A data restoration method comprising determining whether a restoration process is in progress, in response to receiving a read request to read contents from a track on a source volume (ST[i]); reading data from ST[i], in response to determining that the restoration process is not in progress; determining whether the read request was originated from a host, in response to determining the restoration process is in progress; reading the data directly from ST[i], in response to determining the read request was not originated from a host; determining whether ST[i] is designated as remote with respect to the restore operation, in response to determining the read request was originated from a host; reading the data directly from ST[i], in response to determining ST[i] is designated as local; and reading the data from a track on a target volume (TT[i]), in response to determining ST[i] is designated as remote.Type: ApplicationFiled: July 22, 2008Publication date: January 28, 2010Applicant: International Business Machines CorporationInventor: Aviad Zlotnick
-
Publication number: 20100023562Abstract: Methods, apparatus, and articles for creating a document revision history for a document imported into a first Electronic Document Management System (EDMS) from a second EDMS. Metadata and content from the second EDMS is “mirrored” within the first EDMS to create an artificial or mirrored revision history of a document within the first EDMS. Doing so allows users of the first EDMS to access any version of a document and its history, as though the document had always existed on the first EDMS. Content may be stored onto the first EDMS or a reference to the content may be stored instead. Rules may be developed to resolve conflicts between different document versions in the first and second EDMS.Type: ApplicationFiled: July 28, 2008Publication date: January 28, 2010Inventors: ROBERT M. KREUCH, Michael Seaman, Roger G. Bacalzo, Grace Smith, Eric L. Edeen
-
Publication number: 20100023505Abstract: Same document group creation means (11) acquires a ratio of common words and characters between documents in order to obtain a predetermined similarity greater than a predetermined threshold value between the documents. According to the ratio, words or characters are selected with a common priority in all the documents to be matched. The documents are correlated to the same document candidate group identified by the selected words or characters and stored in a same group candidate group storage unit (22).Type: ApplicationFiled: September 13, 2007Publication date: January 28, 2010Inventors: Kenji Tateishi, Dai Kusui
-
Publication number: 20100023512Abstract: Distribution of content between publishers and consumers is accomplished using an overlay network that may make use of XML language to facilitate content identification. The overlay network includes a plurality of routers that may be in communication with each other and the publishers and consumers on the Internet. Content and queries are identified by content descriptors that are routed from the originator to a nearest router in the overlay network. The nearest router, for each unique content descriptor, generates a hash identification of the content descriptor which is used by remaining routers in the overlay network to provide the appropriate functions with respect to the content descriptor. In particular, this allows all routers in the overlay network except the nearest router to properly route content without processing every content descriptor.Type: ApplicationFiled: October 7, 2009Publication date: January 28, 2010Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.Inventors: Kadangode Ramakrishnan, William Fenner, Michael Rabinovich, Divesh Srivastava, Yin Zhang
-
Publication number: 20100023500Abstract: A method and system for enabling users of a network to create, store, and provide access to relationships between document objects stored on the network. The method may include the steps for allowing a user of the network to create a link relationship between a first document object and a second document object; for storing the link relationship in one or more link directories; and for providing all users of the network access to the link relationships stored in the one or more link directories based upon the document object currently accessed by the users. The system may include one or more client devices that access document objects stored on the network and create link relationships between the first document object and the second document object; and one or more servers that store and filter the link relationships created by the client devices and transmit one or more link relationships and link references to the client devices.Type: ApplicationFiled: October 31, 2007Publication date: January 28, 2010Inventors: Thomas Layne Bascom, Tanya Jones
-
Publication number: 20100017364Abstract: The present inventors have devised one or more systems, methods, and software for distributed loading of information retrieval systems. One exemplary system includes two or more (at least two) load monitor servers that not only monitor and ensure completion of load tasks by individual load servers in a set of two or more load servers, but also provide for one load monitor to monitor performance of the another. Moreover, the exemplary system provides a service-level-agreement (SLA) data structure for each load server. The SLA data structure governs what types and priority levels of loading tasks will be performed for predetermined time periods.Type: ApplicationFiled: January 15, 2009Publication date: January 21, 2010Applicant: Thomson Reuters Global ResourcesInventors: Mark A. Bluhm, Jon Verreaux
-
Publication number: 20100017406Abstract: A switching information acquiring unit 110 acquires information of switching from a screen displayed by a first application program to a screen displayed by a second application program. A character string extracting unit 104 detects character strings from a document file displayed on a screen by the first application program by using a filter serving as a rule to detect a character string matched with a predetermined condition supposed to be used in a second application program and matched with a predetermined condition from a document file. A display control unit 106 presents a character string actually used in the second application from the character strings detected by a character string extracting unit in response to detection of the switching by a switching detecting unit in a display mode in which a user can select the character string.Type: ApplicationFiled: September 27, 2007Publication date: January 21, 2010Applicant: Access Co., Ltd.Inventor: Koji Yamamoto
-
Publication number: 20100017403Abstract: A set of index keys is included in an index search system that are associated with the scope of the search rather than the content of the documents that are the target of the search. These scope related index keys, or scope keys allows the scope of the search to be selected, reducing the number of documents that a search is required to sift through to obtain results. Furthermore, compound scopes are recognized and stored such that an index of complex search scopes is provided to eliminate rehashing of the searches based on these complex search scopes.Type: ApplicationFiled: September 29, 2009Publication date: January 21, 2010Applicant: MICROSOFT CORPORATIONInventors: Chadd Creighton Merrigan, Kyle G. Peltonen, Dmitriy Meyerzon, David J. Lee
-
Publication number: 20100017385Abstract: In some embodiments a method includes creating a bookmark of a deployable web archives In some embodiments, the bookmark includes deployment and runtime information of current and prior invocations of the deployable web archive, at least one user- and/or author-defined external specified tag describing the deployable web archive, and/or reference/link/access information to the deployable web archivesType: ApplicationFiled: July 16, 2008Publication date: January 21, 2010Applicant: INTERNATIONAL BUSINESS MACHINESInventors: LAUREN GABRIELLE WILCOX, MARSHALL ALLEN LAMB, CHRISTINA KAREN LAURIDSEN, MALCOLM CASEY ONG
-
Publication number: 20100017430Abstract: The subject application is directed to document processing job management. Login data is first received corresponding to an associated user. Electronic documents are then associatively stored with a user identifier in an associated data storage. A default menu is generated on a user interface associated with a document processing device, and an actively displayed default menu on the user interface is determined. Document set data is then generated corresponding to an association of electronic documents in the data storage and received login data based upon the correlation between the login data and the associated user identifier. Summary listing data is then generated that identifies each electronic document in the data storage based upon the generated document set data and upon the determination that a default menu is actively displayed on the user interface. Thereafter, a display is generated corresponding to the summary listing.Type: ApplicationFiled: July 21, 2008Publication date: January 21, 2010Inventor: Marianne L. Kodimer
-
Publication number: 20100017400Abstract: Document classification systems are valuable tools for searching and retrieving classified documents but can be prohibitively complex and cumbersome for users. A system for the indexing and retrieval of classified documents inserts keywords, titles or definitions of previously applied classifications into the document record and provides the resulting record to a search engine. Searchers are able to retrieve documents by searching on keywords from the classification system without looking up class coding.Type: ApplicationFiled: July 6, 2009Publication date: January 21, 2010Inventor: Alan Kent ENGEL
-
Publication number: 20100017390Abstract: Described is a next search keyword presentation apparatus, method and program for the presentation of the next recommended search keyword for use in conjunction with the search results. There is provided an apparatus of presenting relevant next search keywords, including an input unit for inputting a search keyword. A search control unit sends search keywords to a search system and receives search results which are displayed as documents on a display unit. A text body extraction unit extracts the text body and an analysis unit carries out a semantic attribute analysis of words contained within the text body. The search keywords are stored as user history data which is used with semantic attributes of each word to create document representative information. A cluster representative keyword extraction unit clusters document characteristic information and extracts cluster representative keywords which are displayed as search keyword candidates, providing recommended keywords based on browsing history.Type: ApplicationFiled: May 8, 2009Publication date: January 21, 2010Applicant: KABUSHIKI KAISHA TOSHIBAInventors: Tomohiro Yamasaki, Takahiro Kawamura
-
Publication number: 20100017392Abstract: Method and apparatus for a query based search engine that searches a database of linked documents. In some embodiments, the method and apparatus computes reliability degrees of the documents, abstracts each document to generate its abstracts, provides a search query interface so that a user can use to enter a search query, processes the search query to generate an intent match criterion, identifies matched documents according to the generated intent match criterion, computes relevance degrees of the matched documents, sets order of the matched documents, and presents the matched documents to the user according to the set order by displaying the following items for each matched document: a link to the matched document, an abstract of the matched document if there are abstracts of the matched document, and a match in the matched document if there are matches in the matched document.Type: ApplicationFiled: July 17, 2009Publication date: January 21, 2010Inventor: Jianwei Dian
-
Publication number: 20100010970Abstract: A document retrieval apparatus holds: index information in which data and an entity document are associated, with respect to a group of entity documents that are XML documents including entity information; and index information in which data and an annotation document are associated, with respect to a group of annotation documents including annotation information that corresponds to the entity information, respectively. Upon receiving an input of a retrieval query including the entity data for retrieval and the annotation data for retrieval, the document retrieval apparatus at first specifies an entity document including the entity data for retrieval. Further, the document retrieval apparatus specifies an annotation document including the annotation data for retrieval, and specifies an entity document corresponding to the specified annotation document.Type: ApplicationFiled: September 28, 2007Publication date: January 14, 2010Applicant: JUSTSYSTEMS CORPORATIONInventors: Jun Takeuchi, Takanori Hino
-
Publication number: 20100005103Abstract: A system and method is provided for remote administration and management of a computer network, by installation of distributed software agents in remote network components, such as software agents implemented using encapsulated reusable interfaces such as COM or CORBA interfaces. Remote network management is effected by communication with the distributed agents using a structured language-independent parsable text document, such as a markup language; e.g. XML.Type: ApplicationFiled: September 15, 2009Publication date: January 7, 2010Applicant: Global 360, Inc.Inventors: Geoffrey Hager, Robert Chang, Robert Tjia
-
Publication number: 20100004944Abstract: A computer implemented method and system is provided for compiling and publishing an online book in an online collaborative environment. Requirements are collected from one or more users in the online collaborative environment. The collected requirements comprise, for example, educational subjects, topical subjects, information on syllabus, an outline, etc. One or more content structures are created based on the collected requirements using, for example, one or more of search criteria, applicable context, curriculum guidelines, curriculum standards, degrees of difficulty, etc. The educational content is retrieved from multiple online sources, created through authoring, or a combination thereof. The retrieved educational content is compiled into the content structures for creation of the online book. The compilation is performed by automatic compilation or user declared specification. The online book is then published in the online collaborative environment.Type: ApplicationFiled: June 19, 2009Publication date: January 7, 2010Inventor: Murugan Palaniappan
-
Publication number: 20100005058Abstract: A computer-readable recording medium stores therein an information retrieving program that causes a computer to execute acquiring a document to be searched and having a hierarchical structure; generating a path schema related to the acquired document; receiving input of a retrieval keyword, a retrieval condition for the retrieval keyword, and a retrieval equation specifying a retrieval range for the retrieval keyword; generating a single automaton that includes a hierarchy retrieval automaton that retrieves a hierarchy of the generated path schema and a hit keyword retrieval automaton that retrieves a hit keyword satisfying the retrieval condition, the single automaton making state transition between a hit hierarchical node where the hit keyword in the hierarchy retrieval automaton is present and a set of nodes representing the hit keyword; retrieving, from the document and using the single automaton, the hit keyword within the retrieval range; and outputting a retrieval result.Type: ApplicationFiled: September 15, 2009Publication date: January 7, 2010Applicant: FUJITS LIMITEDInventors: Shinichiro TAGO, Seishi Okamoto, Hiroya Inakoshi, Tatsuya Asai
-
Publication number: 20100005054Abstract: Techniques and systems for indexing and retrieving data and documents stored in a record-based database management system (RDBMS) utilize a search engine interface. Search-engine indices are created from tables in the RDBMS and data from the tables is used to create “documents” for each record. Queries that require data from multiple tables may be parsed into a primary query and a set of one or more secondary queries. Join mappings and documents are created for the necessary tables. Documents matching the query string are retrieved using the search-engine indices and join mappings.Type: ApplicationFiled: June 16, 2009Publication date: January 7, 2010Inventors: Tim Smith, William Kimble Johnson, III, Rik Tamm-Daniels, Sid Probstein
-
Publication number: 20100005008Abstract: Systems and methods for generating and sharing interactive charts are described. The interactive charts are generated in an online portal that allows users to customize the interactive features of the chart. An exemplary interactive feature includes an interactive audio feature. The interactive chart can be shared by, for example, embedding the interactive chart in an external electronic document, such as a .ppt or PDF document, that can be shared with other users. The interactive chart and/or the data associated with the interactive chart may also be purchased through an online store environment or otherwise shared with other users.Type: ApplicationFiled: September 5, 2008Publication date: January 7, 2010Inventors: Seymour Duncker, Tyron Montgomery
-
Publication number: 20100005144Abstract: The present invention discloses a method system for transmitting a document over a Network including the steps of a document sender converts a sharing document to be transmitted into a GDI (Graph Device Interface) document by performing virtual printing. The document receiver receives the graph device interface document sent from the document sender through the network The document receiver restores the received GDI document. The contents of the restored GDI document are the same as that of the sharing document. The present invention also provides a system, a virtual printer apparatus and a restoration apparatus, the transmission of the document is not restricted by the application using the method, system and apparatus of the present invention.Type: ApplicationFiled: December 15, 2005Publication date: January 7, 2010Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventor: Haijun Wu
-
Publication number: 20100005083Abstract: Frequency based keyword extraction method and system utilizing a statistical measure is disclosed which generates keywords within a page and/or document that can distinguish the document from an average document. A simple frequency threshold parameter can be utilized to determine a number of common stop words if a word in the document possesses a frequency in a corpus that is more than the threshold parameter. A statistical confidence interval of the frequency in the document can be compared against a frequency confidence interval of the word in the corpus. The extracted keyword possesses a greater intra-document frequency confidence interval than the frequency confidence interval of the word within the corpus. A statistical hypothesis test can also be utilized to determine the keyword by calculating a test statistic and testing whether the test statistic is greater than some threshold.Type: ApplicationFiled: July 1, 2008Publication date: January 7, 2010Inventors: Stephen C. Morgana, John C. Handley
-
Publication number: 20090327326Abstract: A method for exporting native source documents (NSDs) from a document repository. The method includes identifying a first NSD to export, where the first NSD includes a first version of content and first metadata, and identifying a second NSD to export, where the second NSD comprises a second version of the content and second metadata. The method further includes generating a source content definition file (CDF) document that includes a global property, a first version-specific property for the first version of the content, a reference to the first version of the content, a second version specific-property for the second version of the content, and a reference to the second version of the content. The method further includes storing the source CDF document in a persistent storage device.Type: ApplicationFiled: April 30, 2008Publication date: December 31, 2009Applicant: ENTERPRISE CONTENT MANAGEMENT GROUP, LLC.Inventor: Ernest F. Bahr