Document Retrieval Systems (epo) Patents (Class 707/E17.008)
  • Publication number: 20100077005
    Abstract: Users may be presented with different viewing interfaces for a document based on a combination of factors relating to display rights possessed for the document and user specific information. In one implementation, the user's location is used to determine portions of the document that can be displayed to the user. More particularly, access privileges to a document for a user are determined based on geographical location information of the user and based on access rights possessed for the document. Portions of the document may then be formatted for display to the user based on the determined access privileges.
    Type: Application
    Filed: November 30, 2009
    Publication date: March 25, 2010
    Applicant: GOOGLE INC.
    Inventors: Joseph O'Sullivan, Siraj Khaliq, Adam M. Smith, Alexander MacGillivray, Joe Sriver
  • Publication number: 20100076954
    Abstract: Duplicate documents are detected in a web crawler system. Upon receiving a newly crawled document, a set of documents, if any, sharing the same content as the newly crawled document is identified. Information identifying the newly crawled document and the selected set of documents is merged into information identifying a new set of documents. Duplicate documents are included and excluded from the new set of documents based on a query independent metric for each such document. A single representative document for the new set of documents is identified in accordance with a set of predefined conditions.
    Type: Application
    Filed: December 1, 2009
    Publication date: March 25, 2010
    Inventors: Daniel Dulitz, Alexandre A. Verstak, Sanjay Ghemawat, Jeffrey A. Dean
  • Publication number: 20100076984
    Abstract: A system and method for query expansion allows the refinement and expansion of a keyword query search by combining a key concept with semantically related concepts proposed by the system and associated with that key concept. The semantically related concepts may be grouped together in a cluster, which is then presented to the user in the form of a tooltip. Once a semantically related concept is selected from the cluster, a new search is launched. This new search may use as query terms the combination of at least part of the key concept and the selected semantically related concept to thereby expand the original search.
    Type: Application
    Filed: March 26, 2009
    Publication date: March 25, 2010
    Inventors: Alkis Papadopoullos, Claude Vogel, Matthias Howell
  • Publication number: 20100076999
    Abstract: In registering a new document file in an index, the accumulated percentage of the number of registered keys A from registered keys associated with one posting data, including registered data, is computed. The posting data of a registered key associated with the number of posting data items, which is at most a threshold N, is stored in a leaf page of a balanced-plus tree constituted of the registered keys, and the posting data of a registered key associated with the number of posting data items, which is greater than the threshold N, is stored in a page of a posting-storing unit. When the accumulated number i of registered documents is a predetermined document number, the threshold N of the number of posting data items is changed to the maximum number of the posting data items that are associated with a registered key where the accumulated percentage is less than 60 percent.
    Type: Application
    Filed: September 26, 2007
    Publication date: March 25, 2010
    Applicant: Justsystems Corproation
    Inventors: Yasuhisa Okazaki, Takanori Hino, Kyoko Fujita, Mikio Moriya
  • Publication number: 20100070482
    Abstract: Content search on a device involves receiving a search request at a search engine of a device. The search request is associated with a search category. In response to the search request, a plug-in module of the search engine is selected based at least in part on a search category of the request. An extensible markup-language-formatted definition document is received from the plug-in module. The definition document includes an arrangement of data particular to the search category. A results document is formed based on the definition document and sent from the search engine to a search result renderer operating concurrently on the device with the search engine.
    Type: Application
    Filed: September 12, 2008
    Publication date: March 18, 2010
    Inventors: Murali-Krishna Punaganti Venkata, Kristian Luoma, Jussi-Pekka Partanen, Mikko Tapio Kankainen
  • Publication number: 20100070464
    Abstract: A workflow application allows users to store, manage, and perform tasks related to workflows comprised of ordered sets of documents. The application provides an interface for creating and managing a workflow. Each document added to the workflow is assigned to a particular position in the workflow. Via the interface, the user saves data defining the workflow. In this manner, the workflow may be shared or preserved for subsequent re-use. The application allows a user to perform various tasks with respect to the workflow. Via an interface control presented by the application, the user instigates such tasks. In response, the application causes the task to be performed for each document in the workflow in an order corresponding to the arrangement of the documents within the workflow. For example, the application may render and print each document in the workflow. Or, the application may generate a combined workflow report.
    Type: Application
    Filed: September 9, 2009
    Publication date: March 18, 2010
    Inventors: Andrew Aymeloglu, Nicholas Miyake, Brandon Burr, Derek Cicerone, Kevin Simler, Garry Tan
  • Patent number: 7680852
    Abstract: A technique to maintain the fast search capability for the large-scale documents without causing the update delay is provided. This search processing method includes: causing an index search unit for carrying out an index search using a search index before document update to carry out the index search relating to a search request, and obtaining a first list of document IDs of pertinent documents; causing a string pattern matching unit having document contents after the document update to carry out a string pattern matching relating to the search request for the document contents after the document update, and obtaining a second list of document IDs of pertinent documents; and generating a search result for the search request by using the first and second lists and a third list of document IDs of documents relating to the document update.
    Type: Grant
    Filed: April 13, 2007
    Date of Patent: March 16, 2010
    Assignee: Fujitsu Limited
    Inventor: Isao Nanba
  • Publication number: 20100057752
    Abstract: A document management apparatus capable of processing a request from a user logged in from an information processing apparatus via a network, includes a user management unit configured to manage attribute information for specifying document information to be used by a user logging in from the information processing apparatus, a document management unit configured to associate document information having different document attributes obtained from the information processing apparatus with one another to manage the document information as integrated document information, a determination unit configured to determine whether the document information requested by the log-in user is an integrated document, a specifying unit configured to specify, when the determination unit determines that the document information requested by the log-in user is an integrated document, document information to be referred to by the user from the integrated document information based on the attribute information by the log-in user, a
    Type: Application
    Filed: August 26, 2009
    Publication date: March 4, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Koji Kikuchi
  • Publication number: 20100057798
    Abstract: A method and system for adapting search results of a query to the information needs of the user submitting the query is provided. A search system analyzes click-through triplets indicating that a user submitted a query and that the user selected a document from the results of the query. To overcome the large size and sparseness of the click-through data, the search system when presented with an input triplet comprising a user, a query, and a document determines a probability that the user will find the input document important by smoothing the click-through triplets. The search system then orders documents of the result based on the probability of their importance to the input user.
    Type: Application
    Filed: November 11, 2009
    Publication date: March 4, 2010
    Applicant: Microsoft Corporation
    Inventors: Benyu Zhang, Gui-Rong Xue, Hua-Jun Zeng, Wei-Ying Ma, Xue-Mei Jiang, Zheng Chen
  • Publication number: 20100057691
    Abstract: The present invention relates to a method for storing annotations of non-XML documents (10) in an XML database (1), the XML database (1) being adapted for storing a corresponding shadow XML document (20) for each of the non-XML documents (10), the method comprising the steps of: a. receiving an annotation document (15) comprising the annotations and attaching the annotations to the corresponding shadow XML document (20) in the XML database (1); and b. receiving an updated non-XML document (10?) and attaching any existing annotations from the original shadow XML document (20) to an updated shadow XML document (20?) created by the XML database (1).
    Type: Application
    Filed: November 12, 2008
    Publication date: March 4, 2010
    Applicant: SOFTWARE AG
    Inventors: Julius Geppert, Michael Gesmann
  • Publication number: 20100057483
    Abstract: A computer readable medium configured to store instructions for executing the following steps (A) accepting a first number of user parameters to be organized as a patent idea, (B) accepting a second number of user parameters to add to the patent idea, and (C) automatically converting the patent idea into a patent disclosure. The patent disclosure may comply with a number of criteria specific to a particular organization.
    Type: Application
    Filed: August 29, 2008
    Publication date: March 4, 2010
    Inventor: Michael L. Peterson
  • Publication number: 20100049708
    Abstract: A system and method for scoring concepts in a document set is provided. Concepts including two or more terms extracted from the document set are identified. Each document having one or more of the concepts is designated as a candidate seed document. A score is calculated for each of the concepts identified within each candidate seed document based on a frequency of occurrence, concept weight, structural weight, and corpus weight. A vector is formed for each candidate seed document. The vector is compared with a center of one or more clusters each comprising thematically-related documents. At least one of the candidate seed documents that is sufficiently distinct from the other candidate seed documents is selected as a seed document for a new cluster. Each of the unselected candidate seed documents is placed into one of the clusters having a most similar cluster center.
    Type: Application
    Filed: October 26, 2009
    Publication date: February 25, 2010
    Inventors: Kenji Kawai, Lynne Marie Evans
  • Publication number: 20100049705
    Abstract: The present invention relates to a document search apparatus for searching a predetermined corpus for a document file whose content is related to text for searching. The apparatus stores index information that indicates the position in a document and the position in a morpheme for a respective gram. Upon the receipt of the input of text for searching, from a user, the document search apparatus extracts a morpheme and a gram. Then, upon the indexing of the rarity of the morpheme in the corpus and the detection of a document file that contains the morpheme, the number of times such a morpheme appears in the document file is counted as an appearance frequency. From the estimate number and the appearance frequency regarding the morpheme, the relevance of the contents between the text for searching and the document file is indexed as a relevance score.
    Type: Application
    Filed: September 28, 2007
    Publication date: February 25, 2010
    Applicant: Justsystems Corporation
    Inventors: Shingo Ochi, Takanori Hino
  • Publication number: 20100049773
    Abstract: A unique document handling facility on the scale of a Lotus Notes document. Preferably, the documents are stored in a relational database and served-up using Java servlets, with provisions for handling document content and group level security. The preferred implementation of the invention provides several specific features: (1) Presentation and control of heterogeneous document content through the service of the Enterprise Application Development Platform, (2) An efficient scheme for group level and user level security, (3) Presentation of heterogeneous document types, (4) Presentation of heterogeneous data types in the document, (5) A method to externalize definition of keyword selections, and (6) The ability to present document fields in any order, regardless of whether they originate in the head or body of the underlying document.
    Type: Application
    Filed: October 28, 2009
    Publication date: February 25, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: James R. Wason
  • Publication number: 20100042617
    Abstract: In one embodiment, the invention provides a method for a system to provide information based on a query, the method comprising: performing a first search of at least one first source for information responsive to the query; providing a result of said search to a user; based on user input, performing a second search of at least one second source for information responsive to the query; and providing a result of said second search to the user.
    Type: Application
    Filed: August 12, 2009
    Publication date: February 18, 2010
    Inventors: Anna Matveenko, Alexander Rylov
  • Publication number: 20100040220
    Abstract: A system for responding to queries has an interface for receiving request communications from requesters. An agent platform is configured to receive the request communications and to provide replies to the requesters. A tracking module tracks the request communications and the replies and a news module tracks news events. A correlation module correlates increases in request communications relative to a first news event over a set time frame. An analysis module generates a search assistance routine based on the correlated increases in request communications relative to the first news events over a set time frame, where the search assistance routine is activated when a second news event is detected, similar to the first news event.
    Type: Application
    Filed: July 24, 2009
    Publication date: February 18, 2010
    Inventors: Faith McGary, Alexis Tabora Adorable
  • Publication number: 20100042599
    Abstract: Techniques are disclosed for allowing efficient updating of metadata and high performance searching through the use of a text index and a separate updateable metadata index. Generally, an updateable metadata index is used to store document metadata. A text index is used to store document text. Documents in the text index are stored in the same order as the corresponding metadata entries. Upon receiving a search query, a search engine decomposes the query into a metadata condition and a text condition. Search engine performs a parallel scan upon the metadata index and the text index. To increase performance, metadata entries are skipped over if the corresponding text entries do not match the text condition. During the scan, when a document in the metadata index matches a document in the text index, the document is stored in the search results. After the scan, search results are displayed.
    Type: Application
    Filed: August 12, 2008
    Publication date: February 18, 2010
    Inventors: Tom William Jacopi, Andreas Neumann, Liem Gioi Tran
  • Publication number: 20100042657
    Abstract: Saving database storage space includes extracting a standard property unit from a database of commodity information and including the SPU in a SPU library, generating a sequence document of the standard property unit and sending the sequence document to a front-end device, determining whether a newly released commodity matches the standard property unit of the sequence document of the standard property unit and in the event that the newly released commodity matches the standard property unit of the sequence document, binding the new released commodity and the matched standard property unit.
    Type: Application
    Filed: July 29, 2009
    Publication date: February 18, 2010
    Inventors: Xu Qiang Yue, Chen Zhu, Ke Jin, Hu Wei, Jing Feng Luo, Ling Cao
  • Publication number: 20100042593
    Abstract: In one embodiment, the invention provides a method, comprising: receiving a query from a user computer device; determining what custom messages are applicable based on the query; and delivering any applicable custom messages to the user computer device. The messages may be selected and customized based on a customization control parameter. In one embodiment, the customization control parameter may include the interface language, the regional settings, and the version of the product. Advantageously, trial versions of dictionaries may have different messages from paid or non-trial versions. For example, for the trial versions, a warning message will be sent informing the user that the trial period is about to expire. Additionally, the server device has the capability to detect whether the version used by a user is bootleg or not, and send customized messages to users of bootleg versions.
    Type: Application
    Filed: August 12, 2009
    Publication date: February 18, 2010
    Inventors: Anna Matveenko, Alexander Rylov
  • Publication number: 20100036817
    Abstract: Disclosed herein is a control system. The control server includes a management server, a plurality of client terminals configured to includes first and second client terminals and communicate with the management server, and a file server configured to store documents shared by the first and second client terminals. Each of the first and second client terminals includes an external device recognition module that reads codes of external devices that are communicably connected to or separated from the first or second client terminal. The management server includes an external device DB that stores the codes of the external devices, and an external device verification module that searches the external device DB for information about a code, and performs control so that a document stored in the file server is stored in an external device and is then transferred from the file server if the information about the code is found to exist.
    Type: Application
    Filed: December 11, 2007
    Publication date: February 11, 2010
    Inventors: Hwan Kuk Bae, Yang Jin Seo, Sang Hak Nah
  • Publication number: 20100036838
    Abstract: A search engine for retrieving documents from a database including a semantic document editor that allows a user to edit an existing document by creating searchable compound words that contains information contextually relevant to the contents of the document. The editor associates the created compound words with the document to produce an enhanced document having the compounds words associated therewith. A database is provided for storing enhanced documents and a semantic query editor is provided that enables a searcher to address the database of enhanced documents with a query. The query editor receives the query and converts it into one or more compound search words that contain contextually relevant information. A search module is provided that receives the searchable compound words and locates the relevant enhanced documents that have compound words associated with the document matching the searchable compound words. An output module presents any located documents to the searcher.
    Type: Application
    Filed: August 14, 2009
    Publication date: February 11, 2010
    Inventor: Gerard Ellis
  • Publication number: 20100030798
    Abstract: The invention provides for techniques to process and produce email documents. The techniques provide for organizing a first plurality of email documents into a plurality of document groups, reviewing a document group from the plurality of document groups, and associating a review content with the document group. The techniques provide for ways to propagate the review content to one or more email documents associated with the document group and producing a second plurality of email documents. The techniques provide for annotating one or more email documents in accordance with the review content. Depending on the embodiment, review content may include text, graphics, audio, tag, and multimedia information. Produced documents can be searched and browsed in accordance with information in the review content. Email documents can be grouped by information in meta information and/or header information associated with the email documents into various groups, including threads or conversations, for example.
    Type: Application
    Filed: July 29, 2008
    Publication date: February 4, 2010
    Applicant: Clearwell Systems, Inc.
    Inventors: Mohan Kumar, Gary Lehrman, Hari Krishna Dara
  • Publication number: 20100030805
    Abstract: A method, system, and computer usable program product for propagating information in a trust chain processing are provided in the illustrative embodiments. Upon a trust client invoking the trust chain processing, a mapped security information is received, the mapped security information being stored in a memory or a data storage associated with a data processing system. A set of security information attributes are located from the mapped security information according to a configuration. The set of security information attributes are packaged to form a packaged security information. The packaged security information is issued to a target system, the target system being distinct from the trust client that invoked the trust chain processing. The locating, the packaging, and the issuing collectively form monitoring the trust chain processing. A next component in the trust chain processing may be invoked. The invoking may occur before, after, or during the monitoring.
    Type: Application
    Filed: July 30, 2008
    Publication date: February 4, 2010
    Applicant: International Business Machines Corporation
    Inventors: Heather Maria Hinton, Sridhar R. Muppidi, David Eugene Cox
  • Publication number: 20100030773
    Abstract: An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. The document index is partitioned into multiple indexes, including a primary index and a secondary index. The primary index stores phrase posting lists with relevance rank ordered documents. The secondary index stores excess documents from the posting lists in document order.
    Type: Application
    Filed: July 20, 2009
    Publication date: February 4, 2010
    Applicant: Google Inc.
    Inventor: Anna L. Patterson
  • Publication number: 20100030765
    Abstract: Systems and method for providing source attribution for a document are provided. A source attribution generator includes a source determiner and an attribution information generator. The source determiner is configured to determine a source for a section of content received in an electronic document by accessing a network-based search index. The attribution information generator is configured to generate attribution information that indicates the determined source in the electronic document, and to provide the generated attribution information to be included in the electronic document.
    Type: Application
    Filed: July 30, 2008
    Publication date: February 4, 2010
    Applicant: YAHOO! INC.
    Inventors: Liang-Yu Chi, Ashley Hall
  • Publication number: 20100030749
    Abstract: The present inventors devised, among other things, an online legal research system with improved user controls. One exemplary system allows users to enter a query in a query input region that automatically expands to accommodate the length of the query field. The exemplary system also responds to the query by automatically directing it to an appropriate database, saving the user from having to choose among the myriad databases within the system. The exemplary system also provides user-specific folders for not only selected documents or excerpts from documents, but also annotating these documents with notes. The system enables the user to determine whether to make the notes private or publicly available.
    Type: Application
    Filed: December 31, 2008
    Publication date: February 4, 2010
    Inventor: Michael Dahn
  • Publication number: 20100023311
    Abstract: System and method for analysis of an opinion expressed in documents on a particular topic computes opinion strength on a continuous numeric scale, or qualitatively. A variety of opinion scoring techniques are plugged in to score opinion expressing words and sentences in documents. These scores are aggregated to measure the opinion intensity of documents. Multilingual opinion analysis is supported by capability to concurrently identify and visualize the opinion intensity expressed in documents in multiple languages. A multi-dimensional representation of the measured opinion intensity is generated which is agreeable with multi-lingual domain.
    Type: Application
    Filed: June 8, 2007
    Publication date: January 28, 2010
    Inventors: Venkatramanan Siva Subrahmanian, Antonio Picariello, Bonnie J. Dorr, Diego Recupero Reforgiato, Carmine Cesarano, Amelia Sagoff
  • Publication number: 20100023550
    Abstract: A system for handling meta data for describing one or more resources, wherein the one or more resources are deliverable to a common group of users at one or more user terminals, the system including: a resource server for storing the one or more resources for delivery to at least one of the common group of users at one or more user terminals, an administration server arranged to serve the common group of users at the one or more user terminals, for storing a set of meta data for describing the learning resources, the meta data having a format including a non-semantic tag which is customisable in accordance with the common group of users' requirements.
    Type: Application
    Filed: October 6, 2009
    Publication date: January 28, 2010
    Applicant: SAP AG
    Inventors: Martin Erhard, Andreas KREBS, Marcus Philipp
  • Publication number: 20100023561
    Abstract: A data restoration method comprising determining whether a restoration process is in progress, in response to receiving a read request to read contents from a track on a source volume (ST[i]); reading data from ST[i], in response to determining that the restoration process is not in progress; determining whether the read request was originated from a host, in response to determining the restoration process is in progress; reading the data directly from ST[i], in response to determining the read request was not originated from a host; determining whether ST[i] is designated as remote with respect to the restore operation, in response to determining the read request was originated from a host; reading the data directly from ST[i], in response to determining ST[i] is designated as local; and reading the data from a track on a target volume (TT[i]), in response to determining ST[i] is designated as remote.
    Type: Application
    Filed: July 22, 2008
    Publication date: January 28, 2010
    Applicant: International Business Machines Corporation
    Inventor: Aviad Zlotnick
  • Publication number: 20100023562
    Abstract: Methods, apparatus, and articles for creating a document revision history for a document imported into a first Electronic Document Management System (EDMS) from a second EDMS. Metadata and content from the second EDMS is “mirrored” within the first EDMS to create an artificial or mirrored revision history of a document within the first EDMS. Doing so allows users of the first EDMS to access any version of a document and its history, as though the document had always existed on the first EDMS. Content may be stored onto the first EDMS or a reference to the content may be stored instead. Rules may be developed to resolve conflicts between different document versions in the first and second EDMS.
    Type: Application
    Filed: July 28, 2008
    Publication date: January 28, 2010
    Inventors: ROBERT M. KREUCH, Michael Seaman, Roger G. Bacalzo, Grace Smith, Eric L. Edeen
  • Publication number: 20100023505
    Abstract: Same document group creation means (11) acquires a ratio of common words and characters between documents in order to obtain a predetermined similarity greater than a predetermined threshold value between the documents. According to the ratio, words or characters are selected with a common priority in all the documents to be matched. The documents are correlated to the same document candidate group identified by the selected words or characters and stored in a same group candidate group storage unit (22).
    Type: Application
    Filed: September 13, 2007
    Publication date: January 28, 2010
    Inventors: Kenji Tateishi, Dai Kusui
  • Publication number: 20100023512
    Abstract: Distribution of content between publishers and consumers is accomplished using an overlay network that may make use of XML language to facilitate content identification. The overlay network includes a plurality of routers that may be in communication with each other and the publishers and consumers on the Internet. Content and queries are identified by content descriptors that are routed from the originator to a nearest router in the overlay network. The nearest router, for each unique content descriptor, generates a hash identification of the content descriptor which is used by remaining routers in the overlay network to provide the appropriate functions with respect to the content descriptor. In particular, this allows all routers in the overlay network except the nearest router to properly route content without processing every content descriptor.
    Type: Application
    Filed: October 7, 2009
    Publication date: January 28, 2010
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventors: Kadangode Ramakrishnan, William Fenner, Michael Rabinovich, Divesh Srivastava, Yin Zhang
  • Publication number: 20100023500
    Abstract: A method and system for enabling users of a network to create, store, and provide access to relationships between document objects stored on the network. The method may include the steps for allowing a user of the network to create a link relationship between a first document object and a second document object; for storing the link relationship in one or more link directories; and for providing all users of the network access to the link relationships stored in the one or more link directories based upon the document object currently accessed by the users. The system may include one or more client devices that access document objects stored on the network and create link relationships between the first document object and the second document object; and one or more servers that store and filter the link relationships created by the client devices and transmit one or more link relationships and link references to the client devices.
    Type: Application
    Filed: October 31, 2007
    Publication date: January 28, 2010
    Inventors: Thomas Layne Bascom, Tanya Jones
  • Publication number: 20100017364
    Abstract: The present inventors have devised one or more systems, methods, and software for distributed loading of information retrieval systems. One exemplary system includes two or more (at least two) load monitor servers that not only monitor and ensure completion of load tasks by individual load servers in a set of two or more load servers, but also provide for one load monitor to monitor performance of the another. Moreover, the exemplary system provides a service-level-agreement (SLA) data structure for each load server. The SLA data structure governs what types and priority levels of loading tasks will be performed for predetermined time periods.
    Type: Application
    Filed: January 15, 2009
    Publication date: January 21, 2010
    Applicant: Thomson Reuters Global Resources
    Inventors: Mark A. Bluhm, Jon Verreaux
  • Publication number: 20100017406
    Abstract: A switching information acquiring unit 110 acquires information of switching from a screen displayed by a first application program to a screen displayed by a second application program. A character string extracting unit 104 detects character strings from a document file displayed on a screen by the first application program by using a filter serving as a rule to detect a character string matched with a predetermined condition supposed to be used in a second application program and matched with a predetermined condition from a document file. A display control unit 106 presents a character string actually used in the second application from the character strings detected by a character string extracting unit in response to detection of the switching by a switching detecting unit in a display mode in which a user can select the character string.
    Type: Application
    Filed: September 27, 2007
    Publication date: January 21, 2010
    Applicant: Access Co., Ltd.
    Inventor: Koji Yamamoto
  • Publication number: 20100017403
    Abstract: A set of index keys is included in an index search system that are associated with the scope of the search rather than the content of the documents that are the target of the search. These scope related index keys, or scope keys allows the scope of the search to be selected, reducing the number of documents that a search is required to sift through to obtain results. Furthermore, compound scopes are recognized and stored such that an index of complex search scopes is provided to eliminate rehashing of the searches based on these complex search scopes.
    Type: Application
    Filed: September 29, 2009
    Publication date: January 21, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Chadd Creighton Merrigan, Kyle G. Peltonen, Dmitriy Meyerzon, David J. Lee
  • Publication number: 20100017385
    Abstract: In some embodiments a method includes creating a bookmark of a deployable web archives In some embodiments, the bookmark includes deployment and runtime information of current and prior invocations of the deployable web archive, at least one user- and/or author-defined external specified tag describing the deployable web archive, and/or reference/link/access information to the deployable web archives
    Type: Application
    Filed: July 16, 2008
    Publication date: January 21, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES
    Inventors: LAUREN GABRIELLE WILCOX, MARSHALL ALLEN LAMB, CHRISTINA KAREN LAURIDSEN, MALCOLM CASEY ONG
  • Publication number: 20100017430
    Abstract: The subject application is directed to document processing job management. Login data is first received corresponding to an associated user. Electronic documents are then associatively stored with a user identifier in an associated data storage. A default menu is generated on a user interface associated with a document processing device, and an actively displayed default menu on the user interface is determined. Document set data is then generated corresponding to an association of electronic documents in the data storage and received login data based upon the correlation between the login data and the associated user identifier. Summary listing data is then generated that identifies each electronic document in the data storage based upon the generated document set data and upon the determination that a default menu is actively displayed on the user interface. Thereafter, a display is generated corresponding to the summary listing.
    Type: Application
    Filed: July 21, 2008
    Publication date: January 21, 2010
    Inventor: Marianne L. Kodimer
  • Publication number: 20100017400
    Abstract: Document classification systems are valuable tools for searching and retrieving classified documents but can be prohibitively complex and cumbersome for users. A system for the indexing and retrieval of classified documents inserts keywords, titles or definitions of previously applied classifications into the document record and provides the resulting record to a search engine. Searchers are able to retrieve documents by searching on keywords from the classification system without looking up class coding.
    Type: Application
    Filed: July 6, 2009
    Publication date: January 21, 2010
    Inventor: Alan Kent ENGEL
  • Publication number: 20100017390
    Abstract: Described is a next search keyword presentation apparatus, method and program for the presentation of the next recommended search keyword for use in conjunction with the search results. There is provided an apparatus of presenting relevant next search keywords, including an input unit for inputting a search keyword. A search control unit sends search keywords to a search system and receives search results which are displayed as documents on a display unit. A text body extraction unit extracts the text body and an analysis unit carries out a semantic attribute analysis of words contained within the text body. The search keywords are stored as user history data which is used with semantic attributes of each word to create document representative information. A cluster representative keyword extraction unit clusters document characteristic information and extracts cluster representative keywords which are displayed as search keyword candidates, providing recommended keywords based on browsing history.
    Type: Application
    Filed: May 8, 2009
    Publication date: January 21, 2010
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventors: Tomohiro Yamasaki, Takahiro Kawamura
  • Publication number: 20100017392
    Abstract: Method and apparatus for a query based search engine that searches a database of linked documents. In some embodiments, the method and apparatus computes reliability degrees of the documents, abstracts each document to generate its abstracts, provides a search query interface so that a user can use to enter a search query, processes the search query to generate an intent match criterion, identifies matched documents according to the generated intent match criterion, computes relevance degrees of the matched documents, sets order of the matched documents, and presents the matched documents to the user according to the set order by displaying the following items for each matched document: a link to the matched document, an abstract of the matched document if there are abstracts of the matched document, and a match in the matched document if there are matches in the matched document.
    Type: Application
    Filed: July 17, 2009
    Publication date: January 21, 2010
    Inventor: Jianwei Dian
  • Publication number: 20100010970
    Abstract: A document retrieval apparatus holds: index information in which data and an entity document are associated, with respect to a group of entity documents that are XML documents including entity information; and index information in which data and an annotation document are associated, with respect to a group of annotation documents including annotation information that corresponds to the entity information, respectively. Upon receiving an input of a retrieval query including the entity data for retrieval and the annotation data for retrieval, the document retrieval apparatus at first specifies an entity document including the entity data for retrieval. Further, the document retrieval apparatus specifies an annotation document including the annotation data for retrieval, and specifies an entity document corresponding to the specified annotation document.
    Type: Application
    Filed: September 28, 2007
    Publication date: January 14, 2010
    Applicant: JUSTSYSTEMS CORPORATION
    Inventors: Jun Takeuchi, Takanori Hino
  • Publication number: 20100005103
    Abstract: A system and method is provided for remote administration and management of a computer network, by installation of distributed software agents in remote network components, such as software agents implemented using encapsulated reusable interfaces such as COM or CORBA interfaces. Remote network management is effected by communication with the distributed agents using a structured language-independent parsable text document, such as a markup language; e.g. XML.
    Type: Application
    Filed: September 15, 2009
    Publication date: January 7, 2010
    Applicant: Global 360, Inc.
    Inventors: Geoffrey Hager, Robert Chang, Robert Tjia
  • Publication number: 20100004944
    Abstract: A computer implemented method and system is provided for compiling and publishing an online book in an online collaborative environment. Requirements are collected from one or more users in the online collaborative environment. The collected requirements comprise, for example, educational subjects, topical subjects, information on syllabus, an outline, etc. One or more content structures are created based on the collected requirements using, for example, one or more of search criteria, applicable context, curriculum guidelines, curriculum standards, degrees of difficulty, etc. The educational content is retrieved from multiple online sources, created through authoring, or a combination thereof. The retrieved educational content is compiled into the content structures for creation of the online book. The compilation is performed by automatic compilation or user declared specification. The online book is then published in the online collaborative environment.
    Type: Application
    Filed: June 19, 2009
    Publication date: January 7, 2010
    Inventor: Murugan Palaniappan
  • Publication number: 20100005058
    Abstract: A computer-readable recording medium stores therein an information retrieving program that causes a computer to execute acquiring a document to be searched and having a hierarchical structure; generating a path schema related to the acquired document; receiving input of a retrieval keyword, a retrieval condition for the retrieval keyword, and a retrieval equation specifying a retrieval range for the retrieval keyword; generating a single automaton that includes a hierarchy retrieval automaton that retrieves a hierarchy of the generated path schema and a hit keyword retrieval automaton that retrieves a hit keyword satisfying the retrieval condition, the single automaton making state transition between a hit hierarchical node where the hit keyword in the hierarchy retrieval automaton is present and a set of nodes representing the hit keyword; retrieving, from the document and using the single automaton, the hit keyword within the retrieval range; and outputting a retrieval result.
    Type: Application
    Filed: September 15, 2009
    Publication date: January 7, 2010
    Applicant: FUJITS LIMITED
    Inventors: Shinichiro TAGO, Seishi Okamoto, Hiroya Inakoshi, Tatsuya Asai
  • Publication number: 20100005054
    Abstract: Techniques and systems for indexing and retrieving data and documents stored in a record-based database management system (RDBMS) utilize a search engine interface. Search-engine indices are created from tables in the RDBMS and data from the tables is used to create “documents” for each record. Queries that require data from multiple tables may be parsed into a primary query and a set of one or more secondary queries. Join mappings and documents are created for the necessary tables. Documents matching the query string are retrieved using the search-engine indices and join mappings.
    Type: Application
    Filed: June 16, 2009
    Publication date: January 7, 2010
    Inventors: Tim Smith, William Kimble Johnson, III, Rik Tamm-Daniels, Sid Probstein
  • Publication number: 20100005008
    Abstract: Systems and methods for generating and sharing interactive charts are described. The interactive charts are generated in an online portal that allows users to customize the interactive features of the chart. An exemplary interactive feature includes an interactive audio feature. The interactive chart can be shared by, for example, embedding the interactive chart in an external electronic document, such as a .ppt or PDF document, that can be shared with other users. The interactive chart and/or the data associated with the interactive chart may also be purchased through an online store environment or otherwise shared with other users.
    Type: Application
    Filed: September 5, 2008
    Publication date: January 7, 2010
    Inventors: Seymour Duncker, Tyron Montgomery
  • Publication number: 20100005144
    Abstract: The present invention discloses a method system for transmitting a document over a Network including the steps of a document sender converts a sharing document to be transmitted into a GDI (Graph Device Interface) document by performing virtual printing. The document receiver receives the graph device interface document sent from the document sender through the network The document receiver restores the received GDI document. The contents of the restored GDI document are the same as that of the sharing document. The present invention also provides a system, a virtual printer apparatus and a restoration apparatus, the transmission of the document is not restricted by the application using the method, system and apparatus of the present invention.
    Type: Application
    Filed: December 15, 2005
    Publication date: January 7, 2010
    Applicant: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Haijun Wu
  • Publication number: 20100005083
    Abstract: Frequency based keyword extraction method and system utilizing a statistical measure is disclosed which generates keywords within a page and/or document that can distinguish the document from an average document. A simple frequency threshold parameter can be utilized to determine a number of common stop words if a word in the document possesses a frequency in a corpus that is more than the threshold parameter. A statistical confidence interval of the frequency in the document can be compared against a frequency confidence interval of the word in the corpus. The extracted keyword possesses a greater intra-document frequency confidence interval than the frequency confidence interval of the word within the corpus. A statistical hypothesis test can also be utilized to determine the keyword by calculating a test statistic and testing whether the test statistic is greater than some threshold.
    Type: Application
    Filed: July 1, 2008
    Publication date: January 7, 2010
    Inventors: Stephen C. Morgana, John C. Handley
  • Publication number: 20090327326
    Abstract: A method for exporting native source documents (NSDs) from a document repository. The method includes identifying a first NSD to export, where the first NSD includes a first version of content and first metadata, and identifying a second NSD to export, where the second NSD comprises a second version of the content and second metadata. The method further includes generating a source content definition file (CDF) document that includes a global property, a first version-specific property for the first version of the content, a reference to the first version of the content, a second version specific-property for the second version of the content, and a reference to the second version of the content. The method further includes storing the source CDF document in a persistent storage device.
    Type: Application
    Filed: April 30, 2008
    Publication date: December 31, 2009
    Applicant: ENTERPRISE CONTENT MANAGEMENT GROUP, LLC.
    Inventor: Ernest F. Bahr