Document Retrieval Systems (epo) Patents (Class 707/E17.008)
  • Publication number: 20100162099
    Abstract: Dynamic web page generation is optimized by reducing the processing overhead required to parse the web page HTML code for tokens and insert dynamic content. Using the invention, an HTML file for a dynamic web page need be read and parsed only once throughout the life of the server. A software object parses the HTML, decomposes the page into constituent pieces and saves them to data structures as byte streams, which are cached, along with the software object, rendering multiple disk accesses unnecessary when the page is reconstituted. For subsequent requests, the dynamic page is created from the cached version, which is shareable across users and across requests. The optimization reduces server resource usage for dynamic page generation to near zero. The invention is also applicable to other documents combining static and dynamic content that require composition tools for editing.
    Type: Application
    Filed: March 8, 2010
    Publication date: June 24, 2010
    Inventors: Keith BERNSTEIN, Robert KIEFFER
  • Publication number: 20100161993
    Abstract: A notary document processing system and related methods are described. The system receives files uploaded by users, processes them by applying a document ID, time stamp, etc. to pages of the document, and converts them to a read only format for storage. Once the documents are processed and stored in the system, they cannot be changed by any user including the owner of the document. The system makes stored documents available to the owner or other users upon the owner's request or permission. The system also processes files generated from short messages inputted by users and annotated versions of existing documents. The system provides a way of preserving original versions of documents to be used later for purposes of evidencing the dates and contents of documents, evidencing agreement between parties as to the contents of documents, etc. Electronic notary, electronic signature, tamper watermarking, etc. functions are also provided.
    Type: Application
    Filed: December 29, 2009
    Publication date: June 24, 2010
    Inventor: Darcy Mayer
  • Publication number: 20100161625
    Abstract: An information retrieval system uses phrases to index, retrieve, organize and describe documents. Phrases are identified that predict the presence of other phrases in documents. Documents are the indexed according to their included phrases. Related phrases and phrase extensions are also identified. Phrases in a query are identified and used to retrieve and rank documents. Phrases are also used to cluster documents in the search results, create document descriptions, and eliminate duplicate documents from the search results, and from the index.
    Type: Application
    Filed: March 4, 2010
    Publication date: June 24, 2010
    Applicant: GOOGLE INC.
    Inventor: Anna Lynn Patterson
  • Publication number: 20100161970
    Abstract: A user terminal and a method of managing user information are provided. The method includes issuing a request for issuance of a certificate for a user to a certification authority; generating a document including at least part of user information using a certificate issued by the certification authority; and issuing a subscription request to a desired web service provider by providing the document including the at least part of the user information to the desired web service provider. Therefore, it is possible to strengthen the user's right to self-determination and control over the exposure and use of his or her personal information. In addition, it is possible to improve the reliability of user information provided to each website by the user.
    Type: Application
    Filed: October 20, 2009
    Publication date: June 24, 2010
    Applicant: Electronics and Telecommunications Research Institute
    Inventors: Yun Kyung LEE, Byung Ho CHUNG, Jeong Nyeo KIM, Seung Wan HAN, Sok Joon LEE
  • Publication number: 20100161737
    Abstract: Techniques to manage email personal archives are described. A computer-implemented system may comprise a primary mailbox component, associated with a user, and operative to receive and send email. The computer-implemented system may further comprise an alternate mailbox component separate from the primary mailbox component and associated with the user and the primary mailbox, operative to store email. The computer-implemented system may also include a mail client operative on a client computer to access and display contents of the primary and alternate mailboxes substantially simultaneously. Other embodiments are described and claimed.
    Type: Application
    Filed: December 23, 2008
    Publication date: June 24, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Ashish Consul, Yogesh Bansal, Karim M. Batthish, Harvey Rook, Lauren B. Lavoie
  • Publication number: 20100153438
    Abstract: A method and apparatus for allowing a computer to search a hierarchical structure document by creating a list in which a true flag indicating that conditions of a predicate of a search formula are satisfied or a false flag indicating that the conditions of the predicate of the search formula are not satisfied is set to a predicate node of the document data based on the search formula, and scanning the list to search for data designated by the search formula from the document data.
    Type: Application
    Filed: December 9, 2009
    Publication date: June 17, 2010
    Applicant: FUJITSU LIMITED
    Inventors: Tatsuya ASAI, Shinichiro Tago, Seishi Okamoto, Masahiko Nagata
  • Publication number: 20100153435
    Abstract: A document stored at each of a number of database replicas communicatively connected with a current database replica is desired to be opened. The current database replica is initially opened. The current database replica stores an indicator for the document denoting the database replicas at which the document is stored. A probable time to retrieve the document from each database replica is retrieved from the current database replica. For each database replica, a real-time analysis of network parameters in relation to the database replica is performed based on the probable time retrieved. This analysis yields an updated probable time to retrieve the document from the database replica. The document is thus retrieved from the database replica having the lowest updated probable time.
    Type: Application
    Filed: December 15, 2008
    Publication date: June 17, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Pranav Anand Kuber, Kannepalli Venkata Sreekanth
  • Publication number: 20100153356
    Abstract: A document retrieving apparatus can retrieve a target document and output the retrieved target documents according to ranking when a retrieval keyword or retrieval expression is input. However, it requires a skilful technique to narrow a retrieval range since an appropriate retrieval keyword or retrieval expression needs to be created. A document retrieving apparatus of the present invention reads out and compiles a document list included in a designated area when a user designates an area of a document to be read on a two-dimensional map. When the user designates an area of a document to be read on the two-dimensional map, the document retrieving apparatus of the present invention combines query vectors of a plurality of documents included in a designated area and extracts documents based on a combined query vector.
    Type: Application
    Filed: May 15, 2008
    Publication date: June 17, 2010
    Applicant: SO-TI, INC.
    Inventors: Tatsuo Nakamura, Yoshio Takaeda
  • Publication number: 20100153416
    Abstract: Various technologies and techniques are disclosed for creating and managing persistent document collections. A data store is used for storing one or more persistent document collections. A content management application is used for managing documents for users, for creating one or more persistent document collections of a sub-set of the documents upon user request, and for storing the one or more persistent document collections in the data store. Users can create one or more persistent document collections from a sub-set of the documents. Users can also modify the one or more persistent document collections. A requested portion of one or more persistent document collections can be output upon request from an external application so that the external application can download one or more of the documents that are represented by the persistent document collection for further modification by the user.
    Type: Application
    Filed: December 17, 2008
    Publication date: June 17, 2010
    Applicant: Microsoft Corporation
    Inventors: Ethan Gur-esh, Nathan Fink, Dustin Friesenhahn, Nithya Ramkumar, Maura J. FitzGerald
  • Publication number: 20100153402
    Abstract: One embodiment of a non-word-based information retrieval system includes searching stock or image documents in a huge data source. A non-word-based document is first divided into a series of elements or an array of cells. Each element or cell is matched against a series of predefined token patterns, so that a match will generate a token having a name. The collection of the generated named tokens is a word-based representation of the non-word-based document.
    Type: Application
    Filed: February 4, 2010
    Publication date: June 17, 2010
    Inventor: Sizhe Tan
  • Publication number: 20100146381
    Abstract: The present invention provides a method of establishing a plain text document from a HTML document. The method including the steps of (A) acquiring a HTML document defined by HTML elements, each composed of tags and content between the tags; (B) pre-processing the HTML document by omitting some of the tags (including the content between those tags), whereby the rest of the HTML document comprises at least one target tag (including content between the target tags); (C) using a data structure to store the remaining tags of the pre-processed HTML document; (D) grouping the remaining tags (including the content between the remaining tags) stored in the data structure of the pre-processed HTML document into at least one target group according to the target tag(s); and (E) identifying the target group(s) most related to a title of the HTML document by comparing correlation(s) between the target group(s) and the title, and establishing a plain text document having the content of the identified target group.
    Type: Application
    Filed: December 1, 2009
    Publication date: June 10, 2010
    Applicant: ESOBI INC.
    Inventors: HONG-YANG TSAI, CHI-HAU HUNG
  • Publication number: 20100146439
    Abstract: An information processing apparatus includes: a first reception module configured to acquire first information of a character string selected through an input module from character strings displayed on a display module; a candidate creation module configured to create a plurality of character strings relevant to the selected character string as candidates based on the first information and to display the candidates on the display module; a second reception module configured to acquire second information of a character string determined through the input module from the candidates; and a retrieval module configured to: receive the second information from the second reception module; perform information retrieval based on the second information; and display a result of the information retrieval on the display module.
    Type: Application
    Filed: May 21, 2009
    Publication date: June 10, 2010
    Applicant: KABUSHIKI KAISHA TOSHIBA
    Inventor: Toshio ARIGA
  • Publication number: 20100145985
    Abstract: A document management device (100) comprises a document information storage unit (114) for storing document information including at least one keyword associated with a document so as to search for the document, a sectoral keyword list storage unit (122) for storing a sectoral keyword option list containing a technical keyword relating to each of sections, a common keyword list storage unit (120) for storing a common keyword option list containing a general keyword common to the sections, an authenticating unit (108) for authenticating the login by the user into a section, a presenting unit (128) for displaying the sectoral keyword option list of the section and the common keyword option list on a screen, a keyword receiving unit (130) for receiving the in keyword selected from the sectoral or common keyword option list, a searching unit (132) for searching the document information storage unit (114) by using the received keyword, and a presenting unit (128) for presenting the search result to the user.
    Type: Application
    Filed: June 13, 2008
    Publication date: June 10, 2010
    Applicant: TOYO ENGINEERING CORPORATION
    Inventor: Masanari Takahashi
  • Publication number: 20100145904
    Abstract: A computer-based architecture and system provides operational control of the document management process, permitting a client seamless access to sophisticated document production operations. Direct client control over SEC compliant document management permits faster more accurate document production operations.
    Type: Application
    Filed: November 6, 2009
    Publication date: June 10, 2010
    Applicant: Bowne & Co., Inc.
    Inventors: Constantino L. Riviello, Yuriy Bildeyenko
  • Publication number: 20100146593
    Abstract: A method for providing secure document management includes receiving a document from a user having an associated security access profile and generating a security label to be stored as an attribute of the document. The security label includes a clearance component selected from an authorized subset of clearance components that are determined based on the security access profile associated with the user, and also includes one or more secondary security components selected from an authorized subset of secondary security components that are determined based on the clearance component of the security label and the security access profile associated with the user. The method includes storing the document in a document repository storing a plurality of documents each having an associated security label, and determining whether a third-party user is authorized to access the document based on a comparison of a security access profile of the third-party user and the security label associated with the document.
    Type: Application
    Filed: December 12, 2008
    Publication date: June 10, 2010
    Applicant: Raytheon Company
    Inventors: Noah Z. Stahl, Wendy S. Bartlett, Randall S. Brooks
  • Publication number: 20100138426
    Abstract: In an index generating device, a similarity calculating unit calculates access similarities indicating similarities of access histories between documents, based on history information indicating the access histories to the respective documents by users, and a similar document specifying unit specifies a similar document similar to a given document as a retrieval target, based on the access similarities or the like. A retrieval index generating unit generates a retrieval index for the given document as the retrieval target from words appearing in a document set consisting of the similar document and the given document as the retrieval target.
    Type: Application
    Filed: November 27, 2009
    Publication date: June 3, 2010
    Applicant: NTT DoCoMo, Inc.
    Inventors: Takehiro NAKAYAMA, Daisuke Torii
  • Publication number: 20100138457
    Abstract: Documents having a structured nature such as contracts, legislation, etc. can be graphically depicted to emphasize their logical structure. In a document mapping method, a set of logical operator classes each representing a logical operator may be defined. Each logical operator class may also have a dedicated mapping symbol. Through a mapping interface, logically structured document sections may be mapped. Each document section will typically include a logical operator and one or more requirements logically associated with the logical operator. The mapping symbol may depict the logical operator in a node structure and the unique requirements in branch structures extending from the logical operator node. Multiple document sections may be graphically and logically linked, including embedding document structures within the requirement fields of parent structures. The document map may be used to determine compliance with a document, costs of compliance, etc.
    Type: Application
    Filed: February 9, 2010
    Publication date: June 3, 2010
    Inventor: Nathan Joel McDonald
  • Publication number: 20100138442
    Abstract: To provide an information processing apparatus, a database system, an information processing method, and a program which ensure efficient database accesses by partitioning.
    Type: Application
    Filed: November 6, 2009
    Publication date: June 3, 2010
    Applicant: International Business Machines Corporation
    Inventors: Kaoru Shinkawa, Issei Yoshida
  • Publication number: 20100138747
    Abstract: The present invention allows for a content provider to interact with a user on a Digital Media Frame (DMF) in real time is disclosed. From a DMF coupled with a network, content is displayed. The user inputs data on DMF with regard to the displayed content. A content provider hosting receives the user's input with the identification information of the DMF. From the content provider's network server, in response to receiving the input data, actions are performed. The content provider hosting sends new content to the DMF. Features of the present invention will be apparent from the accompanying drawings and from the detailed description which follows.
    Type: Application
    Filed: November 28, 2008
    Publication date: June 3, 2010
    Inventors: Heikwan Cheng, Yueqing Zhang
  • Publication number: 20100131551
    Abstract: In various embodiments, a computer-implemented method and system of designating and/or protecting confidential information in an original document includes receiving a file containing the original document through a computer-network interface. The original document contains confidential information, and the original document may be stored in one or more structured databases configured in one or more memories. A user interface is provided between the processor and the user associated with the original document. The user identifies at least a portion of the information considered to be confidential through the user interface. The processor may identify each occurrence of the confidential information contained in the original document, and may selectively generate one or more redacted or confidential files in which each occurrence of confidential information in the original document is obscured or redacted. The user may select to not have any confidential information redacted.
    Type: Application
    Filed: November 18, 2009
    Publication date: May 27, 2010
    Applicant: THELADDERS.COM, INC.
    Inventors: Alain BENZAKEN, Gregg DONOVAN, Selena HADZIBABIC
  • Publication number: 20100131534
    Abstract: An information providing system comprises: an associated document determining unit that determines at least one piece of associated document data that includes an expression equal or similar to a cited section based on a cited section in document data; a limiting expression extraction unit that extracts an expression that corresponds to a condition for, correction for, addition to, or annotation to an expression equal or similar to the cited section from associated document data determined by the associated document determining unit; an information creation unit that creates an expression extracted by the limiting expression extraction unit or information regarding this expression as information to be displayed; and a display unit that displays information created by the information creation unit.
    Type: Application
    Filed: April 9, 2008
    Publication date: May 27, 2010
    Inventors: Toshio Takeda, Susumu Akamine, Satoshi Nakazawa, Kai Ishikawa
  • Publication number: 20100131495
    Abstract: Disclosed are methods and apparatus for executing a search query. In accordance with one embodiment, a search query is obtained. The search query is classified into one or more of a plurality of categories. The search query is executed for each of the one or more of the plurality of categories. Search results corresponding to the search query are obtained for each of the one or more of the plurality of categories. The search results are then provided for each of the one or more of the plurality of categories.
    Type: Application
    Filed: November 25, 2008
    Publication date: May 27, 2010
    Inventors: Vanessa Murdock, Lluis Garcia, Barbara Poblete, Vassilis Plachouras
  • Publication number: 20100131459
    Abstract: In a system that realizes a workflow by a plurality of devices (workflow execution apparatuses) while cooperating with one another in serverless environment, the system enables search for a file used in the workflow, even if a file name used in the workflow of each device is changed.
    Type: Application
    Filed: November 20, 2009
    Publication date: May 27, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Shinji Todaka
  • Publication number: 20100125570
    Abstract: Approaches and techniques are discussed for ranking the documents indicated in search results for a query based on click-through information collected for the query in previous query sessions. According to an embodiment of the invention, when calculating a relevance score for a particular document, one may overcome positional bias by utilizing click-through information about other documents previously returned in the same search results as the particular document. According to an embodiment, one may utilize Dynamic Bayesian Network, based on said click-through information, to model relevance. According to an embodiment of the invention, one may utilize click-through information to generate targets for learning a ranking function.
    Type: Application
    Filed: November 18, 2008
    Publication date: May 20, 2010
    Inventors: Olivier Chapelle, Anne Ya Zhang
  • Publication number: 20100121816
    Abstract: A content management system (CMS) provides a way to add a phase property to synchronization rules. In one suitable implementation, each of the synchronization rules has a corresponding phase value. In another suitable implementation, there are default synchronization rules and only synchronization rules other than the default synchronization rules have a phase value. A phase synchronization mechanism uses the phase property of the synchronization rules to evaluate only appropriate synchronization rules at each step as required.
    Type: Application
    Filed: November 13, 2008
    Publication date: May 13, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventor: John Edward Petri
  • Publication number: 20100118346
    Abstract: A method of retrieving a document from a database of hierarchical electronic document versions is disclosed. Each document version is associated with a unique document instance. In the method a search form is printed. The search form includes a search instruction input field relating to at least one parameter of a search to be carried out within the database and a plurality of coded data tags. Each coded data tag encodes a location of that coded data tag on the search form. The coded data tags are sensed by a sensing device as the sensing device is used to handwrite at least one search term on the search form. Data representing the parameter and the at least one search term is generated, with the data representing the at least one search term being generated from the locations of the coded data tags. Next, a search is carried out within the database based on the at least one search term and parameter in order to identify document versions.
    Type: Application
    Filed: January 17, 2010
    Publication date: May 13, 2010
    Inventors: Kia Silverbrook, Paul Lapstun, Jacqueline Anne Lapstun
  • Publication number: 20100121859
    Abstract: A workflow management system for managing a constructive workflow includes a storing unit, to which a user inputs a search condition to search for documents relevant to a target task and tasks neighboring to the target task, and which stores the input search condition in a query database by causing the condition to relate to the target task, a search condition obtaining unit which obtains a search condition to search for the documents relevant to the target task from the query database when the documents relevant to the target task are requested to be searched for, a restructuring unit which restructures the obtained search condition to a search condition having a predetermined format by considering a type and a weighting factor of the type included in the obtained search condition, and a searching unit which obtains a list of documents from a document database based on the restructured search condition.
    Type: Application
    Filed: September 4, 2009
    Publication date: May 13, 2010
    Inventors: Kaoru Maeda, Takeshi Suzuki, Heiko Maus, Harald Holz, Oleg Rostanin
  • Publication number: 20100114929
    Abstract: A computer-implemented method provides suggested search queries based on an input search query. The input search query is received. A first list of documents is determined that correspond to processing the query by a search engine determining the list of result queries, including processing the first list of documents to determine clusters of documents and determining potential queries that correspond to the determined clusters by comparing results of the potential queries with documents in the determined clusters. A list of result queries is determined, wherein executing the list of result queries would correspond to a second list of documents, that result from presenting the result queries to the search engine; and the documents of the second list of documents cover the documents of the first list of documents. The list of result queries based on the potential queries determined to correspond to the determined clusters.
    Type: Application
    Filed: November 6, 2008
    Publication date: May 6, 2010
    Applicant: YAHOO! INC.
    Inventors: Francesco Bonchi, Aristides Gionis, Debora Donato
  • Publication number: 20100114928
    Abstract: A computer-implemented method is such that suggested search queries are provided based on an input search query. The search query is received (such as from a user providing the search query to a search engine service) and a first list of documents is determined that correspond to processing the query by a search engine. A list of result queries is determined, wherein executing the list of result queries would correspond to a second list of documents, that result from presenting the result queries to the search engine, and the documents of the second list of documents cover the documents of the first list of documents. The list of result queries is returned as the suggested queries.
    Type: Application
    Filed: November 6, 2008
    Publication date: May 6, 2010
    Applicant: YAHOO! INC.
    Inventors: Francesco Bonchi, Aristides Gionis, Debora Donato
  • Publication number: 20100114913
    Abstract: A document processing apparatus according to the present embodiment handles a structured document file described in XML, XHTML, and HTML, etc., as a document to be processed. The document processing apparatus selects a base tag and a comparison tag from a structured document file, and computes a positional proximity between the two tags in a hierarchical structure as a tag-proximity degree. The apparatus specifies a comparison tag with a tag-proximity degree of a predetermined threshold value or more with respect to the base tag, as a proximity-tag. The apparatus outputs the data specified by one or more of the proximity-tags, as the proximity-data with respect to the base tag.
    Type: Application
    Filed: September 28, 2007
    Publication date: May 6, 2010
    Applicant: JUSTSYSTEMS CORPORATION
    Inventors: Shingo Ochi, Takanori Hino, Shingo Hada
  • Publication number: 20100115468
    Abstract: The subject application is directed to a system and method for hierarchical electronic file navigation. Electronic files, of documents or folders, are first stored in an associated data storage. Upon receipt of user identification, a default subset of stored files is retrieved and indicia corresponding to the files are displayed to an associated user. The files are displayed with a folder icon or a document thumbnail image. Selection data is then received of a selected electronic folder listed on the display and indicia are generated on the display corresponding to contents of electronic files of the selected document folder. Shortcut selection data is received from the user corresponding to at least one selected electronic file and the at least one selected electronic file is added to the default listing in accordance with received shortcut selection data.
    Type: Application
    Filed: November 6, 2008
    Publication date: May 6, 2010
    Inventor: Marianne L. KODIMER
  • Publication number: 20100106522
    Abstract: A system and method for processing data includes organizing medical information in a concept frame data structure, which is adapted to include medical measurements and related metadata. The medical information is analyzed to extract further information using information extractors and to store extracted medical information in the concept frame data structure. References are stored to appropriate visualization methods along with an associated concept in the concept frame data structure.
    Type: Application
    Filed: October 23, 2008
    Publication date: April 29, 2010
    Inventors: James W. Cooper, Youssef Drissi, Shahram Ebadollahi, Anthony Tom Levas
  • Publication number: 20100106699
    Abstract: A client generates a list associating a document, an operation for the document, and a user and transmits the list to a server. When an operation for the document is received from a user, the client acquires information including a document, an operation for the document, and a user and transmits the information to the server. The server receives and stores the list from the client, receives the information from the client, searches storage unit for a list including information matching the information, and transmits the list to the client. The client receives and displays the searched list.
    Type: Application
    Filed: October 5, 2009
    Publication date: April 29, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Ryutaro Watanabe
  • Publication number: 20100106701
    Abstract: An electronic document retrieval system is disclosed. It has particular utility to World-Wide Web searching. The system requires webmasters to put forward categories into which the pages on their web-site might sensibly be divided, and to provide a list of those categories together with a list of popular keywords associated with those categories to a global search engine. The global search engine is then able to augment one or more of its search results with links to category-heading pages which most closely relate to the query provided by the user. In this way, a user is able to find the page most relevant to his query more rapidly than has hitherto been possible.
    Type: Application
    Filed: March 26, 2008
    Publication date: April 29, 2010
    Inventors: Gery M. Ducatel, N. Azarmi, Zhan Cui
  • Publication number: 20100106741
    Abstract: A method and device for searching for a music file of a mobile terminal are provided. The method of searching for a music file of a mobile terminal includes, receiving at least one input key signal in an idle state of the mobile terminal, setting a search word by combining characters mapped to the received at least one input key signal, determining whether a music file that includes the search word exists within the mobile terminal, and requesting, if it is determined that a music file that includes the search word does not exist within the mobile terminal, a search for a music file that includes the search word to a preset web server.
    Type: Application
    Filed: October 26, 2009
    Publication date: April 29, 2010
    Applicant: SAMSUNG ELECTRONICS CO. LTD.
    Inventor: Jung Jin YOO
  • Publication number: 20100100523
    Abstract: A system and method that allows a publisher to create a document series, associate a document with a document series and present to an end-user a series nugget displaying at least one document associated with the document series. Document series information may be inherited by each document in the document series. Document series information is stored in an asset database.
    Type: Application
    Filed: November 17, 2009
    Publication date: April 22, 2010
    Applicant: Barclays Capital Inc.
    Inventors: Wayne Marcy, Hood Qa'lm-maqami
  • Publication number: 20100100555
    Abstract: A computer-readable medium includes instructions for causing at least one processor to perform a method. The method may include receiving a symbol sequence into a document, identifying another symbol sequence in the document whose probability of matching the received symbol sequence is above a threshold, and replacing the received symbol sequence with the other symbol sequence.
    Type: Application
    Filed: December 22, 2009
    Publication date: April 22, 2010
    Inventor: John Eric Harrity
  • Publication number: 20100100543
    Abstract: System, device and method for using user-generated metadata to arrive at a modified search index that emphasizes a relationship between documents selected by a user during a prior search session and salient terms of those documents. An initial search index is modified by adding a synthetic term and a synthetic document to terms and documents that are used to arrive at the elements of the index and by modifying the relevance scores to highlight one or more of the search terms, the synthetic term, and the synthetic document. Synthetic term ties a cluster of related documents together and synthetic document ties terms of these documents together. Synthetic term is not found in any other documents and synthetic document does not belong to any normal corpus of documents. Modified index aids in re-generating prior user choices because it contains artifacts reflecting associations that user perceived between various terms and documents.
    Type: Application
    Filed: October 21, 2009
    Publication date: April 22, 2010
    Inventor: James Brady
  • Publication number: 20100094856
    Abstract: System and method for querying multiple websites using keywords entered into a search box on a browser. The search box can receive a list of keywords that are placed into multiple websites. After a process receives its start command, each process can work on retrieving information from a number of websites, for example, Google and EBay. The processes can be managed so that the system is not bogged down and the central processing unit is available for other tasks. The process can store a set of query results from the website in a database. The system can create a final grid using the query results in the database and display the final grid to the user. In addition to or separate thereto, the system can generate folders containing the query results. Links can be saved within the query results. The system does not use robots, crawlers, harvesters, agents, or scrapers. Instead, the system takes advantage of the raw HTML source freely offered by searched websites.
    Type: Application
    Filed: October 12, 2009
    Publication date: April 15, 2010
    Inventors: Eric Rodrick, Benjamin Woodard
  • Publication number: 20100095202
    Abstract: System, apparatus and method for managing documents and metadata generated by a plurality of software application systems are provided.
    Type: Application
    Filed: October 14, 2008
    Publication date: April 15, 2010
    Applicants: RICOH COMPANY, LTD.
    Inventor: Hiroaki ISHIZUKA
  • Publication number: 20100088298
    Abstract: A system, method, and computer-readable medium that facilitate management of data skew during a parallel multiple join operation are provided. Portions of tables involved in the join operation are distributed among a plurality of processing modules, and each of the processing modules is provided with a list of skewed values of a join column of a larger table involved in the join operation. Each of the processing modules scans the rows of first and second tables distributed to the processing modules and compares values of the join columns of both tables with the list of skewed values. Rows of a larger table having non-skewed values in the join column are redistributed, and rows of the larger table having skewed values in the join column are maintained locally at the processing modules. Rows of the smaller table that have non-skewed values in the join column are redistributed, and rows of the smaller table that have skewed values in the join column are duplicated among the processing modules.
    Type: Application
    Filed: October 6, 2008
    Publication date: April 8, 2010
    Inventor: Yu Xu
  • Publication number: 20100088269
    Abstract: Embodiments of the present invention address deficiencies of the art in respect to data backup and archival tools and provide a method, system and computer program product for the dispersal and retrieval of fragments in a peer-to-peer data backup and archival network. In an embodiment of the invention, a method for the dispersal and retrieval of fragments in a peer-to-peer data backup and archival network can include partitioning a file into multiple, different fragments for storage in a peer-to-peer data backup and archival network, selecting different peer hosts in the peer-to-peer data backup and archival network to store different ones of the fragments, and storing each of the fragments in at least one of the selected different peer hosts. Optionally, the fragments can be encrypted before storage in the different peer hosts.
    Type: Application
    Filed: October 2, 2008
    Publication date: April 8, 2010
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Steven J. Buller, Richard C. Garrett, Richard Hutzler
  • Publication number: 20100082605
    Abstract: The present invention is directed to a method and system for determining user interaction patterns. The method and system comprises generating a plurality of atomic sessions by grouping search events related to a user and a query string using a search engine. The method and system includes using the atomic sessions, constructing a first query chain based on actions of the user to satisfy an information need. The method and system includes dividing the first query chain into at least one smaller chain by both a time factor and a query similarity factor. And the method and system includes determining user-interaction patterns relating to the search engine using the at least one smaller chain.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Applicant: Yahoo! Inc.
    Inventors: Georges Dupret, Benjamin Piwowarski
  • Publication number: 20100082626
    Abstract: A method for filtering out identical or similar documents includes storing a plurality of documents to be filtered as a pat tree (PT) data structure profile based on a pat tree data structure, searching for all string nodes with a consecutive character length reaching a lower threshold in the PT profile and all documents to which the string nodes belong, and finding documents having identical consecutive characters with a length reaching a higher threshold from the documents. Another technical solution includes searching for all string nodes with a consecutive character length reaching a lower threshold in the PT profile and all documents to which the string nodes belong, and finding documents having identical consecutive characters with such a length that a ratio of the length of the identical consecutive characters to a total character length of the original document reaches a ratio threshold from the documents, these documents are similarity.
    Type: Application
    Filed: September 17, 2009
    Publication date: April 1, 2010
    Applicant: Esobi Inc.
    Inventors: Hong Yang Tsai, Hsun Hsueh Cho
  • Publication number: 20100082539
    Abstract: In accordance with an example embodiment of the present invention, an electronic device, server or service may update at least one contact widget in real time. Further, the electronic device, server or service may display the at least one contact widget.
    Type: Application
    Filed: September 23, 2008
    Publication date: April 1, 2010
    Applicant: NOKIA CORPORATION
    Inventors: Jani Petri Bostrom, Matti Keltanen
  • Publication number: 20100082677
    Abstract: Disclosed herein are systems and methods for controlling access to content, and/or regions thereof, as well as controlling access to annotations to the content, or regions thereof. An audience can be specified for a region of content and one or more associated annotations. In response to a request for a content region, a content region definition, an audience definition for the content region, and at least one annotation for the content region and audience can be obtained, and the content region and the at least one annotation can be transmitted in response to the request if it is determined that the request is from a member of the audience, so that the content region and annotation can be experienced at an audience member's device.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Inventors: Athellina Athsani, Elizabeth F. Churchill
  • Publication number: 20100082709
    Abstract: This invention is directed to a document processing system and control method thereof. The system stores a plurality of items of document data each containing metadata pertaining to the contents of each item of document data, and relation information representing the relations between the plurality of items of document data. When scanned image data or facsimile-received image data is input, document data related to the input image data is specified among the plurality of items of stored document data, based on the metadata contained in each item of document data. Relation information representing the relation between the input image data and the specified related document data is stored. Even document data obtained from a paper document is able to be stored as document data subjected to search processing.
    Type: Application
    Filed: September 17, 2009
    Publication date: April 1, 2010
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Masahito Yamamoto
  • Publication number: 20100077218
    Abstract: According to the present invention, there is provided a system and method for the management, organization, collaboration, and submission of electronic files and documents associated with a clinical trial. The system of the present invention enables users to create and easily access a central document repository. The system of the present invention includes various tools for the management, organization, collaboration, and editing of the documents and files stored within the system, as well as tools which enable automated regulatory submissions of required documents and files.
    Type: Application
    Filed: September 25, 2009
    Publication date: March 25, 2010
    Inventors: Jules T. Mitchel, Joyce B. Hays
  • Publication number: 20100076946
    Abstract: A method for sharing documents between on-demand services is provided. In an embodiment, a user of a first on-demand service may be able to view a list of content that includes content stored at the first on-demand service and content stored at a second on-demand service. The content of the second on-demand service may be associated with information about the content, allowing the content to be shared among multiple users of the first on-demand service. The user wanting to view the content, select or click on an indicator identifying the content, a connection to the second on-demand service is established, and images of the content are sent from the second on-demand service to the first on-demand service.
    Type: Application
    Filed: September 14, 2009
    Publication date: March 25, 2010
    Applicant: Salesforce.com Inc.
    Inventors: Timothy J. Barker, Jonathan Levine, James Johnson
  • Publication number: 20100073713
    Abstract: A document transfer method, document transfer apparatus and document transfer system. The document transfer method includes transferring a document and metadata required to request an additional transfer of the document from a sending device to a receiving device, editing the transferred metadata to request an additional transfer of the document, storing the edited metadata in the receiving device, and additionally transferring the document based on the stored metadata by the sending device.
    Type: Application
    Filed: June 16, 2009
    Publication date: March 25, 2010
    Applicant: Samsung Electronics Co., Ltd
    Inventor: Kyoung-youl CHAE