Document Retrieval Systems (epo) Patents (Class 707/E17.008)
  • Publication number: 20120041929
    Abstract: Systems and methods for controlling document storage and tracking model dynamic attributes of a document in a time-varying manner, and enable reconstruction of a document's state at any point in time. The time-varying model identifies dynamic components of documents, where dynamic components include time-varying properties that changes over time. A set of validity parameters are associated with each state of a document (the state of a document corresponds to a new version of the document that results from changes to the document), and the set of validity parameters define a validity period for the state. Instead of archiving each new version of the document that corresponds to each new state of the document, the document is archived in storage by archiving information of the states along with the set of validity parameters corresponding to each state.
    Type: Application
    Filed: August 16, 2010
    Publication date: February 16, 2012
    Applicant: MIMOSA SYSTEMS, INC.
    Inventor: Rahul KAPOOR
  • Publication number: 20120041977
    Abstract: A data structure of index information for retrieving pair character strings on a computer at high speed is provided. A method of retrieving a pair character strings appearing in close proximity of each other in a document using the index information at high speed is also provided. Bits of a suffix array of reference document data are rearranged, thereby creating index information LSA localizable, or usable as an index for a subregion of the document. Through use of this, a process of dichotomizing a region, where the entire document is designated as an initial region, is repeated and positions of index information for a query character string in the reference document data are gradually detailed. The distance between the pair is evaluated and candidates are narrowed down. Finally, positions where the pair character strings occur in close proximity of each other are identified.
    Type: Application
    Filed: April 5, 2010
    Publication date: February 16, 2012
    Applicant: HITACHI, LTD.
    Inventor: Kouichi Kimura
  • Patent number: 8116888
    Abstract: A parameter setting system for programmable logic controller (PLC) includes a document module, a main program module, a parameter storage module, an interface display module, and a display interface. The document module stores application documents and an extensible markup language (XML) document of PLC. The main program module reads names and scale values of the parameters from the XML document, and outputs a control signal. The parameter storage module stores the names and scale values of the parameters from the main program module. The interface display module receives the control signal from the main program module and reading names and scale values of the parameters from the parameter storage module according to the control signal. The display interface displays the names and scale values of the parameters from the interface display module.
    Type: Grant
    Filed: April 27, 2009
    Date of Patent: February 14, 2012
    Assignee: Foxnum Technology Co., Ltd.
    Inventor: Ming-Chieh Tsai
  • Patent number: 8115952
    Abstract: A document managing system includes: an image forming device that includes an image forming part, and forms a document on a prescribed output medium; a discarding device that includes a document discarding part that discards the document formed on the output medium by the image forming device; and a discard certificate issuing device that includes a discard certificate issuing part, and issues a certificate of a discarding process in the discarding device.
    Type: Grant
    Filed: July 13, 2007
    Date of Patent: February 14, 2012
    Assignee: Fuji Xerox Co., Ltd.
    Inventor: Nobuo Inoue
  • Publication number: 20120036121
    Abstract: In general, the subject matter described in this specification can be embodied in methods, systems, and program products for receiving user input that defines a search query, and providing the search query to a server system. Information that a search engine system determined was responsive to the search query is received at a computing device. The computing device is identified as in a first state, and a first output mode for audibly outputting at least a portion of the information is selected. The first output mode is selected from a collection of the first output mode and a second output mode. The second output mode is selected in response to the computing device being in a second state and is for visually outputting at least the portion of the information and not audibly outputting the at least portion of the information. At least the portion of information is audibly output.
    Type: Application
    Filed: August 6, 2010
    Publication date: February 9, 2012
    Inventors: John Nicholas Jitkoff, Michael J. Lebeau, William J. Byrne, David P. Singleton
  • Publication number: 20120036142
    Abstract: The present invention concerns a satellite image retrieving system that comprises electronic input means, electronic retrieving means, and electronic storing means. The electronic input means are coupled with the electronic retrieving means and are configured to generate input data and to provide the electronic retrieving means with said input data. The input data is indicative of a given geographic area. The electronic retrieving means are coupled with the electronic storing means that are configured to store satellite images, each satellite image stored on the electronic storing means representing a corresponding area of the earth's surface and being associated with corresponding telemetry data generated and associated with the satellite image by a satellite that has remotely sensed the satellite image.
    Type: Application
    Filed: December 10, 2008
    Publication date: February 9, 2012
    Applicant: TELESPAZIO S.P.A.
    Inventors: Alissa Ioannone, Gian Luca Eusebi Borzelli
  • Publication number: 20120036115
    Abstract: An apparatus, comprising a processor, memory including computer program code, the memory and the computer program code configured to, working with the processor, cause the apparatus to perform at least determining at least one significant location, receiving a first information associated with the significant location from a first information repository, retrieving a second information associated with the significant location from a second information repository, and generating a third information based at least in part on the first information and the second information is disclosed.
    Type: Application
    Filed: December 30, 2010
    Publication date: February 9, 2012
    Inventors: Hiroshi Horii, Agathe Battestini, Timothy Sohn
  • Publication number: 20120030201
    Abstract: Techniques are disclosed for searching a set of documents using search terms. In one embodiment, a summary is provided for each document in the set. Search terms are received, and the set of documents are parsed using the received search terms. A first relevance value is calculated using only the summary of each document. A subset of documents having the highest relevance is provided by using the first relevance value. The subset of documents is parsed using the received search terms, to calculate a second relevance value for each document using the respective document. Query results are provided, the query results including documents having the highest relevance according to the second relevance value.
    Type: Application
    Filed: July 15, 2011
    Publication date: February 2, 2012
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: John B. Pickering, Fenglian Xu
  • Publication number: 20120023094
    Abstract: To address problems related to interface differences and disunity among on-line services, such as newsgroups message boards and forums, the present inventors devised systems, methods, and software for automating the posting and retrieval of content across different on-line services as well as encouraging growth of active on-line communities. One exemplary system includes a posting module, a retrieval module, and a web server. The posting module allows users to create and initiate data postings that are sent automatically to several newsgroups, message boards, and/or other on-line information sources. The retrieval module automatically retrieves replies to the postings at each of the on-line sources and presents them through the webserver for user review and further reply, eliminating the need for users to repeatedly visit posting sites in search of reply messages.
    Type: Application
    Filed: June 24, 2011
    Publication date: January 26, 2012
    Applicant: Body1
    Inventors: Christopher P. Messina, Nabeil O. Sarhan, Shinpei Kuga
  • Publication number: 20120023063
    Abstract: A method of naming documents according to a document naming convention (DNC) includes receiving, from any of a plurality of data processing applications (DPAs) within an enterprise, a request for a document name and generating a document name “infix” portion in compliance with the DNC for inclusion in the document name. The prefix may be descriptive of a characteristic of the document and the suffix may indicate a document type, a document format, or both. The infix may include a fixed portion and a modifiable portion. The fixed portion may include a datestamp and a document unification identifier. The infix may include a variable portion that can be modified by a DPA that creates a new or modified document, but the name of the modified document maintains the fixed portion of the original document.
    Type: Application
    Filed: July 23, 2010
    Publication date: January 26, 2012
    Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
    Inventor: Charles Stanley Fenton
  • Publication number: 20120023133
    Abstract: A document search system and a method are provided to enable a writer (document provider) of a document to detect metadata, which is included in web/WAP documents, and to offer summary or detailed information on the corresponding product (including multimedia content), which is indicated by the metadata, to a document reader. According to the present invention, a client detects the metadata inserted into the corresponding document from the document that is written by the document provider, extracts a reference value, demands production information corresponding to a product ID, to a serer to collect the product information and offers the product information to the document reader. Additionally, the client collects the reference value and information on the document reader who reads the document, into which the metadata is inserted, and stores the collected data in the server.
    Type: Application
    Filed: March 31, 2010
    Publication date: January 26, 2012
    Applicant: WOODT INC.
    Inventor: Chan Ik Yeon
  • Publication number: 20120016893
    Abstract: A system and method for modifying publication data in a publication system are described. An example embodiment includes receiving proposed publication data and accessing a success measurement associated with past publications within a publication system. The success measurement may indicate a measurement of success associated with the past publications. An example system and method may generate modification data to be used to modify the proposed publication data. The modification data may be based on the success measurement and proposed publication data.
    Type: Application
    Filed: September 28, 2011
    Publication date: January 19, 2012
    Inventors: Brian Scott Johnson, Alvaro Bolivar
  • Publication number: 20120016871
    Abstract: A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Application
    Filed: September 26, 2011
    Publication date: January 19, 2012
    Applicant: GOOGLE INC.
    Inventors: Anurag Acharya, Matt Cutts, Jeffrey DEAN, Paul Haahr, Monika Henzinger, Urs Hoelzle, Steve Lawrence, Karl Pfleger, Olcan Sercinoglu, Simon Tong
  • Publication number: 20120016888
    Abstract: A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Application
    Filed: September 26, 2011
    Publication date: January 19, 2012
    Applicant: GOOGLE INC.
    Inventors: Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Olcan Sercinoglu, Simon Tong
  • Publication number: 20120016889
    Abstract: A system may determine an extent to which a document is selected when the document is included in a set of search results, generate a score for the document based, at least in part, on the extent to which the document is selected when the document is included in a set of search results; and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Application
    Filed: September 26, 2011
    Publication date: January 19, 2012
    Applicant: GOOGLE INC.
    Inventors: Jeffrey DEAN, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Olcan Sercinoglu, Simon Tong
  • Publication number: 20120016887
    Abstract: Systems and methods for identifying inadequate search content are provided. Inadequate search content, for example, can be identified based on statistics associated with the search queries related to the content.
    Type: Application
    Filed: September 23, 2011
    Publication date: January 19, 2012
    Applicant: GOOGLE INC.
    Inventors: Jeffrey David Oldham, Hal R. Varian, Matthew D. Cutts, Matt Rosencrantz
  • Publication number: 20120011131
    Abstract: A computer-implemented method of realizing an associative memory capable of storing a set of documents and retrieving one or more stored documents similar to an inputted query document, said method comprising: coding each document or a part of it through a corresponding feature vector consisting of a series of bits which respectively code for the presence or absence of certain features in said document; arranging the feature vectors in a matrix; generating a query feature vector based on the query document and, according to the rules used for generating the feature vectors corresponding to the stored documents such that the query vector corresponds in its length to the width of the matrix; storing the matrix column-wise; for those columns of the matrix where the query vector indicates the presence of a feature, bitwise performing one or more of preferably hardware supported logical operations between the columns of the matrix to obtain one or more, additional result columns coding for a similarity measure bet
    Type: Application
    Filed: February 9, 2011
    Publication date: January 12, 2012
    Applicant: BDGB ENTERPRISE SOFTWARE S.A.R.L
    Inventors: Gannady Lapir, Harry Urbshat
  • Publication number: 20120005199
    Abstract: A system may determine a measure of how a content of a document changes over time, generate a score for the document based, at least in part, on the measure of how the content of the document changes over time, and rank the document with regard to at least one other document based, at least in part, on the score.
    Type: Application
    Filed: September 14, 2011
    Publication date: January 5, 2012
    Applicant: GOOGLE INC.
    Inventors: Anurag Acharya, Jeffrey Dean, Paul Haahr, Monika Henzinger, Steve Lawrence, Karl Pfleger, Simon Tong
  • Publication number: 20120005157
    Abstract: According to a first aspect of the present invention there is provided a method of operating an XML Document Management Server in an IP Multimedia Subsystem. The method comprises receiving a message from a user terminal requesting an XML document, the XML document conforming to a given structure and containing one or more service rules relating to a service, retrieving an XML document containing said service rules from a data storage entity. If the structure of the retrieved XML document differs from said given structure, adapting a rule or rules of the retrieved XML document such that the XML document conforms to said given structure, and sending the adapted XML document to the user terminal.
    Type: Application
    Filed: March 10, 2009
    Publication date: January 5, 2012
    Applicant: Telefonaktiebolaget LM Erricsson (publ)
    Inventors: Mikael Forsberg, Lennart Norell
  • Publication number: 20110313985
    Abstract: Method and system for enabling a user of a query based search engine to have some control over the presentation of search results. In some embodiments, the method and system provides an associating interface for a user of a query based search engine to associate documents with a search query, with the documents being termed as associated documents. The method and system stores the associated documents. The method and system causes the associated documents to be presented in a special area.
    Type: Application
    Filed: June 20, 2011
    Publication date: December 22, 2011
    Inventor: Jianwei Dian
  • Publication number: 20110302176
    Abstract: Disclosed are a document ranking system and method based on contribution scoring. The document ranking system includes: a content score calculating unit for calculating content scores for documents with respect to at least one word contained in the documents, with regard to each such word; a contribution score calculating unit for calculating contribution scores for the documents with respect to jointly occurring words; and a ranking unit for ranking the documents with respect to the at least one word, with regard to each such word, by using the content scores and the contribution scores.
    Type: Application
    Filed: December 15, 2009
    Publication date: December 8, 2011
    Applicant: NHN CORPORATION
    Inventors: Dong Jin Kim, Sang-Wook Kim
  • Publication number: 20110295857
    Abstract: A system and method for aligning multilingual content and indexing multilingual documents, to a computer readable data storage medium having stored thereon computer code means for indexing multilingual documents, to a system for presenting multilingual content. The method for aligning multilingual content and indexing multilingual documents comprises the steps of generating multiple bilingual terminology databases, wherein each bilingual terminology database associates respective terms in a pivot language with one or more terms in another language; and combining the multiple bilingual terminology databases to form a multilingual terminology database, wherein the multilingual terminology database associates terms in different languages via the pivot language terms.
    Type: Application
    Filed: June 20, 2008
    Publication date: December 1, 2011
    Inventors: Ai Ti Aw, Min Zhang, Lian Hau Lee, Thuy Vu, Fon Lin Lai
  • Publication number: 20110295949
    Abstract: Distributed computing using communities is described. In an embodiment computations in a distributed computing system are driven and controlled by a document storing a distributed computing graph, a graph layout view of that graph and visualization elements. For example, the document is replicated and synchronized at each of a plurality of entities in the distributed computing system. In examples a community may be drawn as a rectangle or other shape in the graph layout view and represents one or more computing resources in the distributed computing system. For example by placing graphical elements representing currently executing processes into the community on the graph layout view a user is able to ensure that those processes execute using the computing resources of the community. In examples communities may be nested and may have parameters specifying conditions which are to be met by the computing resources they represent.
    Type: Application
    Filed: May 28, 2010
    Publication date: December 1, 2011
    Applicant: Microsoft Corporation
    Inventors: Martin Calsyn, Alexander Brandle, Vassily Lyutsarev, Andreas Heil
  • Publication number: 20110295861
    Abstract: A technique is provided for processing search results. The technique includes executing a search based on a user input entered via a graphical user interface using a processor, identifying relevant documents based on the search, and obtaining a standard classification for each relevant document. The standard classification is a classification within a standard classification system. The technique also includes reclassifying each relevant document, based on the relevant document's standard classification, into an interpretive classification within an interpretive classification system. The interpretive classification comprises at least a primary class and a secondary class. The technique further includes grouping the relevant documents into each relevant document's primary class and secondary class, and displaying the primary classes of the relevant documents and a number of relevant documents grouped in each displayed primary class via the graphical user interface on a display device.
    Type: Application
    Filed: May 26, 2010
    Publication date: December 1, 2011
    Applicant: CPA GLOBAL PATENT RESEARCH LIMITED
    Inventor: Randy W. Lacasse
  • Publication number: 20110289118
    Abstract: Architecture that maps document data (e.g., XML-extended markup language) into columns of one table, thereby avoiding schema normalization problems through special data storage. Moreover, an algorithm is described that can translate a query (e.g., in XPath (XML path language), a query language for navigating through document elements and attributes of an XML document) into a relational algebra query of the document column representation. Based on the characteristics of the new mapping, query rewriting rules are provided that optimize the relational algebra query by minimizing the number of joins. The mapping of XML documents to the table is based on a summary structure and a hierarchical labeling scheme (e.g., ordpath) to enable a high-fidelity representation. Annotations are employed on the summary structure nodes to assist in mapping XML elements and attributes to the table.
    Type: Application
    Filed: May 20, 2010
    Publication date: November 24, 2011
    Applicant: MICROSOFT CORPORATION
    Inventors: Liang Chen, Nikita Shamgunov, Philip A. Bernstein, Michael Rys, James F. Terwilliger, Peter Alan Carlin, Dragan Tomic
  • Publication number: 20110282829
    Abstract: A system and method for workflow task routing based on cardinality of task data, or the structure of elements in a business object associated with a task. In accordance with an embodiment, a system such as a human workflow system, that allows for the definition of human workflow tasks, can include a forEach construct within a human task routing definition and a payload. In scenarios that require a plurality of task of similar type be undertaken, such as a purchase order approval involving a plurality of items and potentially different approvers, the system allows for modeling a separate routing for each of those task items (e.g. the lines in the purchase order). In each of the branches of the forEach construct, complex routing patterns, such as parallel routing, can be used. The forEach construct allows creating of looping constructs at any level deep.
    Type: Application
    Filed: May 14, 2010
    Publication date: November 17, 2011
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Ravi Rangaswamy, Will Stallard, David C. Lam
  • Publication number: 20110282855
    Abstract: A method, system, and computer program product for scoring relationships between objects in information retrieval are provided. The method includes: receiving a query object as an input in a search, wherein the query object is a query for a searchable entity type; identifying indexed document objects associated with the query object; and identifying facet objects referenced in the indexed document objects, which facet objects share a defined relationship type with the query object. The method calculates for each relationship between a facet object and the query object a weight of relationship. Wherein a query object, document object, and facet object can represent any searchable entity. Calculating a weight of relationship calculates the weight of relationships over all document objects divided by a selected normalization.
    Type: Application
    Filed: May 12, 2010
    Publication date: November 17, 2011
    Applicant: International Business Machines Corporation
    Inventors: Inbal Ronen, Sivan Yogev
  • Publication number: 20110264639
    Abstract: A document selector selects and ranks documents that are relevant to a query. The document selector executes an instance of a multi-armed bandits algorithm to select a document for each slot of a results page according to one or more strategies. The documents are selected in an order defined by the results page and documents selected for previous slots are used to guide the selection of a document for a current slot. If a document in a slot is subsequently selected, the strategy used to select the document is rewarded with positive feedback. When the uncertainty in an estimate of the utility of a strategy is less than the variation between documents associated with the strategy, the strategy is subdivided into multiple strategies. The document selector is able to “zoom in” on effective strategies and provide more relevant search results.
    Type: Application
    Filed: April 21, 2010
    Publication date: October 27, 2011
    Applicant: Microsoft Corporation
    Inventors: Aleksandrs Slivkins, Sreenivas Gollapudi, Filip Radlinski
  • Publication number: 20110264628
    Abstract: A data collector may monitor a data source and identify updated data, which may be processed and prepared for inclusion into a search database. The data collector may have various handlers that may interact with a data source, which may be a database, web service, file system, collaboration system, or other source, and may store a identifying signature and content signature for each document or item. The signatures may be used to identify new, changed, or deleted items, and a payload may be created containing the updates.
    Type: Application
    Filed: April 26, 2010
    Publication date: October 27, 2011
    Applicant: Microsoft Corporation
    Inventors: PATRICK SOKOLAN, Dennis Doherty, Claude Duguay, William Radcliffe, Virgil Bourassa
  • Publication number: 20110264672
    Abstract: The invention relates to a method and a system for detecting a similarity of documents. The similarity of documents is detected with the help of an analysis of citations in one or more citation document(s), wherein the distance between the individual citations is used as criterion of the analysis. On the basis of the determined distance between two citations, respectively, a similarity value is determined, which is characteristic of the cited documents. A small distance between two citations leads to a high similarity of the cited documents. In case of several citations with regard to documents from several citation documents, the similarity values for the citation pairs from the individual citation documents are used for determining a final similarity value.
    Type: Application
    Filed: July 1, 2011
    Publication date: October 27, 2011
    Inventors: Bela Gipp, Joeran Beel
  • Publication number: 20110264654
    Abstract: A computer-implemented method is disclosed. The method includes receiving from a remote device a search query, generating a local result set and one or more non-local result sets for the search query, determining a display location for the local result set relative to the non-local result set based on a position of the search query in a local relevance indicium.
    Type: Application
    Filed: June 2, 2011
    Publication date: October 27, 2011
    Applicant: GOOGLE INC.
    Inventors: Gabriel WOLOSIN, Charity Yueh-Chwen LU
  • Publication number: 20110258229
    Abstract: Techniques for utilizing data mining technology to extract universal topics with multilingual representations from a multilingual database, and to organize existing or new documents in different languages by analyzing their respective topic distributions.
    Type: Application
    Filed: April 15, 2010
    Publication date: October 20, 2011
    Applicant: Microsoft Corporation
    Inventors: Xiaochuan Ni, Jian-Tao Sun, Zheng Chen, Jian Hu
  • Publication number: 20110258170
    Abstract: In a document analysis system that receives and processes jobs from a plurality of users, in which each job may contain multiple electronic documents, to extract data from the electronic documents, a method of automatically correcting the extracted data using known constraints amongst semantics of extracted data elements is provided. The method includes: analyzing each electronic document in a job to automatically extract data; automatically analyzing the extracted data to identify incorrectly extracted data elements using rules defining constraints amongst semantics of extracted data elements; and automatically attempting to correct the incorrectly extracted data elements using the rules.
    Type: Application
    Filed: January 14, 2011
    Publication date: October 20, 2011
    Inventors: Matthew DUGGAN, Janice O'NEIL, Girish WELLING, Depankar NEOGI, Steven K. LADD
  • Publication number: 20110258222
    Abstract: A method and system for providing a query using an image is disclosed. A search keyword is determined using an image and position information of a terminal that are received from the terminal. Search queries associated with the determined search keyword are provided to the terminal. Lower search queries may be provided to the terminal if one of the provided search queries is selected.
    Type: Application
    Filed: April 13, 2011
    Publication date: October 20, 2011
    Applicant: NHN CORPORATION
    Inventors: Gunhan PARK, Byounghak KIM, Dong Wook KIM
  • Patent number: 8042053
    Abstract: A method for making one or more digital documents browseable. In one implementation, the digital documents may be automatically, topically segmented into one or more topical segments. A topical segment may be selected from the topical segments. One or more topical segments that are substantially similar to the selected topical segment may be identified. One or more links between the selected topical segment and the identified topical segment may be established. The established links may be displayed.
    Type: Grant
    Filed: September 24, 2007
    Date of Patent: October 18, 2011
    Assignee: Microsoft Corporation
    Inventors: Kareem Mohamed Darwish, Ahmed Morsy
  • Publication number: 20110252029
    Abstract: A computer-implemented system and method for providing a legitimacy rating of a content source are provided. A request for a document is received. An electronic document associated with a content source is passed by a document provider in response to the request. A legitimacy rating of the content source is passed. Examples of legitimacy rating information include, for example, a history rating of the content source based on the length of time the document provider has published documents associated with the content source and a transaction volume rating of the content source based on the number of electronic documents associated with the content source that are passed by the document provider.
    Type: Application
    Filed: June 24, 2011
    Publication date: October 13, 2011
    Applicant: GOOGLE INC.
    Inventors: Johnny Chen, Mohit Aron
  • Patent number: 8036497
    Abstract: A document/image retrieval method for retrieving a document/image corresponding to a captured digital image from a database by comparing features calculated based on feature points of the captured digital image with features preliminarily calculated based on feature points of each of documents and/or images stored in the database, the method comprising: extracting the feature points from the captured digital image; defining a local set of feature points for each of the extracted feature points; selecting feature points from the defined local set to define a feature point subset of the local set; determining invariant values as values characterizing the defined subset for combinations of the feature points in the subset, the invariant values being invariant to a geometric transformation; calculating a feature by combining the determined invariant values; and performing a voting process on the documents and/or images in the database based on the preliminarily calculated features of the documents and/or images;
    Type: Grant
    Filed: February 15, 2006
    Date of Patent: October 11, 2011
    Assignee: Osaka Prefecture University Public Corporation
    Inventors: Koichi Kise, Tomohiro Nakai, Masakazu Iwamura
  • Publication number: 20110246520
    Abstract: Methods and systems for automatically determining, from a body of emails, blogs, and other documents, authors of the documents who are authorities on certain subjects, and what those subjects are. An intersection of the semantic footprints of documents by an author are deemed to be the derived skills footprint of the author. The derived skills footprints of many authors are compared with a user's query to determine who is the best person that could respond to the user.
    Type: Application
    Filed: January 10, 2011
    Publication date: October 6, 2011
    Applicant: salesforce.com, inc.
    Inventors: Jari Koister, Mike Micucci
  • Publication number: 20110246439
    Abstract: A query is annotated with a small sketch (e.g. a Bloom filter) that approximates a set of interest that is related to the query. The query and sketch may be forwarded to index servers that each stores a portion of a search engine corpus. Each of the index servers may filter documents using the sketch before returning results for aggregation. The sketch is designed so there may be false positives (results returned by authors not in the set), but no false negatives (all relevant results are returned). The final aggregated results set may be checked against the full set to remove false positives before returning the final results to the user.
    Type: Application
    Filed: April 6, 2010
    Publication date: October 6, 2011
    Applicant: Microsoft Corporation
    Inventors: Michael A. Isard, Marc A. Najork, Sean A. Suchter, Eric R. Scheel
  • Publication number: 20110246526
    Abstract: A method and a system implementing a service level agreement based storage access system. A service level agreement based storage access system presents a single interface for data storage consumers and translates generic data operation requests to data operation request specific to a storage server. The SLA based storage access system also monitors storage server performance and may throttle processes to ensure service level agreements are not violated.
    Type: Application
    Filed: March 31, 2010
    Publication date: October 6, 2011
    Inventors: Yuri Finkelstein, Kumar Rethinakaleeswaran, John Helm, Zheng Xu
  • Publication number: 20110238617
    Abstract: A document management apparatus includes a document storing portion to store a document, a designated part accepting portion to accept any part designated by a user within the stored document as a designated part, an associating portion to associate the accepted designated part with a notification destination, an altering portion to alter the stored document, and a notifying portion to notify the notification destination associated with the designated part when at least a portion of an altered part altered within the stored document by the altering portion is included in the designated part.
    Type: Application
    Filed: March 18, 2011
    Publication date: September 29, 2011
    Applicant: Konica Minolta Business Technologies, Inc.
    Inventors: Kaitaku OZAWA, Hiroaki KUBO, Jun KUNIOKA, Ayumi ITOH
  • Publication number: 20110238681
    Abstract: An apparatus and method for creating an association between a word and an object comprising creating an object identification (ID); assigning a link ID to the object ID; determining whether a word in the object is part of a word list; performing either a) adding the word to the word list and creating a unique word ID for the word, or b) gathering a word ID associated with the word; and associating either the unique word ID or the word ID to the link ID. In one aspect, the apparatus and method search for an object based on word search and visual image by searching for a word ID associated with a word in a word list; searching for at least one link ID associated with the word ID; associating the at least one link ID with an object ID; and visually displaying the object associated with the object ID.
    Type: Application
    Filed: March 24, 2010
    Publication date: September 29, 2011
    Inventors: Basker S. Krishnan, Hanoz J. Kateli, Bryan Heesch
  • Publication number: 20110238664
    Abstract: A region based information retrieval system improves on conventional information retrieval systems by breaking down documents into one or more region(s) and processing the additional information available at a region level of analysis. When looking at regions, it becomes possible to quickly distinguish between groups of related documents, quickly ignore or focus on certain information, track recent evolutions of documents, as well as understand the historical relationships, heritage, and versions of these documents. This is all possible whether or not the document publishers specify where the content originally came from.
    Type: Application
    Filed: March 25, 2011
    Publication date: September 29, 2011
    Inventor: Palle M. Pedersen
  • Publication number: 20110238668
    Abstract: In a document management system that manages index item definition and document data by cabinet, an index can be easily provided. A user that can log into a first database can use an index item defined by the first database to provide an index value to document data stored in a second database.
    Type: Application
    Filed: March 16, 2011
    Publication date: September 29, 2011
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Yoshitaka Matsumoto
  • Publication number: 20110238618
    Abstract: Some embodiments of the invention provide a method of medical collaboration. Some embodiments include a server application receiving and storing an image via an uploading application. In some embodiments, the image can be stored in a database, and upon receiving a request to view the image from a plurality of client applications, the image can be transmitted to the plurality of client applications so that each of the client applications can display the image. Some embodiments can include displaying an application interface on each of the plurality of client applications substantially simultaneously with the image.
    Type: Application
    Filed: March 25, 2011
    Publication date: September 29, 2011
    Inventors: Michael Valdiserri, Warren Goble
  • Publication number: 20110238682
    Abstract: When a document searcher transmits E-mail including a search condition to a document management system and searches for a document, a document search result can be obtained irrespective of an access right which the document searcher has. In order to solve this problem, the document management system searches a database for the document corresponding to document search information received from the document searcher, and transmits a message requesting a permission for the document searcher to obtain the document to a creator of the searched document.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 29, 2011
    Applicant: CANON KABUSHIKI KAISHA
    Inventor: Yousuke Ootaki
  • Patent number: 8028231
    Abstract: A method for storing, organizing and providing remote electronic access to documents. A cover sheet including a standard set of identification data characterizing each document is developed and stored. A digital version of each document is created and stored by scanning each contract. Each digital version includes a scanned image and a searchable text file, wherein the text is overlaid with the image. An index of bookmarks identifying sections of the digital version of each document is generated. Selected fields of information are captured from the digital version of the document. The documents are organized and cross-referenced in a database that includes the captured information and additional information related to each document. Designated parties are alerted of critical dates associated with each document. Remote electronic access to the documents is provided over the internet.
    Type: Grant
    Filed: March 20, 2007
    Date of Patent: September 27, 2011
    Assignee: TractManager, Inc.
    Inventors: Scott R. Jeffery, Thomas A. Rizk
  • Publication number: 20110231385
    Abstract: An object oriented search mechanism extracts structural metadata and data based on type of document contents and data sources connected to the documents. Relationships between textual and non-textual elements within documents as well as metadata associated with the elements and data sources are utilized to generate a unified object model with the addition of semantic information derived from metadata and taxonomy, which are used to enhance search indexing, ranking of search results, and dynamic adjustment of result rendering user interface with fine tuned relevancy. Additional data from data sources connected to the documents may also be used to unlock hidden data such as data that has been filtered out in an original document.
    Type: Application
    Filed: March 16, 2010
    Publication date: September 22, 2011
    Applicant: Microsoft Corporation
    Inventors: Luming Wang, Xiaohong Yang, Hailei Zhang, Sonal Jain
  • Publication number: 20110225155
    Abstract: A system and method are provided for refining a user's query. An entity index, generated from a corpus of text documents, is provided. The entity index includes a set of entity structures, each including a plurality of terms. Each of the terms of an entity structure is a feature of the same entity. Entity structures can be retrieved from the entity index which match at least a portion of the user's query. Clusters of the retrieved entity structures are identified which have at least one of their terms in common. A cluster hierarchy is generated from the identified clusters in which nodes of the hierarchy are defined by one or more of the terms of the retrieved entity structures. At least a portion of the cluster hierarchy is presented to the user for facilitating refinement of the user's query through user selection of a node which, when formulated as a search, retrieves one or more responsive documents from the corpus of documents.
    Type: Application
    Filed: March 10, 2010
    Publication date: September 15, 2011
    Applicant: Xerox Corporation
    Inventors: Frederic Roulland, Stefania Castellani, Antonietta Grasso, Caroline Brun
  • Publication number: 20110225139
    Abstract: User role based customizable searches, where crawled documents may be evaluated against user roles or attributes during crawl time, are provided. Metadata retrieved from searched documents may also be evaluated against the user roles and/or attributes such that customized search results ranking documents based on their content beyond textual content may be provided.
    Type: Application
    Filed: March 11, 2010
    Publication date: September 15, 2011
    Applicant: Microsoft Corporation
    Inventors: Luming Wang, Xiaohong Yang, Anton Amirov, Malik Hussain