Document Retrieval Systems (epo) Patents (Class 707/E17.008)
-
Publication number: 20090327261Abstract: A computing device includes one or more rich internet application (RIA) client engines. Each RIA client engine includes a corresponding private RIA storage area. The computing device also includes a per-RIA public storage area for each RIA. The per-RIA public storage area including a subset of data items in the private RIA storage area of the corresponding RIA client engine.Type: ApplicationFiled: June 25, 2008Publication date: December 31, 2009Applicant: Microsoft CorporationInventor: Jonathan C. Hawkins
-
Publication number: 20090327293Abstract: An information processing apparatus includes a document information storage unit that stores a derivation relationship designating a first document as a parent and a second document generated after an operation as a child, and an operator that performed the operation; an organization information storage unit that stores a structure of an organization hierarchy and members belonging to each of organizations; and a document output permission/prohibition determination unit that, upon receiving a document output request, determines whether or not to permit output of the requested document, by checking an operator of the requested document or an operator of a document corresponding to an ancestor of the requested document in a tree structure of the derivation relationships against members belonging to an organization including a requesting person as a member or an organization being located at a higher level than the organization including the requesting person in the organization hierarchy.Type: ApplicationFiled: April 30, 2008Publication date: December 31, 2009Applicant: FUJI XEROX CO., LTD.Inventor: Taro Takashima
-
Publication number: 20090327276Abstract: A data handling device has access to a store of existing metadata pertaining to existing documents having associated metadata terms. It analyses the metadata to generate statistical data as to the co-occurrence of pairs of terms in the metadata of one and the same document. When a fresh document is received, it is analysed to assign to it a set of terms and determine for each a measure of their strength of association with the document. Then, for each term of the set, a score is generated that is a monotonically increasing function of (a) the strength of association with the document and of (b) the relative frequency of co-occurrence of that term and another term that occurs in the set; metadata for the fresh document are then selected as the subset of the terms in the set having the highest scores.Type: ApplicationFiled: July 5, 2007Publication date: December 31, 2009Inventors: Ian Thurlow, Richard Thurlow, Nicholas J. Davies
-
Publication number: 20090327271Abstract: Information retrieval with unified search between heterogeneous objects is described. The method includes: indexing a first object as a document in a search index; referencing a second object related to the first object in a facet of the document; and storing a relationship strength between the first and second objects in the facet of the document in the search index. Multiple heterogeneous objects can be related to the first object and referenced in multiple facets of the document, each with its relationship strength to the first object. Scoring an indirect object by indirect relation to a query object can be carried out by aggregating the relationship strengths between the indirect object and the retrieved objects multiplied by the retrieved objects' direct scores of relationship strength to the query object.Type: ApplicationFiled: June 30, 2008Publication date: December 31, 2009Inventors: Einat Amitay, David Carmel, Nadav Golbandi, Nadav Y. Har'el, Shila Ofek-Koifman, Sivan Yogev
-
Publication number: 20090327636Abstract: A software transactional memory system is provided that generates and stores compressed transactional locks in a portion of object headers. The software transactional memory system allocates preferred write log memory with a predefined size of memory that corresponds to a number of bits in the compressed transactional locks. The compressed transactional locks identify write log entries in corresponding write logs in the preferred write log memory. If the preferred write log memory becomes full, additional write log memory is allocated for write log entries and subsequent transactional locks are stored uncompressed in an auxiliary memory. A pointer that may be used to locate the uncompressed transactional lock is stored in the header. If an object header with a compressed transactional lock is needed for another use, the compressed transactional lock is uncompressed and stored in the auxiliary memory. A pointer that may be used to locate the uncompressed transactional lock is stored in the header.Type: ApplicationFiled: June 27, 2008Publication date: December 31, 2009Applicant: Microsoft CorporationInventors: David L. Detlefs, Vinod K. Grover, Yosseff Levanoni, Michael M. Magruder
-
Publication number: 20090327259Abstract: A method of identifying thematic groups of nodes by analysis of a corpus of documents. The method uses a distance metric based on connectedness of nodes, which is derived from a co-occurrence measure. The invention is also embodied as a computer-implemented visualization tool that generates a display of nodes and thematic groupings. The invention is useful for ‘data mining’ a large corpus of documents, particularly textual documents, to extract relevant information.Type: ApplicationFiled: April 26, 2006Publication date: December 31, 2009Applicant: THE UNIVERSITY OF QUEENSLANDInventor: Andrew Smith
-
Publication number: 20090319517Abstract: Apparatus, systems and methods for predictive query identification for advertisements are disclosed. Candidate query are identified from queries stored in a query log. Relevancy scores for a plurality of web documents are generated, each relevancy score associated with a corresponding web document and being a measure of the relevance of the candidate query to the web document. A web document having an associated relevancy score that exceeds a relevancy threshold is selected. The selected web document is associated with the candidate query.Type: ApplicationFiled: June 17, 2009Publication date: December 24, 2009Applicant: GOOGLE INC.Inventors: Ramananthan V. Guha, Shivakumar Venkataraman, Vineet Gupta, Gokay Baris Gultekin, Pradnya Karbhari, Abhinav Jalan
-
Publication number: 20090313212Abstract: A relational database system. The system includes a relational database configured to store and present data in a plurality of tables and a database application operatively coupled with the relational database. The system is configured to execute in an intermediate language runtime environment that supports native treatment of user data type definitions, and the database application and the relational database are configured to populate the plurality of tables with records that are each uniquely identified by a key. For each record, the key is constructed in accordance with a compound user-defined data type, such that the key includes: identification of an originating table and additional record-identifying information for the record.Type: ApplicationFiled: June 17, 2008Publication date: December 17, 2009Applicant: MICROSOFT CORPORATIONInventor: Brian Aust
-
Publication number: 20090313225Abstract: The current invention provides an interactive system or platform suitable for performing a research project by individuals associated with a research company and a client company seeking the research. The interactive system provides a centralized electronic exchange for managing the research project and enhancing interaction between individuals carrying out the research and individuals associated with the client. Further, the current invention provides a method for using the interactive system which improves the research process.Type: ApplicationFiled: May 9, 2006Publication date: December 17, 2009Inventor: Walter O'Neal Nordlinger
-
Publication number: 20090313247Abstract: A method and a system for providing snippets of source documents of an answer to a fact query are disclosed. Snippets of source documents may be provided in response to a user request for the source documents from which the fact answer to a fact query was extracted. The snippets include the terms of the fact query and terms of the answer. The snippets may be displayed along with Uniform Resource Locators (URL's) of the source documents.Type: ApplicationFiled: August 24, 2009Publication date: December 17, 2009Inventor: Andrew William Hogue
-
Publication number: 20090307086Abstract: Systems and methods for sharing albums are provided. An album has a plurality of links to documents, such as static graphic representations of web pages, or web pages themselves. Users may graphically add links to documents to albums by dragging graphic representations of such documents to a designated place on a computer interface. When a user shares an album, a link to the album is sent to a recipient. When the recipient activates the album link, documents linked by the album are presented as a graphical presentation can then be reviewed. Each of these documents can be in a document repository addressable and each of these documents may be categorized. Advertisements can be served when albums are shared. Such advertisements are chosen from an advertisement repository based on a characterization of the album or a characterization of one or more documents linked by the album.Type: ApplicationFiled: May 31, 2008Publication date: December 10, 2009Inventors: Randy Adams, Mark David Kvamme, John Lawrence Holland, JR.
-
Publication number: 20090307244Abstract: A statistical tree representing an extensible Markup Language (XML) Schema document (XSD) is generated. The statistical tree captures information defined by the XSD by representing elements, attributes, and enumerations of the XSD as branches, nodes, and leaves of the statistical tree. The statistical tree has bits corresponding to nodes of the statistical tree. An XML document defined by the XSD is adaptively encoded, or compressed, as a number of bits based on the statistical tree that has been generated. The number of bits encoding the XML document are decoded, or decompressed, to yield the XML document also based on the statistical tree that has been generated.Type: ApplicationFiled: June 8, 2008Publication date: December 10, 2009Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Umesh Kumar B. Balegar, Rohit Shetty
-
Publication number: 20090307188Abstract: A system sends a search query to a search engine and receives from the search engine, responsive to the search query, a document comprising a first search result item and a second search result item. The system visually renders a portion that includes less than an entirety of the first search result item and includes the second search result item, where the portion is visually rendered in a region of the document. The system receives a selection of the first search result item from a user and visually expands the region of the document to a size sufficient to render an entirety of the first search result item based on the selection. The system visually renders the entirety of the first search result item within the expanded region of the document.Type: ApplicationFiled: November 15, 2006Publication date: December 10, 2009Applicant: GOOGLE INC.Inventors: Jeffrey Oldham, Joshua D. Mittleman, Alex Cook
-
Publication number: 20090307195Abstract: A document management apparatus is provided in a system that includes a project management unit capable of managing progress of a project to which a user belongs.Type: ApplicationFiled: June 4, 2009Publication date: December 10, 2009Applicant: CANON KABUSHIKI KAISHAInventor: Makoto Anno
-
Publication number: 20090300006Abstract: Keyword frequency data for a plurality of document-derived segments is represented in a matrix form in which each segment is represented as a vector of dimensionality equal to the number of keywords. The matrix may be subdivided into a plurality of sub-matrices, each preferably corresponding to a non-overlapping portion of the plurality of keywords. When determining a similarity measurement between any pair of segments, at least a portion of the keyword frequency data for each sub-matrix's non-overlapping keywords are used to determine a sub-matrix dot product for the pair of segments. The resulting plurality of sub-matrix dot products are then summed together in order to provide the similarity measurement. Keywords that are synonyms of each other may be accommodated through the modification of keyword frequency data. Where the keyword frequency data in the matrix representation is relative sparse, compressed views of the matrix representation may be provided.Type: ApplicationFiled: May 28, 2009Publication date: December 3, 2009Applicant: ACCENTURE GLOBAL SERVICES GMBHInventors: Jagadeesh Chandra Bose Rantham Prabhakara, Ashwin Nayak, Anitha Chandran
-
Publication number: 20090300478Abstract: An image forming apparatus includes multiple executing units; multiple Webpage generating units each corresponding to an executing unit and configured to execute a process corresponding to an HTTP request and generate a Web page for displaying information indicating the process result; multiple menu-information integrating units, each corresponding to an executing unit and configured to obtain, from each Webpage generating unit corresponding to the executing unit, a URL of the Webpage generating unit and menu-item display information provided for allowing use of the Webpage generating unit, integrate and store the menu-item display information in a first file specific to the executing unit, and merge, with the first file, information obtained from another first file specific to another executing unit; and a menu-page generating unit configured to generate, based on information stored in the merged first file, a Web page including menu items provided for allowing use of the Webpage generating units.Type: ApplicationFiled: April 20, 2009Publication date: December 3, 2009Inventor: DAISUKE KONDO
-
Publication number: 20090300007Abstract: An information processing apparatus for creating a retrieval result displaying a list of retrieval documents is disclosed. Retrieval documents corresponding to a retrieval condition are classified into groups based on scores indicating degrees of relevance to the retrieval condition. A clustering process is conducted with respect to the retrieval documents in a group, for each of groups to which the retrieval documents belong.Type: ApplicationFiled: May 28, 2009Publication date: December 3, 2009Inventor: Takuya Hiraoka
-
Publication number: 20090292713Abstract: A computationally implemented method includes, but is not limited to: acquiring data indicative of an inferred mental state of an authoring user in connection with at least a particular item of an electronic message, and associating the data indicative of the inferred mental state of the authoring user with the particular item. In addition to the foregoing, other method aspects are described in the claims, drawings, and text forming a part of the present disclosure.Type: ApplicationFiled: July 30, 2008Publication date: November 26, 2009Inventors: Edward K.Y. Jung, Eric C. Leuthardt, Royce A. Levien, Robert W. Lord, Mark A. Malamud, John D. Rinaldo, JR., Lowell L. Wood, JR.
-
Publication number: 20090292595Abstract: A new generation online e-commerce and networking system is disclosed. According to the embodiments of the present invention, the online e-commerce and networking system is based on: 1) a new model of meeting individuals' work and life needs and integrating entities' processes to create large online networks; 2) a new advertising model of associating online advertisements with goods services, activities, and incentives that meet users' needs to allow users to solicit or request sponsorships so that direct interaction between users and advertisers is enabled; and 3) a new community model to enable entities to build deeper and better relationships with users through entities' direct involvement in online community building and real and virtual event hosting for users to participate.Type: ApplicationFiled: May 21, 2008Publication date: November 26, 2009Inventors: Wenxuan Tonnison, James Ian Tonnison
-
Publication number: 20090287993Abstract: A main control unit determines whether or not a previous version with which comparison is to be made exists regarding a document stored in a session storage unit. Next, confirmation is made regarding whether or not there is difference between both compared objects, and the control unit determines whether or not there is difference. The main control unit then executes an action embedded in the attributes of the object data. The object data with difference that has been saved in the session information storage unit is saved in a document information storage unit.Type: ApplicationFiled: May 17, 2009Publication date: November 19, 2009Applicant: CANON KABUSHIKI KAISHAInventor: Atsushi Kashioka
-
Publication number: 20090287686Abstract: A playback device includes a communication component, an operation component and a playback control component. The communication component is configured to communicate with a network device via a network. The operation component is configured to select a random playback of a plurality of content items that is stored in the network device. The playback control component is configured to control the random playback of the content items. The playback control component acquires only numerical information of the content items from the network device when the operation component selects the random playback of the content items with the numerical information indicating number of the content items. The playback control component randomly determines one of the content items based on the numerical information. The playback control component acquires the one of the content items from the network device to play the one of the content items.Type: ApplicationFiled: April 14, 2009Publication date: November 19, 2009Applicant: FUNAI ELECTRIC CO., LTD.Inventor: Masaki OKAZAKI
-
Publication number: 20090288006Abstract: A method of visualizing and manipulating data on a display of a computer is provided, the method comprising retrieving a plurality of documents from a file system operably connected to the computer, each said document having at least one attribute associated therewith; graphically organizing the plurality of documents retrieved from the file system along a first substantially linear axis on the display; selecting a selected attribute associated with a selected document, the selected document being selected from among the plurality of documents organized along the first substantially linear axis; and graphically organizing a subset of the plurality of documents retrieved from the file system along a second substantially linear axis on the display, the first and second axes being organized such that the first and second substantially linear axes are non-parallel to one another, wherein the subset of the plurality of documents only includes documents having attributes associated therewith that match the selectedType: ApplicationFiled: June 27, 2009Publication date: November 19, 2009Inventors: Mathieu Audet, Yves Berthiaume
-
Publication number: 20090282033Abstract: A client system provides to a server system a fill-the-blank query comprising one or more term segments and one or more missing term identifiers signifying missing information sought by a user. The client system receives from the server system a response to the query, the response including at least one or more potential answers corresponding to the one or more missing term identifiers in the fill-the-blank query, and then displays the response to the query, including displaying the one or more potential answers. Optionally, the client system displays a ranked list of documents containing the one or more potential answers. Optionally, the response to the query further includes snippets of text from one or more documents containing the one or more potential answers. Optionally, the fill-the blank query includes a respective missing term identifier located between two respective term segments.Type: ApplicationFiled: July 21, 2009Publication date: November 12, 2009Inventor: Hiyan Alshawi
-
Publication number: 20090281996Abstract: A solution for generating a Service-Oriented Architecture (SOA) policy based on a context model is provided, which generates an application scope of the SOA policy; generates a context model; generates an action list for the context model based on action semantic modules customized by a user; generates a condition part of the SOA policy according to the context module; generates an action part of the SOA policy according to the action list; and combines the condition part and the action part to generate the SOA policy.Type: ApplicationFiled: November 11, 2008Publication date: November 12, 2009Inventors: Xin Peng Liu, Xi Ning Wang, Yu Chen Zhou
-
Publication number: 20090282065Abstract: A method includes receiving a user input in a design environment indicating at least one software component for which a technical design document is desired, identifying files in which information associated with the component is located, accessing the information, and formatting the information according to a technical design document template. A system has a design environment for development of software components, and a document generator to automatically generate documentation for the software components on demand. A method to design software components includes producing a design of a software component in a design environment residing upon a computer, selecting the software component for design review, and generating a design document according to a template from the design environment.Type: ApplicationFiled: May 8, 2008Publication date: November 12, 2009Applicant: ORACLE INTERNATIONAL CORPORATIONInventors: Paul Brimble, Samir Buche
-
Publication number: 20090281993Abstract: In accordance with one or more embodiments, a system for facilitating transfer of data and information over a network includes a database component, a communication component adapted to communicate with a user via a portable communication device over the network, and a processing component adapted to receive a request for data and information from the user via the portable communication device over the network and process the request by accessing one or more documents from the database component related to at least one component of a machine specified by the user passed with the request. The communication component transfers the one or more documents from the database component to the portable communication device for viewing by the user.Type: ApplicationFiled: May 9, 2008Publication date: November 12, 2009Inventors: Brent L. Hadley, Patrick J. Eames
-
Publication number: 20090282032Abstract: A method and system for generating a search result for a query of hierarchically organized documents based on retrieval of subtrees that are key resources for topic distillation is provided. The retrieval system may identify documents relevant to a query using conventional searching techniques. The retrieval system then calculates a subtree feature for subtrees that have an identified document as their root. After the retrieval system calculates the subtree feature for the subtrees, the retrieval system may generate a subtree relevance score for each subtree based on its subtree feature. The retrieval system may then order the identified documents based on their corresponding subtree relevances.Type: ApplicationFiled: July 17, 2009Publication date: November 12, 2009Applicant: Microsoft CorporationInventors: Tie-Yan Liu, Tao Qin, Wei-Ying Ma
-
Publication number: 20090276435Abstract: A software module is presented that enables a person to determine the relevance of a document while preventing the person from making a copy of the entire document. In one embodiment, this is accomplished by programmatically controlling which portions of a document will be presented to a user and which portions will not be presented to the user. In one embodiment, the software module is used in conjunction with a search engine to present a document search result.Type: ApplicationFiled: July 13, 2009Publication date: November 5, 2009Applicant: GOOGLE INC.Inventors: Alma W. Whitten, Joseph K. O'Sullivan
-
Publication number: 20090276421Abstract: A method for re-ranking search results, includes: generating the search results from a data source based on a search query from a user; retriving a re-ranking expression; re-ranking all or part of the documents in the search results based on the re-ranking expression; and displaying all or part of the documents in the search results with the re-ranked order.Type: ApplicationFiled: May 4, 2009Publication date: November 5, 2009Inventor: Gang Qiu
-
Publication number: 20090276531Abstract: The present invention provides for systems and methods for communicating media files and creating a collection of media files, also referred to herein as a master media file. In addition, the systems and methods of the present invention provide for the creation of automatic metadata and compilation of metadata associated with the collection of media files. The present invention is able to bond devices, referred to herein as slave devices, such as media capture devices, presence devices and/or sensor devices and instruct the slave devices, particularly the media capture devices, to communicate captured media files with a specified set of metadata included.Type: ApplicationFiled: July 9, 2009Publication date: November 5, 2009Applicant: Nokia CorporationInventors: Andreas Myka, Christian Lindholm
-
Publication number: 20090271370Abstract: Embodiments are directed towards providing a list of potential friends to a user based on an analysis of friends' contact lists. The user may provide a subset of friends within a contact list for analysis, along with a degree of separation over which to perform the analysis, and/or a minimum threshold number of occurrences for identifying a candidate friend. The subset of friends' contact lists may then be recursively traversed and merged, where common friends may be identified as members of a candidate set for suggesting friends to the user. In one embodiment, the candidate members may be retained within the candidate set if there is a commonality between the friends and the candidate that exceeds the minimum threshold. The candidate list may also be rank order using various approaches, including a weighted energy diffusion model based in part on a number of communications between the candidates.Type: ApplicationFiled: April 28, 2008Publication date: October 29, 2009Applicant: YAHOO! INC.Inventors: Sunil Jagadish, Jignashu Parikh
-
Publication number: 20090265330Abstract: Techniques for locating information in a document relevant to an interest of a user are provided. Information defined by the user of a document browser is collected. A context model is generated using the collected information. A document selected by the user is obtained. The document is divided into one or more segments. A relevance value is computed for each of the one or more segments by comparing each of the one or more segments to the context model. The relevance value represents a relationship to an interest of the user. Each of the one or more segments with the computed relevance value is presented in a defined organizational area of a display. The one or more segments presented on the display are linked to a corresponding one or more segments in the document.Type: ApplicationFiled: April 18, 2008Publication date: October 22, 2009Inventors: Wen-Huang Cheng, David Haim Gotz
-
Publication number: 20090265344Abstract: An object of the present invention is to provide a document processing device and document processing method that can provide a search result satisfactory to a user with respect to WWW documents in which a number of links among WWW documents is low and a number of accesses by users is low. An access pattern collection unit 101 generates an access user vector uj of one WWW document Dj and an access user vector uje of another document Dje. A user similarity computing unit 105 computes a document similarity sim (uj, uje) which indicates a user similarity between the WWW document Dj and WWW document Dje. A keyword vector smoothing unit 106 acquires a smoothed keyword weight vector w?j by correcting a keyword weight vector wj in one document, using the computed document similarity sim (uj, uje). An rearranging unit 110 calculates an evaluation value B_SCORE for input information for searching, based on the smoothed keyword weight vector w?j.Type: ApplicationFiled: April 21, 2009Publication date: October 22, 2009Applicant: NTT DoCoMo, Inc.Inventors: Minoru Etoh, Takehiro Nakayama, Yoshikazu Akinaga
-
Publication number: 20090265187Abstract: Certain embodiments of the present invention provide a clinical document storage and locator system including a clinical database component, a document retriever component, and a query mapper component. The clinical database component is adapted to store a plurality of clinical documents. The document retriever component is adapted to receive a request. The document retriever component is further adapted to determine an LOINC code using the received request. The query mapper component is adapted to generate a document query using the determined LOINC code. The document retriever component is adapted to retrieve a relevant clinical document from the clinical database component using the generated document query.Type: ApplicationFiled: April 21, 2008Publication date: October 22, 2009Applicant: GENERAL ELECTRIC COMPANYInventors: Keith W. Boone, Pradip Kumar Parida, Trivedi Bodlapati
-
Publication number: 20090265372Abstract: The present invention relates to a method of storing attributes describing documents in a document management system software application which is configured to be run on a computer with an operating system and a file system, wherein the documents to be managed are stored in said file system and said documents are described by attributes such as identifiers and properties. The attributes are stored as separate elements attached to said documents. Furthermore, the present invention relates to a document management system software application which is configured to be run on a computer with an operating system and a file system, wherein the document management system is for managing documents stored in said file system, wherein said documents are described by attributes such as identifiers and properties. The document management system is configured to handle document attributes being separately and directly attached to said documents.Type: ApplicationFiled: February 21, 2007Publication date: October 22, 2009Inventor: Arne Esmann-Jensen
-
Publication number: 20090265654Abstract: A method provides, as part of a computer administration system, an administration interface that can operate on almost any computerized device that has a user interface. The computer administration system manages components of a computer system and the administration interface is operable to configure the components and to provide dynamic performance and configuration information of the components to the user as the components operate. The method provides a “commentary input” area on the administration interface while providing the performance and configuration information of a specific component or a set of components. Thus, the method can receive comment(s) about the specific component(s) of the computerized system in the commentary input area. When this occurs, the method stores the comment(s) in a data store in a manner that associates the comment(s) with the specific component(s) that was being monitored.Type: ApplicationFiled: April 22, 2008Publication date: October 22, 2009Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Andreas Dieberger, Eben M. Haber, Eser Kandogan, Gilly Leshed
-
Publication number: 20090265313Abstract: This description provides tools and techniques for automatically extracting data from semi-structured documents. A computer-readable storage medium may contain computer-executable instructions that, when executed by a computer, cause the computer to receive a request for data representing an inferred structure of an input document. For the request, the computer may determine whether a repository containing mined information includes the requested data. If the repository contains the requested data, the computer may return the data representing the inferred structure of the input document in response to the request.Type: ApplicationFiled: April 18, 2008Publication date: October 22, 2009Inventor: Yanxin Wang
-
Patent number: 7606741Abstract: A system, apparatus and method for filling forms, including using a graphical capture device, are described herein.Type: GrantFiled: April 1, 2005Date of Patent: October 20, 2009Assignee: Exbibuo B.V.Inventors: Martin T. King, James Q. Stafford-Fraser, Clifford A. Kushler, Dale L. Grover
-
Patent number: 7607076Abstract: Methods and apparatuses that synchronize a paper document to an associated digital document by establishing a mapping. An embedded interactive code (EIC) Document is created as a digital file that serves as an intermediate tier between the paper document and the digital document. Both the paper document and the EIC document are generated while printing the paper document. The EIC document records the corresponding EIC array allocations and a unique document identification number. An image capturing pen may generate a stroke on any page of paper document. With the EIC document, the methods and apparatuses inform an application the page and location on the page of the stroke.Type: GrantFiled: February 18, 2005Date of Patent: October 20, 2009Assignee: Microsoft CorporationInventors: Jian Wang, Liyong Chen, Youjun Liu, Jiang Wu
-
Publication number: 20090254524Abstract: Data may be provided in a language chosen by a user. A data record may be stored in a database using symbols to represent data. These symbols may be converted into various languages. A resource file for a given language defines a correspondence between the symbols in the record and the words in that language. A user indicates a choice of language in which to receive data. The user's choice is stored in the database, and conversion information from the resource file for the user's chosen language is copied to the database. When a program connects to a database and requests data records, the user's language choice is retrieved from the data, and the conversion information stored in the database is used to convert the data records into the chosen language. The requested data is then provided to the application in the user's chosen language.Type: ApplicationFiled: April 7, 2008Publication date: October 8, 2009Applicant: MICROSOFT CORPORATIONInventors: Soren Francker, Jorn Lindhard Mortensen, Srinivasan Parthasarathy, Hans Jorgen Gron
-
Publication number: 20090254838Abstract: A data processing system for delivering an open profile personalization system based profile data structures that contain one or more interest nodes. The interest nodes include respective sets of targets and qualifiers, where the targets and qualifiers comprise typed-attributes to be used in the filtering of information files for delivery as a result set for the interest nodes. Targets and qualifiers are applied the typed-attributes of available information files to produce the filtered set. Web pages showing the personalized results include tools based on sophisticated content analysis to assist the user in creation and editing of the open profile. A method for presenting and updating the web pages is responsive to the use of these tools.Type: ApplicationFiled: April 2, 2009Publication date: October 8, 2009Applicant: iCurrent, Inc.Inventors: RAMANA B. RAO, Todd A. Cass, Moshe Cohen, Brian L. Neumann, Linda Uyechi
-
Publication number: 20090254523Abstract: Methods and apparatuses relate to hosting an inverted index for term-based document searching. According to disclosed aspects, each bank of a plurality of banks receives a plurality of Document IDentifiers (DocIDs) in the inverted index, and within each bank, posting lists for each term are determined large or small. DocIDs for large posting lists are distributed among computers in a bank while responsibility for producing DocIDs identifiers in a small posting list are distributed by term to one or fewer computers in the bank. During operation, each term of a query is distributed to each bank, and then for small terms, only those computers assigned responsibility for a given term need to search for responsive DocIDs. DocIDs can be redistributed among computers in a bank such that results are presented from the computers that would have produced those results in a cluster having a pure DocIDs distribution scheme.Type: ApplicationFiled: April 4, 2008Publication date: October 8, 2009Applicant: Yahoo! Inc.Inventors: Kevin Lang, Swee Lim, Choongsoon Chang
-
Publication number: 20090254549Abstract: Embodiments of methods and apparatuses for searching contents, including structured search are described herein. Embodiments of the present invention use tree structures (or more generally, graph structures), layout structures, and/or content category information to capture within search results relevant content that would otherwise be missed, to reduce the incidence of false positives within search results, and to improve the accuracy of rankings within search results. Embodiments of the present invention further use tree structures (or more generally, graph structures), layout structures, and/or content category information to extend search results to include sub-document constituents. Embodiments of the present invention also support the use of distribution properties as criteria for ranking search results.Type: ApplicationFiled: April 10, 2009Publication date: October 8, 2009Applicant: Zalag CorporationInventor: Samuel S. Epstein
-
Publication number: 20090254575Abstract: Methods for packing and unpacking files in a multi-level hierarchy in single actions. The methods operate in memory through using one file pointer for the archive file in recursive calls to the packing and unpacking methods, for accessing files in multiple nested levels. The packing and unpacking are performed in memory, and no temporary files are written to a storage device, thus saving on storage and processing time. A user can also store or retrieve files selectively from an archive file.Type: ApplicationFiled: April 2, 2008Publication date: October 8, 2009Applicant: SAP PORTALS ISRAEL LTD.Inventor: Pavel KRAVETS
-
Publication number: 20090248678Abstract: A document set, and history documents including documents, etc., browsed by a user are input. The document set and the history documents are each analyzed to obtain characteristic vectors. A plurality of topic clusters and a plurality of sub-topic clusters are obtained by clustering the document set. A transition structure showing transitions of topics among the sub-topic clusters is generated, and a characteristic attribute is extracted from each topic cluster and each sub-topic cluster. An cluster-of-interest is extracted in comparison among characteristic vectors of the history documents and a characteristic vector of each document included in the document set, a sub-topic cluster having transition relations with the cluster-of-interest is obtained on the basis of a transition structure owned by the cluster-of-interest, and a document included in the sub-topic cluster is extracted as a recommended document to be presented together with the characteristic attribute.Type: ApplicationFiled: March 20, 2009Publication date: October 1, 2009Inventors: Masayuki Okamoto, Masaaki KIKUCHI
-
Publication number: 20090248674Abstract: A search keyword improvement apparatus includes a unit extracting a word as an additional keyword candidate from a new document, number of times of appearance of the word in the new document being greater than number of times of appearance of the word in each of a first documents except for the new document, if the new document and a new search target identification information item which is used to search the new document are accumulated, a unit generating a first search query based on an input keyword, a second search target associated with the input keyword, and one of the additional keywords, and generating a second search query, a unit moving the additional keyword candidate and the third search target identification information item, if the desired search result is selected from a third search result list corresponding to the second search query.Type: ApplicationFiled: February 17, 2009Publication date: October 1, 2009Inventors: Masaru SUZUKI, Tomoharu Kokubu
-
Publication number: 20090248620Abstract: Extraction methods can interact on a common data source using identifiers that correspond to events or other actions. These identifiers can be updated, whenever appropriate, once the corresponding data has been summarized, in order to provide for multiple extraction methods to operate only on the data of interest, and obtain a lock only on the data within the scope of extraction. High water marks, such as identifiers in the sequent, can be used to further designate which data has previously been extracted. Similarly, summarization methods can interact by utilizing corresponding persistent tables in the flows for the methods, but utilizing separate intermediate tables to allow for data transformations and application of various business rules and tuning techniques. The ability to switch between different methods can accommodate business, performance, or other such needs, and can provide for the dynamic extraction and summarization of different volumes of data.Type: ApplicationFiled: March 31, 2008Publication date: October 1, 2009Applicant: Oracle International CorporationInventors: Shane Robert Vermette, Vijay Manguluru, Manoj Kumar
-
Publication number: 20090248510Abstract: An apparatus and method for providing relevant search result and query terms are disclosed herein. Natural language processing of the documents and previous search session history are used to dynamically determine document relevance, queries relevant to search categories prior to start of a search session, and query to query correlations.Type: ApplicationFiled: March 31, 2008Publication date: October 1, 2009Applicant: Yahoo! Inc.Inventor: Ashwinder Ahluwalia
-
Publication number: 20090248756Abstract: In general, embodiments of the invention relate to reading data from and writing data to a storage system. Specifically, embodiments of the invention relate to a read only mode for a portion of a storage system. In one embodiment, a selective read-only mode for a portion of a storage system is implemented by monitoring a condition that may affect a subset of persistent storage in a storage system, by detecting the condition, by entering a read-only mode for the subset, and by enforcing a policy of processing write requests and read requests to the storage system, which includes processing the write requests without modifying user data stored on the subset and processing the read requests, including requests for user data stored on the subset.Type: ApplicationFiled: March 27, 2008Publication date: October 1, 2009Inventors: Tyler A. Akidau, Neal T. Fachan, Aaron J. Passey
-
Publication number: 20090248666Abstract: An apparatus and method for providing relevant search result and query terms are disclosed herein. Natural language processing of the documents and previous search session history are used to dynamically determine document relevance, queries relevant to search categories prior to start of a search session, and query to query correlations.Type: ApplicationFiled: March 31, 2008Publication date: October 1, 2009Applicant: Yahoo! Inc.Inventor: Ashwinder AHLUWALIA