Document Retrieval Systems (epo) Patents (Class 707/E17.008)
  • Patent number: 9367659
    Abstract: A system and method for reverse synthesizing an integrated circuit from a netlist. A netlist extracted from a device under review is received and converted to a connected graph. Blocks of cells are identified within the connected graph and a circuit model is formed from the blocks of cells, wherein forming includes iteratively building more complex blocks of cells from simpler blocks of cells.
    Type: Grant
    Filed: August 7, 2014
    Date of Patent: June 14, 2016
    Assignee: Raytheon Company
    Inventors: Parviz Saghizadeh, Thomas Allen Spargo, Robert T. Narumi, Mark W. Redekopp
  • Patent number: 9367604
    Abstract: The present invention makes legal research more efficient by selecting clusters in response to the behavior of a user (e.g., a legal professional such as a paralegal, lawyer, or judge). The clusters, which are formed prior to the user accessing a legal document (and thus, providing user behavior to a system), are identified to the based upon a set of metadata associated with the legal document. At least two clusters are identified and a signal associated therewith is transmitted to the user. Each cluster is associated with a unique legal topic. Further, each cluster may comprise primary and/or secondary authority.
    Type: Grant
    Filed: October 16, 2015
    Date of Patent: June 14, 2016
    Assignee: Thomson Reuters Global Resources
    Inventors: Qiang Lu, Jack G. Conrad, Michael Dahn, William M. Keenan
  • Patent number: 9304649
    Abstract: Embodiments of the present invention address deficiencies of the art in respect to hierarchical tree file browsers and provide a method, system and computer program product for selectably flattening a hierarchical tree object structure in a hierarchical tree object browser. In one embodiment of the invention, a method of flattening an object hierarchy in an object browser can be provided. The method can include selecting a node in an object hierarchy in an object browser and, responsive to selecting the node, displaying content for the selected node and content for at least one node below the selected node in the object browser.
    Type: Grant
    Filed: February 20, 2014
    Date of Patent: April 5, 2016
    Assignee: International Business Machines Corporation
    Inventors: Josef Scherpa, Andrew L. Schirmer
  • Patent number: 9246919
    Abstract: A portable information terminal apparatus includes an obtaining unit that causes an information apparatus shared by multiple users to obtain an access token that contains access right information indicating that the information apparatus has an access right to an external service utilization system, and obtains the obtained access token, and a request unit that transmits the access token obtained by the obtaining unit to the information apparatus upon requesting the external service utilization system to execute a service operation, and causes the information apparatus to perform data communication with the external service utilization system in response to a request for execution of the service operation, using the access token.
    Type: Grant
    Filed: September 6, 2013
    Date of Patent: January 26, 2016
    Assignee: FUJI XEROX CO., LTD.
    Inventor: Shigeki Ishino
  • Patent number: 9201956
    Abstract: The present subject matter provides systems, methods, software, and data structures for patent mapping, storage, and searching. Some such embodiments include mapping patent documents, claims, and claim limitations. Some further embodiments provide for searching a universe of patent documents by patent document, claim, limitation, class, element, or concept.
    Type: Grant
    Filed: February 2, 2012
    Date of Patent: December 1, 2015
    Assignee: Schwegman Lundberg & Woessner, P.A.
    Inventors: Steven W. Lundberg, Janal M. Kalis, Pradeep Sinha
  • Patent number: 8996561
    Abstract: A method, system and computer program product are disclosed for searching for data. In one embodiment, the invention provides a method comprising identifying a query and a search scope including a set of specified entities; and for each of these entities, estimating a number of documents that would be identified in a search through the entity to answer the query. On the basis of this estimating, a subset of the entities is formed. The query and this subset of entities are sent to a search engine to search the subset of entities to answer the query. In one embodiment, the estimating includes collecting statistical information from queries to build up a historical cache using heuristics or machine learning techniques, wherein the query includes a key word and a scope, and the historical cache contains a maximum number of returned results for an entity given the queries executed.
    Type: Grant
    Filed: August 4, 2009
    Date of Patent: March 31, 2015
    Assignee: International Business Machines Corporation
    Inventors: Yu Deng, Murthy V. Devarakonda, Rafah A. Hosn, Nithya Rajamani, Norbert G. Vogl
  • Patent number: 8977689
    Abstract: A system facilitates collaborative communications and information sharing in a network defined by a model. The model and a portion of the system are stored on a storage component coupled to a terminal. The system captures context information and user-defined data, the user-defined data provided during user interaction of the user in a first domain of the network, and dynamically stores the context information as metadata associated with the user-defined data, the user-defined data and the metadata stored on the storage component; a tracking component for tracking a change of the user from the first domain to a second domain of the network and dynamically updating the stored metadata based on the change, where the user accesses the user-defined data from the second domain; and an interface to the system that permits the user to create and view the user-defined data according to the model of the network.
    Type: Grant
    Filed: June 5, 2014
    Date of Patent: March 10, 2015
    Assignee: VirtualAgility Inc.
    Inventor: Douglas F. Beaven
  • Patent number: 8775246
    Abstract: A comprehensive platform for merchandising intellectual property (IP) and conducting IP transactions is disclosed. A standardized data collection method enables IP assets to be characterized, rated and valuated in a consistent manner. Project management, workflow and data security functionality enable consistent, efficient and secure interactions between the IP Marketplace participants throughout the IP transaction process. Business rules, workflows, valuation models and rating methods may be user defined or based upon marketplace, industry or technology standards.
    Type: Grant
    Filed: July 29, 2011
    Date of Patent: July 8, 2014
    Assignee: American Express Travel Related Services Company, Inc.
    Inventor: Tracey R. Thomas
  • Patent number: 8775667
    Abstract: A message routing system that allows applications at either end of the system to run as-is without modification. The system functions in a multithreaded environment and is capable of handling complex routing rules and message transformation. It is also capable of learning and executing new routing rules and message transformations in formats previously unrecognized by the system. The system enables precise and reliable logging of messages throughout processing and supports publication of enterprise-wide broadcast messages. The system further preferably employs cooperating inbound and outbound transport processes for consuming, routing, processing, safely storing and publishing messages in batches of logical units of work to ensure that the logical units of work are not lost in system transactions. The system also preferably utilizes a replay server for preserving and replaying messages that might otherwise fail to reach their intended destinations.
    Type: Grant
    Filed: April 4, 2008
    Date of Patent: July 8, 2014
    Assignee: Goldman, Sachs & Co.
    Inventors: Carl J. Reed, Michael R. Marzo, Tomozumi Kanayama, Konstantin A. Krasheninnikov, Julien George Beguin
  • Publication number: 20140172915
    Abstract: One illustrative embodiment involves receiving a content request for accessing a piece of content, the content request identifying the piece of content, the content request received by a first computer device, and the content request requesting access to the piece of content by a content requester. The embodiment further involves receiving information about the content requester and sending from the first computer device a requester-specific information request requesting additional information from the content requester based at least in part on information about the content requester. The embodiment further involves receiving the additional information at the first computer device and selectively, at the first computer device, providing access to the piece of content based at least in part on the additional information.
    Type: Application
    Filed: February 16, 2011
    Publication date: June 19, 2014
    Applicant: ADOBE SYSTEMS INCORPORATED
    Inventor: Jonathan Herbach
  • Publication number: 20140136487
    Abstract: A computer-implemented system and method for content management targeted rollback, including receiving at least one change to be made to a field on a document. A rollback document representing the document is stored. Metadata associated with the change and the rollback document is stored. The change is executed. A rollback request is received containing targeting metadata designating the rollback document. The document is rolled back to the rollback document that is associated with the stored metadata that corresponds to the targeting metadata.
    Type: Application
    Filed: November 14, 2012
    Publication date: May 15, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Randy E. Oyarzabal, Jeffery A. Turner
  • Patent number: 8719278
    Abstract: In one exemplary embodiment, a set of attributes derived from an element of a first digital document is obtained. The element is identified from eye-tracking data of a user viewing the digital document. A search query of a database comprising at least one query term is received. A set of documents in the database is identified according to the search query. An attribute score is determined for each document. The set of documents are sorted according to the attribute score. Optionally, a commonality between the query term and at least one member of the set of attributes may be determined. The search query may be generated by the user. The database may be a hypermedia database.
    Type: Grant
    Filed: September 15, 2011
    Date of Patent: May 6, 2014
    Assignee: Buckyball Mobile Inc.
    Inventors: Amit V. Karmarkar, Sharada Karmarkar, Richard R. Peters
  • Publication number: 20140108397
    Abstract: Systems and methods are provided for providing document data. A host application is displayed on an interface of a computer system, where the host application includes an interface field that is linked to a document field of documents in a document management system. An enabler application captures a field value for the interface field and an operation identification from the host application. A context rule database contains a plurality of context rules that are accessed based upon the operation identification, where the context rule identifies a type of document that is relevant to the identified operation. A document management system is configured to be queried based on the field value and the relevant document type, where the document management system is configured to return document data based on said query, and where the interface of the computer system is configured to be updated based on the returned document data.
    Type: Application
    Filed: October 12, 2012
    Publication date: April 17, 2014
    Applicant: HYLAND SOFTWARE, INC.
    Inventors: Miguel A. Zubizarreta, Alejandro Vanegas
  • Publication number: 20140089251
    Abstract: A computer receives one or more files having configuration information that includes data that defines a plurality of stages of an extract, transform, and load (ETL) job, wherein the plurality of stages comprise a read stage that is preceded by a write stage, and wherein the read stage reads data from a source location, and wherein the data that is read or a modified version of the data that is read is being written by the write stage that writes data to the source location. The computer replaces the read stage with a decompressor stage. The computer replaces the write stage with a compressor stage. The computer executes the decompressor stage and compressor stage on a field-programmable gate array that is programmatically customized with data compression and data decompression functionality to enhance the performance of the ETL job.
    Type: Application
    Filed: September 21, 2012
    Publication date: March 27, 2014
    Applicant: International Business Machines Corporation
    Inventors: Manish A. Bhide, Krishna K. Bonagiri, Srinivas K. Mittapalli, Sumit Negi
  • Publication number: 20140089308
    Abstract: A method and system for tracking documents in computing devices is provided. The method includes creating a unique identifier for a document in a computing device based on contents of the document and meta-data associated with the document. The unique identifier is then used for tracking the document in one or more computing devices. The tracking of the document can be done for all the documents in plurality of computing devices. The method further tracks propagation of the document in the plurality computing devices.
    Type: Application
    Filed: September 26, 2012
    Publication date: March 27, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Arun Ramakrishnan, Rohit Shetty
  • Publication number: 20140081904
    Abstract: A system, comprised of at least two computing appliances, is provided for use by a plurality of users. Each of the at least two computing appliances provides a presentation of an underlying image. Each input apparatus is responsive to an input made by a respective user thereof, to generate respective annotation data representative of annotations made by at an associated location on the presentation display of the underlying image. An event generator, responsive to the annotation data, generates events comprising event content generated in an entry order of input that is stored in event storage. Selection logic defines a selected set of events (that can comprise less than all the events within the entry order of input). Display assembly logic generates display data responsive to processing the event content for the events in said selected set of events. A display presentation is generated responsive to the display data.
    Type: Application
    Filed: September 14, 2012
    Publication date: March 20, 2014
    Inventors: David H. Sitrick, Russell T. Fling
  • Publication number: 20140067819
    Abstract: A method and apparatus are provided for building and using a persistent XML tree index for navigating an XML document. The XML tree index is stored separately from the XML document content, and thus is able to optimize performance through the use of fixed-sized index entries. The XML document hierarchy need not be constructed in volatile memory, so creating and using the XML tree index scales even for large documents. To evaluate a path expression including descendent or ancestral syntax, navigation links can be read from persistent storage and used directly to find the nodes specified in the path expression. The use of an abstract navigational interface allows applications to be written that are independent of the storage implementation of the index and the content. Thus, the XML tree index can index documents stored at least in a database, a persistent file system, or as a sequence of in memory.
    Type: Application
    Filed: September 5, 2012
    Publication date: March 6, 2014
    Applicant: ORACLE INTERNATIONAL CORPORATION
    Inventors: Anguel Novoselsky, Zhen Hua Liu, Thomas Baby
  • Publication number: 20140046947
    Abstract: A method for question/answer creation for a document is described. The method includes importing a document having a set of questions based on content in the document. The method also includes automatically creating a candidate question from the content in the document. The method also includes automatically generating answers for the set of questions and the candidate question using the content in the document. The method also includes presenting the set of questions, the candidate question, and the answers to a content creator for user verification of accuracy. The method also includes storing a verified set of questions in the document. The verified set of questions includes the candidate question.
    Type: Application
    Filed: August 9, 2012
    Publication date: February 13, 2014
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Jana H. Jenkins, David C. Steinmetz, Wlodek W. Zadrozny
  • Patent number: 8650197
    Abstract: A system determines documents that are associated with a location, identifies a group of signals associated with each of the documents, and determines authoritativeness of the documents for the location based on the signals.
    Type: Grant
    Filed: March 9, 2012
    Date of Patent: February 11, 2014
    Assignee: Google Inc.
    Inventors: Daniel Egnor, Geeta Chaudry
  • Publication number: 20140032558
    Abstract: A computer implemented system and method are provided for refining category scores for pages of a sequence of document pages that potentially includes document boundaries. The method uses initial category scores provided by a categorizer that considers one page at a time or concatenated pairs of pages (called bipages). The category scores represent the probability that a page belongs to a particular category. The method uses anisotropic diffusion to refine the initial page category scores using the scores of neighboring pages as a function of the probability that there is a boundary between the pages. The method may be performed iteratively.
    Type: Application
    Filed: July 26, 2012
    Publication date: January 30, 2014
    Applicant: Xerox Corporation
    Inventors: Jean-Michel Renders, François Ragnet, Damien Cramet
  • Publication number: 20140032489
    Abstract: A method and apparatus for viewing a collaborative document and a portable document at a device in a network. The collaborative document is hosted on a server and accessible through a network. The device hosts a corresponding portable document. The document processing application allows viewing of the portable document and the collaborative document on the device, wherein the user may select the desired view. In one embodiment, each view is displayed as a tabbed window, and switching views is enabled by selection of a tab. When the device is disconnected from the network, the user may view and process the portable document.
    Type: Application
    Filed: January 22, 2009
    Publication date: January 30, 2014
    Applicant: Adobe Systems Incorporated
    Inventors: Vivek Hebbar, Robert K. McAfee
  • Publication number: 20140032488
    Abstract: A method and apparatus for processing collaborative documents providing a portable document version which may be processed when not connected to the collaborative document. The collaborative document is accessible to users through a network. Updates to the collaborative document are provided to the portable document, which may be modified to include the updates or replaced with an updated version of the collaborative document.
    Type: Application
    Filed: January 22, 2009
    Publication date: January 30, 2014
    Applicant: Adobe Systems Incorporated
    Inventors: Robert K. McAfee, Vivek Hebbar
  • Publication number: 20140006341
    Abstract: An online document management system is disclosed. In one embodiment, the online document management system comprises: one or more editorial computers operated by one or more administrators or editors, the editorial computers send invitations and manage peer review of document submissions; one or more system computers, the system computers maintain journals, records of submitted documents and user profiles, and issue notifications; and one or more user computers; the user computers submit documents or revisions to the document management system; wherein one or more of the editorial computers coordinate with one or more of the system computers to migrate one or more documents between journals maintained by the online document management system.
    Type: Application
    Filed: June 28, 2012
    Publication date: January 2, 2014
    Inventor: Robin Jason Lopulalan
  • Publication number: 20140006424
    Abstract: The present invention provides systems, methods, and software for automatically processing data included in a document and identifying and recommending citations matching the processed data. The system allows a user to select and submit text segment(s) for analysis and to select from a set of recommended citations a citation(s) that matches the text segment as well as profile data for inclusion in the document. One or more citation libraries or authority databases are queried to find citations for recommendation which best match the text segment selected and submitted by the author. The system automatically processes data submitted by an author to generate a set of recommended citations for consideration and for inclusion within a document while the document is presented by a document rendering application. A selected citation is then formatted and inserted in the document.
    Type: Application
    Filed: June 29, 2012
    Publication date: January 2, 2014
    Inventors: Khalid Al-Kofahi, Charles Macomber, Jason Rollins, Ellen Rotenberg, Christine Killian
  • Publication number: 20130346421
    Abstract: A targeted disambiguation system is described herein which determines true mentions of a list of named entities in a collection of documents. The list of named entities is homogenous in the sense that the entities pertain to the same subject matter domain. The system determines the true mentions by leveraging the homogeneity in the list, and, more specifically by applying a context similarity hypothesis, a co-mention hypothesis, and an interdependency hypothesis. In one implementation, the system executes its analysis using a graph-based model. The system can operate without the existence of additional information regarding the entities in the list; nevertheless, if such information is available, the system can integrate it into its analysis.
    Type: Application
    Filed: June 22, 2012
    Publication date: December 26, 2013
    Applicant: Microsoft Corporation
    Inventors: Chi Wang, Kaushik Chakrabarti, Tao Cheng, Surajit Chaudhuri
  • Publication number: 20130346424
    Abstract: Technologies pertaining to computing a respective TF-IDF value for each term in each document of a relative large document corpus are described herein. TF-IDF values are computed with respect to terms in documents of a large document corpus by in a single pass over the document corpus. Secondary sorting functionality of a distributed computing framework is exploited to compute TF-IDF values efficiently.
    Type: Application
    Filed: June 21, 2012
    Publication date: December 26, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Xiong Zhang, Hung-chih Yang, Danny Lange
  • Patent number: 8615721
    Abstract: A thumbnail image generating unit generates thumbnail images from a plurality of images having a sequential relation. A thumbnail image displaying unit displays the thumbnail images generated according to the sequential relation. A thumbnail image designating unit receives a designation of a thumbnail image from among the thumbnail images. An enlarging unit generates an enlarged image of a designated thumbnail image. An enlarged image displaying unit displays the enlarged image. A forward-advance designating unit designates a forward advance of the enlarged image displayed by the enlarged image displaying unit according to the sequential relation.
    Type: Grant
    Filed: December 17, 2008
    Date of Patent: December 24, 2013
    Assignee: Ricoh Company, Ltd.
    Inventor: Junichi Hara
  • Publication number: 20130332461
    Abstract: A computer-based apparatus for searching confidential documents, including a computer with a memory element and a processor to execute instructions stored in the memory to receive a confidential document and related non-confidential information from a source entity. The processor executes the instructions to: store the confidential document and non-confidential information in the memory element; and restrict access to the confidential document stored in the memory element to the source entity and a library entity only, or to the first source entity only. The processor executes the computer readable instructions to: receive a search request from a searching entity including a search parameter; identify the search parameter as being applicable to the confidential document; and transmit for access by the searching entity, the non-confidential information. The library entity is different from the source entity.
    Type: Application
    Filed: June 8, 2012
    Publication date: December 12, 2013
    Applicant: IP.COM I, LLC
    Inventors: James T. Shea, Samuel C. Baxter, Natalia V. Britvikhina, John E. Meczynski, JR.
  • Publication number: 20130318055
    Abstract: Methods, systems, and computer program products for cache conflict detection are provided. A computer-implemented method may include providing a partial graph of data to an application executing on a mobile device where the partial graph is derived from a document comprising a graph of data having a plurality of nodes, receiving a modified partial graph from the application where the modified partial graph includes one or more changes to the partial graph, and determining a document version used to derive the partial graph is no longer a most recent version of the document.
    Type: Application
    Filed: May 23, 2012
    Publication date: November 28, 2013
    Applicant: Sybase, Inc.
    Inventors: Brian Keith Lorenz, Johannes Alberti, Lance Waterman
  • Patent number: 8593663
    Abstract: An image forming apparatus executes functional processing according to specific functional processing information stored in a storage unit. The image forming apparatus determines whether any data remains in the storage unit when new functional processing information is set. The image forming apparatus prevents the specific functional processing information stored in the storage unit from being updated based on the new functional processing information if any data remains in the storage unit, and replaces the specific functional processing information stored in the storage unit with the new functional processing information if no data remains in the storage unit.
    Type: Grant
    Filed: August 23, 2007
    Date of Patent: November 26, 2013
    Assignee: Canon Kabushiki Kaisha
    Inventor: Toru Yoshida
  • Publication number: 20130290266
    Abstract: Versioning of an archived document having at least one of a first element, a second element, and a third element, is managed. The first element is mapped to a source set identifier, the second element is mapped to a first source identifier, and/or the third element is mapped to a second source identifier. The source set identifier, the first source identifier, and the second source identifier are agnostic to a type of the document and a method in which the document is captured. A determination is made as to whether the document comprises a copy of an existing document in an archive, a new version of an existing document in the archive, or a new document to be stored in the archive based upon an analysis of the mapped at least one of the source set identifier, the first source identifier, and the second source identifier.
    Type: Application
    Filed: April 26, 2012
    Publication date: October 31, 2013
    Inventor: Rahul Kapoor
  • Publication number: 20130290299
    Abstract: Content-based navigation of an electronic device includes receiving supplemental content to an electronic book. The supplemental content is created separately from the electronic book. The content-based navigation also includes associating an identifier of the electronic book with the supplemental content, storing the supplemental content with the identifier in a storage device, and creating an index to the supplemental content that is searchable by the identifier of the electronic book. The content-based navigation further includes providing end user devices with access to the supplemental content in the storage device via the index.
    Type: Application
    Filed: April 25, 2012
    Publication date: October 31, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Guillaume Hoareau, Althea Hookens, John Musial, Sandeep R. Patil
  • Publication number: 20130275392
    Abstract: Computer program products and systems, determine solutions to a problem experienced by a data processing system user. A query is received from the user. The query includes a problem description of the problem experienced by the user with respect to the data processing system. One or more keywords are extracted from the received problem description. An index of problems and associated solutions is searched using the one or more extracted keywords. The index of problems and associated solutions is created by analyzing a document collection describing problems and associated solutions with a text analytics application. One or more documents are returned that contains words or phrases that are similar to the keywords used for searching the index of problems and associated solutions. The documents relevant for the problem and associated solutions are presented to the user.
    Type: Application
    Filed: April 12, 2012
    Publication date: October 17, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Dhruv A. Bhatt, Kristin E. McNeil, Nitaben A. Patel
  • Publication number: 20130275475
    Abstract: An information handling system comprises a connection via a network interface for receiving data representing business process data from an integrated business process running at a location, the business process data comprising at least data indicating from where documents are received. The system also comprises a storage device for storing data representing an aggregate of business process data for an integrated business process, and a processor adapted to determine from the aggregate business process data whether the integrated business process running at the location receives documents from an external trading partner that, if the integrated business process receives documents from the external trading partner, then the processor prepares instructions to select an information handling system environment for running the integrated business process having additional disk space or solid state drive resources.
    Type: Application
    Filed: April 13, 2012
    Publication date: October 17, 2013
    Applicant: DELL PRODUCTS, LP
    Inventor: James T. Ahlborn
  • Publication number: 20130262373
    Abstract: A comment infrastructure for managing co-authoring conflict resolutions is provided. During co-authoring, multiple users may make edits to a document at the same time or users may merge edits to a document. Embodiments determine if changes submitted by a user conflict with previously submitted changes. If a conflict is found, the conflicting change may be saved to the document as a comment, allowing for the user to choose when to resolve the conflict. The original content and the different co-authoring edits may be displayed side-by-side, allowing users to make an informed decision about a desired resolution of a conflict. Additional commenting functionalities may be provided for allowing users to leave comments, replies, or messages associated with a co-authoring conflict, providing communication and collaboration between users about a best way to resolve a co-authoring conflict.
    Type: Application
    Filed: March 30, 2012
    Publication date: October 3, 2013
    Applicant: MICROSOFT CORPORATION
    Inventor: Benjamin Edward Rampson
  • Publication number: 20130262439
    Abstract: A method includes identifying at least one document associated with content from at least one digital content source. The at least one document includes information identifying at least one aspect of the content. The method also includes determining a document index for the at least one document based on keywords included in the at least one document. An activity field is inserted into the document index. The method includes accessing activity information. The activity information identifies at least one activity keyword associated with at least one activity. The method further includes identifying at least one present activity keyword in the document based on the activity information. An indicator of at least one present activity is determined based on the at least one present activity keyword. The method includes indexing the indicator of the at least one present activity in the activity field of the document index.
    Type: Application
    Filed: March 27, 2012
    Publication date: October 3, 2013
    Applicant: VERIZON PATENT AND LICENSING INC.
    Inventors: Jack Jianxiu Hao, Zhiying Jin, Martin Busse, Jimena Velarde
  • Publication number: 20130262465
    Abstract: A method for clustering documents is provided. Each document is represented by a multidimensional data point. The data points are initially assigned to a respective cluster and serve as their initial representative points. Thereafter, in an iterative process, the data points are clustered among the clusters, by assigning the data points to the clusters based on a comparison measure of each data point with the cluster or its representative point, and a threshold of the comparison measure. Based on this clustering, a new representative point for each of the clusters can be computed. Optionally, overlapping clusters are merged. For the next iteration, the new representative points are used as the representative points. An assignment of the documents to the clusters is output, based on a clustering of the data points in the latest iteration. Multiple batches may be processed, retaining the initial clusters to which the original batch was assigned.
    Type: Application
    Filed: April 2, 2012
    Publication date: October 3, 2013
    Applicant: Xerox Corporation
    Inventors: Matthias Galle, Jean-Michel Renders
  • Publication number: 20130254178
    Abstract: An apparatus and method of retrieving relevant documents having medical research evidence receives a request to access a plurality of documents in a database stored in a memory device. Each of the plurality of documents contains information relating to medical research evidence and has an associated relational expression. The method then causes display of a user interface with a plurality of fields (a set of these fields are selectable, prescribed terms), and receives a relational expression based on information received from the user interface. The received relational expression includes at least one of the selectable, prescribed terms in the user interface. Next, the method compares the received relational expression with the relational expressions associated with at least one of the plurality of documents, and causes the display of information relating to a set of documents in the database as a function of the comparison of relational expressions.
    Type: Application
    Filed: March 23, 2012
    Publication date: September 26, 2013
    Applicant: NAVYA NETWORK INC.
    Inventors: Gitika Srivastava, Naresh Ramarajan
  • Publication number: 20130246467
    Abstract: A documentation inventory manager which assigns a protection key to each piece of documentation that is received. More specifically, when providing information to a receiving company, a client provides their files to a common FTP server. As a support team of the receiving company accesses the files and stores some or all of the files to a local storage system, the files are modified to include an imbedded header record. In certain embodiments, the imbedded header record includes information regarding an original file name sent by the client, a key value that is assigned to that version of the downloaded file, permissions such as whether the file can be copied, and the inventory manager location. Each time a version of the file is downloaded to a different location within the receiving company, that file name, location, and new unique key is updated in the documentation inventory manager.
    Type: Application
    Filed: March 16, 2012
    Publication date: September 19, 2013
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Joseph V. Malinowski, David C. Reed, Max D. Smith
  • Publication number: 20130238639
    Abstract: The present invention provides a method and system for comparing a first XML document with a second XML document. An XML event is parsed from the first XML document or the second XML document based on a plurality of parameters. The parsed XML event is stored as a node in a first data structure or a second data structure, and compared with one or more nodes stored in the second data structure or the first structure, respectively. A comparison result is outputted, when the one or more nodes is a comparable node of the stored node, and on outputting the comparison result the comparable node and the stored node are deleted from the first data structures and the second data structures. Aforementioned steps are repeated till the first XML document and the second XML document are completely parsed and compared.
    Type: Application
    Filed: June 26, 2012
    Publication date: September 12, 2013
    Applicant: INFOSYS LIMITED
    Inventors: Ganapathy Raman Venkatasubramanian, Sriram Hariharasubramanian, Saravanan Sakthivel, Anantasrinivas Lakshmanan, Bhuvanalakshmi Kadapakkam Nandabalan
  • Publication number: 20130226934
    Abstract: The subject disclosure is directed towards ranking electronic documents in sub-linear time complexity. An advertising provider may perform such a ranking in order to identify one or more electronic document to advertise a product or service. A ranking mechanism may execute a number of random walks around the Internet by navigating the electronic documents via embedded links from a starting document and an ending document that are within a pre-determined distance. After finishing the random walks, an estimate of rank contribution information associated with each electronic document is provided. The estimated rank contribution information is used to determine an exposure level with respect to a network for one or more of the electronic documents. The exposure value of an example electronic document may correspond to a ranking value that may be computed using a sample of the rank contribution information related to that document.
    Type: Application
    Filed: February 27, 2012
    Publication date: August 29, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Michael A. Brautbar, Christian Herwarth Borgs, Jennifer Tour Chayes, Shanghua Teng
  • Patent number: 8515964
    Abstract: Method, system, and programs for computing similarity. Input data is first received from one or more data sources and then analyzed to obtain an input feature vector that characterizes the input data. An index is then generated based on the input feature vector and is used to archive the input data, where the value of the index is computed based on an improved Johnson-Lindenstrass transformation (FJLT) process. With the improved FJLT process, first, the sign of each feature in the input feature vector is randomly flipped to obtain a flipped vector. A Hadamard transformation is then applied to the flipped vector to obtain a transformed vector. An inner product between the transformed vector and a sparse vector is then computed to obtain a base vector, based on which the value of the index is determined.
    Type: Grant
    Filed: July 25, 2011
    Date of Patent: August 20, 2013
    Assignee: Yahoo! Inc.
    Inventors: Shanmugasundaram Ravikumar, Anirban Dasgupta, Tamas Sarlos
  • Publication number: 20130212063
    Abstract: A system for managing litigation documents including a computer-readable medium having a litigation management application. The application has computer-executable instructions for managing documents. The system includes a server that executes the instructions for managing documents on the computer-readable medium. The system has a database containing document information. The database is arranged with the server such that the server can access and modify document information in the database. The application includes a first documents tab that receives inputs from a first user to upload documents to the database for display in the first documents tab. The application has a second documents tab that receives inputs from a second user. The application generates a notification for delivery to the second user when the first user uploads a document to the database. The application can display the document in the second documents tab when the first user uploads the document to the database.
    Type: Application
    Filed: February 9, 2012
    Publication date: August 15, 2013
    Applicant: MERCURY HOLDINGS LLC
    Inventors: Jonathan ROTH, Aaron YAFFA
  • Publication number: 20130212123
    Abstract: In one embodiment, the invention provides a method for a system to provide information based on a query, the method comprising: performing a first search of at least one first source for information responsive to the query; providing a result of the search to a user; searching documents using at least a part of the result of the search; providing the user with at least one example of usage of the result of the search obtained from the searching of stored documents; based on user input, performing a second search of at least one second source for information responsive to the query; and providing a result of said second search to the user.
    Type: Application
    Filed: February 14, 2012
    Publication date: August 15, 2013
    Inventors: Anna Matveenko, Alexander Rylov, Tatiana Parfentieva
  • Publication number: 20130212062
    Abstract: Methods and apparatus, including computer program products, to assemble a collection of documents according to a document list. The document list represents documents to be included in the collection, and includes multiple entries that identify document templates. Each document template includes instructions that a web server can execute to generate a web document based on one or more parameters. A web document corresponding to each of the multiple entries is requested; the requested web documents are received and stored in the collection of documents. Links in the received web documents can be identified and updated. The collection of documents can be accessed as part of a web site.
    Type: Application
    Filed: February 7, 2008
    Publication date: August 15, 2013
    Inventors: Philip Levy, Naoki Hada
  • Publication number: 20130212118
    Abstract: The present disclosure is generally directed to a system and method for managing and querying historical document use within a litigation history. In one illustrative embodiment, each document used in any litigation case for a specific client, corporation or individual can be included in a historical database for that client, corporation or individual. If, and when a document is tagged, redacted or produced, the history for that document can be updated with case information. A link can be provided that would enable the user to return back to the original case database. Each document can be identified by a hash value that allows efficient tracking of the same document throughout the litigation history of the client.
    Type: Application
    Filed: February 13, 2012
    Publication date: August 15, 2013
    Inventors: James M. KING, Richard T. RUYLE
  • Publication number: 20130198123
    Abstract: Systems, methods, and media for extracting and processing entity data included in an electronic document are provided herein. Methods may include executing one or more extractors to extract entity data within an electronic document based upon an extraction model for the document, selecting extracted entity data via one or more experts, each of the experts applying at least one business rule to organize at least a portion of the selected entity data into a desired format, and providing the organized entity data for use by an end user.
    Type: Application
    Filed: January 27, 2012
    Publication date: August 1, 2013
    Inventors: Jan Stadermann, Denis Jager, Uri Zernik
  • Publication number: 20130191389
    Abstract: Embodiments of the present disclosure provide for analyzing paragraphs in a fixed format document to determine style clusters or groupings of each paragraph. In certain embodiments, the paragraphs are grouped into style clusters based on a first property. Each style cluster is then further divided into sub-groups based on a second property. Once the sub-groups have been determined, a third property associated with each paragraph in each sub-group is normalized based on a dominant one of the at least the third property.
    Type: Application
    Filed: January 23, 2012
    Publication date: July 25, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Milos Lazarevic, Milos Raskovic
  • Publication number: 20130185317
    Abstract: A computer system for extracting address information from PDF documents to create a database of address information that can be used to generate address sheets for mail. It is preferred that the mail be accountable mail requiring feedback on the mailing process.
    Type: Application
    Filed: January 18, 2012
    Publication date: July 18, 2013
    Inventors: Nathan J. Welton, Jerry E. Staddon
  • Publication number: 20130185307
    Abstract: A method of evaluating a semantic relatedness of terms. The method comprises providing a plurality of text segments, calculating, using a processor, a plurality of weights each for another of the plurality of text segments, calculating a prevalence of a co-appearance of each of a plurality of pairs of terms in the plurality of text segments, and evaluating a semantic relatedness between members of each the pair according to a combination of a respective the prevalence and a weight of each of the plurality of text segments wherein a co-appearance of the pair occurs.
    Type: Application
    Filed: January 18, 2012
    Publication date: July 18, 2013
    Applicant: Technion Research & Development Foundation Ltd.
    Inventors: Ran EL-YANIV, David Yanay