Abstract: For adaptive similarity search resolution in a data deduplication system using a processor device in a computing environment, input data is partitioned into data chunks. Input similarity elements are calculated for an input chunk. The input similarity elements are used to find similar data in a repository of data using a similarity search structure. A resolution level is calculated for storing the input similarity elements. The input similarity elements are stored in the calculated resolution level in the similarity search structure.
Type:
Grant
Filed:
July 17, 2013
Date of Patent:
September 11, 2018
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for identifying topical entities. In one aspect, a method includes obtaining a plurality of entities that are associated with a first resource; for one or more of the identified entities, receiving search results for a search query derived from the entity; determining that search results for a search query including a particular entity include a specific type of search results; and determining that the particular entity is a topical entity of the first resource based at least in part on the particular entity appearing in a title or a resource locator of the first resource, wherein the topical entity of the first resource represents a predominant topic of the first resource.
Abstract: A system and method for operating a user device includes a receiver that receives a first data object and a second data object. A first memory stores the first data object and a second memory stores the second data object. The second memory is separate from the first memory. A user interface module generates a user input command. An application interface selects a memory location from the first memory or the second memory and obtaining stored data from the first memory or the second memory based on the user input commands. A display displays the stored data.
Type:
Grant
Filed:
September 30, 2010
Date of Patent:
August 14, 2018
Assignee:
The DIRECTV Group, Inc.
Inventors:
Leo Wu, Huy Q. Tran, Flemming R. Hansen, Peter Leong, Eric H. Chang, Gordon H. Chen
Abstract: A search engine for searching based on related scientific or technological concepts, comprises: a learning module for learning about relationships between technical phrases based on their rates of occurrence in related documents, therefrom to form concepts from groupings of related phrases, and a search module for searching for related documents to a query document based on occurrence in said related documents of concepts present in said query document, the learning module carrying out said learning based on a training set of documents and inter-document relations.
Abstract: A method and system for performing a graph search, includes constructing an abstract representation of the graph using state-space abstraction. The abstract representation of the graph includes one or more abstract nodes having duplicate detection scopes and one or more abstract edges having operator groups. The duplicate detection scopes of the abstract nodes are partitioned into smaller duplicate detection scopes using edge partitioning. The abstract edges include the smaller duplicate detection scopes. Nodes in the current search layer are expanded using the operator groups of outgoing abstract edges of the abstract nodes the nodes map to. The operator groups associated with abstract edges having disjoint duplicate detection scopes are used to expand the nodes in parallel. Once all the operator groups in the current search layer have been used for node expansion the method progresses to the next search layer.
Type:
Grant
Filed:
July 23, 2010
Date of Patent:
May 29, 2018
Assignee:
PALO ALTO RESEARCH CENTER INCORPORATED
Inventors:
Rong Zhou, Tim Schmidt, Minh Binh Do, Serdar Uckun
Abstract: Managing data set objects for graph-based data processing includes: storing a group of one or more data set objects in a data storage system, the data set objects each representing a respective data set; and generating an association between at least a first data set object in the group and at least a first node of a dataflow graph for processing data in a data processing system, the first node representing a source or sink of data in a flow of data represented by a link in the dataflow graph, and the first data set object including a plurality of modes in which different transformational logic is applied to data processed by the first node.
Type:
Grant
Filed:
October 25, 2011
Date of Patent:
May 22, 2018
Assignee:
Ab Initio Technology LLC
Inventors:
Brond Larson, Richard A. Shapiro, Craig W. Stanfill, Adam Harris Weiss
Abstract: In a previous storage apparatus, differential JNLs are reflected in order of the sequential numbers, to the data volumes thereof. If a first storage apparatus is suspended, it is determined which is newer: the sequential number which the journal recently reflected in a second storage apparatus or the sequential number reflected in a third storage apparatus. In the newer storage apparatus having the newer sequential number, it is determined whether one or more JNLs from the journal having the sequential number next to the sequential number which is not determined to be the newer to the journal having the sequential number determined to be the newer exist, or not. If the result of the determination is positive, from the newer storage apparatus to the previous storage apparatus which is not the newer of the second and the third storage apparatuses, one or more differential JNLs are copied.
Abstract: A method, system and computer program product for generation and management of incremental backups of VEE file system using bitmaps. The proposed method allows users to roll back to any previous version or state of the VEE file system and to instantiate this version using the data encapsulated in virtual disk storage (i.e. file system) of the VEE. A number of VEEs run on a Host Operating System of the computer system. One of the VEEs implemented on the computer system is designated to generation and management of backups of the virtual disk data of the other VEEs without freezing the file system during the entire backup process. A special tracing application runs on the designated VEE for generating the bitmap of a file system snapshot. The user can also set up a time for generating a backup or create a schedule for automatically generating the backups at critical points.
Type:
Grant
Filed:
February 26, 2013
Date of Patent:
December 26, 2017
Assignee:
Parallels IP Holdings GmbH
Inventors:
Alexay N. Kuznetzov, Alexander G. Tormasov, Kirill S. Korotaev, Dmitry I. Monakhov
Abstract: A system and method for reporting a user's behavior and patterns when engaged in use of an electronic consumable. In a preferred embodiment, an electronic consumable such as an electronic book or library includes detectors for collecting biological information from a user. This information is analyzed to identify the user's interests in and reactions to the electronic consumable.
Type:
Grant
Filed:
July 31, 2003
Date of Patent:
December 5, 2017
Assignee:
International Business Machines Corporation
Inventors:
John R. Hind, Steven Michael Miller, Patrick P. Reynolds, Abdolreza Salahshour
Abstract: System and method to compress data records by providing data records with a binary structure; dividing the data records into several bit vectors; reducing the size of each bit vector by dividing the bit vector into consecutive partial areas of equal size, each partial area consisting of n bits, classifying the partial areas as trivial partial areas, quasi-trivial partial areas and non-trivial partial areas, combining one non-trivial or several consecutive non-trivial partial areas into one so named R block, and removing the trivial partial areas; as well as combining one quasi-trivial or several consecutive quasi-trivial partial areas into one so named O block.
Type:
Grant
Filed:
February 4, 2011
Date of Patent:
October 31, 2017
Assignee:
PARSTREAM GMBH
Inventors:
Jorg Bienert, Michael Hummel, Norbert Heusser
Abstract: An apparatus and method for searching and displaying using cognitive pattern recognition including searching for document(s) with at least one search text, wherein each search text is associated with a highlight option; selecting to enable or disable the highlight option for each of the search text; displaying a progressive relationship of the document(s) in scaled common image format (CIF) by displaying: a first display presenting the document(s), wherein each of the document(s) includes all of the search text; a second display presenting only pages from the document(s) where the only pages presented include one or more of the search text with its associated highlight option enabled; and a third display presenting one page from the only pages wherein all occurrences of the search text where the highlight option for the search text is enabled are displayed simultaneously on the page.
Type:
Grant
Filed:
November 28, 2011
Date of Patent:
September 26, 2017
Assignee:
ImageScan, Inc.
Inventors:
Basker S. Krishnan, Hanoz J. Kateli, Bryan Heesch
Abstract: A digital analytics system comprises a data management system including data extraction modules and a data storage system. The data extraction modules extract data from data sources and store the data in storage units. An analytics engine system including analytics engines and interfaces to retrieve data relevant to the analytics engines from the storage units. The analytics engines may perform prescriptive or descriptive analytics on the retrieved data. An applications interface and storage stores applications. The applications may be executed using information generated by the prescriptive or descriptive analytics performed by the analytics engines.
Type:
Grant
Filed:
October 25, 2011
Date of Patent:
August 15, 2017
Assignee:
ACCENTURE GLOBAL SERVICES LIMITED
Inventors:
Leonidas Michael Barrett, Tzuu-Wang Shein