Abstract: Mechanisms are provided for adjusting a configuration of data stored in a storage system. According to various embodiments, a storage module may be configured to store a configuration of data. A processor may be configured to identify an estimated performance level for the storage system based on a configuration of data stored on the storage system. The processor may also be configured to transmit an instruction to adjust the configuration of data on the storage system to meet the service level objective when the estimated performance level fails to meet a service level objective for the storage system.
Abstract: Methods and apparatus to partition data are discloses. An example method includes generating, with a processor, an indicator matrix for a set of panelist data based on a set of matrix criteria corresponding to the panelist data. The entries in the indicator matrix are determined based on a conditional probability of a panelist having one or more characteristics. An indicator is placed in a panelist vector of the matrix if the panelist has the one or more characteristics. A set of unique panelist vectors is determined and redundant panelist vectors that are not unique are removed to determine a minimum set of mutually exclusive partitions of the panelist data.
Type:
Grant
Filed:
September 21, 2015
Date of Patent:
October 2, 2018
Assignee:
The Nielsen Company (US), LLC
Inventors:
Michael Sheppard, Jonathan Sullivan, Peter Lipa, Alejandro Terrazas
Abstract: The present invention provides a method and apparatus for representing content information. The method and apparatus for representing content information according to embodiment of the present invention notifies the user employing a mobile environment (mobile terminal or smart terminal) in a tactile, auditory, or visual manner about existence of contents having a score larger than or equal to a particular score within a predetermined distance to represented contents in a predetermined order and enables the user to perform a search for the contents desired by the user based on the user's input, providing such an advantage that the user can find the contents of interest intuitively without examining searched contents one after another.
Abstract: An information processing apparatus, backup method, and program product that enable efficient differential backup. In one embodiment, an information processing apparatus for files stored in a storage device includes: a metadata management unit for managing metadata of files stored in the storage device; a map generation unit for generating a map which indicates whether metadata associated with an identification value uniquely identifying a file in the storage device is present or absent; and a backup management unit for scanning the metadata to detect files that have been created, modified, or deleted since the last backup, and storing at least a data block and the metadata for a detected file in a backup storage device as backup information in association with the identification value.
Type:
Grant
Filed:
September 19, 2017
Date of Patent:
August 14, 2018
Assignee:
International Business Machines Corporation
Abstract: An information processing apparatus, backup method, and program product that enable efficient differential backup. In one embodiment, an information processing apparatus for files stored in a storage device includes: a metadata management unit for managing metadata of files stored in the storage device; a map generation unit for generating a map which indicates whether metadata associated with an identification value uniquely identifying a file in the storage device is present or absent; and a backup management unit for scanning the metadata to detect files that have been created, modified, or deleted since the last backup, and storing at least a data block and the metadata for a detected file in a backup storage device as backup information in association with the identification value.
Type:
Grant
Filed:
September 19, 2017
Date of Patent:
August 14, 2018
Assignee:
International Business Machines Corporation
Abstract: A method for interacting with a database stored in an object grid is described. The database is given attributes of a spreadsheet. Elements stored in the database are represented and addressed as cells of a spreadsheet. Cells can store data objects, including formulas, and executable scripts. The spreadsheet can evaluate formulas, carry out the program instructions of executable scripts, and perform complex event processing. Interaction with the spreadsheet is accomplished through the use of structured data messages which include instructions, spreadsheet and cell addressing and, optionally, data elements.
Type:
Grant
Filed:
June 30, 2014
Date of Patent:
July 24, 2018
Assignee:
International Business Machines Corporation
Abstract: Provided are techniques for invoking with a processor executing on a computer a source code parser to obtain source information that includes a first location of an Application Programming Interface (API) call and parameters of the API call in source code of a client application, where the parameters the API call do not include query text for a query that is to be used to access a database; examining a stack trace to determine a second location of the API call in the stack trace; and deriving the query of the API call and a third location of the query in the source code by identifying the query in the stack trace at the location of the API call in the stack trace.
Type:
Grant
Filed:
March 20, 2015
Date of Patent:
July 3, 2018
Assignee:
International Business Machines Corporation
Inventors:
Stephen A. Brodsky, Zeus O. Courtois, Tom W. Jacopi, Michael Y. Kwong, Tony K. Leung, Sonali Surange
Abstract: Disclosed is a system, method, and computer program product for performing theme analysis and creating topics with regards to social data. A user interface is provided that allows the user to view and interact with to view and control the process/mechanism or creating topics. The topic creation process can be facilitated and automated using a volatility index.
Type:
Grant
Filed:
November 26, 2014
Date of Patent:
June 19, 2018
Assignee:
ORACLE INTERNATIONAL CORPORATION
Inventors:
Timothy P. McCandless, Mehrshad Setayesh
Abstract: Disclosed is a system, method, and computer program product for performing dynamic theme analysis with regards to social data. A user interface is provided that allows the user to view and interact with to view and control the process/mechanism for performing theme analysis.
Type:
Grant
Filed:
November 26, 2014
Date of Patent:
June 12, 2018
Assignee:
ORACLE INTERNATIONAL CORPORATION
Inventors:
Timothy P. McCandless, Mehrshad Setayesh
Abstract: A multi-user search system with methodology for personal searching. In one embodiment, for example, a system for personal searching includes a plurality of index servers storing a plurality of index shards. Each index shard of the plurality of index shards indexes a plurality of documents. Each document of the plurality of documents belongs to one of a plurality of document namespaces assigned to the index shard. The system further includes a front-end server computer for receiving a search query from an authenticated user; an access control server for determining an authorized document namespace the authenticated user is authorized to access; and a query processor for answering the search query and restricting, based on an identifier of the authorized document namespace, an answer to the search query to identifying only documents satisfying the search query and belonging to the authorized document namespace.
Abstract: Techniques for managing big data include tagging of documents and subsequent retrieval using per-subject dictionaries having entries with some entries specially designated as entities. An entity indicates that the term in the entry has special meaning, e.g., brands (trademarks/service marks), trade names, geographic identifiers or other classes of terms. A dictionary may include a non-entity entry for a term and one or more entity entries, for different entity types. The entries may also include subject-determining-power scores. The subject-determining-power scores provide an indication of the descriptive power of the term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different subject-determining-power scores in each of the dictionaries. The entity distinctions for a term can then be used in tagging documents and processing retrieval requests.
Type:
Grant
Filed:
October 13, 2015
Date of Patent:
May 15, 2018
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Anne Elizabeth Gattiker, Fadi H. Gebara, Anthony N. Hylick, Rouwaida N. Kanj
Abstract: Techniques for managing big data include tagging of documents and subsequent retrieval using per-subject dictionaries having entries with subject-determining-power scores. The subject-determining-power scores provide an indication of the descriptive power of the term with respect to the subject of the dictionary containing the term. The same term may have entries in multiple dictionaries with different subject-determining-power scores in each of the dictionaries. A retrieval request for one or more documents containing search terms descriptive of the one or more documents can be processed identifying a set of candidate documents tagged with subjects and optional terms, and then applying subject-determining-power scores from the multiple dictionaries for the search term to determine a subject for the search term. The method then selects the one or more documents from the candidate documents according to the subject.
Type:
Grant
Filed:
October 20, 2015
Date of Patent:
May 15, 2018
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Anne Elizabeth Gattiker, Fadi H. Gebara, Anthony N. Hylick, Rouwaida N. Kanj, Jian Li
Abstract: A system in which data stored in a first information processing apparatus is migrated to a second information processing apparatus, wherein the first information processing apparatus comprises: an export unit configured to export migration target data that is stored in a storing unit; and a recording unit configured to record time information indicating the time of exporting performed by the export unit, and the export unit furthermore compares the time information and information regarding an update date of the data stored in the storing unit and exports data updated at or subsequent to the time indicated by the time information as a difference migration target.
Abstract: Methods and systems for determining schema element types are shown that include pooling potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring the pool of potential annotations according to relevancy using information using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating the element of the unlabeled schema using the most relevant potential annotations.
Type:
Grant
Filed:
March 23, 2011
Date of Patent:
May 1, 2018
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Songyun Duan, Achille B. Fokoue-Nkoutche, Oktie Hassanzadeh, Anastasios Kementsietsidis, Kavitha Srinivas, Michael J. Ward
Abstract: The content recommendation system that acquires attribute information of a given user, acquires at least one list from among lists of content sequentially generated over time on the basis of the attribute information of the given user, acquires user preference information, which is feature information of content preferred by the given user, extracts some content from content included in the list acquired on the basis of the user preference information, and presents the content to the given user.
Abstract: A high-performance gridded database protocol for storing, arranging, and extracting gridded data includes associating values for a single grid cell and storing them together to extract as many useful values as possible from a single read operation. Gridded data is stored in a geographically-indexed cylindrical grid that permits efficient data extraction for a particular location while maximizing efficiency of read operations. Cylinders of values are built by grouping grids that are related to each other so that when data for a location is to be extracted, a minimal number of read operations is needed to retrieve an entire stack of data relevant to the location.
Type:
Grant
Filed:
July 16, 2014
Date of Patent:
March 20, 2018
Assignee:
CLEARAG, INC.
Inventors:
Douglas K. Rand, John J. Mewes, Leif Pedersen, Kristopher A. Zarns, Dustin Salentiny
Abstract: A data processing system performs query progress estimation based on processed value packets. In the illustrative data processing system, a database query processor comprises a query optimizer that creates a query plan, and a database plan executor that executes the query plan and observes intermediate result streams processed as the query plan is executed. A value packet manager anticipates value packets during query optimization, creates value packets as the intermediate result streams are processed, and compares anticipated value packets with created value packets to determine accuracy of the anticipated value packets and estimate query progress.
Type:
Grant
Filed:
June 30, 2009
Date of Patent:
December 5, 2017
Assignee:
Hewlett Packard Enterprise Development LP
Abstract: According to an embodiment of the present invention, a computer-implemented method of cleansing data is provided that comprises determining a criticality score and a complexity score for identified attributes of an enterprise, wherein the criticality score represents a relevance of an attribute to one or more enterprise dimensions and the complexity score represents complexity of cleansing data for an attribute. The identified attributes for data cleansing based on the criticality and complexity scores are prioritized, and data of the identified attributes is cleansed in accordance with priority of the identified attributes. Embodiments further include a system, apparatus and computer readable media to cleanse data in substantially the same manner as described above.
Type:
Grant
Filed:
November 25, 2014
Date of Patent:
December 5, 2017
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Carl M. Marrelli, Ram S. Narayanan, Martin Oberhofer, Solmaz Rashidi
Abstract: An information retrieval and analysis system for numeric data which provides high precision and recall for numeric search and uses a methodology for determining contextualization of the extracted data. The capabilities include extracting, parsing, and contextualizing numeric data including both a numeric value and an accompanying unit. This system facilitates the organization of largely unstructured numeric data into an inverted index and other database formats. An information retrieval system which enables the exploration and refinement of an extracted numeric data set defined by a search input that may be precise or initially vague. This system also facilitates analyzing and portraying numeric data graphically, creating knowledge by combining data from multiple sources, extracting correlations between seemingly disparate variables, and recognizing numeric data trends.