Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 9269353
    Abstract: A content is divided into tokens. Tokens can represent sounds, symbols, letters, words, sentences, paragraphs, or documents, among other possibilities. Probable measures of meaning (PMMs) are determined for each token. Given a target token, a proximity function for other tokens can be calculated, which in turn can be used to modify the PMMs for the other tokens. These modified PMMs for the other tokens can then be used in conjunction with the PMM for the target token to generate a relative measure of meaning (RMM) for the target token.
    Type: Grant
    Filed: December 5, 2012
    Date of Patent: February 23, 2016
    Inventors: Manu Rehani, Warren L. Wolf
  • Patent number: 9258317
    Abstract: The present invention discloses a device and method for data matching and a device and method for network intrusion detection. The method for data matching includes: searching in a regular expression set one or more complex regular expressions causing a sharp increase in number of states generated based on a regular expression during interaction; constructing a corresponding simplified expression for each complex regular expression; compiling a simplified state machine; compiling one or more substate machines, wherein each of the one or more substate machines is compiled based on a corresponding one of the one or more complex regular expressions; and matching data based on the simplified state machine and the one or more substate machines. The present invention further discloses a device for data matching employing the method for data matching and a device and method for intrusion detection employing the device and method for data matching.
    Type: Grant
    Filed: March 12, 2013
    Date of Patent: February 9, 2016
    Assignee: NSFOCUS INFORMATION TECHNOLOGY CO., LTD.
    Inventors: Gang Yao, Xiao Han, Tao Zhang, Peng Han, Lijun Cheng
  • Patent number: 9258340
    Abstract: Secure digital remediation methods and systems are provided for managing a reputation. A method for managing a reputation includes receiving a complaint from a victim regarding an unfavorable item published by one or more third parties, extracting information from the unfavorable item, and providing a communication to the one or more third parties of the complaint including the information, the communication requesting a response from the one or more third parties. The method also includes scoring the unfavorable item on a severity scale based on the information, responses to queries and/or a ruleset to produce a severity score, the ruleset including instructions for responding to the complaint based on the severity score.
    Type: Grant
    Filed: June 19, 2013
    Date of Patent: February 9, 2016
    Assignee: IPA (Cayman) Limited
    Inventor: Jeff Ervine
  • Patent number: 9251485
    Abstract: A data transformation system receives data from one or more external source systems and stores and transforms the data for providing to reporting systems. The data transformation system maintains multiple versions of data received from an external source system. The data transformation system can combine data from different versions of data and provide to the reporting system. As a result, external source systems that do not maintain data in a format appropriate for reporting systems and/or do not maintain sufficient historical data to generate different types of reports are able to generate these reports. The data transformation system can also enhance older versions of data stored in the system or exclude portions of data from reports. The data transformation system can purge older versions of data so that older data that is less frequently requested is maintained at a lower frequency than recent data.
    Type: Grant
    Filed: May 18, 2015
    Date of Patent: February 2, 2016
    Assignee: GoodData Corporation
    Inventor: Pavel Kolesnikov
  • Patent number: 9251226
    Abstract: Approaches for data integration between multiple IT sources using automated data processing based on target metadata are provided. Specifically, an integration component is configured to load a mapped data set into a table with delta processing based on a configuration object containing, e.g., the source data location, target table name, and source to target mapping. The integration component uses the target metadata to validate data, identify changes, generate the necessary database programming language (e.g., structured query language (SQL)), and run the database programming language with data binding to perform the actual data updates. The integration component leverages the data target metadata to automate the processing of source data, thus providing a way to validate the data, and identify delta changes at the field level between the source and target. This significantly reduces the overall development effort, while providing consistency in record handling and error reporting.
    Type: Grant
    Filed: March 15, 2013
    Date of Patent: February 2, 2016
    Assignee: International Business Machines Corporation
    Inventors: Gautam Majumdar, Tarun K. Saha, Michael Q. Wang
  • Patent number: 9244988
    Abstract: A system and method provide for an application for matching metadata from a transaction in an online transaction processing subsystem of an enterprise system to query metadata in an online analytical processing subsystem. A business object type is identified from a transaction in the online transaction subsystem using an object repository key and a corresponding data object attribute is determined from a mapping table. A new query of metadata from the online analytical processing subsystem is run, using the value of the data object attribute as a filter. This may allow for the generation of dynamic related report results for the transaction.
    Type: Grant
    Filed: March 26, 2012
    Date of Patent: January 26, 2016
    Assignee: SAP SE
    Inventors: Sumanth Hegde, Santosh V, Tarun Shetty
  • Patent number: 9244809
    Abstract: An approach for validating code for an extract, transform and load (ETL) tool is provided. Naming, coding, and performance standards for the code is received. The code is exported to a job definition file and parsed. Violations of the standards are determined by determining the parsed code does not match the standards. A report identifying the violations is generated. Based on a review of the report and a rework of the code to comply with the standards, the reworked code is exported to another job definition file and parsed, the parsed reworked code is determined to not include the violations of the standards, and a second report is generated that indicates that the reworked code does not include the violations. An approval of the reworked code is received based on the second report.
    Type: Grant
    Filed: July 15, 2014
    Date of Patent: January 26, 2016
    Assignee: International Business Machines Corporation
    Inventor: Rokky Vilakkumadathil
  • Patent number: 9244990
    Abstract: Various arrangements for creating a pivot table are presented. A set of stored records may be accessed. A plurality of stored records from the set of stored records may be selected to be represented in a plurality of cells of the pivot table. A graphical representation may be generated for display for each stored record of the plurality of stored records. Each graphical representation may be generated for display within a cell of the plurality of cells of the pivot table. Each graphical representation of the pivot table is actively linked with the corresponding stored record.
    Type: Grant
    Filed: October 7, 2011
    Date of Patent: January 26, 2016
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventor: Hendrik van den Broek
  • Patent number: 9158831
    Abstract: The present disclosure describes a standardized logical model architecture, for use in building a business intelligent systems, and a system and method of using the logical model architecture to populate a data repository (e.g., a data mart/warehouse) with the data to satisfy reporting and data analysis needs.
    Type: Grant
    Filed: June 30, 2009
    Date of Patent: October 13, 2015
    Assignee: YAHOO! INC.
    Inventors: Yue Zhuge, Mahmoud Alnahlawi
  • Patent number: 9152627
    Abstract: In an example embodiment, a method of automatically generating data validation rules from data stored in a column of a table is provided. Outliers for the data are determined by analyzing a profiling statistic for the data, the profiling statistic having a type. Then it is determined if a predefined limit is exceeded, based on a quantity of the outliers determined for the data through the analysis of the profiling statistic. A data validation rule is then automatically generated based on non-outliers detected in the data through the analysis of the profiling statistic, the generated data validation rule also being based on the type of the profiling statistic. The data validation rule can then be applied to data subsequently entered for the column, causing at least a portion of the data subsequently entered for the column to be rejected.
    Type: Grant
    Filed: September 20, 2012
    Date of Patent: October 6, 2015
    Assignee: Business Objects Software Ltd
    Inventors: David Kung, Min He, Mengdan Ma
  • Patent number: 9141728
    Abstract: Methods and systems are provided for creating an adaptive thesaurus. A term pair including an index term and an expansion term is received. A recall gain, an expansion independence, and a semantic similarity of the term pair are calculated using a processor. Whether to store the term pair is determined based on the recall gain, the expansion independence, and the semantic similarity. The term pair is stored based on the determination. Methods and systems are provided for searching using an adaptive thesaurus. A search query including a query term is received. An expansion term stored in association with an index term matching the query term in the adaptive thesaurus is retrieved. Using a processor, the search query is expanded using the expansion term based on a recall gain, an expansion independence, and a semantic similarity.
    Type: Grant
    Filed: May 17, 2013
    Date of Patent: September 22, 2015
    Assignee: LexisNexis, a division of Reed Elsevier Inc.
    Inventors: Donald Loritz, Shian-jung Chen, Narasimha Edala, Harry Silver
  • Patent number: 9135071
    Abstract: A method for data flow processing includes determining values for each of a set of parameters associated with a task within a data flow processing job, and applying a set of rules to determine one of a set of processing techniques that will be used to execute the task. The set of rules is determined through a set of benchmark tests for the task using each of the set of processing techniques while varying the set of parameters.
    Type: Grant
    Filed: August 19, 2011
    Date of Patent: September 15, 2015
    Assignee: Hewlett-Packard Development Company, L.P.
    Inventors: Alkiviadis Simitsis, William Kevin Wilkinson
  • Patent number: 9110994
    Abstract: A method of performing a query on a cube of data is provided. A cube of data is distributed onto a plurality of computing devices with each computing device of the plurality of computing devices storing a different portion of the cube of data. A perturbation rule configured for application to the cube of data and associated with a user is received. A process space associated with the user is created. The received perturbation rule is compiled in association with the created process space. A query on the portion of the cube of data stored at the computing device is received. The received query is associated with the created process space. The query is processed while applying the compiled perturbation rule to data extracted from the portion of the cube of data stored at the computing device. A result of the processed query is sent to a requesting computing device.
    Type: Grant
    Filed: July 24, 2013
    Date of Patent: August 18, 2015
    Assignee: SAS Institute Inc.
    Inventors: Stacey M. Christian, Donald James Erdman, Scott T. Gray
  • Patent number: 9082107
    Abstract: One embodiment of the invention includes a method of associating a file with an intellectual asset record. The method includes reading information such as the file name and metadata pertaining to a file. The file name has a predetermined format made up of fields. The fields in the file name and the metadata are extracted. Based upon the fields, the file is linked to intellectual asset record. The intellectual asset record includes a set of related data organized into meaningful units pertaining to an intellectual asset.
    Type: Grant
    Filed: October 24, 2011
    Date of Patent: July 14, 2015
    Assignee: ANAQUA, INC.
    Inventors: Denise Cheung, Kelly Crawford, Erik Bailey, David Bartle
  • Publication number: 20150149402
    Abstract: A probabilistic data structure is generated for efficient query processing using a histogram for unsorted data in a column of a columnar database. A bucket range size is determined for multiples buckets of a histogram of a column in a columnar database table. In at least some embodiments, the histogram may be a height-balanced histogram. A probabilistic data structure is generated to indicate for which particular buckets in the histogram there is a data value stored in the data block. When an indication of a query directed to the column for select data is received, the probabilistic data structure for each of the data blocks storing data for the column may be examined to determine particular ones of the data blocks which do not need to be read in order to service the query for the select data.
    Type: Application
    Filed: February 2, 2015
    Publication date: May 28, 2015
    Applicant: AMAZON TECHNOLOGIES, INC.
    Inventor: ANURAG WINDLASS GUPTA
  • Publication number: 20150149401
    Abstract: A new dictionary can be created for a result column in a query plan operation executed on a database. The result column can be generated by multiple worker jobs running in parallel to read tasks from a shared queue as part of a query plan operation that includes a group-by column within an input set of input columns. The group-by column can include an original dictionary for all values contained within the group-by column If the new dictionary has fewer entries than the original dictionary for the group-by column such that mapping is required between old value identifiers within the group-by column and new value identifiers within the result column, the old value identifiers are renamed to the new value identifiers using a mapping vector.
    Type: Application
    Filed: June 10, 2014
    Publication date: May 28, 2015
    Inventors: Stephan Kottler, Nico Bohnsack, Kai Stammerjohann
  • Patent number: 9043273
    Abstract: A data warehouse is constructed using the relational mapping of a transactional database without reconstructing the data relationships of the transactional database. First, an application programmer analyzes an object model in order to describe facts and dimensions using the objects, attributes, and paths of the object model. Each of the dimensions has an identifier that correlates an item in the transactional database to a dimension record in the data warehouse. The fact and dimension descriptions are saved to a description file. Second, a Data Warehouse Engine (DWE) then access the description file and uses the object model, fact and dimension descriptions, and object-relational mapping to map transactional data to the data warehouse.
    Type: Grant
    Filed: April 25, 2008
    Date of Patent: May 26, 2015
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Lawrence Scott Rich, Rafik Jaouani
  • Patent number: 9043272
    Abstract: A system for determining the start of a match of a regular expression has a special state table which contains start state entries and terminal state entries; a plurality of start state registers for storing offset information indicative of the start of a match of the regular expression; a deterministic finite state automaton (DFA) next state table which, given the current state and an input character, returns the next state. The DFA next state table includes a settable indicator for any next state table entry which indicates whether to perform a lookup into the special state table. A compiler loads values into the special state table based on the regular expression.
    Type: Grant
    Filed: September 18, 2007
    Date of Patent: May 26, 2015
    Assignee: INSIDE SECURE
    Inventors: Daniel Wyschogrod, Leonid Leibman
  • Publication number: 20150142724
    Abstract: The various embodiments herein provide a method for representing OLAP (Online Analytical Processing) queries in a data grid in real time. The method comprises of providing a new OLAP query, building a directed acyclic graph structure for the OLAP query, initializing the graph structure by generating signals for one or more business entities, processing the generated signals of one or more entities, linking entities based on matching of query filter parameters specified in the OLAP query and calculating aggregate values at one or more nodes. The nodes in the directed acyclic graph comprises: a plurality of blue nodes adapted to maintain links to one or more business entities stored in a datagrid and to hold the aggregate values and a plurality of grey nodes adapted to hold the calculated aggregate values. The aggregate values comprise at least one of SUM, ACCOUNT and AVG.
    Type: Application
    Filed: November 18, 2013
    Publication date: May 21, 2015
    Applicant: Nuwafin Holdings Ltd
    Inventor: NANDAKUMAR KRISHNAN NAIR
  • Publication number: 20150142725
    Abstract: An event tap associated with a server, such as a Web server, at a machine can transform a server event into a tuple, select a database node for the tuple, and place the tuple in a queue for that database node, and then flush the queue periodically directly into database notes. The use of an event tap can thus reduce the computational burden on the database while keeping the server event data in the database relatively fresh.
    Type: Application
    Filed: September 10, 2014
    Publication date: May 21, 2015
    Inventors: George Candea, Anastasios Argyros, Mayank Bawa
  • Patent number: 9037534
    Abstract: A data transformation system receives data from one or more external source systems and stores and transforms the data for providing to reporting systems. The data transformation system maintains multiple versions of data received from an external source system. The data transformation system can combine data from different versions of data and provide to the reporting system. As a result, external source systems that do not maintain data in a format appropriate for reporting systems and/or do not maintain sufficient historical data to generate different types of reports are able to generate these reports. The data transformation system can also enhance older versions of data stored in the system or exclude portions of data from reports. The data transformation system can purge older versions of data so that older data that is less frequently requested is maintained at a lower frequency than recent data.
    Type: Grant
    Filed: December 12, 2014
    Date of Patent: May 19, 2015
    Assignee: GoodData Corporation
    Inventor: Pavel Kolesnikov
  • Patent number: 9037533
    Abstract: A database appliance in which two or more internetworked data storage units are used to coordinate the storage and retrieval of database records. One or more central database processing units are also associated with the data storage units. A network infrastructure provides the ability for the central database processors and storage processors to communicate as network nodes, with the network infrastructure using a communication protocol. A software application for executing database operations executes in a distributed fashion with portions of the database application executing on at least one central database processor and other portions executing on the data storage processors. At least a portion of the database application is implemented within and/or coordinated by a communication process that is executing the communication protocol.
    Type: Grant
    Filed: September 11, 2007
    Date of Patent: May 19, 2015
    Assignee: IBM International Group B.V.
    Inventors: Foster D. Hinshaw, Steven T. Almy, David A. Utter, Barry M. Zane
  • Patent number: 9037585
    Abstract: A system and method for mapping an input uniform resource identifier (URI) to an entry in a database. The system cleans an input URI to produce a prime URI that references an entry in a database. The prime URI is created by applying a regular expression determined for a particular domain to the input URI. Once the prime URI is generated, the prime URI can be used to retrieve information from a database.
    Type: Grant
    Filed: December 29, 2010
    Date of Patent: May 19, 2015
    Inventors: Kristopher Kubicki, Lawrence Hsieh
  • Publication number: 20150134594
    Abstract: Systems and methods for data warehouse management for medical information is provided. The system receives a set of medical record documents. These documents include evidence for one or more findings which are identified using natural language processing of evidence. The data set is probabilistically transformed into a structured data set (often as a table). This structured data set includes embedded links that reference the source document where the evidence is located. If the finding is supported by multiple articles of evidence, the link will direct the user to the source document with the highest confidence ranking. Evidence in the source document is highlighted and may include associated annotations. Evidence, findings and annotations may be updated by the user.
    Type: Application
    Filed: November 11, 2014
    Publication date: May 14, 2015
    Inventors: Vishnuvyas Sethumadhavan, John O. Schneider, Darren Matthew Schulte, Robert Derward Rogers
  • Publication number: 20150134592
    Abstract: Various of the disclosed embodiments present systems and methods for generating consolidated job postings from disparate originating sources and formats. Applying an Extraction Transform Load (ETL) framework to the incoming data, a parallel and asynchronous as well as scalable approach to distributing job posting information is presented. “Extraction” may involve the recognition of salient information in the disparate formats (e.g., in employment listings on company webpages). During “transformation”, the information may be reformatted into a universal format or into a format suitable for use at a given destination system. During “loading”, the reformatted data may be supplied to a suitable destination system, e.g., the Application Programming Interface (APIs) of a job board system. Applications in related domains and various optimizations are also discussed.
    Type: Application
    Filed: October 29, 2014
    Publication date: May 14, 2015
    Inventors: Maxime VERGER-DEL BOVE, Paul Clais, Olivier Le Floch
  • Publication number: 20150134589
    Abstract: A computer-implemented method for processing information related to an extract-transform-load (ETL) data migration, including aggregating operational metadata and determining: a plurality of metrics, organized by business object, corresponding to the migration; a number of business object instances not successfully loaded; a first end-to-end execution time for at least one business object; relevant input metadata; load readiness status per business object; impact of a business object that is not load ready by analyzing business process hierarchies; business object load readiness by reference to incomplete development status or data defects; scope per test cycle based, at least in part, upon business object load readiness; and high-priority defects of business objects that stop testing based, at least in part, upon analysis of business process hierarchies.
    Type: Application
    Filed: November 8, 2013
    Publication date: May 14, 2015
    Applicant: International Business Machines Corporation
    Inventors: Carl M. Marrelli, Ram S. Narayanan, Martin Oberhofer, Solmaz Rashidi
  • Publication number: 20150134590
    Abstract: A method, system, and computer program product for normalizing amorphous query result sets are provided in the illustrative embodiments. A property of data in a portion of the result set is identified. the property is usable for normalizing the portion into a structured data. Based on the property, the portion is categorized into a first category as a candidate for normalization using a first structure specification. The portion is transformed, responsive to the first category being selected for normalizing the portion over a second category in an evaluation, into the structured data according to the first structure specification of the first category. The structured data and a metadata of structure specification are added to a normalized result set. The normalized result set is output to a consumer application.
    Type: Application
    Filed: November 11, 2013
    Publication date: May 14, 2015
    Applicant: International Business Machines Corporation
    Inventors: TAMER E. ABUELSAAD, Gregory Jensen Boss, Craig Matthew Trim, Albert Tien-Yuen Wong
  • Publication number: 20150134591
    Abstract: A system is disclosed to evaluate data quality in a big data environment. An example method performed by the system includes retrieving one or more rules from an asset catalog. The method further includes retrieving, based on the one or more rules, counter information from a data set, and generating, by a processor, one or more metrics based on the one or more rules and the counter information. In addition, the method includes evaluating, by the processor, the one or more metrics based on the one or more rules. In an instance in which evaluation of a particular metric of the one or more metrics identifies an attribute value that exceeds a predetermined threshold, the method includes causing a notification message regarding the particular metric to be output. A corresponding apparatus and computer program product are also provided.
    Type: Application
    Filed: September 23, 2014
    Publication date: May 14, 2015
    Inventors: Carla Staeben, Cristina Maier, Bob Savard, Alex Wilbur
  • Publication number: 20150134593
    Abstract: A system and method for optimizing the computation of metrics facilitating monitoring of service level agreements (SLA) in an IT-enabled framework to reduce computational overhead. A receiving module receives data associated with activities performed by one or more service providers. A transforming module transforms the data into a user-defined data structure using a transforming technique. A selecting module selects an attribute and parameters associated with a key performance indicator (KPI) of the one or more service providers. A view creating module creates a materialized view of the user-defined data structure based on the selected attribute. A query generating module m generates a query to be executed on the materialized view. A metric executing module executes the query on the materialized view in order to compute the metric.
    Type: Application
    Filed: November 11, 2014
    Publication date: May 14, 2015
    Inventors: Rajini Ramesh, Srikanth Tupakula, Reshma Sherly, Dhinakaran Pandian
  • Publication number: 20150134595
    Abstract: A data feeds platform, comprising a data feed using a data feed protocol. The data feed protocol comprises, a feed header defining at least one identification attribute, a data header defining a data structure comprising data nodes, each data node corresponding to a subset of the data, and a data section, the data section storing the data according to the data structure. A plurality of heterogeneous data protocols is stored. A data input is received in one of the plurality of heterogeneous data protocols. The data input is converted to the data feed protocol, creating the data feed.
    Type: Application
    Filed: January 22, 2015
    Publication date: May 14, 2015
    Inventors: Bhavin P. Dave, Amit S. Modi
  • Patent number: 9031901
    Abstract: Various systems and methods can provide a flexible database schema. One method can store information identifying a first entity in a first table. A unified data model includes several tables, including the first table and a metadata table. Each of the tables stores information describing one or more entities belonging to a respective archetype. The first table already stores information identifying a second entity when the information identifying the first entity is stored. The second entity is already related to another entity when the information identifying the first entity is stored. The first entity is a new type of entity not already stored in the first table when the information identifying the first entity is stored. The method then stores metadata associated with the first entity in a metadata table. The metadata then identifies the new type of entity.
    Type: Grant
    Filed: October 3, 2011
    Date of Patent: May 12, 2015
    Assignee: Symantec Corporation
    Inventors: Craig Robert King, Richard K. Freeman
  • Patent number: 9031988
    Abstract: A system and method for the automatic generation of a website. The website is created upon a user request, where the user provides identification information that is used to search an information database. The information will provide a reference to a code that relates to a template that is used to specify the layout, style and content of the website that is to be generated, and the website is then generated in accordance with the structure of the template, based on content contained in the information database. The user may engage the system and method by means of e-mail or a dedicated website or other such suitable means.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: May 12, 2015
    Assignee: Site Technologies Inc.
    Inventors: Oles Tymoshenko, Stephen Curry
  • Patent number: 9031902
    Abstract: A computer-implemented method, computer program product and a system for identifying and handling slowly changing dimension (SCD) attributes for use with an Extract, Transform, Load (ETL) process, comprising importing a data model for dimensional data into a data integration system, where the dimensional data comprises a plurality of attributes, identifying via a data discovery analyzer one or more attributes in the data model as SCD attributes, importing the identified SCD attributes into the data integration system, selecting a data source comprising dimensional data, automatically generating an ETL job for the dimensional data utilizing the imported SCD attributes, and executing the automatically generated ETL to extract the dimensional data from the data source and loading the dimensional data into the imported SCD attributes in a target data system.
    Type: Grant
    Filed: November 10, 2011
    Date of Patent: May 12, 2015
    Assignee: International Business Machines Corporation
    Inventors: Manish Anand Bhide, Srinivas Kiran Mittapalli, Sriram K. Padmanabhan
  • Publication number: 20150127600
    Abstract: A report data management server includes a report data storing unit storing position information and report data in association with each other. The report data management server receives a request for supply of information containing position information, extracts information corresponding to position information contained in the request for supply from the report data storing unit, and supplies extracted information to a user terminal.
    Type: Application
    Filed: October 28, 2014
    Publication date: May 7, 2015
    Applicant: IPS CO., LTD.
    Inventor: Toshifumi AKITA
  • Publication number: 20150127599
    Abstract: A cache and method of caching for a database are provided, in which the cache and the database are in the same state. Responsive to a query for data, the method extracts search terms from the query and maps filter settings to an appropriate format. The method also (a) outputs data retrieved from a cache, if there is a cache hit and (b) builds cache and retrieves the data from the database, if there is a cache miss. If there are multiple users and/or instances, the cache may be updated for all users and/or instances by determining whether all changes are to be committed, in which case the changes are pushed to all users and/or instances.
    Type: Application
    Filed: November 7, 2013
    Publication date: May 7, 2015
    Inventor: Dirk Schiebeler
  • Publication number: 20150120642
    Abstract: A system and method for realtime snapshot indices is presented. A query is calculated on all target data of a data warehouse, with all variable combinations, to generate a result. The result is stored in a snapshot index associated with the data warehouse. The result is recalcualated to generate a subresult, and the snapshot index is updated with the subresult. A conversion routine is generated to recalculate the subresult into a separate table, and the separate table is then recalculated by a background job to recalculate the subresult.
    Type: Application
    Filed: October 28, 2013
    Publication date: April 30, 2015
    Inventors: Alex Gruener, Klaus Steinbach, Lars Spielberg
  • Publication number: 20150120644
    Abstract: A data analytics system includes processing circuitry that receives one or more objects from one or more data sources, and the one or more objects are described based on a common ontology that defines the one or more objects as data objects, manipulation objects, visualization objects, and utility objects. The one or more objects are self-referencing and self-validating. Data pipelines are defined based on input from a user. The data pipelines are executed to perform a runtime instance.
    Type: Application
    Filed: October 28, 2014
    Publication date: April 30, 2015
    Applicant: Edge Effect, Inc.
    Inventors: John Stephen Eberhardt, III, Richard King, Amalio Escobar, Michael Garcia
  • Publication number: 20150120643
    Abstract: This invention relates to a system, method and computer program product for replicating triplestore data from generic data records including: a plurality of triplestore mapping operations; a record reader for listening for data events from the generic data record and for matching each data event to one or more triplestore mapping operations; a processor for generating equivalent triples for each located data event by applying the matched triplestore mapping operation; and a data sink for receiving generated equivalent triples so that the triplestore data is synchronized in real time with the generic data records.
    Type: Application
    Filed: September 13, 2014
    Publication date: April 30, 2015
    Inventors: Patrick DANTRESSANGLE, Charles D. WOLFSON
  • Patent number: 9020944
    Abstract: Embodiments of the invention provide systems and methods for applying Business Process Management (BPM) to enterprises having business processes documented in multiple representations. Embodiments of the invention reconcile and organize documented information about processes into groups that convey inter-process similarity. The discovered knowledge can be used by embodiments of the invention for many applications to find process clusters that significantly boost performance.
    Type: Grant
    Filed: October 29, 2009
    Date of Patent: April 28, 2015
    Assignee: International Business Machines Corporation
    Inventors: Biplav Srivastava, Debdoot Mukherjee
  • Patent number: 9020882
    Abstract: Embodiments relate to systems and methods for a middleware database hosting dimensional transforms. A set of applications can access a combined data of a set of data sources via a database system. The set of data sources can be configured with diverse dimensions and associated data. The requesting application(s) can generate a request to the database system to extract data in compatible formats for the application(s). In embodiments, the database system can transform selected dimensional data (e.g., company, year, sales dimensions) from the data sources into transformed dimensional data. The requesting application(s) can therefore perform searches, sorts, and/or other operations on the transformed dimensional results.
    Type: Grant
    Filed: November 26, 2008
    Date of Patent: April 28, 2015
    Assignee: Red Hat, Inc.
    Inventor: Eric Williamson
  • Patent number: 9020883
    Abstract: A system and method are disclosed for providing BPEL support for correlation aggregation. The system can comprise a business process engine, executing on one or more application servers organized in a cluster. The system can comprise a database including a table which stores records relating to message groups and business process instances. When a message is received by the business process engine, the business process engine can determine an identity associated with the message, and check the database to determine if there is an entry associated with the identity. If there is no entry associated with the identity, then the business process engine can instantiate a first business process instance associated with the identity, and if there is an entry associated with the identity, then the business process engine can route the message to a business process instance associated with the identity.
    Type: Grant
    Filed: May 22, 2012
    Date of Patent: April 28, 2015
    Assignee: Oracle International Corporation
    Inventors: Glenn Mi, Yogesh Kumar
  • Patent number: 9020901
    Abstract: A method for balancing database workloads among a plurality of database servers includes when a new server is available, selecting a set of master and slave databases to be migrated to the new server; and migrating the selected databases to result in a balanced new workload among all servers while minimizing migration cost; and during operation, if workload unbalance is detected in real time from a workload change in a certain database, iteratively selecting one database at a time to migrate to a different server to balance the workload.
    Type: Grant
    Filed: April 3, 2013
    Date of Patent: April 28, 2015
    Assignee: NEC Laboratories America, Inc.
    Inventors: Yun Chi, Vahit Hakan Hacigumus
  • Publication number: 20150112921
    Abstract: Optimizing job scheduling in a data processing system includes determining a time factor associated with a report, determining one or more jobs that contribute to the report, and narrowing execution frequency of the one or more jobs based on the time factor of the report. The time factor provides information about timing details for data included in the report. The jobs are executed in a computing environment according to a time schedule.
    Type: Application
    Filed: June 3, 2014
    Publication date: April 23, 2015
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Michael Frankhauser, Benjamin Halberstadt, Roger K. Hecker
  • Patent number: 9015104
    Abstract: A form is received and loaded. It is determined if a schema structure definition defines one or more choice elements for data of the form. A data structure is created for the data, the data structure includes both a schema structure valid component and a non-validated component. When a user selects one of the choice elements, a first set of zero or more data elements corresponding to the selected choice element are loaded into the schema structure valid component of the data structure. The previous set of zero or more data elements and values associated with the previous set of data elements are then stored in the non-validated component of the data structure.
    Type: Grant
    Filed: October 10, 2011
    Date of Patent: April 21, 2015
    Assignee: International Business Machines Corporation
    Inventor: John M. Boyer
  • Patent number: 9015105
    Abstract: A form is received and loaded. It is determined if a schema structure definition defines one or more choice elements for data of the form. A data structure is created for the data. The data structure includes both a schema structure valid component and a non-validated component. When a user selects one of the choice elements, a first set of zero or more data elements corresponding to the selected choice element are loaded into the schema structure valid component of the data structure. The previous set of zero or more data elements and values associated with the previous set of data elements are then stored in the non-validated component of the data structure.
    Type: Grant
    Filed: April 25, 2012
    Date of Patent: April 21, 2015
    Assignee: International Business Machines Corporation
    Inventor: John M. Boyer
  • Patent number: 9009099
    Abstract: Methods, systems and articles of manufacture are provided for migrating entity relationship data residing in an object oriented program environment to a relational database schema. Further, functionality is provided to facilitate more efficient searching and reconstruction of the entity relationship data in the relational database.
    Type: Grant
    Filed: January 15, 2013
    Date of Patent: April 14, 2015
    Assignee: Google Inc.
    Inventors: Judy I Djugash, Hoa T Tran, Travis M Drucker
  • Patent number: 9009098
    Abstract: A system extracts data from at least one database. The data is extracted into a data set. The system performs a transformation function on the data extracted into the data set. The transformation function is performed for the purpose of determining a preferred version of data from at least one database. The database is capable of being updated with the preferred version of data such that the database is capable of being synchronized with the centralized data store. The system loads the preferred version of data into the centralized data store.
    Type: Grant
    Filed: June 30, 2008
    Date of Patent: April 14, 2015
    Assignee: EMC Corporation
    Inventor: Dan Hushon
  • Publication number: 20150100542
    Abstract: According to one embodiment of the present invention, a method automatically generates one or more Extract, Transform and Load (ETL) jobs. Input data in a source format and output data in a target format is received. The input data and output data is analyzed to determine properties and relationships thereof. One or more mapping models are automatically generated using the properties and relationships, wherein the mapping models describe the mapping and transformation of the input data to the output data. One or more ETL jobs are generated using the mapping models. Embodiments further include a system and program product apparatus for automatically generating one or more ETL jobs.
    Type: Application
    Filed: June 6, 2014
    Publication date: April 9, 2015
    Inventors: Jeff J. Li, Wendi L. Nusbickel
  • Publication number: 20150100541
    Abstract: According to one embodiment of the present invention, a method automatically generates one or more Extract, Transform and Load (ETL) jobs. Input data in a source format and output data in a target format is received. The input data and output data is analyzed to determine properties and relationships thereof. One or more mapping models are automatically generated using the properties and relationships, wherein the mapping models describe the mapping and transformation of the input data to the output data. One or more ETL jobs are generated using the mapping models. Embodiments further include a system and program product apparatus for automatically generating one or more ETL jobs.
    Type: Application
    Filed: October 3, 2013
    Publication date: April 9, 2015
    Applicant: International Business Machines Corporation
    Inventors: Jeff J. Li, Wendi L. Nusbickel
  • Patent number: 9002779
    Abstract: To provide an apparatus and method for efficiently displaying image content, which is sent and received between a server and a client, in a correct direction. An image content providing server stores, as property information, original angle information indicating correct directions of image content as metadata of the image content. The server generates content information including the original angle information and sends the generated content information to a client. This allows the client to understand the correct direction of the image content received from the server, and to obtain and display image data in the correct direction by performing rotation processing in the client device based on the obtained information or to request the server to perform the rotation processing.
    Type: Grant
    Filed: December 16, 2003
    Date of Patent: April 7, 2015
    Assignee: Sony Corporation
    Inventors: Masaaki Hamada, Norifumi Kikkawa, Yoshiki Tanaka, Takehiko Morita