Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 9002780
    Abstract: A method for loading data into an alternate evaluator for evaluating directory service operations includes loading data into a list of entries. Loading data into a list of normalized values and associated pointers. Loading data into a list of indexes and associated pointers.
    Type: Grant
    Filed: May 20, 2005
    Date of Patent: April 7, 2015
    Assignee: CA, Inc.
    Inventors: Richard H. Harvey, Ronald W. Ramsay
  • Patent number: 9001182
    Abstract: In one embodiment, source data for a communication session may be split into an audio portion for transmission on a phone channel and a non-audio portion for transmission on a data channel. A server and a phone may accordingly establish an audio portion of a communication session on the phone channel. In response to a trigger, the server may provide a push notification on the data channel to the phone, where the push notification is associated with an application executing on the phone that is configured to participate in the non-audio portion of the communication session on the data channel with the server. Upon obtaining the push notification on the data channel during the audio portion on the phone channel, the application may correspondingly activate on the phone to participate in the non-audio portion of the communication session during the phone's participation in the audio portion (e.g., merging the portions).
    Type: Grant
    Filed: April 24, 2013
    Date of Patent: April 7, 2015
    Assignee: Cisco Technology, Inc.
    Inventors: Yongjian Tian, Zheng Yuan, Tejas Bhandarkar
  • Patent number: 8996450
    Abstract: A system and method for allocating storage devices within a database system including fast solid state storage devices (SSDs) and traditional hard drive storage device (HDDs). The method repetitively collects temperature and queue depth data for each storage device, and determines from the collected data points a service time for each storage device. The method then selects, in response to an allocation request, a storage device having a minimum total wait time determined from the current queue depth and service time for each storage device.
    Type: Grant
    Filed: December 30, 2012
    Date of Patent: March 31, 2015
    Assignee: Teradata US, Inc.
    Inventor: Paul Anthony Rubio
  • Patent number: 8996545
    Abstract: The embodiments provide an in-memory database system having an extraction module configured to extract data (e.g., business data and address data) from one or more external data sources and transform the data into a standard format, a geocoder configured to geocode the address data including obtaining spatial data based on the address data using an internal reference table, and an internal database configured to store the internal reference table, the business data, the address data, and the spatial data.
    Type: Grant
    Filed: November 13, 2012
    Date of Patent: March 31, 2015
    Assignee: SAP SE
    Inventors: Yingyu Chen, Bin Dong, Yongyuan Shen
  • Publication number: 20150088806
    Abstract: A system and method for supporting multi-tenancy in a federated data management system are provided herein. The method includes receiving a query from a client at a data management server, wherein the query includes a tenant property condition corresponding to the client. The method also includes identifying an external data store including data specified by the query and determining whether the external data store is multi-tenant enabled. The method further includes removing the tenant property condition and executing the query to retrieve the specified data if the external data store is not multi-tenant enabled.
    Type: Application
    Filed: July 31, 2012
    Publication date: March 26, 2015
    Inventors: Yalon Lotan, Gil Tzadikevitch, Hadas Avraham
  • Publication number: 20150088807
    Abstract: A method of resolving data queries in a data processing system. The method comprises receiving in the data processing system a data query, where the data processing system stores a plurality of information units describing pluralities of data elements, a first information unit having a retrieval subunit that includes information for retrieving all unique data elements in a first plurality of data elements and a summary subunit including summarized information about data elements in the first plurality of data elements. The method further includes deriving, via the data processing system, a result of the data query, wherein the result of the data query comprises a plurality of new data elements. The data processing system uses summary subunits of information units to select a set of information units describing data elements that are sufficient to resolve the data query.
    Type: Application
    Filed: September 25, 2014
    Publication date: March 26, 2015
    Inventors: Graham Toppin, Janusz Borkowksi, Dominik Slezak, Shengli Shi, Piotr Synak, Jakub Wroblewski, Todd Joseph Wongkee, George Charalabopoulos
  • Patent number: 8990410
    Abstract: Certain exemplary embodiments can provide a method, which can comprise, responsive to a determination that a Web-based transaction has caused an exceedance of a maximum threshold percentage of transactions processed offline, at a user interface of an information device, rendering an automatic alert indicative of the exceedance. The Web-based transaction can be requested from a browser.
    Type: Grant
    Filed: December 11, 2008
    Date of Patent: March 24, 2015
    Assignee: IBM International Group BV
    Inventor: Suresh Damodaran
  • Patent number: 8990146
    Abstract: According to some embodiments, a system, method, means, and/or computer program code are provided to facilitate a display of information on a client device. For example, a server may retrieve first enterprise data from an enterprise database and store the first enterprise data into a first client based cache at the server, the first client based cache being associated with a first user. Similarly, the server may retrieve second enterprise data from the enterprise database and store the second enterprise data into a second client based cache at the server, the second client based cache being associated with a second user. Subsequent to the storing of the first enterprise data, the server may receive a display request from a first client device associated with the first user and transmit the first enterprise data to the first client device.
    Type: Grant
    Filed: December 22, 2010
    Date of Patent: March 24, 2015
    Assignee: SAP SE
    Inventors: Karl-Peter Nos, Andreas Riehl, Belenki Michael
  • Patent number: 8990148
    Abstract: A method is provided for parsing a configuration file. The method comprises determining whether an element in a sequence of configuration file elements is a type of element expected at the configuration file element's position in the sequence; when the configuration file element is not the expected type, consulting a set of rules for handling unexpected configuration file elements; and following a rule in the rule set. The rules cause different actions to be taken at a given position in the configuration file at different times of executing the method for parsing. One of the rules specifies that subsequent configuration file elements in the sequence are to be analyzed and, if at least one subsequent configuration file element is an expected type, the at least one subsequent configuration file element is to be considered valid and is to be parsed in a manner appropriate for the subsequent configuration file element.
    Type: Grant
    Filed: January 8, 2013
    Date of Patent: March 24, 2015
    Assignee: Sprint Communications Company L.P.
    Inventors: Bruce Edward Ziegler, William Peter Bryan
  • Patent number: 8990904
    Abstract: Mechanisms are provided for collecting configuration data from components of a managed computing system environment. A portion of code is obtained, in a data processing system, from a data collection system that does not have security credentials to allow the data collection system to directly access to the managed computing system environment. The portion of code is executed by the data processing system using security credentials maintained in the data processing system. Executing the portion of code causes the data processing system to access the managed computing system environment and collect configuration data from the managed computing system environment. The data processing system, via the portion of code, provides the configuration data collected from the managed computing system to the data collection system which stores the collected configuration data in a data storage.
    Type: Grant
    Filed: June 29, 2012
    Date of Patent: March 24, 2015
    Assignee: International Business Machines Corporation
    Inventors: Joel W. Branch, Michael E. Nidd, Ruediger Rissmann
  • Patent number: 8990147
    Abstract: Technologies are described herein for defining the scope of a sub-cube based upon a database query. A request is received to perform a query of a multi-dimensional database, such as a cube, stored at a server computer. In response to the request, the scope for a sub-portion, called a sub-cube, of the multi-dimensional database is determined based upon the query. The scope of the sub-cube may be determined by identifying the minimum sub-portion of the cube necessary to perform the query. Once the scope has been determined, the sub-cube is generated at the server computer and transmitted to a client computer. The client computer stores the sub-cube locally and performs the requested database query on the sub-cube stored at the client computer. A user interface may be provided through which a user can modify the scope of the sub-cube determined based upon the query.
    Type: Grant
    Filed: November 29, 2011
    Date of Patent: March 24, 2015
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Xiaohong Yang, Luming Wang
  • Publication number: 20150081620
    Abstract: A method facilitating interoperability across the integrating platforms is provided by composing purpose specific integrating tools using conceptual modeling framework and a set of primitive foundational components. The model driven architecture provides easy migration from one solution architecture to another. Also, the present invention discloses a method to migrate from one implementation to another using set of composed purpose specific integration tools.
    Type: Application
    Filed: November 24, 2014
    Publication date: March 19, 2015
    Applicant: Tata Consultancy Services Limited
    Inventors: Sreedhar Sannareddy Reddy, Prasenjit Das, Raghavendra Yeddula Reddy
  • Publication number: 20150081619
    Abstract: Systems and methods for interest-driven business intelligence systems including geo-spatial data in accordance with embodiments of the invention are illustrated.
    Type: Application
    Filed: June 24, 2014
    Publication date: March 19, 2015
    Inventors: Hans-Frederick Brown, James Mulholland, Kevin Scott Beyer
  • Publication number: 20150081616
    Abstract: According to one embodiment of the present invention, a system inserts data into a database object. The system associates the database object with a parameter specifying materialization of data for the database object. The system inserts data into the database object and materializes the data in accordance with the parameter to provide access to the data from the database object, wherein the parameter specifies a portion of the data to be materialized upon insertion. Embodiments of the present invention further include a method and computer program product for inserting data into a database object in substantially the same manners described above.
    Type: Application
    Filed: September 16, 2013
    Publication date: March 19, 2015
    Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Daniel Kozin, Meichi M. Lin, Arthur Marais, Nigel G. Slinger, John B. Tobler, Wen J. Zhu
  • Publication number: 20150081617
    Abstract: A system may include at least one data source configured to provide network usage data indicative of the existence of communications with subscriber devices, and web and application usage data indicative of data usage of the subscriber network by the subscriber devices.
    Type: Application
    Filed: September 18, 2013
    Publication date: March 19, 2015
    Applicant: CELLCO PARTNERSHIP (D/B/A VERIZON WIRELESS)
    Inventors: Altaff Mohiddin Shaik, Narendra Polavarapu, Dustin Logan Applegate, Satish Kumar Hosanagara, Richard Stemper
  • Publication number: 20150081618
    Abstract: Systems and methods for interest-driven business intelligence systems including event-oriented data in accordance with embodiments of the invention are illustrated. In one embodiment, an interest-driven business intelligence system includes raw data storage configured to contain raw data and perform ETL processes, a data mart configured to contain metadata that describes the raw data, and an intermediate processing layer, wherein the intermediate processing layer is configured to compile an interest-driven data pipeline configured to generate ETL processing jobs to generate event-oriented data from the raw data by filtering the raw data based on the metadata describing the raw data, determining ordering data based on the metadata describing the raw data, aligning the filtered raw data based on the ordering data, and generating event-oriented data based on the aligned filtered raw data, and storing the event-oriented data in the data mart.
    Type: Application
    Filed: March 5, 2014
    Publication date: March 19, 2015
    Applicant: Platfora, Inc.
    Inventors: John Schuster, Benjamin Mark Werther, Brian Babcock, Kevin Scott Beyer, Hans-Frederick Brown
  • Patent number: 8984598
    Abstract: Mechanisms are provided for collecting configuration data from components of a managed computing system environment. A portion of code is obtained, in a data processing system, from a data collection system that does not have security credentials to allow the data collection system to directly access to the managed computing system environment. The portion of code is executed by the data processing system using security credentials maintained in the data processing system. Executing the portion of code causes the data processing system to access the managed computing system environment and collect configuration data from the managed computing system environment. The data processing system, via the portion of code, provides the configuration data collected from the managed computing system to the data collection system which stores the collected configuration data in a data storage.
    Type: Grant
    Filed: June 27, 2012
    Date of Patent: March 17, 2015
    Assignee: International Business Machines Corporation
    Inventors: Joel W. Branch, Michael E. Nidd, Ruediger Rissmann
  • Patent number: 8983896
    Abstract: Disclosed is a method, system, and program for providing access to spatial data. A request for data is received. Enterprise and third party data are integrated. The integrated data is processed. Spatially referenced results are generated using the processed data. The spatially referenced results are returned in response to the request.
    Type: Grant
    Filed: July 7, 2011
    Date of Patent: March 17, 2015
    Assignee: The Paradigm Alliance, Inc.
    Inventors: Tim A. von Kaenel, David Neil Dyrnaes, C. Suresh Kumar, Jared Paul Wayman, Jonathan David Goodwin, Craig Evan Trivelpiece, Joseph Mihalich, Anthony Page Jenkins, Richard Hoyt Odom, Jr., Mark Andrew Stier, Anne Janetta Obee
  • Patent number: 8984014
    Abstract: The invention provides a system and method for retrieving and storing industrial data, the system comprising a data retriever that includes a data retrieval manager and one or more watchers for monitoring data associated with one or more industrial devices, wherein if the data associated with the one or more industrial devices is new or modified, the one or more watchers notifies the data retrieval manager thereof and the data retrieval manager uploads the new or modified data. The system further includes a database manager for receiving the new or modified data in a first form from the data retrieval manager and for storing the new or modified data in a structural data form in one or more databases.
    Type: Grant
    Filed: May 2, 2013
    Date of Patent: March 17, 2015
    Assignee: Rockwell Automation Technologies, Inc.
    Inventors: Marek Obitko, Ivan Havel, Michal Fortik, Robert Mavrov, Radek Marik
  • Patent number: 8984157
    Abstract: Aspects of the invention are directed to a method and system for discovering business content transfer paths in a network using file transfer information, and for calculating business risk per network component in a network. A method according to an embodiment includes: obtaining file transfer information for a plurality of file transfers between a plurality of nodes within a network; generating a confidence of correlation for each pair of file transfers in the plurality of file transfers; determining interdependencies between the plurality of file transfers based on the confidence of correlation for each pair of file transfers; and determining a business content transfer path based on the interdependencies between the plurality of file transfers.
    Type: Grant
    Filed: July 18, 2012
    Date of Patent: March 17, 2015
    Assignee: International Business Machines Corporation
    Inventors: Vedika Dalmia, Thomas I. Lewin, James W. Smith, Peter F. Weller
  • Publication number: 20150074037
    Abstract: An in memory database system includes a random access memory to store an in memory database table, a stored set of defined views over the database tables, a stored set of analytical services adapted to provide transaction services and database warehouse services, an engine to run the analytical services on the database table, and an interface to interact with users and provide access to the engine, analytical services, and views to perform transactions and data warehouse functions on the database table.
    Type: Application
    Filed: September 12, 2013
    Publication date: March 12, 2015
    Applicant: SAP AG
    Inventor: Siar Sarferaz
  • Publication number: 20150074038
    Abstract: A method includes processing a transaction on an in memory database where data being processed has a validity time, updating a time dependent data view responsive to the transaction being processed to capture time validity information regarding the data, and storing the time validity information in a historization table to provide historical access to past time dependent data following expiration of the validity time.
    Type: Application
    Filed: September 12, 2013
    Publication date: March 12, 2015
    Applicant: SAP AG
    Inventor: Siar Sarferaz
  • Patent number: 8977643
    Abstract: The disclosed architecture leverages realtime continuous event processing (CEP) to address using a general input interface framework to import a dynamic set of event types (e.g., assets), and using a declarative, expressive query model to implement monitoring and management tasks on an asset level. This is in contrast to looking separately at single values from static databases and/or realtime streams as is common conventionally. The architecture uses the CEP data model to model assets as realtime event types. Thus, queries can be formulated per asset and not just per single stream. The architecture uses the query capabilities of CEP to formulate asset management and monitoring tasks as standing, declarative queries, and uses the input interface of a CEP platform to correlate data from different data sources with different dynamic properties.
    Type: Grant
    Filed: June 30, 2010
    Date of Patent: March 10, 2015
    Assignee: Microsoft Corporation
    Inventors: Roman Schindlauer, Balan Sethu Raman, Torsten W. Grabs, Beysim Sezgin
  • Patent number: 8977587
    Abstract: A log file contains operation records, each operation record is of a certain type, and each operation record is associated with a transaction. A plurality of operation records is read from the log file into a record store. Records of the plurality of operation records of each operation record type are sampled at a predefined sampling rate. Operation records in the plurality of operations records are identified that are associated with completed transactions of which the sampled operation records are associated. The identified operation records are then extracted from the record store into a data store.
    Type: Grant
    Filed: January 3, 2013
    Date of Patent: March 10, 2015
    Assignee: International Business Machines Corporation
    Inventors: Manoj K. Agarwal, Curt L. Cotner, Amitava Kundu
  • Publication number: 20150066838
    Abstract: In an embodiment a method is provided. The method includes receiving forecast information in a computer from a first group of users. The method also includes integrating the forecast information into a database and corresponding OLAP cube accessible by the computer. The method further includes receiving comments on specific entries of the forecast information in the computer from a second group of users. The method also includes receiving changes of the forecast information in the computer from a second group of users The method further includes integrating comments and changes into the database and OLAP cube. Moreover, the method includes providing a display of the comments and the changes to the first group of users in an interface to the computer.
    Type: Application
    Filed: November 3, 2014
    Publication date: March 5, 2015
    Applicant: RIGHT90, INC.
    Inventors: Kim Orumchian, Art Stabenow, Dean Skelton, David Petiot
  • Patent number: 8972334
    Abstract: A method includes monitoring, on a computing device, data events corresponding to manipulation of data by an application. The monitoring is performed by a data library service that is embedded in the application. The method includes, in response to the monitoring indicating first data on the computing device is modified by the application, synchronizing, by the computing device under control at least in part of the data library service, second data stored on one or more storage providers in a network with the first data modified by the application and stored on the computing device. Apparatus and computer program products are also disclosed.
    Type: Grant
    Filed: December 21, 2012
    Date of Patent: March 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Paul C. Castro, Marco Pistoia, John Ponzo
  • Patent number: 8972337
    Abstract: A bloom filter is generated for efficient query processing for unsorted data in a column of a columnar database. Bloom filters represented as bitmaps are generated for data blocks storing data for a column of a columnar database table. An indication of a query directed toward the column is received and the bloom filter for each data block is examined to determine which ones of the data blocks do not need to be read in order to service the query for the select data. Data is then read from the data blocks storing data for the column excepting the ones which do not need to be read.
    Type: Grant
    Filed: February 21, 2013
    Date of Patent: March 3, 2015
    Assignee: Amazon Technologies, Inc.
    Inventor: Anurag Windlass Gupta
  • Patent number: 8972338
    Abstract: A log file contains operation records, each operation record is of a certain type, and each operation record is associated with a transaction. A plurality of operation records is read from the log file into a record store. Records of the plurality of operation records of each operation record type are sampled at a predefined sampling rate. Operation records in the plurality of operations records are identified that are associated with completed transactions of which the sampled operation records are associated. The identified operation records are then extracted from the record store into a data store.
    Type: Grant
    Filed: March 11, 2014
    Date of Patent: March 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Manoj K. Agarwal, Curt L. Cotner, Amitava Kundu
  • Patent number: 8972336
    Abstract: A system and method for mapping columns from a source file to a target file. The header for each source column is evaluated heuristically to see if the header matches a predefined entity. The contents of a group of cells in the source column are evaluated probabilistically to determine a probability that the cell contents correspond to at least one of the predefined entities. A score is assigned to the likelihood that the column corresponds to one or more predefined entities. If the score meets a threshold, then the correspondence between the source column and one or more predefined entities is mapped. If the score fails to meets the threshold, then the correspondence between the source column and one or more undefined entities is mapped. Finally, each source column is transformed into a target column in accord with the map.
    Type: Grant
    Filed: February 21, 2013
    Date of Patent: March 3, 2015
    Assignee: salesforce.com, inc.
    Inventor: Arun Kumar Jagota
  • Publication number: 20150058278
    Abstract: A computer implemented system for automatically determining a reliability score of a data report. The system includes a data structure indicative of data lineage that includes data assets flowing into a data report. The system further includes a processor configured to calculate trust scores for data assets based on characteristics of the data assets as well as a data report trust density score indicative of the reliability score of the report based on the calculated trust scores. The system further includes an output device configured to output the calculated data report trust density score.
    Type: Application
    Filed: August 20, 2013
    Publication date: February 26, 2015
    Applicant: International Business Machines Corporation
    Inventors: Michael Fankhauser, Benjamin Halberstadt, Roger Hecker, Ilan Prager, Erel Sharf
  • Publication number: 20150058280
    Abstract: A computer implemented method, computer program product and system for data quality monitoring includes measuring a data quality of loaded data relative to a predefined data quality metric. The measuring the data quality includes identifying delta changes in at least one of the loaded data and the data quality rules relative to a previous measurement of the data quality of the loaded data. Logical calculus defined in the data quality rules is applied to the identified delta changes.
    Type: Application
    Filed: October 21, 2014
    Publication date: February 26, 2015
    Inventors: Sebastian Nelke, Martin Oberhofer, Yannick Saillet, Jens Seifert
  • Publication number: 20150058279
    Abstract: A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data.
    Type: Application
    Filed: September 30, 2014
    Publication date: February 26, 2015
    Inventors: Shruti Choudhary, Jeff J. Li, Wendi L. Nusbickel
  • Patent number: 8965838
    Abstract: A computer implemented method, system and computer program product for managing data in databases generates a control file for a database by utilizing a definition file associated with the database and predefined column definitions, organizes data within a data file in a predefined format, wherein the data is desirable to be loaded in the database, and loads the data from the data file to the database by utilizing the control file and the data file.
    Type: Grant
    Filed: January 17, 2012
    Date of Patent: February 24, 2015
    Assignee: International Business Machines Coporation
    Inventor: Kartika Garg
  • Patent number: 8965836
    Abstract: Some embodiments provide a system for managing multidimensional data storage (MDDS) cubes. The system includes a content pack manager for processing a content pack that specifies (1) a set of multidimensional data storage (MDDS) cube definitions for a set of MDDS cubes, (2) a set of data sources that includes data for each of the MDDS cubes in the set of MDDS cubes, and (3) a content map that includes a set of relationships between data for the set of MDDS cubes and the set of data sources. The system includes an MDDS cube manager for provisioning, based on the content pack, the set of MDDS cubes to a set of MDDS clients for viewing the set of MDDS cubes.
    Type: Grant
    Filed: October 8, 2011
    Date of Patent: February 24, 2015
    Assignee: Bay Dynamics
    Inventors: Ryan G. Stolte, Firas S. Rifai
  • Patent number: 8965877
    Abstract: A system and methods for automatically assigning of classification codes to a business based on information about the business collected from the Internet are provided in which data extracted from trawling the Internet is compared to a node structure based on a taxonomy of a selected business classification code system.
    Type: Grant
    Filed: March 14, 2013
    Date of Patent: February 24, 2015
    Assignee: Glenbrook Networks
    Inventors: Julia Komissarchik, Edward Komissarchik
  • Patent number: 8966456
    Abstract: A class definition syntax that leverages the array and math-oriented nature of a dynamically typed array-based programming language is discussed. Some embodiments of the present invention provides a mechanism for using meta-data to define a class implemented in the dynamically typed array-based programming language. The meta-data provide an easy way to provide class definitions and allow a user to access the class definitions from objects. The meta-data also support extension of class definitions without introducing backward incompatibility.
    Type: Grant
    Filed: July 17, 2006
    Date of Patent: February 24, 2015
    Assignee: The MathWorks, Inc.
    Inventor: David A. Foti
  • Patent number: 8959123
    Abstract: Generally described, the present invention is directed to a network operating system that provides more effective ways of leveraging the connectivity of computer networks. In one embodiment, an XML virtual machine is implemented that accepts high-level application code written in an XML programming language as input. Functionality is provided to interpret or translate the application code written in an XML programming language into code that is suitable for execution across computer platforms. Moreover, the XML virtual machine supports the Model View Controller (MVC) design paradigm that facilitates true data abstraction from applications to a common data model. Multi-instance applications, with each instance potentially supporting multiple views, may be created and executed.
    Type: Grant
    Filed: September 29, 2008
    Date of Patent: February 17, 2015
    Assignee: Xcerion Aktiebolag
    Inventors: Daniel Arthursson, Marcus Bristav
  • Publication number: 20150046390
    Abstract: In one embodiment, a method receives a document including metadata in a serial format and parses the document to determine components in the metadata, a nested structure of the components, and an order of the components in the data. The metadata is stored in a table format where columns of the table define the nested structure of the components and the order of the components in the data. A query is received for the data. The method determines components from the metadata stored in the table format from the columns that satisfy the query where the components are determined based on the nested structure of the determined components and the order of the determined components from information in the columns of the table. A result for the query is output including the determined components based on the nested structure of the determined components and the order of the determined components.
    Type: Application
    Filed: September 5, 2013
    Publication date: February 12, 2015
    Applicant: SAP AG
    Inventors: Enping Tu, Yanlun Ma, Hung-Liang Yang, Yung-Yin Chen
  • Publication number: 20150046389
    Abstract: Handling extract-transform-load (ETL) job mismatches as “exceptions.” Exception handling may include the following steps: (i) determining a mismatch while running an extract-transform-load job with the mismatch being a mismatch of at least one of the following types: design time information mismatch, and/or operational metadata mismatch; and (ii) responsive to determining the mismatch, handling the mismatch as an exception.
    Type: Application
    Filed: August 6, 2013
    Publication date: February 12, 2015
    Applicant: International Business Machines Corporation
    Inventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Ajay Sood
  • Patent number: 8954377
    Abstract: A system, a method and a computer readable storage medium for pre-processing data collected from one or more data sources more accurately summarize data. During the pre-processing, multiple raw data are summarized into a pre-processed datum. By using the pre-processed data entries, more accurate trend data may be generated. Alternatively, data entries are indexed and selectively retrieved based on indices. Decimation of data points are performed based on the indices without retrieving all the data sets from the database, reducing the data access time for returning a query result. Additional data sets may also be retrieved efficiently from the database using the indices.
    Type: Grant
    Filed: May 25, 2012
    Date of Patent: February 10, 2015
    Assignee: Moog Inc.
    Inventors: Martin A. Turon, Rahul Kapur
  • Patent number: 8954375
    Abstract: A method and system for developing data integration applications with reusable semantic types to represent and process application data. Methods include creating schemas to describe external data, creating semantic types to describe internal data, mapping schemas to semantic types, developing dataflows that configure input and output operations using schemas, mappings, and semantic types and all other transformation operations and functions based solely on semantic types, and executing dataflows defined in this manner.
    Type: Grant
    Filed: October 13, 2011
    Date of Patent: February 10, 2015
    Assignee: QlikTech International AB
    Inventors: William V. Kehoe, Jerry L. Callen, Steven R. Frechette
  • Patent number: 8954378
    Abstract: A name of one or more entity classes of the data model may be refined to conform to a naming convention. A semantic meaning of each of the names and one or more attributes of each entity class may be determined. It may be determined that the name of a first entity class is semantically similar to the name of a second entity class based on a semantic distance between the semantic meaning of the names, where a substantial similarity may be determined between the first entity class and the second entity class by comparing the semantic meaning of the one or more attributes of the first entity class to the semantic meaning of the one or more attributes of the second entity class. The data model may be normalized based on the substantial similarity.
    Type: Grant
    Filed: August 28, 2012
    Date of Patent: February 10, 2015
    Assignee: SAP AG
    Inventors: Gunther Stuhec, Florian Gessner, Jens Lemcke
  • Patent number: 8954376
    Abstract: Techniques are disclosed for identifying transcoding tables in an Extract-Transform-Load (ETL) process, by identifying, by operation of one or more computer processors, records passing through an operator configured to replace values in the records with values from at least one table linked to the operator before being sent to an output table, wherein the operator specifies an operation for extracting, transforming, or loading data stored in one or more source systems into storage by a target system, and evaluating at least a first table linked to the operator to determine whether the first table is a transcoding table by assigning a score to the first table, wherein the score is indicative of the likelihood that the first table is a transcoding table, wherein a transcoding table is used to harmonize values from a plurality of tables in the one or more source systems to a table in the target.
    Type: Grant
    Filed: March 8, 2012
    Date of Patent: February 10, 2015
    Assignee: International Business Machines Corporation
    Inventors: Dan J. Mandelstein, Ivan M. Milman, Martin Oberhofer, Sushain Pandit
  • Patent number: 8949204
    Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.
    Type: Grant
    Filed: August 29, 2012
    Date of Patent: February 3, 2015
    Assignee: International Business Machines Corporation
    Inventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
  • Patent number: 8949175
    Abstract: A generic approach for automatically ingesting data into an HDFS (Hadoop File System) based data warehouse includes a datahub server, a generic pipelined data loading framework, and a meta-data model that, together, address data loading efficiency, data source heterogeneities, and data warehouse schema evolvement. The loading efficiency is achieved via the MapReduce scale-out solution. The meta-data model is comprised of configuration files and a catalog. The configuration file is setup per ingestion task. The catalog manages the data warehouse schema. When a scheduled data loading task is executed, the configuration files and the catalog collaboratively drive the datahub server to load the heterogeneous data to their destination schemas automatically.
    Type: Grant
    Filed: May 8, 2012
    Date of Patent: February 3, 2015
    Assignee: Turn Inc.
    Inventors: Mingxi Wu, Songting Chen
  • Patent number: 8943222
    Abstract: A system, method, and computer readable medium for employing data obtained from an electronic network, such as the Internet, as an emulation of sensor data. A virtual sensor bridge mechanism may be interfaced to a data source, such as the Internet, to receive Web data or remote sensor data. The virtual sensor bridge may convert the received data into a format usable by a logic controller, thereby emulating sensor data and enabling the logic controller to treat the networked data source as a sensor. Furthermore, the virtual sensor bridge may interface directly with one or more sensors and may serve as an adapter or data relay between a sensor and the logic controller.
    Type: Grant
    Filed: August 30, 2011
    Date of Patent: January 27, 2015
    Assignee: ioBridge, Inc.
    Inventors: Peter Hans Scharler, Robert Stanley Mawrey, Jason Thomas Winters
  • Patent number: 8943472
    Abstract: A system and method for reading and writing datasets (SAS datasets) written in a proprietary runtime macro language (SAS, owned by the SAS Institute, Inc.) directly, without going through a system that uses the SAS proprietary runtime macro language. The system employs a JAVA Database Connectivity (JDBC) application program interface (API) driver usable by JDBC enabled JAVA applications. The driver extracts from the SAS dataset information from a header page that enables identification of information on metadata pages which in turn are used by the driver to present to the JAVA application a view of the data as a database table, from which the JAVA application can request reading and writing of the SAS dataset.
    Type: Grant
    Filed: May 27, 2013
    Date of Patent: January 27, 2015
    Assignee: Dulles Research LLC
    Inventor: Pavel Ganelin
  • Publication number: 20150026115
    Abstract: A computer software implemented method for transforming a first extract transform load (ETL) job having at least some unload transform load (UTL) portions. The method includes the following steps: (i) decomposing the first ETL job into an intermediate set of one or more jobs; and (ii) for each job of the intermediate set, transforming the job into a transactionally equivalent job to yield a final set of one or more jobs. The decomposing is performed so that each job of the intermediate jobs set is a Simple UTL job. The transforming is performed so that each job of the final set includes no UTL portions.
    Type: Application
    Filed: July 19, 2013
    Publication date: January 22, 2015
    Inventors: Alexander Gorelik, Sriram K. Padmanabhan, James D. Spyker
  • Publication number: 20150026116
    Abstract: Embodiments of the present invention relate to a method, program product, and system for managing the percentage of unpopulated cells in a multidimensional data structure during the servicing of multidimensional analytical queries. In an embodiment, a multidimensional data structure is stored in a memory location of a computing device, wherein the multidimensional data structure includes a dimension that is a subset of a plurality of dimensions included in a relational database. The computing device determines a frequency of utilization of a dimension to service a multidimensional query request. The computing device combines the dimension with the multidimensional data structure responsive to the frequency of utilization.
    Type: Application
    Filed: September 17, 2014
    Publication date: January 22, 2015
    Inventors: Vijay Bharti, Jeremiah Joseph, Hemant K. Sivaswamy, Kamalpreet Sapna, Rohit Jain
  • Publication number: 20150026117
    Abstract: A data warehouse system comprising a warehouse database and a data warehouse manager communicatively coupled to the database. The warehouse manager is to store data to and retrieve data from the warehouse database and perform metadata reconciliation to update metadata of objects in the data warehouse to reflect a change in metadata of a first object in the warehouse by identifying a first set of objects impacted by the change to the metadata of the first object, identifying a second set of objects on which the first object depends, identifying at least one task that has to be completed to propagate the impact of the change in the metadata of the first object to objects in the first and second sets of objects, performing the task, and dynamically computing additional tasks based on whether or not definitions of objects in the first or second sets of objects changes based upon completion of the at least one task.
    Type: Application
    Filed: October 6, 2014
    Publication date: January 22, 2015
    Inventor: Winnie Tak Yu Wan