Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
-
Patent number: 9002780Abstract: A method for loading data into an alternate evaluator for evaluating directory service operations includes loading data into a list of entries. Loading data into a list of normalized values and associated pointers. Loading data into a list of indexes and associated pointers.Type: GrantFiled: May 20, 2005Date of Patent: April 7, 2015Assignee: CA, Inc.Inventors: Richard H. Harvey, Ronald W. Ramsay
-
Patent number: 9001182Abstract: In one embodiment, source data for a communication session may be split into an audio portion for transmission on a phone channel and a non-audio portion for transmission on a data channel. A server and a phone may accordingly establish an audio portion of a communication session on the phone channel. In response to a trigger, the server may provide a push notification on the data channel to the phone, where the push notification is associated with an application executing on the phone that is configured to participate in the non-audio portion of the communication session on the data channel with the server. Upon obtaining the push notification on the data channel during the audio portion on the phone channel, the application may correspondingly activate on the phone to participate in the non-audio portion of the communication session during the phone's participation in the audio portion (e.g., merging the portions).Type: GrantFiled: April 24, 2013Date of Patent: April 7, 2015Assignee: Cisco Technology, Inc.Inventors: Yongjian Tian, Zheng Yuan, Tejas Bhandarkar
-
Patent number: 8996450Abstract: A system and method for allocating storage devices within a database system including fast solid state storage devices (SSDs) and traditional hard drive storage device (HDDs). The method repetitively collects temperature and queue depth data for each storage device, and determines from the collected data points a service time for each storage device. The method then selects, in response to an allocation request, a storage device having a minimum total wait time determined from the current queue depth and service time for each storage device.Type: GrantFiled: December 30, 2012Date of Patent: March 31, 2015Assignee: Teradata US, Inc.Inventor: Paul Anthony Rubio
-
Patent number: 8996545Abstract: The embodiments provide an in-memory database system having an extraction module configured to extract data (e.g., business data and address data) from one or more external data sources and transform the data into a standard format, a geocoder configured to geocode the address data including obtaining spatial data based on the address data using an internal reference table, and an internal database configured to store the internal reference table, the business data, the address data, and the spatial data.Type: GrantFiled: November 13, 2012Date of Patent: March 31, 2015Assignee: SAP SEInventors: Yingyu Chen, Bin Dong, Yongyuan Shen
-
Publication number: 20150088806Abstract: A system and method for supporting multi-tenancy in a federated data management system are provided herein. The method includes receiving a query from a client at a data management server, wherein the query includes a tenant property condition corresponding to the client. The method also includes identifying an external data store including data specified by the query and determining whether the external data store is multi-tenant enabled. The method further includes removing the tenant property condition and executing the query to retrieve the specified data if the external data store is not multi-tenant enabled.Type: ApplicationFiled: July 31, 2012Publication date: March 26, 2015Inventors: Yalon Lotan, Gil Tzadikevitch, Hadas Avraham
-
Publication number: 20150088807Abstract: A method of resolving data queries in a data processing system. The method comprises receiving in the data processing system a data query, where the data processing system stores a plurality of information units describing pluralities of data elements, a first information unit having a retrieval subunit that includes information for retrieving all unique data elements in a first plurality of data elements and a summary subunit including summarized information about data elements in the first plurality of data elements. The method further includes deriving, via the data processing system, a result of the data query, wherein the result of the data query comprises a plurality of new data elements. The data processing system uses summary subunits of information units to select a set of information units describing data elements that are sufficient to resolve the data query.Type: ApplicationFiled: September 25, 2014Publication date: March 26, 2015Inventors: Graham Toppin, Janusz Borkowksi, Dominik Slezak, Shengli Shi, Piotr Synak, Jakub Wroblewski, Todd Joseph Wongkee, George Charalabopoulos
-
Patent number: 8990410Abstract: Certain exemplary embodiments can provide a method, which can comprise, responsive to a determination that a Web-based transaction has caused an exceedance of a maximum threshold percentage of transactions processed offline, at a user interface of an information device, rendering an automatic alert indicative of the exceedance. The Web-based transaction can be requested from a browser.Type: GrantFiled: December 11, 2008Date of Patent: March 24, 2015Assignee: IBM International Group BVInventor: Suresh Damodaran
-
Patent number: 8990146Abstract: According to some embodiments, a system, method, means, and/or computer program code are provided to facilitate a display of information on a client device. For example, a server may retrieve first enterprise data from an enterprise database and store the first enterprise data into a first client based cache at the server, the first client based cache being associated with a first user. Similarly, the server may retrieve second enterprise data from the enterprise database and store the second enterprise data into a second client based cache at the server, the second client based cache being associated with a second user. Subsequent to the storing of the first enterprise data, the server may receive a display request from a first client device associated with the first user and transmit the first enterprise data to the first client device.Type: GrantFiled: December 22, 2010Date of Patent: March 24, 2015Assignee: SAP SEInventors: Karl-Peter Nos, Andreas Riehl, Belenki Michael
-
Patent number: 8990148Abstract: A method is provided for parsing a configuration file. The method comprises determining whether an element in a sequence of configuration file elements is a type of element expected at the configuration file element's position in the sequence; when the configuration file element is not the expected type, consulting a set of rules for handling unexpected configuration file elements; and following a rule in the rule set. The rules cause different actions to be taken at a given position in the configuration file at different times of executing the method for parsing. One of the rules specifies that subsequent configuration file elements in the sequence are to be analyzed and, if at least one subsequent configuration file element is an expected type, the at least one subsequent configuration file element is to be considered valid and is to be parsed in a manner appropriate for the subsequent configuration file element.Type: GrantFiled: January 8, 2013Date of Patent: March 24, 2015Assignee: Sprint Communications Company L.P.Inventors: Bruce Edward Ziegler, William Peter Bryan
-
Patent number: 8990904Abstract: Mechanisms are provided for collecting configuration data from components of a managed computing system environment. A portion of code is obtained, in a data processing system, from a data collection system that does not have security credentials to allow the data collection system to directly access to the managed computing system environment. The portion of code is executed by the data processing system using security credentials maintained in the data processing system. Executing the portion of code causes the data processing system to access the managed computing system environment and collect configuration data from the managed computing system environment. The data processing system, via the portion of code, provides the configuration data collected from the managed computing system to the data collection system which stores the collected configuration data in a data storage.Type: GrantFiled: June 29, 2012Date of Patent: March 24, 2015Assignee: International Business Machines CorporationInventors: Joel W. Branch, Michael E. Nidd, Ruediger Rissmann
-
Patent number: 8990147Abstract: Technologies are described herein for defining the scope of a sub-cube based upon a database query. A request is received to perform a query of a multi-dimensional database, such as a cube, stored at a server computer. In response to the request, the scope for a sub-portion, called a sub-cube, of the multi-dimensional database is determined based upon the query. The scope of the sub-cube may be determined by identifying the minimum sub-portion of the cube necessary to perform the query. Once the scope has been determined, the sub-cube is generated at the server computer and transmitted to a client computer. The client computer stores the sub-cube locally and performs the requested database query on the sub-cube stored at the client computer. A user interface may be provided through which a user can modify the scope of the sub-cube determined based upon the query.Type: GrantFiled: November 29, 2011Date of Patent: March 24, 2015Assignee: Microsoft Technology Licensing, LLCInventors: Xiaohong Yang, Luming Wang
-
Publication number: 20150081620Abstract: A method facilitating interoperability across the integrating platforms is provided by composing purpose specific integrating tools using conceptual modeling framework and a set of primitive foundational components. The model driven architecture provides easy migration from one solution architecture to another. Also, the present invention discloses a method to migrate from one implementation to another using set of composed purpose specific integration tools.Type: ApplicationFiled: November 24, 2014Publication date: March 19, 2015Applicant: Tata Consultancy Services LimitedInventors: Sreedhar Sannareddy Reddy, Prasenjit Das, Raghavendra Yeddula Reddy
-
Publication number: 20150081619Abstract: Systems and methods for interest-driven business intelligence systems including geo-spatial data in accordance with embodiments of the invention are illustrated.Type: ApplicationFiled: June 24, 2014Publication date: March 19, 2015Inventors: Hans-Frederick Brown, James Mulholland, Kevin Scott Beyer
-
Publication number: 20150081616Abstract: According to one embodiment of the present invention, a system inserts data into a database object. The system associates the database object with a parameter specifying materialization of data for the database object. The system inserts data into the database object and materializes the data in accordance with the parameter to provide access to the data from the database object, wherein the parameter specifies a portion of the data to be materialized upon insertion. Embodiments of the present invention further include a method and computer program product for inserting data into a database object in substantially the same manners described above.Type: ApplicationFiled: September 16, 2013Publication date: March 19, 2015Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Daniel Kozin, Meichi M. Lin, Arthur Marais, Nigel G. Slinger, John B. Tobler, Wen J. Zhu
-
Publication number: 20150081617Abstract: A system may include at least one data source configured to provide network usage data indicative of the existence of communications with subscriber devices, and web and application usage data indicative of data usage of the subscriber network by the subscriber devices.Type: ApplicationFiled: September 18, 2013Publication date: March 19, 2015Applicant: CELLCO PARTNERSHIP (D/B/A VERIZON WIRELESS)Inventors: Altaff Mohiddin Shaik, Narendra Polavarapu, Dustin Logan Applegate, Satish Kumar Hosanagara, Richard Stemper
-
Publication number: 20150081618Abstract: Systems and methods for interest-driven business intelligence systems including event-oriented data in accordance with embodiments of the invention are illustrated. In one embodiment, an interest-driven business intelligence system includes raw data storage configured to contain raw data and perform ETL processes, a data mart configured to contain metadata that describes the raw data, and an intermediate processing layer, wherein the intermediate processing layer is configured to compile an interest-driven data pipeline configured to generate ETL processing jobs to generate event-oriented data from the raw data by filtering the raw data based on the metadata describing the raw data, determining ordering data based on the metadata describing the raw data, aligning the filtered raw data based on the ordering data, and generating event-oriented data based on the aligned filtered raw data, and storing the event-oriented data in the data mart.Type: ApplicationFiled: March 5, 2014Publication date: March 19, 2015Applicant: Platfora, Inc.Inventors: John Schuster, Benjamin Mark Werther, Brian Babcock, Kevin Scott Beyer, Hans-Frederick Brown
-
Patent number: 8984598Abstract: Mechanisms are provided for collecting configuration data from components of a managed computing system environment. A portion of code is obtained, in a data processing system, from a data collection system that does not have security credentials to allow the data collection system to directly access to the managed computing system environment. The portion of code is executed by the data processing system using security credentials maintained in the data processing system. Executing the portion of code causes the data processing system to access the managed computing system environment and collect configuration data from the managed computing system environment. The data processing system, via the portion of code, provides the configuration data collected from the managed computing system to the data collection system which stores the collected configuration data in a data storage.Type: GrantFiled: June 27, 2012Date of Patent: March 17, 2015Assignee: International Business Machines CorporationInventors: Joel W. Branch, Michael E. Nidd, Ruediger Rissmann
-
Patent number: 8983896Abstract: Disclosed is a method, system, and program for providing access to spatial data. A request for data is received. Enterprise and third party data are integrated. The integrated data is processed. Spatially referenced results are generated using the processed data. The spatially referenced results are returned in response to the request.Type: GrantFiled: July 7, 2011Date of Patent: March 17, 2015Assignee: The Paradigm Alliance, Inc.Inventors: Tim A. von Kaenel, David Neil Dyrnaes, C. Suresh Kumar, Jared Paul Wayman, Jonathan David Goodwin, Craig Evan Trivelpiece, Joseph Mihalich, Anthony Page Jenkins, Richard Hoyt Odom, Jr., Mark Andrew Stier, Anne Janetta Obee
-
Patent number: 8984014Abstract: The invention provides a system and method for retrieving and storing industrial data, the system comprising a data retriever that includes a data retrieval manager and one or more watchers for monitoring data associated with one or more industrial devices, wherein if the data associated with the one or more industrial devices is new or modified, the one or more watchers notifies the data retrieval manager thereof and the data retrieval manager uploads the new or modified data. The system further includes a database manager for receiving the new or modified data in a first form from the data retrieval manager and for storing the new or modified data in a structural data form in one or more databases.Type: GrantFiled: May 2, 2013Date of Patent: March 17, 2015Assignee: Rockwell Automation Technologies, Inc.Inventors: Marek Obitko, Ivan Havel, Michal Fortik, Robert Mavrov, Radek Marik
-
Patent number: 8984157Abstract: Aspects of the invention are directed to a method and system for discovering business content transfer paths in a network using file transfer information, and for calculating business risk per network component in a network. A method according to an embodiment includes: obtaining file transfer information for a plurality of file transfers between a plurality of nodes within a network; generating a confidence of correlation for each pair of file transfers in the plurality of file transfers; determining interdependencies between the plurality of file transfers based on the confidence of correlation for each pair of file transfers; and determining a business content transfer path based on the interdependencies between the plurality of file transfers.Type: GrantFiled: July 18, 2012Date of Patent: March 17, 2015Assignee: International Business Machines CorporationInventors: Vedika Dalmia, Thomas I. Lewin, James W. Smith, Peter F. Weller
-
Publication number: 20150074037Abstract: An in memory database system includes a random access memory to store an in memory database table, a stored set of defined views over the database tables, a stored set of analytical services adapted to provide transaction services and database warehouse services, an engine to run the analytical services on the database table, and an interface to interact with users and provide access to the engine, analytical services, and views to perform transactions and data warehouse functions on the database table.Type: ApplicationFiled: September 12, 2013Publication date: March 12, 2015Applicant: SAP AGInventor: Siar Sarferaz
-
Publication number: 20150074038Abstract: A method includes processing a transaction on an in memory database where data being processed has a validity time, updating a time dependent data view responsive to the transaction being processed to capture time validity information regarding the data, and storing the time validity information in a historization table to provide historical access to past time dependent data following expiration of the validity time.Type: ApplicationFiled: September 12, 2013Publication date: March 12, 2015Applicant: SAP AGInventor: Siar Sarferaz
-
Patent number: 8977643Abstract: The disclosed architecture leverages realtime continuous event processing (CEP) to address using a general input interface framework to import a dynamic set of event types (e.g., assets), and using a declarative, expressive query model to implement monitoring and management tasks on an asset level. This is in contrast to looking separately at single values from static databases and/or realtime streams as is common conventionally. The architecture uses the CEP data model to model assets as realtime event types. Thus, queries can be formulated per asset and not just per single stream. The architecture uses the query capabilities of CEP to formulate asset management and monitoring tasks as standing, declarative queries, and uses the input interface of a CEP platform to correlate data from different data sources with different dynamic properties.Type: GrantFiled: June 30, 2010Date of Patent: March 10, 2015Assignee: Microsoft CorporationInventors: Roman Schindlauer, Balan Sethu Raman, Torsten W. Grabs, Beysim Sezgin
-
Patent number: 8977587Abstract: A log file contains operation records, each operation record is of a certain type, and each operation record is associated with a transaction. A plurality of operation records is read from the log file into a record store. Records of the plurality of operation records of each operation record type are sampled at a predefined sampling rate. Operation records in the plurality of operations records are identified that are associated with completed transactions of which the sampled operation records are associated. The identified operation records are then extracted from the record store into a data store.Type: GrantFiled: January 3, 2013Date of Patent: March 10, 2015Assignee: International Business Machines CorporationInventors: Manoj K. Agarwal, Curt L. Cotner, Amitava Kundu
-
Publication number: 20150066838Abstract: In an embodiment a method is provided. The method includes receiving forecast information in a computer from a first group of users. The method also includes integrating the forecast information into a database and corresponding OLAP cube accessible by the computer. The method further includes receiving comments on specific entries of the forecast information in the computer from a second group of users. The method also includes receiving changes of the forecast information in the computer from a second group of users The method further includes integrating comments and changes into the database and OLAP cube. Moreover, the method includes providing a display of the comments and the changes to the first group of users in an interface to the computer.Type: ApplicationFiled: November 3, 2014Publication date: March 5, 2015Applicant: RIGHT90, INC.Inventors: Kim Orumchian, Art Stabenow, Dean Skelton, David Petiot
-
Patent number: 8972334Abstract: A method includes monitoring, on a computing device, data events corresponding to manipulation of data by an application. The monitoring is performed by a data library service that is embedded in the application. The method includes, in response to the monitoring indicating first data on the computing device is modified by the application, synchronizing, by the computing device under control at least in part of the data library service, second data stored on one or more storage providers in a network with the first data modified by the application and stored on the computing device. Apparatus and computer program products are also disclosed.Type: GrantFiled: December 21, 2012Date of Patent: March 3, 2015Assignee: International Business Machines CorporationInventors: Paul C. Castro, Marco Pistoia, John Ponzo
-
Patent number: 8972337Abstract: A bloom filter is generated for efficient query processing for unsorted data in a column of a columnar database. Bloom filters represented as bitmaps are generated for data blocks storing data for a column of a columnar database table. An indication of a query directed toward the column is received and the bloom filter for each data block is examined to determine which ones of the data blocks do not need to be read in order to service the query for the select data. Data is then read from the data blocks storing data for the column excepting the ones which do not need to be read.Type: GrantFiled: February 21, 2013Date of Patent: March 3, 2015Assignee: Amazon Technologies, Inc.Inventor: Anurag Windlass Gupta
-
Patent number: 8972338Abstract: A log file contains operation records, each operation record is of a certain type, and each operation record is associated with a transaction. A plurality of operation records is read from the log file into a record store. Records of the plurality of operation records of each operation record type are sampled at a predefined sampling rate. Operation records in the plurality of operations records are identified that are associated with completed transactions of which the sampled operation records are associated. The identified operation records are then extracted from the record store into a data store.Type: GrantFiled: March 11, 2014Date of Patent: March 3, 2015Assignee: International Business Machines CorporationInventors: Manoj K. Agarwal, Curt L. Cotner, Amitava Kundu
-
Patent number: 8972336Abstract: A system and method for mapping columns from a source file to a target file. The header for each source column is evaluated heuristically to see if the header matches a predefined entity. The contents of a group of cells in the source column are evaluated probabilistically to determine a probability that the cell contents correspond to at least one of the predefined entities. A score is assigned to the likelihood that the column corresponds to one or more predefined entities. If the score meets a threshold, then the correspondence between the source column and one or more predefined entities is mapped. If the score fails to meets the threshold, then the correspondence between the source column and one or more undefined entities is mapped. Finally, each source column is transformed into a target column in accord with the map.Type: GrantFiled: February 21, 2013Date of Patent: March 3, 2015Assignee: salesforce.com, inc.Inventor: Arun Kumar Jagota
-
Publication number: 20150058278Abstract: A computer implemented system for automatically determining a reliability score of a data report. The system includes a data structure indicative of data lineage that includes data assets flowing into a data report. The system further includes a processor configured to calculate trust scores for data assets based on characteristics of the data assets as well as a data report trust density score indicative of the reliability score of the report based on the calculated trust scores. The system further includes an output device configured to output the calculated data report trust density score.Type: ApplicationFiled: August 20, 2013Publication date: February 26, 2015Applicant: International Business Machines CorporationInventors: Michael Fankhauser, Benjamin Halberstadt, Roger Hecker, Ilan Prager, Erel Sharf
-
Publication number: 20150058280Abstract: A computer implemented method, computer program product and system for data quality monitoring includes measuring a data quality of loaded data relative to a predefined data quality metric. The measuring the data quality includes identifying delta changes in at least one of the loaded data and the data quality rules relative to a previous measurement of the data quality of the loaded data. Logical calculus defined in the data quality rules is applied to the identified delta changes.Type: ApplicationFiled: October 21, 2014Publication date: February 26, 2015Inventors: Sebastian Nelke, Martin Oberhofer, Yannick Saillet, Jens Seifert
-
Publication number: 20150058279Abstract: A computational device receives input data and control data, where the control data includes instructions to modify one or more operations performed during a runtime execution associated with the input data. The control data is processed to modify the one or more operations during the runtime execution associated of the input data.Type: ApplicationFiled: September 30, 2014Publication date: February 26, 2015Inventors: Shruti Choudhary, Jeff J. Li, Wendi L. Nusbickel
-
Patent number: 8965838Abstract: A computer implemented method, system and computer program product for managing data in databases generates a control file for a database by utilizing a definition file associated with the database and predefined column definitions, organizes data within a data file in a predefined format, wherein the data is desirable to be loaded in the database, and loads the data from the data file to the database by utilizing the control file and the data file.Type: GrantFiled: January 17, 2012Date of Patent: February 24, 2015Assignee: International Business Machines CoporationInventor: Kartika Garg
-
Patent number: 8965836Abstract: Some embodiments provide a system for managing multidimensional data storage (MDDS) cubes. The system includes a content pack manager for processing a content pack that specifies (1) a set of multidimensional data storage (MDDS) cube definitions for a set of MDDS cubes, (2) a set of data sources that includes data for each of the MDDS cubes in the set of MDDS cubes, and (3) a content map that includes a set of relationships between data for the set of MDDS cubes and the set of data sources. The system includes an MDDS cube manager for provisioning, based on the content pack, the set of MDDS cubes to a set of MDDS clients for viewing the set of MDDS cubes.Type: GrantFiled: October 8, 2011Date of Patent: February 24, 2015Assignee: Bay DynamicsInventors: Ryan G. Stolte, Firas S. Rifai
-
Patent number: 8965877Abstract: A system and methods for automatically assigning of classification codes to a business based on information about the business collected from the Internet are provided in which data extracted from trawling the Internet is compared to a node structure based on a taxonomy of a selected business classification code system.Type: GrantFiled: March 14, 2013Date of Patent: February 24, 2015Assignee: Glenbrook NetworksInventors: Julia Komissarchik, Edward Komissarchik
-
Patent number: 8966456Abstract: A class definition syntax that leverages the array and math-oriented nature of a dynamically typed array-based programming language is discussed. Some embodiments of the present invention provides a mechanism for using meta-data to define a class implemented in the dynamically typed array-based programming language. The meta-data provide an easy way to provide class definitions and allow a user to access the class definitions from objects. The meta-data also support extension of class definitions without introducing backward incompatibility.Type: GrantFiled: July 17, 2006Date of Patent: February 24, 2015Assignee: The MathWorks, Inc.Inventor: David A. Foti
-
Patent number: 8959123Abstract: Generally described, the present invention is directed to a network operating system that provides more effective ways of leveraging the connectivity of computer networks. In one embodiment, an XML virtual machine is implemented that accepts high-level application code written in an XML programming language as input. Functionality is provided to interpret or translate the application code written in an XML programming language into code that is suitable for execution across computer platforms. Moreover, the XML virtual machine supports the Model View Controller (MVC) design paradigm that facilitates true data abstraction from applications to a common data model. Multi-instance applications, with each instance potentially supporting multiple views, may be created and executed.Type: GrantFiled: September 29, 2008Date of Patent: February 17, 2015Assignee: Xcerion AktiebolagInventors: Daniel Arthursson, Marcus Bristav
-
Publication number: 20150046390Abstract: In one embodiment, a method receives a document including metadata in a serial format and parses the document to determine components in the metadata, a nested structure of the components, and an order of the components in the data. The metadata is stored in a table format where columns of the table define the nested structure of the components and the order of the components in the data. A query is received for the data. The method determines components from the metadata stored in the table format from the columns that satisfy the query where the components are determined based on the nested structure of the determined components and the order of the determined components from information in the columns of the table. A result for the query is output including the determined components based on the nested structure of the determined components and the order of the determined components.Type: ApplicationFiled: September 5, 2013Publication date: February 12, 2015Applicant: SAP AGInventors: Enping Tu, Yanlun Ma, Hung-Liang Yang, Yung-Yin Chen
-
Publication number: 20150046389Abstract: Handling extract-transform-load (ETL) job mismatches as “exceptions.” Exception handling may include the following steps: (i) determining a mismatch while running an extract-transform-load job with the mismatch being a mismatch of at least one of the following types: design time information mismatch, and/or operational metadata mismatch; and (ii) responsive to determining the mismatch, handling the mismatch as an exception.Type: ApplicationFiled: August 6, 2013Publication date: February 12, 2015Applicant: International Business Machines CorporationInventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Ajay Sood
-
Patent number: 8954377Abstract: A system, a method and a computer readable storage medium for pre-processing data collected from one or more data sources more accurately summarize data. During the pre-processing, multiple raw data are summarized into a pre-processed datum. By using the pre-processed data entries, more accurate trend data may be generated. Alternatively, data entries are indexed and selectively retrieved based on indices. Decimation of data points are performed based on the indices without retrieving all the data sets from the database, reducing the data access time for returning a query result. Additional data sets may also be retrieved efficiently from the database using the indices.Type: GrantFiled: May 25, 2012Date of Patent: February 10, 2015Assignee: Moog Inc.Inventors: Martin A. Turon, Rahul Kapur
-
Patent number: 8954375Abstract: A method and system for developing data integration applications with reusable semantic types to represent and process application data. Methods include creating schemas to describe external data, creating semantic types to describe internal data, mapping schemas to semantic types, developing dataflows that configure input and output operations using schemas, mappings, and semantic types and all other transformation operations and functions based solely on semantic types, and executing dataflows defined in this manner.Type: GrantFiled: October 13, 2011Date of Patent: February 10, 2015Assignee: QlikTech International ABInventors: William V. Kehoe, Jerry L. Callen, Steven R. Frechette
-
Patent number: 8954378Abstract: A name of one or more entity classes of the data model may be refined to conform to a naming convention. A semantic meaning of each of the names and one or more attributes of each entity class may be determined. It may be determined that the name of a first entity class is semantically similar to the name of a second entity class based on a semantic distance between the semantic meaning of the names, where a substantial similarity may be determined between the first entity class and the second entity class by comparing the semantic meaning of the one or more attributes of the first entity class to the semantic meaning of the one or more attributes of the second entity class. The data model may be normalized based on the substantial similarity.Type: GrantFiled: August 28, 2012Date of Patent: February 10, 2015Assignee: SAP AGInventors: Gunther Stuhec, Florian Gessner, Jens Lemcke
-
Patent number: 8954376Abstract: Techniques are disclosed for identifying transcoding tables in an Extract-Transform-Load (ETL) process, by identifying, by operation of one or more computer processors, records passing through an operator configured to replace values in the records with values from at least one table linked to the operator before being sent to an output table, wherein the operator specifies an operation for extracting, transforming, or loading data stored in one or more source systems into storage by a target system, and evaluating at least a first table linked to the operator to determine whether the first table is a transcoding table by assigning a score to the first table, wherein the score is indicative of the likelihood that the first table is a transcoding table, wherein a transcoding table is used to harmonize values from a plurality of tables in the one or more source systems to a table in the target.Type: GrantFiled: March 8, 2012Date of Patent: February 10, 2015Assignee: International Business Machines CorporationInventors: Dan J. Mandelstein, Ivan M. Milman, Martin Oberhofer, Sushain Pandit
-
Patent number: 8949204Abstract: Described herein are methods, systems, apparatuses and products for efficient development of a rule-based system. An aspect provides a method including accessing data records; converting said data records to an intermediate form; utilizing intermediate forms to compute similarity scores for said data records; and selecting as an example to be provided for rule making at least one record of said data records having a maximum dissimilarity score indicative of dissimilarity to already considered examples.Type: GrantFiled: August 29, 2012Date of Patent: February 3, 2015Assignee: International Business Machines CorporationInventors: Snigdha Chaturvedi, Tanveer Afzal Faruquie, L. Venkata Subramaniam
-
Patent number: 8949175Abstract: A generic approach for automatically ingesting data into an HDFS (Hadoop File System) based data warehouse includes a datahub server, a generic pipelined data loading framework, and a meta-data model that, together, address data loading efficiency, data source heterogeneities, and data warehouse schema evolvement. The loading efficiency is achieved via the MapReduce scale-out solution. The meta-data model is comprised of configuration files and a catalog. The configuration file is setup per ingestion task. The catalog manages the data warehouse schema. When a scheduled data loading task is executed, the configuration files and the catalog collaboratively drive the datahub server to load the heterogeneous data to their destination schemas automatically.Type: GrantFiled: May 8, 2012Date of Patent: February 3, 2015Assignee: Turn Inc.Inventors: Mingxi Wu, Songting Chen
-
Patent number: 8943222Abstract: A system, method, and computer readable medium for employing data obtained from an electronic network, such as the Internet, as an emulation of sensor data. A virtual sensor bridge mechanism may be interfaced to a data source, such as the Internet, to receive Web data or remote sensor data. The virtual sensor bridge may convert the received data into a format usable by a logic controller, thereby emulating sensor data and enabling the logic controller to treat the networked data source as a sensor. Furthermore, the virtual sensor bridge may interface directly with one or more sensors and may serve as an adapter or data relay between a sensor and the logic controller.Type: GrantFiled: August 30, 2011Date of Patent: January 27, 2015Assignee: ioBridge, Inc.Inventors: Peter Hans Scharler, Robert Stanley Mawrey, Jason Thomas Winters
-
Patent number: 8943472Abstract: A system and method for reading and writing datasets (SAS datasets) written in a proprietary runtime macro language (SAS, owned by the SAS Institute, Inc.) directly, without going through a system that uses the SAS proprietary runtime macro language. The system employs a JAVA Database Connectivity (JDBC) application program interface (API) driver usable by JDBC enabled JAVA applications. The driver extracts from the SAS dataset information from a header page that enables identification of information on metadata pages which in turn are used by the driver to present to the JAVA application a view of the data as a database table, from which the JAVA application can request reading and writing of the SAS dataset.Type: GrantFiled: May 27, 2013Date of Patent: January 27, 2015Assignee: Dulles Research LLCInventor: Pavel Ganelin
-
Publication number: 20150026115Abstract: A computer software implemented method for transforming a first extract transform load (ETL) job having at least some unload transform load (UTL) portions. The method includes the following steps: (i) decomposing the first ETL job into an intermediate set of one or more jobs; and (ii) for each job of the intermediate set, transforming the job into a transactionally equivalent job to yield a final set of one or more jobs. The decomposing is performed so that each job of the intermediate jobs set is a Simple UTL job. The transforming is performed so that each job of the final set includes no UTL portions.Type: ApplicationFiled: July 19, 2013Publication date: January 22, 2015Inventors: Alexander Gorelik, Sriram K. Padmanabhan, James D. Spyker
-
Publication number: 20150026116Abstract: Embodiments of the present invention relate to a method, program product, and system for managing the percentage of unpopulated cells in a multidimensional data structure during the servicing of multidimensional analytical queries. In an embodiment, a multidimensional data structure is stored in a memory location of a computing device, wherein the multidimensional data structure includes a dimension that is a subset of a plurality of dimensions included in a relational database. The computing device determines a frequency of utilization of a dimension to service a multidimensional query request. The computing device combines the dimension with the multidimensional data structure responsive to the frequency of utilization.Type: ApplicationFiled: September 17, 2014Publication date: January 22, 2015Inventors: Vijay Bharti, Jeremiah Joseph, Hemant K. Sivaswamy, Kamalpreet Sapna, Rohit Jain
-
Publication number: 20150026117Abstract: A data warehouse system comprising a warehouse database and a data warehouse manager communicatively coupled to the database. The warehouse manager is to store data to and retrieve data from the warehouse database and perform metadata reconciliation to update metadata of objects in the data warehouse to reflect a change in metadata of a first object in the warehouse by identifying a first set of objects impacted by the change to the metadata of the first object, identifying a second set of objects on which the first object depends, identifying at least one task that has to be completed to propagate the impact of the change in the metadata of the first object to objects in the first and second sets of objects, performing the task, and dynamically computing additional tasks based on whether or not definitions of objects in the first or second sets of objects changes based upon completion of the at least one task.Type: ApplicationFiled: October 6, 2014Publication date: January 22, 2015Inventor: Winnie Tak Yu Wan