Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
  • Patent number: 10114565
    Abstract: Methods and systems for managing, storing, and serving data within a virtualized environment are described. In some embodiments, a data management system may manage the extraction and storage of virtual machine snapshots, provide near instantaneous restoration of a virtual machine or one or more files located on the virtual machine, and enable secondary workloads to directly use the data management system as a primary storage target to read or modify past versions of data. The data management system may allow a virtual machine snapshot of a virtual machine stored within the system to be directly mounted to enable substantially instantaneous virtual machine recovery of the virtual machine.
    Type: Grant
    Filed: February 20, 2015
    Date of Patent: October 30, 2018
    Assignee: RUBRIK, INC.
    Inventors: Arvind Nithrakashyap, Kenny To, Jiangbin Luo, Sharvanath Pathak, Jonathan Derryberry
  • Patent number: 10108914
    Abstract: Morphing object types in enterprise content management systems is described. A text properties file is created that maps properties associated with an object type to properties associated with a new object type. A database associated with an enterprise content management system is queried to retrieve objects based on the object type. The objects are stored in a staging database. The objects are deleted from the database. Property tables are created in the database based on the new object type. The objects are migrated from the staging database to the newly created property tables based on the text properties file.
    Type: Grant
    Filed: May 31, 2017
    Date of Patent: October 23, 2018
    Assignee: Open Text Corporation
    Inventors: Michael T. Mohen, Christopher Dyde
  • Patent number: 10102274
    Abstract: A corpus of texts relating to a domain of knowledge may be searched by determining noun-pair proximity scores measuring associations between pairs of nouns that appear in the corpus and that are semantically related to the domain of knowledge. When a search term is received, the noun-pair proximity scores may be used (at least in part) to identify one or more related nouns that are strongly associated with the search term within the corpus. One or more texts may be selected from the corpus, texts in which the search term and the related nouns appear near each other in one or more places. The selected texts may be categorized and/or clustered based on the related nouns before being returned for presentation as SearchResults.
    Type: Grant
    Filed: March 17, 2014
    Date of Patent: October 16, 2018
    Assignee: NLPCore LLC
    Inventor: Varun Mittal
  • Patent number: 10102253
    Abstract: Elements of a database object are stored among a plurality of different storage regions, each storage region being associated with a first value range indicating upper and lower range values for element values within that storage region. An index is generated and maintained for one or more second value ranges that include range values that comprise subsets of the first value ranges. A query is processed that includes a query value and one or more storage regions to be read is determined by using one of the index and the first value ranges based on the query value relative to a second value range.
    Type: Grant
    Filed: March 11, 2015
    Date of Patent: October 16, 2018
    Assignee: International Business Machines Corporation
    Inventor: Garth A. Dickie
  • Patent number: 10095727
    Abstract: A data unification device for unifying data objects includes a retrieving entity, a first providing entity, a second providing entity, and an assembling entity. The retrieving entity is configured to retrieve unstructured data objects from a number of unstructured knowledge bases and structured data objects from a number of structured knowledge bases. The first providing entity is configured to provide a number of syntactic frames for each retrieved data object, wherein each syntactic frame include a syntactic concept, a syntactic concept-instance, and a syntactic concept-instance relationship. The second providing entity is configured to provide a number of semantic frames for each retrieved data object, wherein each semantic frame includes a semantic concept, a semantic concept-instance, and a semantic concept-instance relationship. The assembling entity is configured to assemble, for each retrieved data object, the provided syntactic frames and the provided semantic frames to one semantic object.
    Type: Grant
    Filed: February 27, 2014
    Date of Patent: October 9, 2018
    Assignee: Siemens Aktiengesellschaft
    Inventor: Ulli Waltinger
  • Patent number: 10096064
    Abstract: The present invention provides software and a method and system of efficient source document data entry and data association. More particularly, the present invention relates to a software module which receives source documents and recognizes or extracts information from the documents or associated files for use in populating fields of related or derivative documents or screens to facilitate accurate transfer of data. The invention also allows for ease in confirming the accuracy of the extracted or imported data by comparison with the source document either directly by a person or through automated or semi-automated procedures.
    Type: Grant
    Filed: November 6, 2009
    Date of Patent: October 9, 2018
    Assignee: Thomson Reuters Global Resources Unlimited Company
    Inventors: Brian D. Vroom, Jonathan Baron, Fred Leicher
  • Patent number: 10091279
    Abstract: Scientific instruments can be network-enabled by adding a wireless communication capability to the computers associated with those scientific instruments. Through this wireless communication capability, the scientific data acquired by a scientific instrument and metadata about that scientific data can be wirelessly transferred from the instrument-associated computer to a data hub. By way of example, a wireless personal area network (PAN) can be established between the instrument-associated computer and the data hub. From the data hub, the scientific data can be further communicated to remote servers via another network connection. Furthermore, in another example embodiment, the wireless communication capability between the instrument-associated computer and the data hub can be leveraged as a conduit for passing commands from the data hub or other devices in communication with the data hub to the instrument-associated computer for controlling the operation of the scientific instrument.
    Type: Grant
    Filed: May 27, 2015
    Date of Patent: October 2, 2018
    Assignee: FLOWJO, LLC
    Inventor: Michael D. Stadnisky
  • Patent number: 10078667
    Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and -comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.
    Type: Grant
    Filed: December 13, 2015
    Date of Patent: September 18, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Chang Sheng Li, Fan Jing Meng, Edith Helen Stern, Han Wang, Jing Min Xu, Lin Yang, Xuejun Zhuo
  • Patent number: 10078666
    Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.
    Type: Grant
    Filed: November 5, 2015
    Date of Patent: September 18, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: Chang Sheng Li, Fan Jing Meng, Edith Helen Stern, Han Wang, Jing Min Xu, Lin Yang, Xuejun Zhuo
  • Patent number: 10067993
    Abstract: Handling extract-transform-load (ETL) job mismatches as “exceptions.” Exception handling may include the following steps: (i) determining a mismatch while running an extract-transform-load job with the mismatch being a mismatch of at least one of the following types: design time information mismatch, and/or operational metadata mismatch; and (ii) responsive to determining the mismatch, handling the mismatch as an exception.
    Type: Grant
    Filed: August 11, 2016
    Date of Patent: September 4, 2018
    Assignee: International Business Machines Corporation
    Inventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Ajay Sood
  • Patent number: 10067962
    Abstract: Although many techniques provide ways to answer queries very quickly by either avoiding joins and/or aggregation or avoiding expression computations in order to apply filtering predicates, these techniques have significant potential overhead for maintenance, and require a great deal of storage space. The present invention discloses the use of synopsis or zone tables to expressions to provide powerful filtering capabilities to base tables across joins, and aggregations on join results, thus allowing for highly effective region or stride filtering and ordering aggregate ranges to select the most likely candidate tuples for joining to produce top-N or bottom-N result sets for queries.
    Type: Grant
    Filed: June 30, 2017
    Date of Patent: September 4, 2018
    Assignee: International Business Machines Corporation
    Inventors: Ian R. Finlay, Abhinav Goyal, Robin D. Grosman, Calisto P. Zuzarte
  • Patent number: 10067926
    Abstract: An image processing system for identifying table captions may include a processor and a non-transitory computer-readable storage medium comprising one or more instructions configured to cause the processor to perform image processing of a scanned document to identify boxes of an electronic fillable form that corresponds to the scanned document. The system creates and stores a data structure that includes a box list. The system searches the box list to identify an empty box, uses an adjacency list determine whether the empty box has adjacent boxes, and in response to determining that the empty box has one or more adjacent boxes, creates a table box list that includes the empty box and one or more other boxes from the table to which the empty box belongs. The system uses the adjacency list to identify a row caption and a column caption associated with the empty box.
    Type: Grant
    Filed: December 21, 2015
    Date of Patent: September 4, 2018
    Assignee: Xerox Corporation
    Inventors: Martin S. Maltz, Safwan R. Wshah
  • Patent number: 10067961
    Abstract: Although many techniques provide ways to answer queries very quickly by either avoiding joins and/or aggregation or avoiding expression computations in order to apply filtering predicates, these techniques have significant potential overhead for maintenance, and require a great deal of storage space. The present invention discloses the use of synopsis or zone tables to expressions to provide powerful filtering capabilities to base tables across joins, and aggregations on join results, thus allowing for highly effective region or stride filtering and ordering aggregate ranges to select the most likely candidate tuples for joining to produce top-N or bottom-N result sets for queries.
    Type: Grant
    Filed: September 29, 2015
    Date of Patent: September 4, 2018
    Assignee: International Business Machines Corporation
    Inventors: Ian R. Finlay, Abhinav Goyal, Robin D. Grosman, Calisto P. Zuzarte
  • Patent number: 10055206
    Abstract: A building management system (BMS) includes building equipment configured to provide raw data samples of a data point and a data collector configured to collect the raw data samples from the building equipment and generate a raw data timeseries including a plurality of the raw data samples. The BMS includes a timeseries service configured to generate one or more optimized data timeseries from the raw data timeseries and a timeseries database configured to store the raw data timeseries and the one or more optimized data timeseries. The BMS includes a dashboard layout generator configured to identify each timeseries associated with the data point and to generate a framework-agnostic dashboard layout description capable of being rendered by multiple different rendering frameworks to visualize each of the identified timeseries.
    Type: Grant
    Filed: June 14, 2016
    Date of Patent: August 21, 2018
    Assignee: Johnson Controls Technology Company
    Inventors: Youngchoon Park, Vijaya S. Chennupati, Gerald A. Asp, Peter A. Craig
  • Patent number: 10055114
    Abstract: A building energy management includes building equipment, one or more data platform services, a timeseries database, and an energy management application. The building equipment operate to monitor and control a variable and provide raw data samples of a data point associated with the variable. The timeseries database stores a plurality of timeseries associated with the data point. The plurality of timeseries include a timeseries of the raw data samples and the one or more optimized data timeseries generated by the data platform services based on the raw data timeseries. The energy management application generates an ad hoc dashboard including a widget and associates the widget with the data point. The widget displays a graphical visualization of the plurality of timeseries associated with the data point and includes interactive user interface options for switching between the plurality of timeseries associated with the data point.
    Type: Grant
    Filed: January 17, 2017
    Date of Patent: August 21, 2018
    Assignee: Johnson Controls Technology Company
    Inventors: Barkha Shah, Vivek Narain, Vipul Devre, Youngchoon Park, Gerald A. Asp, Peter A. Craig, Vijaya S. Chennupati
  • Patent number: 10042903
    Abstract: A method for testing an extract, transform, load job (ETL job) is provided. The method may include receiving an uploaded ETL job. The method may also include executing a functionality test of the uploaded ETL job using a test bucket. The method may further include determining the executed functionality test produces a failure result. The method may also include creating at least one test sub-job corresponding to at least one stage within on the uploaded ETL job. The method may further include generating a plurality of test data for each of the at least one created test sub-job. The method may also include executing the at least one created test sub-job using the plurality of generated test data. The method may further include creating a report based on a plurality of results associated with the at least one executed test sub-job.
    Type: Grant
    Filed: September 2, 2015
    Date of Patent: August 7, 2018
    Assignee: International Business Machines Corporation
    Inventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Sumit Kumar, Rajesh Phillips
  • Patent number: 10042936
    Abstract: Methods, systems, and apparatus include computer programs encoded on a computer-readable storage medium for determining keywords. A method includes determining keyword(s) from a set of event information. The set can include plural, similar events so as to enable an effective analysis of a group of events as opposed to an analysis of a single discrete event. The set of events can be formed by filtering a corpus of events by a category. The corpus of events can include, for each event, information such as a search query, a web page associated with the query or visited by a user device, a location associated with a user device, or a search result response. Top ranked terms can be determined based on term frequency. Keyword(s) can be determined for each top ranked term. The keyword(s) can be used to select content for delivery to user devices associated with the category.
    Type: Grant
    Filed: July 11, 2014
    Date of Patent: August 7, 2018
    Assignee: Google LLC
    Inventor: Yifang Liu
  • Patent number: 10037361
    Abstract: A dataset of a plurality of transactions is received, the plurality of transactions including at least a first item and a second item. An absence-based association rule correlating an absence of the first item with an inclusion of the second item within the plurality of transactions is calculated.
    Type: Grant
    Filed: July 7, 2015
    Date of Patent: July 31, 2018
    Assignee: SAP SE
    Inventor: Ran Bittmann
  • Patent number: 10019472
    Abstract: Systems and methods for querying a distributed dwarf cube are disclosed. A query for retrieving data from a distributed dwarf cube is received. The distributed dwarf cube is built of the data. The data comprises cube values. The distributed dwarf cube is built by processing the data to generate indexes for the data. The cube values in one or more dimensions are sorted based on a cardinality of the cube values. The data is partitioned into data blocks to build distributed dwarf cube from each data block based upon the cardinality of the cube values. The distributed dwarf cube comprises one or more ranges defined for the cube values. The one or more ranges of the cube values are checked based upon the query. Using the cube values, a list is created. The list of the cube values is transmitted from the distributed dwarf cube corresponding to the query.
    Type: Grant
    Filed: August 14, 2014
    Date of Patent: July 10, 2018
    Assignee: INTELLICUS TECHNOLOGIES PVT. LTD.
    Inventors: Ankit Khandelwal, Kapil Ghodawat, Sajal Rastogi, Saurabh Gupta
  • Patent number: 9996562
    Abstract: A method of migrating data from one or more source databases to one or more target databases may include generating a pre-migration analysis for a plurality of objects stored in the one or more source databases, and generating a plurality of migration scripts that transfer the plurality of objects from the one or more source databases to the one or more target databases. The method may also include generating a migration plan that defines an execution order for the plurality of migration scripts, and migrating the plurality of objects from the one or more source databases to one or more target databases according to the migration plan. The method may further include validating the plurality of objects on the one or more target databases.
    Type: Grant
    Filed: July 9, 2013
    Date of Patent: June 12, 2018
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Antony Higginson, John Masterson, Sean Fitzpatrick, Peter Robertshaw, Elmar Spiegelberg, Stephan Buhne, Michael Weick, Nick Balch, Florin Popescu
  • Patent number: 9984173
    Abstract: A method, system, and computer program product for automated value analysis in legacy data are provided in the illustrative embodiments. A first number of occurrences of a first value in a first field is counted in the legacy data. A first identifier associated with the first field, the first value, and the first count are recorded as a first entry in a value data structure, the value data structure comprising a set of entries. The value data structure is analyzed to determine an aberration between the first count and a second count, wherein the second count corresponds to a second number of occurrences of a second value in the first field. The aberration at the first field of the legacy data is reported in a report.
    Type: Grant
    Filed: February 24, 2014
    Date of Patent: May 29, 2018
    Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION
    Inventors: James E. Bostick, John M. Ganci, Jr., Arvind Singh, David S. Wenk
  • Patent number: 9984425
    Abstract: Disclosed are some examples of systems, apparatus, methods, and computer program products related to automatically causing a following relationship to be established in an enterprise social networking system between a user and one or more customer relationship management (CRM) records. In some implementations, a workflow is configurable at least in part by one or more selections defining one or more conditions. Responsive to determining that one or more properties associated with a first CRM record satisfies the one or more conditions, a user is caused to follow the first CRM record. The following enables or causes updates associated with the first CRM record to be shared in a feed of the enterprise social networking system, where the feed can be provided to the user's device. Information based on the following of the first CRM record by the user can be displayed at the device.
    Type: Grant
    Filed: December 20, 2016
    Date of Patent: May 29, 2018
    Assignee: salesforce.com, inc.
    Inventor: Christopher Anthony Kemp
  • Patent number: 9984101
    Abstract: Techniques supporting database migration of schemas enforced by applications without downtime is described. In examples, a method of migrating databases is described to comply with schemas enforced by one or more applications without downtime such that the data remains accessible to the one or more applications. A transformation is initiated, by a schema migration module implemented by one or more computing devices, of a first database in compliance with a first schema into a second database in compliance with a second schema. The first and second schemas have mechanisms for storage and retrieval of data that is modeled using at least one technique other than a relational model.
    Type: Grant
    Filed: February 19, 2015
    Date of Patent: May 29, 2018
    Assignee: ADOBE SYSTEMS INCORPORATED
    Inventor: Nils Meder
  • Patent number: 9977798
    Abstract: A method for migrating data between tables. The method includes, receiving a Structured Query Language (SQL) statement, where the SQL statement identifies a source table and a destination table, and the source table contains source data divided into columns with a set of column names. A database management system identifies a set of instructions for manipulating the source data. The database management system generates an interim SQL statement, based on the SQL statement. The database management system executes the interim SQL statement, which copies the manipulated source data into a set of columns of the destination table.
    Type: Grant
    Filed: July 20, 2015
    Date of Patent: May 22, 2018
    Assignee: International Business Machines Corporation
    Inventors: Karla Bester, Allan T. Chandler, Mark A. Shewell, Stephen J. Yates
  • Patent number: 9971850
    Abstract: Described herein are hash table structures and system and methods for creating and storing data in the hash table structures. Hash tables and input buffers associated with same are configured in several arrangements, including multi-level hash tables and nested multi-level hash tables. The hash table structures described herein are well suited for being configured in the memory blocks of field-programmable gate arrays.
    Type: Grant
    Filed: December 29, 2015
    Date of Patent: May 15, 2018
    Assignee: International Business Machines Corporation
    Inventor: Takanori Ueda
  • Patent number: 9965516
    Abstract: A method and apparatus for outer-joined and/or cross-joined table elimination for duplicate-insignificant queries is provided. A query block of a query specifies a join between a first table and a second table. The join is one of an outer join and a cross join. A first determination is made that the query block contains no references to the second table to be processed after the join. A second determination is made that the query block is duplicate-insignificant. In response to making the first determination and the second determination, the query block is transformed into a transformed query, where the second table is eliminated from a corresponding query block in the transformed query.
    Type: Grant
    Filed: May 21, 2014
    Date of Patent: May 8, 2018
    Assignee: Oracle International Corporation
    Inventors: Rafi Ahmed, Gautam Kumar Parai
  • Patent number: 9959193
    Abstract: According to an embodiment of the present invention, an artifact is received, and unstructured content of the artifact is parsed and analyzed to identify data for one or more of missing structured content of the artifact and inaccurate structured content of the artifact. The identified data is then added to the artifact. Embodiments of the present invention can be used, for example, to provide data for missing and inaccurate structured content in artifacts of Application Lifecycle Management (ALM) frameworks, and improve accuracy of structured information that used to run queries and create reports.
    Type: Grant
    Filed: September 25, 2015
    Date of Patent: May 1, 2018
    Assignee: International Business Machines Corporation
    Inventors: Muhtar B. Akbulut, Mario A. Maldari, David D. Taieb
  • Patent number: 9959336
    Abstract: Compiling test cases. The method may include uploading at least one extract, transform, load job (ETL job) and a plurality of test cases. The method may also include identifying a plurality of design features associated with the at least one uploaded ETL job. The method may further include identifying at least one feature change within the plurality of identified design features. The method may also include determining a plurality of related test cases within the plurality of uploaded test cases corresponding to the at least one identified feature change. The method may further include creating a test bucket based on a grouping of the determined plurality of related test cases. The method may also include adding at least one of a plurality of corresponding ETL jobs and a plurality of corresponding ETL job stages associated with the at least one identified feature change to the created test bucket.
    Type: Grant
    Filed: September 22, 2016
    Date of Patent: May 1, 2018
    Assignee: International Business Machines Corporation
    Inventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Sumit Kumar, Rajesh Phillips
  • Patent number: 9952962
    Abstract: According to an embodiment of the present invention, an artifact is received, and unstructured content of the artifact is parsed and analyzed to identify data for one or more of missing structured content of the artifact and inaccurate structured content of the artifact. The identified data is then added to the artifact. Embodiments of the present invention can be used, for example, to provide data for missing and inaccurate structured content in artifacts of Application Lifecycle Management (ALM) frameworks, and improve accuracy of structured information that used to run queries and create reports.
    Type: Grant
    Filed: March 26, 2015
    Date of Patent: April 24, 2018
    Assignee: International Business Machines Corporation
    Inventors: Muhtar B. Akbulut, Mario A. Maldari, David D. Taieb
  • Patent number: 9934258
    Abstract: Techniques for dynamically generating database views and class objects for allowing access to domain data stored in a repository are provided. A computer-implemented method, a system, and/or a machine-readable medium storing instructions executable by one or more processors may include generating a database view and a class object using metadata included in a domain model, and retrieving domain data from a repository using the database view and the class object. For example, a method may include obtaining a domain model from a repository, the domain model including metadata corresponding to a set of domain data stored in the repository, generating a database view of a subset of the set of domain data using the metadata, generating a class object for the subset using the metadata, generating mapping information, and retrieving the subset of domain data from the repository using the database view, the class object, and the mapping information.
    Type: Grant
    Filed: March 31, 2014
    Date of Patent: April 3, 2018
    Assignee: ORACLE INTERNATIONAL CORPORATION
    Inventors: Dennis Floyd Fuglsang, Joshua Jeffrey Spiegel
  • Patent number: 9922101
    Abstract: Example embodiments of the present invention relate to a method, a system, and a computer program product for federated service requests of a logical data store comprising a plurality of heterogeneous data stores. The method includes providing a data abstraction services layer as a control plane for federated service requests of a logical data store, comprising a plurality of heterogeneous data stores, by an application operating on a processing device, receiving a service request at the data abstraction services layer from the application operating on the processing device directed to the logical data store, and returning results of the service request from the data abstraction services layer to the application operating on the processing device.
    Type: Grant
    Filed: June 28, 2013
    Date of Patent: March 20, 2018
    Assignee: EMC Corporation
    Inventors: David Stephen Reiner, Nihar K. Nanda, John D. Hushon, Jr., David R. Brown, Jagannathan Ramnarayanan
  • Patent number: 9922103
    Abstract: According to one embodiment, a method of copying a dataset associated with a first extract, transform, and load (ETL) job in a first data integration system to a second data integration system comprises copying executable code associated with the first ETL job from the first to the second system. Operating system software, integration system software, and file system structure are substantially identical between the first and second systems. The method further comprises executing the second ETL job to read the dataset from the first data integration system and write the dataset to the second data integration system. The second ETL job is associated with configuration parameters specifying storage resources in the first system associated with the dataset and destination parameters specifying storage resources in the second system. The method further comprises copying metadata generated by the second ETL job from the first to the second data integration system.
    Type: Grant
    Filed: October 21, 2014
    Date of Patent: March 20, 2018
    Assignee: Bank of America Corporation
    Inventors: Jason E. Martens, Brad E. Romano, Sachin M. Nerurkar, Shashi Tanikella
  • Patent number: 9922072
    Abstract: Extract, transform, and load application (ETL) complexity management framework systems and methods are described herein. The present disclosure describes systems and methods that reduce the complexity in managing ETL flow and correcting errant data that is subsequently identified. One or more methods include defining an ETL job definition, defining a data asset definition, defining a data asset dependency definition, receiving an ETL flow to provide execution of one or more ETL flow steps, providing retrieval of data from a source data asset, applying a data control to the source asset data, and producing an ETL job registration, a data asset status, a latest asset available date, a data asset consumer identifier, and a target data asset based on at least one of the ETL job definition, the data asset definition, the data dependency definition, and the source asset data.
    Type: Grant
    Filed: April 21, 2016
    Date of Patent: March 20, 2018
    Assignee: United Services Automobile Association (USAA)
    Inventors: Larry W. Clark, Jason P. Hendry, Mark Steen
  • Patent number: 9910891
    Abstract: Staging data record changes from a faster storage medium to a slower storage medium using data query rewriting is provided. In response to receiving a data query corresponding to a particular data record, it is determined whether the data query is one of a transactional data query or an analytical data query. In response to determining that the data query is a transactional data query, the transactional data query is rewritten to apply transactional delta changes to the particular data record on a storage-class memory of a computer. In response to determining that the data query is an analytical data query, the analytical data query is rewritten to select and reconcile each data record corresponding to the particular data record stored on the storage-class memory with the particular data record stored on a persistent data storage device of the computer.
    Type: Grant
    Filed: September 25, 2013
    Date of Patent: March 6, 2018
    Assignee: International Business Machines Corporation
    Inventors: Bishwaranjan Bhattacharjee, Mustafa Canim, Mohammad S. Hamedani, Kenneth A. Ross
  • Patent number: 9904706
    Abstract: Staging data record changes from a faster storage medium to a slower storage medium using data query rewriting is provided. In response to receiving a data query corresponding to a particular data record, it is determined whether the data query is one of a transactional data query or an analytical data query. In response to determining that the data query is a transactional data query, the transactional data query is rewritten to apply transactional delta changes to the particular data record on a storage-class memory of a computer. In response to determining that the data query is an analytical data query, the analytical data query is rewritten to select and reconcile each data record corresponding to the particular data record stored on the storage-class memory with the particular data record stored on a persistent data storage device of the computer.
    Type: Grant
    Filed: September 6, 2013
    Date of Patent: February 27, 2018
    Assignee: International Business Machines Corporation
    Inventors: Bishwaranjan Bhattacharjee, Mustafa Canim, Mohammad S. Hamedani, Kenneth A. Ross
  • Patent number: 9898522
    Abstract: Techniques are described for managing aggregation of data in a distributed manner, such as for a particular client based on specified configuration information. The described techniques may include storing aggregated data values for an OLAP cube or other data structure in a distributed manner, such as in some situations in a distributed hash table. The aggregated data values to be stored may be generated in various manners, such as by performing multi-stage data manipulation operations—for example, a map-reduce architecture may be used, with a first stage involving the use of one or more specified map functions to be performed, and with at least a second stage involving the use of one or more specified reduce functions to be performed.
    Type: Grant
    Filed: December 16, 2014
    Date of Patent: February 20, 2018
    Assignee: AMAZON TECHNOLOGIES, INC.
    Inventors: Richard J. Cole, Alan D. Mock
  • Patent number: 9892179
    Abstract: A method is provided for managing queries in business intelligence platforms. The method includes receiving, by a processor, a first query requesting data from at least one business intelligence content, the first query having a first format. The method further includes determining, by the processor, at least one requirement to complete the first query. The method further includes converting, by the processor, the first query to a second query configured to be used by an Extract, Transform, and Load (ETL) program, in response to determining that the at least one requirement exceeds a threshold, the second query having a second format.
    Type: Grant
    Filed: May 21, 2014
    Date of Patent: February 13, 2018
    Assignee: International Business Machines Corporation
    Inventors: Arun Balasubramanyan, Yuntao Liu, Pamela K. Prinz, Mary E. Rudden, John A. Tobasco
  • Patent number: 9886474
    Abstract: A multidimensional object, such as a language-integrated query (LINQ) object, can be generated based upon a grouping set operator and zero or more key selectors. The object can be utilized to query over a non-relational data source. The object can include a sequence of functions corresponding to each grouping related to the grouping set operator in order to provide multidimensional grouping representation. Furthermore, the object includes nested structures for optimized execution of the grouping set operator.
    Type: Grant
    Filed: November 22, 2011
    Date of Patent: February 6, 2018
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Bart De Smet, Henricus Johannes Maria Meijer, John Wesley Dyer
  • Patent number: 9836505
    Abstract: A system and method for processing a database query is described. The method can, in response to detection that a database query involves a star or snowflake join operation, determine a selectivity ratio for each of a plurality of dimension tables. The selectivity ratio having a lower value can correspond to a more restrictive dimension table. Thereafter, a table ordering can be created beginning with a fact table and continuing with each of the dimension tables in ascending order of their corresponding selectivity ratios. Then a query plan involving join operations between successive tables in the table ordering can be created.
    Type: Grant
    Filed: March 13, 2014
    Date of Patent: December 5, 2017
    Assignee: SYBASE, INC.
    Inventors: Xun Cheng, Peter Schneider
  • Patent number: 9836530
    Abstract: In one example of the disclosure, data indicative of a word or phrase communicated during a meeting including a plurality of participants is obtained. For each participant, records electronically accessible to the participant are identified, and each record is associated with a tier from a hierarchy of record-relevancy tiers. A set of explanations for the communication and associated scores is identified, including for each participant, beginning with a most relevant tier, searching the records accessible to the participant tier by tier until an explanation is identified, and assigning a score to the explanation according to the tier associated with the record in which the explanation is found. A preferred explanation for the communication is determined based upon the scores, and a display of the preferred explanation is caused.
    Type: Grant
    Filed: December 16, 2013
    Date of Patent: December 5, 2017
    Assignee: ENTIT SOFTWARE LLC
    Inventors: Tomer Priel, Ira Cohen, Eli Mordechai
  • Patent number: 9810539
    Abstract: A method is provided for partitioning probe data into a plurality of subsets, map matching a portion of the subsets and establishing a route based on the map-matched subsets of probe data. A method is provided that includes: receiving probe data points from a plurality of probes, where an uncertainty of the measurement of the probe data is a first distance; partitioning the probe data from the plurality of probes into subsets of probe data, where the subsets are created by ensuring each subset has probe data points that are at least two times the first distance apart from one another; performing map-matching on each subset of probe data; and establishing a route for each subset based on the map-matching of each subset of probe data. Establishing a route for each subset may include determining a route according to at least one of the shortest, fastest, or simplest routes.
    Type: Grant
    Filed: March 16, 2016
    Date of Patent: November 7, 2017
    Assignee: HERE Global B.V.
    Inventor: Dongwook Jang
  • Patent number: 9800484
    Abstract: An approach for managing and optimizing available resources utilized in a networked computing environment (e.g., a cloud computing environment) is provided. In one aspect, a plurality of resources in a plurality of storage pools within the networked computing environment are monitored. Each storage pool is running at least one application workload using a corresponding subset of the plurality of resources. The storage pools are ranked based on the application workloads. In response to an over-utilized storage pool and/or an under-utilized storage pool, the plurality of resources are dynamically reconfigured.
    Type: Grant
    Filed: March 10, 2014
    Date of Patent: October 24, 2017
    Assignee: International Business Machines Corporation
    Inventors: Sandip Agarwala, Venkateswarlu Basyam, Kavita Chavda, Dean Hildebrand, Ramani R. Routray
  • Patent number: 9792150
    Abstract: Various systems and methods for detecting a site change. For example, one method involves detecting migration of a virtual machine. The method also involves determining whether the virtual machine moved from one site to another site during the migration. The method also involves updating location information, where the location information is updated in response to determining whether the virtual machine moved from one site to another.
    Type: Grant
    Filed: June 28, 2013
    Date of Patent: October 17, 2017
    Assignee: Veritas Technologies LLC
    Inventors: Amit Haridas Rangari, Udipta Das
  • Patent number: 9779154
    Abstract: A synchronization system for transform databases and a method thereof are provided. The system includes a database transformer and a data adapter. The database transformer is used to transform tables from a source database to a target database. The data adapter includes a statement parsing unit, a storage unit and a synchronization control unit. The statement parsing unit receives a query statement and parses a query table associated with the query statement. The storage unit stores the query statement temporarily. The synchronization control unit determines a query mechanism and receives a transformation state about the query table from the database transformer. Based on the query mechanism and the transformation state, the synchronization control unit performs the query statement in the source database, the target database, or both of them.
    Type: Grant
    Filed: March 20, 2015
    Date of Patent: October 3, 2017
    Assignee: National Tsing Hua University
    Inventors: Ying-Ti Liao, Jia-Zheng Zhou, Yeh-Ching Chung
  • Patent number: 9773003
    Abstract: An analytical database server and a method for enabling investigative data analytics have been disclosed. The database server comprises a data loader that receives data from a plurality of data sources, and loads the received source data tables onto a Hadoop Distributed File System (HDFS). A processor processes the source data tables loaded onto the HDFS and assigns a unique turf row (TR) identifier to each of the records present in the source data tables. The source data tables and the corresponding metadata are organized into a first data table, a second data table and a third data table. The first table comprises a record for each of the received source data tables. The second data table stores, in the form of an inverted list, the data originally contained in the received source data tables. The third data table stores the unidirectional relationships between the source data tables.
    Type: Grant
    Filed: February 2, 2015
    Date of Patent: September 26, 2017
    Assignee: XURMO TECHNOLOGIES PVT. LTD.
    Inventors: Sridhar Gopalakrishnan, Davis Verghese, Saurabh Santosh
  • Patent number: 9767103
    Abstract: A method and system electronically pull, using an extension of a launched program, accounting data from a launched accounting software application. The pulled data is converted, by the extension, into a format compatible with a second accounting software application. The converted data is written into the launched application associated with the extension. The written data is converted, by another extension, into a format compatible for importing into another accounting software application. The format compatible data is pushed into the second accounting software application by the extension.
    Type: Grant
    Filed: November 3, 2014
    Date of Patent: September 19, 2017
    Inventor: Craig Hurlbut
  • Patent number: 9740831
    Abstract: A clinical data management system (1) has databases (20), processors in servers (2-4) which are programmed to process clinical data and communicate with user interfaces and external systems interfaces, and at least one database. The system imports source data from disparate clinical site sources into staging databases at refresh intervals, maintains data models, and maps data from the staging databases into the data models, and feeds data from the data models into data delivery databases. There is a uniform refresh frequency for the staging databases. The system output is regularly updated data for clinical site performance, quality and risk metrics to a clinical study team. The data mapper servers identify each of a plurality of source data stages, and transform data from each stage to one or more data models according to one or more mapsets, each mapset defining a transformation.
    Type: Grant
    Filed: April 2, 2013
    Date of Patent: August 22, 2017
    Assignee: ICON CLINICAL RESEARCH LIMITED
    Inventors: Colin Thomas Burns, Gareth Allan Milborrow, Paul Crean, Michael Grossman
  • Patent number: 9740718
    Abstract: Methods, computer systems, and stored instructions are described herein for densely grouping dimensional data and/or aggregating data using a data structure, such as one that is constructed based on dimensional data. When smaller tables are joined with a larger table, a server may analyze the smaller tables first to determine actual value combinations that occur in the smaller tables, and these actual value combinations are used to more efficiently process the larger table. A dense data structure may be generated by processing dimensional data before processing data from fact table. The dense data structure may be generated by compressing ranges of values that are possible in dimensions into a range of values that actually occurs in the dimensions. The compressed range of values may be represented by dense set identifiers rather than the actual compressed range of values.
    Type: Grant
    Filed: September 20, 2013
    Date of Patent: August 22, 2017
    Assignee: Oracle International Corporation
    Inventors: Albert A. Hopeman, IV, Martin Roth, Ekrem S. C. Soylemez
  • Patent number: 9740724
    Abstract: Systematically enforcing generation of partition key values for records in a data management system such that for any given partitioned table, all records relating to a common business object share a common partitioning key. Partitioning and/or clustering of these records can then be done based on these specially generated partitioning key values, with entities dependent on the same first-class entity being placed in the same partition and/or cluster. This may provide considerable performance gains in at least some circumstances.
    Type: Grant
    Filed: April 29, 2014
    Date of Patent: August 22, 2017
    Assignee: International Business Machines Corporation
    Inventors: Yongli An, Mohammad Khatibi
  • Patent number: 9734236
    Abstract: A method and system for leveraging content is provided. The method includes receiving, data associated with a subscriber and registering the subscriber with an ECM computing system. Devices belonging to the subscriber are connected to the ECM computing system and metadata associated with content retrieved from the devices is generated. The content in the devices are classified into formal content and informal content. Multiple searches for additional content are monitored and multifaceted search results associated with the formal content and the informal content are generated and presented to the subscriber. The subscriber has an option to request informal content on additional end user devices from respective end users based on metadata presented by search results.
    Type: Grant
    Filed: June 8, 2016
    Date of Patent: August 15, 2017
    Assignee: International Business Machines Corporation
    Inventor: Rajesh Radhakrishnan