Data Extraction, Transformation, And Loading (etl) Patents (Class 707/602)
-
Patent number: 10114565Abstract: Methods and systems for managing, storing, and serving data within a virtualized environment are described. In some embodiments, a data management system may manage the extraction and storage of virtual machine snapshots, provide near instantaneous restoration of a virtual machine or one or more files located on the virtual machine, and enable secondary workloads to directly use the data management system as a primary storage target to read or modify past versions of data. The data management system may allow a virtual machine snapshot of a virtual machine stored within the system to be directly mounted to enable substantially instantaneous virtual machine recovery of the virtual machine.Type: GrantFiled: February 20, 2015Date of Patent: October 30, 2018Assignee: RUBRIK, INC.Inventors: Arvind Nithrakashyap, Kenny To, Jiangbin Luo, Sharvanath Pathak, Jonathan Derryberry
-
Patent number: 10108914Abstract: Morphing object types in enterprise content management systems is described. A text properties file is created that maps properties associated with an object type to properties associated with a new object type. A database associated with an enterprise content management system is queried to retrieve objects based on the object type. The objects are stored in a staging database. The objects are deleted from the database. Property tables are created in the database based on the new object type. The objects are migrated from the staging database to the newly created property tables based on the text properties file.Type: GrantFiled: May 31, 2017Date of Patent: October 23, 2018Assignee: Open Text CorporationInventors: Michael T. Mohen, Christopher Dyde
-
Patent number: 10102274Abstract: A corpus of texts relating to a domain of knowledge may be searched by determining noun-pair proximity scores measuring associations between pairs of nouns that appear in the corpus and that are semantically related to the domain of knowledge. When a search term is received, the noun-pair proximity scores may be used (at least in part) to identify one or more related nouns that are strongly associated with the search term within the corpus. One or more texts may be selected from the corpus, texts in which the search term and the related nouns appear near each other in one or more places. The selected texts may be categorized and/or clustered based on the related nouns before being returned for presentation as SearchResults.Type: GrantFiled: March 17, 2014Date of Patent: October 16, 2018Assignee: NLPCore LLCInventor: Varun Mittal
-
Patent number: 10102253Abstract: Elements of a database object are stored among a plurality of different storage regions, each storage region being associated with a first value range indicating upper and lower range values for element values within that storage region. An index is generated and maintained for one or more second value ranges that include range values that comprise subsets of the first value ranges. A query is processed that includes a query value and one or more storage regions to be read is determined by using one of the index and the first value ranges based on the query value relative to a second value range.Type: GrantFiled: March 11, 2015Date of Patent: October 16, 2018Assignee: International Business Machines CorporationInventor: Garth A. Dickie
-
Patent number: 10095727Abstract: A data unification device for unifying data objects includes a retrieving entity, a first providing entity, a second providing entity, and an assembling entity. The retrieving entity is configured to retrieve unstructured data objects from a number of unstructured knowledge bases and structured data objects from a number of structured knowledge bases. The first providing entity is configured to provide a number of syntactic frames for each retrieved data object, wherein each syntactic frame include a syntactic concept, a syntactic concept-instance, and a syntactic concept-instance relationship. The second providing entity is configured to provide a number of semantic frames for each retrieved data object, wherein each semantic frame includes a semantic concept, a semantic concept-instance, and a semantic concept-instance relationship. The assembling entity is configured to assemble, for each retrieved data object, the provided syntactic frames and the provided semantic frames to one semantic object.Type: GrantFiled: February 27, 2014Date of Patent: October 9, 2018Assignee: Siemens AktiengesellschaftInventor: Ulli Waltinger
-
Patent number: 10096064Abstract: The present invention provides software and a method and system of efficient source document data entry and data association. More particularly, the present invention relates to a software module which receives source documents and recognizes or extracts information from the documents or associated files for use in populating fields of related or derivative documents or screens to facilitate accurate transfer of data. The invention also allows for ease in confirming the accuracy of the extracted or imported data by comparison with the source document either directly by a person or through automated or semi-automated procedures.Type: GrantFiled: November 6, 2009Date of Patent: October 9, 2018Assignee: Thomson Reuters Global Resources Unlimited CompanyInventors: Brian D. Vroom, Jonathan Baron, Fred Leicher
-
Patent number: 10091279Abstract: Scientific instruments can be network-enabled by adding a wireless communication capability to the computers associated with those scientific instruments. Through this wireless communication capability, the scientific data acquired by a scientific instrument and metadata about that scientific data can be wirelessly transferred from the instrument-associated computer to a data hub. By way of example, a wireless personal area network (PAN) can be established between the instrument-associated computer and the data hub. From the data hub, the scientific data can be further communicated to remote servers via another network connection. Furthermore, in another example embodiment, the wireless communication capability between the instrument-associated computer and the data hub can be leveraged as a conduit for passing commands from the data hub or other devices in communication with the data hub to the instrument-associated computer for controlling the operation of the scientific instrument.Type: GrantFiled: May 27, 2015Date of Patent: October 2, 2018Assignee: FLOWJO, LLCInventor: Michael D. Stadnisky
-
Patent number: 10078667Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and -comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.Type: GrantFiled: December 13, 2015Date of Patent: September 18, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Chang Sheng Li, Fan Jing Meng, Edith Helen Stern, Han Wang, Jing Min Xu, Lin Yang, Xuejun Zhuo
-
Patent number: 10078666Abstract: Embodiments include method, computer program products and apparatuses for normalizing non-numeric features of files and corresponding apparatus. Aspects include segmenting at least one pair of positive instances of a non-numeric feature of a file into a number of tokens and comparing the tokens in the at least one pair of positive instances to obtain matching tokens. Aspects also include calculating weights of their matching the file, for the matching tokens, and storing the tokens and their weights in a token base.Type: GrantFiled: November 5, 2015Date of Patent: September 18, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Chang Sheng Li, Fan Jing Meng, Edith Helen Stern, Han Wang, Jing Min Xu, Lin Yang, Xuejun Zhuo
-
Patent number: 10067993Abstract: Handling extract-transform-load (ETL) job mismatches as “exceptions.” Exception handling may include the following steps: (i) determining a mismatch while running an extract-transform-load job with the mismatch being a mismatch of at least one of the following types: design time information mismatch, and/or operational metadata mismatch; and (ii) responsive to determining the mismatch, handling the mismatch as an exception.Type: GrantFiled: August 11, 2016Date of Patent: September 4, 2018Assignee: International Business Machines CorporationInventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Ajay Sood
-
Patent number: 10067962Abstract: Although many techniques provide ways to answer queries very quickly by either avoiding joins and/or aggregation or avoiding expression computations in order to apply filtering predicates, these techniques have significant potential overhead for maintenance, and require a great deal of storage space. The present invention discloses the use of synopsis or zone tables to expressions to provide powerful filtering capabilities to base tables across joins, and aggregations on join results, thus allowing for highly effective region or stride filtering and ordering aggregate ranges to select the most likely candidate tuples for joining to produce top-N or bottom-N result sets for queries.Type: GrantFiled: June 30, 2017Date of Patent: September 4, 2018Assignee: International Business Machines CorporationInventors: Ian R. Finlay, Abhinav Goyal, Robin D. Grosman, Calisto P. Zuzarte
-
Patent number: 10067926Abstract: An image processing system for identifying table captions may include a processor and a non-transitory computer-readable storage medium comprising one or more instructions configured to cause the processor to perform image processing of a scanned document to identify boxes of an electronic fillable form that corresponds to the scanned document. The system creates and stores a data structure that includes a box list. The system searches the box list to identify an empty box, uses an adjacency list determine whether the empty box has adjacent boxes, and in response to determining that the empty box has one or more adjacent boxes, creates a table box list that includes the empty box and one or more other boxes from the table to which the empty box belongs. The system uses the adjacency list to identify a row caption and a column caption associated with the empty box.Type: GrantFiled: December 21, 2015Date of Patent: September 4, 2018Assignee: Xerox CorporationInventors: Martin S. Maltz, Safwan R. Wshah
-
Patent number: 10067961Abstract: Although many techniques provide ways to answer queries very quickly by either avoiding joins and/or aggregation or avoiding expression computations in order to apply filtering predicates, these techniques have significant potential overhead for maintenance, and require a great deal of storage space. The present invention discloses the use of synopsis or zone tables to expressions to provide powerful filtering capabilities to base tables across joins, and aggregations on join results, thus allowing for highly effective region or stride filtering and ordering aggregate ranges to select the most likely candidate tuples for joining to produce top-N or bottom-N result sets for queries.Type: GrantFiled: September 29, 2015Date of Patent: September 4, 2018Assignee: International Business Machines CorporationInventors: Ian R. Finlay, Abhinav Goyal, Robin D. Grosman, Calisto P. Zuzarte
-
Patent number: 10055206Abstract: A building management system (BMS) includes building equipment configured to provide raw data samples of a data point and a data collector configured to collect the raw data samples from the building equipment and generate a raw data timeseries including a plurality of the raw data samples. The BMS includes a timeseries service configured to generate one or more optimized data timeseries from the raw data timeseries and a timeseries database configured to store the raw data timeseries and the one or more optimized data timeseries. The BMS includes a dashboard layout generator configured to identify each timeseries associated with the data point and to generate a framework-agnostic dashboard layout description capable of being rendered by multiple different rendering frameworks to visualize each of the identified timeseries.Type: GrantFiled: June 14, 2016Date of Patent: August 21, 2018Assignee: Johnson Controls Technology CompanyInventors: Youngchoon Park, Vijaya S. Chennupati, Gerald A. Asp, Peter A. Craig
-
Patent number: 10055114Abstract: A building energy management includes building equipment, one or more data platform services, a timeseries database, and an energy management application. The building equipment operate to monitor and control a variable and provide raw data samples of a data point associated with the variable. The timeseries database stores a plurality of timeseries associated with the data point. The plurality of timeseries include a timeseries of the raw data samples and the one or more optimized data timeseries generated by the data platform services based on the raw data timeseries. The energy management application generates an ad hoc dashboard including a widget and associates the widget with the data point. The widget displays a graphical visualization of the plurality of timeseries associated with the data point and includes interactive user interface options for switching between the plurality of timeseries associated with the data point.Type: GrantFiled: January 17, 2017Date of Patent: August 21, 2018Assignee: Johnson Controls Technology CompanyInventors: Barkha Shah, Vivek Narain, Vipul Devre, Youngchoon Park, Gerald A. Asp, Peter A. Craig, Vijaya S. Chennupati
-
Patent number: 10042903Abstract: A method for testing an extract, transform, load job (ETL job) is provided. The method may include receiving an uploaded ETL job. The method may also include executing a functionality test of the uploaded ETL job using a test bucket. The method may further include determining the executed functionality test produces a failure result. The method may also include creating at least one test sub-job corresponding to at least one stage within on the uploaded ETL job. The method may further include generating a plurality of test data for each of the at least one created test sub-job. The method may also include executing the at least one created test sub-job using the plurality of generated test data. The method may further include creating a report based on a plurality of results associated with the at least one executed test sub-job.Type: GrantFiled: September 2, 2015Date of Patent: August 7, 2018Assignee: International Business Machines CorporationInventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Sumit Kumar, Rajesh Phillips
-
Patent number: 10042936Abstract: Methods, systems, and apparatus include computer programs encoded on a computer-readable storage medium for determining keywords. A method includes determining keyword(s) from a set of event information. The set can include plural, similar events so as to enable an effective analysis of a group of events as opposed to an analysis of a single discrete event. The set of events can be formed by filtering a corpus of events by a category. The corpus of events can include, for each event, information such as a search query, a web page associated with the query or visited by a user device, a location associated with a user device, or a search result response. Top ranked terms can be determined based on term frequency. Keyword(s) can be determined for each top ranked term. The keyword(s) can be used to select content for delivery to user devices associated with the category.Type: GrantFiled: July 11, 2014Date of Patent: August 7, 2018Assignee: Google LLCInventor: Yifang Liu
-
Patent number: 10037361Abstract: A dataset of a plurality of transactions is received, the plurality of transactions including at least a first item and a second item. An absence-based association rule correlating an absence of the first item with an inclusion of the second item within the plurality of transactions is calculated.Type: GrantFiled: July 7, 2015Date of Patent: July 31, 2018Assignee: SAP SEInventor: Ran Bittmann
-
Patent number: 10019472Abstract: Systems and methods for querying a distributed dwarf cube are disclosed. A query for retrieving data from a distributed dwarf cube is received. The distributed dwarf cube is built of the data. The data comprises cube values. The distributed dwarf cube is built by processing the data to generate indexes for the data. The cube values in one or more dimensions are sorted based on a cardinality of the cube values. The data is partitioned into data blocks to build distributed dwarf cube from each data block based upon the cardinality of the cube values. The distributed dwarf cube comprises one or more ranges defined for the cube values. The one or more ranges of the cube values are checked based upon the query. Using the cube values, a list is created. The list of the cube values is transmitted from the distributed dwarf cube corresponding to the query.Type: GrantFiled: August 14, 2014Date of Patent: July 10, 2018Assignee: INTELLICUS TECHNOLOGIES PVT. LTD.Inventors: Ankit Khandelwal, Kapil Ghodawat, Sajal Rastogi, Saurabh Gupta
-
Patent number: 9996562Abstract: A method of migrating data from one or more source databases to one or more target databases may include generating a pre-migration analysis for a plurality of objects stored in the one or more source databases, and generating a plurality of migration scripts that transfer the plurality of objects from the one or more source databases to the one or more target databases. The method may also include generating a migration plan that defines an execution order for the plurality of migration scripts, and migrating the plurality of objects from the one or more source databases to one or more target databases according to the migration plan. The method may further include validating the plurality of objects on the one or more target databases.Type: GrantFiled: July 9, 2013Date of Patent: June 12, 2018Assignee: ORACLE INTERNATIONAL CORPORATIONInventors: Antony Higginson, John Masterson, Sean Fitzpatrick, Peter Robertshaw, Elmar Spiegelberg, Stephan Buhne, Michael Weick, Nick Balch, Florin Popescu
-
Patent number: 9984173Abstract: A method, system, and computer program product for automated value analysis in legacy data are provided in the illustrative embodiments. A first number of occurrences of a first value in a first field is counted in the legacy data. A first identifier associated with the first field, the first value, and the first count are recorded as a first entry in a value data structure, the value data structure comprising a set of entries. The value data structure is analyzed to determine an aberration between the first count and a second count, wherein the second count corresponds to a second number of occurrences of a second value in the first field. The aberration at the first field of the legacy data is reported in a report.Type: GrantFiled: February 24, 2014Date of Patent: May 29, 2018Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: James E. Bostick, John M. Ganci, Jr., Arvind Singh, David S. Wenk
-
Patent number: 9984425Abstract: Disclosed are some examples of systems, apparatus, methods, and computer program products related to automatically causing a following relationship to be established in an enterprise social networking system between a user and one or more customer relationship management (CRM) records. In some implementations, a workflow is configurable at least in part by one or more selections defining one or more conditions. Responsive to determining that one or more properties associated with a first CRM record satisfies the one or more conditions, a user is caused to follow the first CRM record. The following enables or causes updates associated with the first CRM record to be shared in a feed of the enterprise social networking system, where the feed can be provided to the user's device. Information based on the following of the first CRM record by the user can be displayed at the device.Type: GrantFiled: December 20, 2016Date of Patent: May 29, 2018Assignee: salesforce.com, inc.Inventor: Christopher Anthony Kemp
-
Patent number: 9984101Abstract: Techniques supporting database migration of schemas enforced by applications without downtime is described. In examples, a method of migrating databases is described to comply with schemas enforced by one or more applications without downtime such that the data remains accessible to the one or more applications. A transformation is initiated, by a schema migration module implemented by one or more computing devices, of a first database in compliance with a first schema into a second database in compliance with a second schema. The first and second schemas have mechanisms for storage and retrieval of data that is modeled using at least one technique other than a relational model.Type: GrantFiled: February 19, 2015Date of Patent: May 29, 2018Assignee: ADOBE SYSTEMS INCORPORATEDInventor: Nils Meder
-
Patent number: 9977798Abstract: A method for migrating data between tables. The method includes, receiving a Structured Query Language (SQL) statement, where the SQL statement identifies a source table and a destination table, and the source table contains source data divided into columns with a set of column names. A database management system identifies a set of instructions for manipulating the source data. The database management system generates an interim SQL statement, based on the SQL statement. The database management system executes the interim SQL statement, which copies the manipulated source data into a set of columns of the destination table.Type: GrantFiled: July 20, 2015Date of Patent: May 22, 2018Assignee: International Business Machines CorporationInventors: Karla Bester, Allan T. Chandler, Mark A. Shewell, Stephen J. Yates
-
Patent number: 9971850Abstract: Described herein are hash table structures and system and methods for creating and storing data in the hash table structures. Hash tables and input buffers associated with same are configured in several arrangements, including multi-level hash tables and nested multi-level hash tables. The hash table structures described herein are well suited for being configured in the memory blocks of field-programmable gate arrays.Type: GrantFiled: December 29, 2015Date of Patent: May 15, 2018Assignee: International Business Machines CorporationInventor: Takanori Ueda
-
Patent number: 9965516Abstract: A method and apparatus for outer-joined and/or cross-joined table elimination for duplicate-insignificant queries is provided. A query block of a query specifies a join between a first table and a second table. The join is one of an outer join and a cross join. A first determination is made that the query block contains no references to the second table to be processed after the join. A second determination is made that the query block is duplicate-insignificant. In response to making the first determination and the second determination, the query block is transformed into a transformed query, where the second table is eliminated from a corresponding query block in the transformed query.Type: GrantFiled: May 21, 2014Date of Patent: May 8, 2018Assignee: Oracle International CorporationInventors: Rafi Ahmed, Gautam Kumar Parai
-
Patent number: 9959193Abstract: According to an embodiment of the present invention, an artifact is received, and unstructured content of the artifact is parsed and analyzed to identify data for one or more of missing structured content of the artifact and inaccurate structured content of the artifact. The identified data is then added to the artifact. Embodiments of the present invention can be used, for example, to provide data for missing and inaccurate structured content in artifacts of Application Lifecycle Management (ALM) frameworks, and improve accuracy of structured information that used to run queries and create reports.Type: GrantFiled: September 25, 2015Date of Patent: May 1, 2018Assignee: International Business Machines CorporationInventors: Muhtar B. Akbulut, Mario A. Maldari, David D. Taieb
-
Patent number: 9959336Abstract: Compiling test cases. The method may include uploading at least one extract, transform, load job (ETL job) and a plurality of test cases. The method may also include identifying a plurality of design features associated with the at least one uploaded ETL job. The method may further include identifying at least one feature change within the plurality of identified design features. The method may also include determining a plurality of related test cases within the plurality of uploaded test cases corresponding to the at least one identified feature change. The method may further include creating a test bucket based on a grouping of the determined plurality of related test cases. The method may also include adding at least one of a plurality of corresponding ETL jobs and a plurality of corresponding ETL job stages associated with the at least one identified feature change to the created test bucket.Type: GrantFiled: September 22, 2016Date of Patent: May 1, 2018Assignee: International Business Machines CorporationInventors: Raghavendra R. Dhayapule, Ritesh K. Gupta, Sumit Kumar, Rajesh Phillips
-
Patent number: 9952962Abstract: According to an embodiment of the present invention, an artifact is received, and unstructured content of the artifact is parsed and analyzed to identify data for one or more of missing structured content of the artifact and inaccurate structured content of the artifact. The identified data is then added to the artifact. Embodiments of the present invention can be used, for example, to provide data for missing and inaccurate structured content in artifacts of Application Lifecycle Management (ALM) frameworks, and improve accuracy of structured information that used to run queries and create reports.Type: GrantFiled: March 26, 2015Date of Patent: April 24, 2018Assignee: International Business Machines CorporationInventors: Muhtar B. Akbulut, Mario A. Maldari, David D. Taieb
-
Patent number: 9934258Abstract: Techniques for dynamically generating database views and class objects for allowing access to domain data stored in a repository are provided. A computer-implemented method, a system, and/or a machine-readable medium storing instructions executable by one or more processors may include generating a database view and a class object using metadata included in a domain model, and retrieving domain data from a repository using the database view and the class object. For example, a method may include obtaining a domain model from a repository, the domain model including metadata corresponding to a set of domain data stored in the repository, generating a database view of a subset of the set of domain data using the metadata, generating a class object for the subset using the metadata, generating mapping information, and retrieving the subset of domain data from the repository using the database view, the class object, and the mapping information.Type: GrantFiled: March 31, 2014Date of Patent: April 3, 2018Assignee: ORACLE INTERNATIONAL CORPORATIONInventors: Dennis Floyd Fuglsang, Joshua Jeffrey Spiegel
-
Patent number: 9922101Abstract: Example embodiments of the present invention relate to a method, a system, and a computer program product for federated service requests of a logical data store comprising a plurality of heterogeneous data stores. The method includes providing a data abstraction services layer as a control plane for federated service requests of a logical data store, comprising a plurality of heterogeneous data stores, by an application operating on a processing device, receiving a service request at the data abstraction services layer from the application operating on the processing device directed to the logical data store, and returning results of the service request from the data abstraction services layer to the application operating on the processing device.Type: GrantFiled: June 28, 2013Date of Patent: March 20, 2018Assignee: EMC CorporationInventors: David Stephen Reiner, Nihar K. Nanda, John D. Hushon, Jr., David R. Brown, Jagannathan Ramnarayanan
-
Patent number: 9922103Abstract: According to one embodiment, a method of copying a dataset associated with a first extract, transform, and load (ETL) job in a first data integration system to a second data integration system comprises copying executable code associated with the first ETL job from the first to the second system. Operating system software, integration system software, and file system structure are substantially identical between the first and second systems. The method further comprises executing the second ETL job to read the dataset from the first data integration system and write the dataset to the second data integration system. The second ETL job is associated with configuration parameters specifying storage resources in the first system associated with the dataset and destination parameters specifying storage resources in the second system. The method further comprises copying metadata generated by the second ETL job from the first to the second data integration system.Type: GrantFiled: October 21, 2014Date of Patent: March 20, 2018Assignee: Bank of America CorporationInventors: Jason E. Martens, Brad E. Romano, Sachin M. Nerurkar, Shashi Tanikella
-
Patent number: 9922072Abstract: Extract, transform, and load application (ETL) complexity management framework systems and methods are described herein. The present disclosure describes systems and methods that reduce the complexity in managing ETL flow and correcting errant data that is subsequently identified. One or more methods include defining an ETL job definition, defining a data asset definition, defining a data asset dependency definition, receiving an ETL flow to provide execution of one or more ETL flow steps, providing retrieval of data from a source data asset, applying a data control to the source asset data, and producing an ETL job registration, a data asset status, a latest asset available date, a data asset consumer identifier, and a target data asset based on at least one of the ETL job definition, the data asset definition, the data dependency definition, and the source asset data.Type: GrantFiled: April 21, 2016Date of Patent: March 20, 2018Assignee: United Services Automobile Association (USAA)Inventors: Larry W. Clark, Jason P. Hendry, Mark Steen
-
Patent number: 9910891Abstract: Staging data record changes from a faster storage medium to a slower storage medium using data query rewriting is provided. In response to receiving a data query corresponding to a particular data record, it is determined whether the data query is one of a transactional data query or an analytical data query. In response to determining that the data query is a transactional data query, the transactional data query is rewritten to apply transactional delta changes to the particular data record on a storage-class memory of a computer. In response to determining that the data query is an analytical data query, the analytical data query is rewritten to select and reconcile each data record corresponding to the particular data record stored on the storage-class memory with the particular data record stored on a persistent data storage device of the computer.Type: GrantFiled: September 25, 2013Date of Patent: March 6, 2018Assignee: International Business Machines CorporationInventors: Bishwaranjan Bhattacharjee, Mustafa Canim, Mohammad S. Hamedani, Kenneth A. Ross
-
Patent number: 9904706Abstract: Staging data record changes from a faster storage medium to a slower storage medium using data query rewriting is provided. In response to receiving a data query corresponding to a particular data record, it is determined whether the data query is one of a transactional data query or an analytical data query. In response to determining that the data query is a transactional data query, the transactional data query is rewritten to apply transactional delta changes to the particular data record on a storage-class memory of a computer. In response to determining that the data query is an analytical data query, the analytical data query is rewritten to select and reconcile each data record corresponding to the particular data record stored on the storage-class memory with the particular data record stored on a persistent data storage device of the computer.Type: GrantFiled: September 6, 2013Date of Patent: February 27, 2018Assignee: International Business Machines CorporationInventors: Bishwaranjan Bhattacharjee, Mustafa Canim, Mohammad S. Hamedani, Kenneth A. Ross
-
Patent number: 9898522Abstract: Techniques are described for managing aggregation of data in a distributed manner, such as for a particular client based on specified configuration information. The described techniques may include storing aggregated data values for an OLAP cube or other data structure in a distributed manner, such as in some situations in a distributed hash table. The aggregated data values to be stored may be generated in various manners, such as by performing multi-stage data manipulation operations—for example, a map-reduce architecture may be used, with a first stage involving the use of one or more specified map functions to be performed, and with at least a second stage involving the use of one or more specified reduce functions to be performed.Type: GrantFiled: December 16, 2014Date of Patent: February 20, 2018Assignee: AMAZON TECHNOLOGIES, INC.Inventors: Richard J. Cole, Alan D. Mock
-
Patent number: 9892179Abstract: A method is provided for managing queries in business intelligence platforms. The method includes receiving, by a processor, a first query requesting data from at least one business intelligence content, the first query having a first format. The method further includes determining, by the processor, at least one requirement to complete the first query. The method further includes converting, by the processor, the first query to a second query configured to be used by an Extract, Transform, and Load (ETL) program, in response to determining that the at least one requirement exceeds a threshold, the second query having a second format.Type: GrantFiled: May 21, 2014Date of Patent: February 13, 2018Assignee: International Business Machines CorporationInventors: Arun Balasubramanyan, Yuntao Liu, Pamela K. Prinz, Mary E. Rudden, John A. Tobasco
-
Patent number: 9886474Abstract: A multidimensional object, such as a language-integrated query (LINQ) object, can be generated based upon a grouping set operator and zero or more key selectors. The object can be utilized to query over a non-relational data source. The object can include a sequence of functions corresponding to each grouping related to the grouping set operator in order to provide multidimensional grouping representation. Furthermore, the object includes nested structures for optimized execution of the grouping set operator.Type: GrantFiled: November 22, 2011Date of Patent: February 6, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Bart De Smet, Henricus Johannes Maria Meijer, John Wesley Dyer
-
Patent number: 9836505Abstract: A system and method for processing a database query is described. The method can, in response to detection that a database query involves a star or snowflake join operation, determine a selectivity ratio for each of a plurality of dimension tables. The selectivity ratio having a lower value can correspond to a more restrictive dimension table. Thereafter, a table ordering can be created beginning with a fact table and continuing with each of the dimension tables in ascending order of their corresponding selectivity ratios. Then a query plan involving join operations between successive tables in the table ordering can be created.Type: GrantFiled: March 13, 2014Date of Patent: December 5, 2017Assignee: SYBASE, INC.Inventors: Xun Cheng, Peter Schneider
-
Patent number: 9836530Abstract: In one example of the disclosure, data indicative of a word or phrase communicated during a meeting including a plurality of participants is obtained. For each participant, records electronically accessible to the participant are identified, and each record is associated with a tier from a hierarchy of record-relevancy tiers. A set of explanations for the communication and associated scores is identified, including for each participant, beginning with a most relevant tier, searching the records accessible to the participant tier by tier until an explanation is identified, and assigning a score to the explanation according to the tier associated with the record in which the explanation is found. A preferred explanation for the communication is determined based upon the scores, and a display of the preferred explanation is caused.Type: GrantFiled: December 16, 2013Date of Patent: December 5, 2017Assignee: ENTIT SOFTWARE LLCInventors: Tomer Priel, Ira Cohen, Eli Mordechai
-
Patent number: 9810539Abstract: A method is provided for partitioning probe data into a plurality of subsets, map matching a portion of the subsets and establishing a route based on the map-matched subsets of probe data. A method is provided that includes: receiving probe data points from a plurality of probes, where an uncertainty of the measurement of the probe data is a first distance; partitioning the probe data from the plurality of probes into subsets of probe data, where the subsets are created by ensuring each subset has probe data points that are at least two times the first distance apart from one another; performing map-matching on each subset of probe data; and establishing a route for each subset based on the map-matching of each subset of probe data. Establishing a route for each subset may include determining a route according to at least one of the shortest, fastest, or simplest routes.Type: GrantFiled: March 16, 2016Date of Patent: November 7, 2017Assignee: HERE Global B.V.Inventor: Dongwook Jang
-
Patent number: 9800484Abstract: An approach for managing and optimizing available resources utilized in a networked computing environment (e.g., a cloud computing environment) is provided. In one aspect, a plurality of resources in a plurality of storage pools within the networked computing environment are monitored. Each storage pool is running at least one application workload using a corresponding subset of the plurality of resources. The storage pools are ranked based on the application workloads. In response to an over-utilized storage pool and/or an under-utilized storage pool, the plurality of resources are dynamically reconfigured.Type: GrantFiled: March 10, 2014Date of Patent: October 24, 2017Assignee: International Business Machines CorporationInventors: Sandip Agarwala, Venkateswarlu Basyam, Kavita Chavda, Dean Hildebrand, Ramani R. Routray
-
Patent number: 9792150Abstract: Various systems and methods for detecting a site change. For example, one method involves detecting migration of a virtual machine. The method also involves determining whether the virtual machine moved from one site to another site during the migration. The method also involves updating location information, where the location information is updated in response to determining whether the virtual machine moved from one site to another.Type: GrantFiled: June 28, 2013Date of Patent: October 17, 2017Assignee: Veritas Technologies LLCInventors: Amit Haridas Rangari, Udipta Das
-
Patent number: 9779154Abstract: A synchronization system for transform databases and a method thereof are provided. The system includes a database transformer and a data adapter. The database transformer is used to transform tables from a source database to a target database. The data adapter includes a statement parsing unit, a storage unit and a synchronization control unit. The statement parsing unit receives a query statement and parses a query table associated with the query statement. The storage unit stores the query statement temporarily. The synchronization control unit determines a query mechanism and receives a transformation state about the query table from the database transformer. Based on the query mechanism and the transformation state, the synchronization control unit performs the query statement in the source database, the target database, or both of them.Type: GrantFiled: March 20, 2015Date of Patent: October 3, 2017Assignee: National Tsing Hua UniversityInventors: Ying-Ti Liao, Jia-Zheng Zhou, Yeh-Ching Chung
-
Patent number: 9773003Abstract: An analytical database server and a method for enabling investigative data analytics have been disclosed. The database server comprises a data loader that receives data from a plurality of data sources, and loads the received source data tables onto a Hadoop Distributed File System (HDFS). A processor processes the source data tables loaded onto the HDFS and assigns a unique turf row (TR) identifier to each of the records present in the source data tables. The source data tables and the corresponding metadata are organized into a first data table, a second data table and a third data table. The first table comprises a record for each of the received source data tables. The second data table stores, in the form of an inverted list, the data originally contained in the received source data tables. The third data table stores the unidirectional relationships between the source data tables.Type: GrantFiled: February 2, 2015Date of Patent: September 26, 2017Assignee: XURMO TECHNOLOGIES PVT. LTD.Inventors: Sridhar Gopalakrishnan, Davis Verghese, Saurabh Santosh
-
Patent number: 9767103Abstract: A method and system electronically pull, using an extension of a launched program, accounting data from a launched accounting software application. The pulled data is converted, by the extension, into a format compatible with a second accounting software application. The converted data is written into the launched application associated with the extension. The written data is converted, by another extension, into a format compatible for importing into another accounting software application. The format compatible data is pushed into the second accounting software application by the extension.Type: GrantFiled: November 3, 2014Date of Patent: September 19, 2017Inventor: Craig Hurlbut
-
Patent number: 9740831Abstract: A clinical data management system (1) has databases (20), processors in servers (2-4) which are programmed to process clinical data and communicate with user interfaces and external systems interfaces, and at least one database. The system imports source data from disparate clinical site sources into staging databases at refresh intervals, maintains data models, and maps data from the staging databases into the data models, and feeds data from the data models into data delivery databases. There is a uniform refresh frequency for the staging databases. The system output is regularly updated data for clinical site performance, quality and risk metrics to a clinical study team. The data mapper servers identify each of a plurality of source data stages, and transform data from each stage to one or more data models according to one or more mapsets, each mapset defining a transformation.Type: GrantFiled: April 2, 2013Date of Patent: August 22, 2017Assignee: ICON CLINICAL RESEARCH LIMITEDInventors: Colin Thomas Burns, Gareth Allan Milborrow, Paul Crean, Michael Grossman
-
Patent number: 9740718Abstract: Methods, computer systems, and stored instructions are described herein for densely grouping dimensional data and/or aggregating data using a data structure, such as one that is constructed based on dimensional data. When smaller tables are joined with a larger table, a server may analyze the smaller tables first to determine actual value combinations that occur in the smaller tables, and these actual value combinations are used to more efficiently process the larger table. A dense data structure may be generated by processing dimensional data before processing data from fact table. The dense data structure may be generated by compressing ranges of values that are possible in dimensions into a range of values that actually occurs in the dimensions. The compressed range of values may be represented by dense set identifiers rather than the actual compressed range of values.Type: GrantFiled: September 20, 2013Date of Patent: August 22, 2017Assignee: Oracle International CorporationInventors: Albert A. Hopeman, IV, Martin Roth, Ekrem S. C. Soylemez
-
Patent number: 9740724Abstract: Systematically enforcing generation of partition key values for records in a data management system such that for any given partitioned table, all records relating to a common business object share a common partitioning key. Partitioning and/or clustering of these records can then be done based on these specially generated partitioning key values, with entities dependent on the same first-class entity being placed in the same partition and/or cluster. This may provide considerable performance gains in at least some circumstances.Type: GrantFiled: April 29, 2014Date of Patent: August 22, 2017Assignee: International Business Machines CorporationInventors: Yongli An, Mohammad Khatibi
-
Patent number: 9734236Abstract: A method and system for leveraging content is provided. The method includes receiving, data associated with a subscriber and registering the subscriber with an ECM computing system. Devices belonging to the subscriber are connected to the ECM computing system and metadata associated with content retrieved from the devices is generated. The content in the devices are classified into formal content and informal content. Multiple searches for additional content are monitored and multifaceted search results associated with the formal content and the informal content are generated and presented to the subscriber. The subscriber has an option to request informal content on additional end user devices from respective end users based on metadata presented by search results.Type: GrantFiled: June 8, 2016Date of Patent: August 15, 2017Assignee: International Business Machines CorporationInventor: Rajesh Radhakrishnan