Abstract: Systems and methods for ingesting and enhancing data in a distributed processing framework. The system includes at least a data ingestion system configured to access data or datasets from one or more data sources. The data is accessed via the data ingestion system and includes metadata defining a plurality of attributes. The attributes are identified in the metadata, via the data ingestion system, and may be applied to the data or dataset for enhancing the data or dataset. Application of the attributes to the data results in enhancements that may include joining the data, enriching the data, or other enhancements accomplished via manipulation of the data via the data ingestion system.
Type:
Grant
Filed:
May 21, 2019
Date of Patent:
October 25, 2022
Assignee:
JPMORGAN CHASE BANK, N.A.
Inventors:
Matthew Yazdi, Manish Doshi, Fazil Syed, Smitha Vijayendra, Grant Poladian, Yue Lin
Abstract: A method and/or system for managing a database that stores space-time context objects is provided. The system receives a query range in a multi-dimensional space. The system maps the query range into a set of fragments of a space-filling curve that fills the multi-dimensional space in all dimensions of the multi-dimensional space. The system uses each mapped fragment in the set of mapped fragments as a key to query the database for space-time context objects that are mapped to the space-filling curve. The system queries the database by identifying one or more context objects that intersect the mapped fragment at the space-filling curve.
Type:
Grant
Filed:
December 1, 2017
Date of Patent:
August 30, 2022
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Ning Duan, Chun Yang Ma, Makoto Tanibayashi, Zhi Hu Wang, Shoichiro Watanabe, Nan Xia, Xin Zhang, Jun Zhu
Abstract: A system and computer implemented method for execution of aggregation expressions on a distributed non-relational database system is provided. According to one aspect, an aggregation operation may be provided that permits more complex operations using separate collections. For instance, it may be desirable to create a report from one collection using information grouped according to information stored in another collection. Such a capability may be provided within other conventional database systems, however, in a non-relational database system such as NoSQL, the system is not capable of performing server-side joins, such a capability may not be performed without denormalizing the attributes into each object that references it, or by performing application-level joins which is not efficient and leads to unnecessarily complex code within the application that interfaces with the NoSQL database system.
Type:
Grant
Filed:
May 25, 2017
Date of Patent:
August 2, 2022
Assignee:
MongoDB, Inc.
Inventors:
Eliot Horowitz, David Lenox Storch, Charles William Swanson
Abstract: A system, method and apparatus for storing metadata in a metadata store in a robust and efficient manner including receiving a request from a client to perform a data transaction, updating a key-value pair in a metadata store based on the request, entering the data transaction in a transaction log, updating a read cache with the key-value pair, and replicating the last transaction log entry in at least one other storage node in the metadata store.
Type:
Grant
Filed:
September 12, 2017
Date of Patent:
August 2, 2022
Assignee:
Western Digital Technologies, Inc.
Inventors:
Frederik Jacqueline Luc De Schrijver, Joris Custers, Carl Rene D'Halluin
Abstract: Methods, computer program products, and systems are presented. The method computer program products, and systems can include, for instance: receiving, by a production database, a query statement for processing by the production database; the production database in performing runtime execution processing of the query statement identifying one or more query statement value of the query statement; the production database in performing runtime execution processing of the query statement establishing, using a predictive model, an access path based on the identified one or more query statement value; and the production database using the established access path in performing runtime execution processing of the query statement.
Type:
Grant
Filed:
May 15, 2018
Date of Patent:
November 16, 2021
Assignee:
INTERNATIONAL BUSINESS MACHINES CORPORATION
Inventors:
Xiaobo Wang, Shuo Li, Ke Wei Wei, Heng Liu
Abstract: An apparatus includes processing circuitry configured to execute instructions that, when executed, cause the apparatus to initialize a mixture model having a number of clusters including categorical data, iteratively update cluster assignments, evaluate cluster quality based on categorical density of the clusters, and prune clusters that have low categorical density, and determine an optimal mixture model based on the pruned clusters.
Type:
Grant
Filed:
April 2, 2018
Date of Patent:
November 17, 2020
Assignee:
The Johns Hopkins University
Inventors:
Cetin Savkli, Jeffrey S. Lin, Philip B. Graff
Abstract: Objects across multiple database manipulation language (DML) redo log records are cached, during continuous replay of redo log records on a secondary system, for the same table partition to enable reuse of such cached objects. Later, these cached objects can be reused as they are accessed during the sequential processing of DML redo records for a specific table partition.
Type:
Grant
Filed:
March 7, 2017
Date of Patent:
September 8, 2020
Assignee:
SAP SE
Inventors:
Sarika Iyer, Vivek Kandiyanallur, Martin Heidel, Rolando Blanco, Stephan Kottler, Carsten Thiel, Peter Steinemann, Jörn Schmidt, Colin Florendo, Michael Muehle, Chaitanya Gottipati
Abstract: Consensus methods, systems, and apparatus, including computer programs encoded on computer storage media, are provided. One of the methods is to be implemented on a blockchain maintained by a number (N) of nodes. One of the nodes acts as a primary node and the other (N?1) nodes act as backup nodes. The method includes: detecting a change in a current height H of the blockchain; determining the primary node based on a parameter P and the number of nodes N, wherein the parameter P is proportional to the current height H of the blockchain; in response to determining that the one of the N nodes is the primary node, executing a PBFT normal operation protocol as the primary node; and in response to determining that the one of the N nodes is not the primary node, executing the PBFT normal operation protocol as one of the backup nodes.