Patents by Inventor Frederick Ryan Johnson

Frederick Ryan Johnson has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating Minor Compactions to Capture Aggregated Actions for Commit Ranges to Data Files

Publication number: 20250363096

Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.

Type: Application

Filed: August 4, 2025

Publication date: November 27, 2025

Inventors: Frederick Ryan Johnson, Prakhar Jain
Efficient superblock data structures for database systems

Patent number: 12475127

Abstract: Efficient superblock data structures are implemented for a data set. A superblock data structure, a log to commit changes to a superblock data structure, and an auxiliary data structure may be updated when an update to a data set is performed. The update may update respective location headers in the auxiliary data structure and the superblock data structure.

Type: Grant

Filed: September 26, 2024

Date of Patent: November 18, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Aditya Subrahmanyan, Gokul Soundararajan, Sriram Subramanian, Venu Gopal Nayar, Frederick Ryan Johnson, Naga Raju Bhanoori, Hebatalla Mohamed Mohamed Aly Eldakiky
Generating minor compactions to capture aggregated actions for commit ranges to data files

Patent number: 12405943

Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.

Type: Grant

Filed: January 17, 2024

Date of Patent: September 2, 2025

Assignee: Databricks, Inc.

Inventors: Frederick Ryan Johnson, Prakhar Jain
Data file clustering with KD-classifier trees

Patent number: 12405920

Abstract: A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.

Type: Grant

Filed: July 5, 2023

Date of Patent: September 2, 2025

Assignee: Databricks, Inc.

Inventors: Prakhar Jain, Frederick Ryan Johnson, Terry Kim, Vijayan Prabhakaran, Bart Samwel
GENERATING MINOR COMPACTIONS TO CAPTURE AGGREGATED ACTIONS FOR COMMIT RANGES TO DATA FILES

Publication number: 20250231930

Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.

Type: Application

Filed: January 17, 2024

Publication date: July 17, 2025

Inventors: Frederick Ryan Johnson, Prakhar Jain
Data file clustering with KD-epsilon trees

Patent number: 12332862

Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.

Type: Grant

Filed: July 6, 2023

Date of Patent: June 17, 2025

Assignee: Databricks, Inc.

Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
DATA FILE CLUSTERING WITH KD-CLASSIFIER TREES

Publication number: 20250013606

Abstract: A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.

Type: Application

Filed: July 5, 2023

Publication date: January 9, 2025

Inventors: Prakhar Jain, Frederick Ryan Johnson, Terry Kim, Vijayan Prabhakaran, Bart Samwel
DATA FILE CLUSTERING WITH KD-EPSILON TREES

Publication number: 20250013619

Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.

Type: Application

Filed: July 6, 2023

Publication date: January 9, 2025

Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
Data ingestion using data file clustering with KD-epsilon trees

Patent number: 12072863

Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.

Type: Grant

Filed: July 5, 2023

Date of Patent: August 27, 2024

Assignee: Databricks, Inc.

Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
Burst performance of database queries according to query size

Patent number: 12013856

Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.

Type: Grant

Filed: August 13, 2018

Date of Patent: June 18, 2024

Assignee: Amazon Technologies, Inc.

Inventors: Bhaven Avalani, Andrew Edward Caldwell, Naresh Chainani, Martin Grund, Anurag Windlass Gupta, Frederick Ryan Johnson, Ippokratis Pandis, Michail Petropoulos, Srividhya Srinivasan
Burst Performance of Database Queries According to Query Size

Publication number: 20200050694

Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.

Type: Application

Filed: August 13, 2018

Publication date: February 13, 2020

Applicant: Amazon Technologies, Inc.

Inventors: Bhaven Avalani, Andrew Edward Caldwell, Naresh Chainani, Martin Grund, Anurag Windlass Gupta, Frederick Ryan Johnson, Ippokratis Pandis, Michail Petropoulos, Srividhya Srinivasan