Patents by Inventor Frederick Ryan Johnson
Frederick Ryan Johnson has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20250363096Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.Type: ApplicationFiled: August 4, 2025Publication date: November 27, 2025Inventors: Frederick Ryan Johnson, Prakhar Jain
-
Patent number: 12475127Abstract: Efficient superblock data structures are implemented for a data set. A superblock data structure, a log to commit changes to a superblock data structure, and an auxiliary data structure may be updated when an update to a data set is performed. The update may update respective location headers in the auxiliary data structure and the superblock data structure.Type: GrantFiled: September 26, 2024Date of Patent: November 18, 2025Assignee: Amazon Technologies, Inc.Inventors: Aditya Subrahmanyan, Gokul Soundararajan, Sriram Subramanian, Venu Gopal Nayar, Frederick Ryan Johnson, Naga Raju Bhanoori, Hebatalla Mohamed Mohamed Aly Eldakiky
-
Patent number: 12405943Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.Type: GrantFiled: January 17, 2024Date of Patent: September 2, 2025Assignee: Databricks, Inc.Inventors: Frederick Ryan Johnson, Prakhar Jain
-
Patent number: 12405920Abstract: A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.Type: GrantFiled: July 5, 2023Date of Patent: September 2, 2025Assignee: Databricks, Inc.Inventors: Prakhar Jain, Frederick Ryan Johnson, Terry Kim, Vijayan Prabhakaran, Bart Samwel
-
Publication number: 20250231930Abstract: A data processing service uses minor compactions for committing transactions to a data table. The service may receive requests to commit transactions to a data table and write metadata for the transactions to log files, and generate a checkpoint file aggregating the transactions described in the log files to compute a data table state at a first time. The service may receive requests to commit a set of transactions and write metadata for the set of transactions to a set of log files. The service may determine that a number of log files in the set of log files reaches a threshold commit number, generate a minor compaction file aggregating the set of transactions, and generate a second checkpoint file aggregating the data table state at the first time with information from the minor compaction file to compute the data table state at a second time.Type: ApplicationFiled: January 17, 2024Publication date: July 17, 2025Inventors: Frederick Ryan Johnson, Prakhar Jain
-
Patent number: 12332862Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.Type: GrantFiled: July 6, 2023Date of Patent: June 17, 2025Assignee: Databricks, Inc.Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
-
Publication number: 20250013606Abstract: A data processing service generates a data classifier tree for managing data files of a data table. The data classifier tree may be configured as a KD-classifier tree and includes a plurality of nodes and edges. A node of the data classifier tree may represent a splitting condition with respect to key-values for a respective key. A node of the data classifier tree may be associated with one or more data files assigned to the node. The data files assigned to the node each include a subset of records having key-values that satisfy the conditions represented by the node and parent nodes of the node. The data processing service may efficiently cluster the data in the data table while reducing the number of data files that are rewritten when data is modified or added to the data table.Type: ApplicationFiled: July 5, 2023Publication date: January 9, 2025Inventors: Prakhar Jain, Frederick Ryan Johnson, Terry Kim, Vijayan Prabhakaran, Bart Samwel
-
Publication number: 20250013619Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.Type: ApplicationFiled: July 6, 2023Publication date: January 9, 2025Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
-
Patent number: 12072863Abstract: A data tree for managing data files of a data table and performing one or more transaction operations to the data table is described. The data tree is configured as a KD-epsilon tree and includes a plurality of nodes and edges. A node of the data tree may represent a splitting condition with respect to key-values for a respective key. A leaf node of the data tree may correspond to a data file for a data table that includes a subset of records having key-values that satisfy the condition for the node and conditions associated with parent nodes of the node. A parent node may correspond to a file including a buffer that stores changes to data files reachable by this parent node, and also includes dedicated storage to pointers of the child nodes. By using the data tree, the data processing system may efficiently cluster the data in the data table while reducing the number of data files that are rewritten.Type: GrantFiled: July 5, 2023Date of Patent: August 27, 2024Assignee: Databricks, Inc.Inventors: Prakhar Jain, Frederick Ryan Johnson, Bart Samwel
-
Patent number: 12013856Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.Type: GrantFiled: August 13, 2018Date of Patent: June 18, 2024Assignee: Amazon Technologies, Inc.Inventors: Bhaven Avalani, Andrew Edward Caldwell, Naresh Chainani, Martin Grund, Anurag Windlass Gupta, Frederick Ryan Johnson, Ippokratis Pandis, Michail Petropoulos, Srividhya Srinivasan
-
Publication number: 20200050694Abstract: Burst performance of a database query may be determined according to a size of the database query. A query to a database may be received. A size may be determined for the query. If the size is less than a size threshold assigned to a first query engine, then the query may be performed at the first query engine. If the size is greater than or equal to the size threshold assigned to the first query engine, then the query may be performed at a second query engine.Type: ApplicationFiled: August 13, 2018Publication date: February 13, 2020Applicant: Amazon Technologies, Inc.Inventors: Bhaven Avalani, Andrew Edward Caldwell, Naresh Chainani, Martin Grund, Anurag Windlass Gupta, Frederick Ryan Johnson, Ippokratis Pandis, Michail Petropoulos, Srividhya Srinivasan