Patents by Inventor ANURAG WINDLASS GUPTA
ANURAG WINDLASS GUPTA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11615083Abstract: Storage level query processing may be implemented for processing database queries. Nodes that can access a database may perform parallel processing for at least a portion of a database query. An indication may be received that specifies parallel processing for the database query. The nodes can then be caused to perform the portion of the query as part of providing a result in response to the database query instead of a node, such as a query engine node, that received the database query.Type: GrantFiled: March 12, 2018Date of Patent: March 28, 2023Assignee: Amazon Technologies, Inc.Inventors: Gopi Krishna Attaluri, Dhruva Ranjan Chakrabarit, Volodymyr Verovkin, Kamal Kant Gupta, Shriram Sridharan, Aakash Shah, Aleksandr Valerevich Feinberg, Yuri Volobuev, Tengiz Kharatishvili, Saileshwar Krishnamurthy, Anurag Windlass Gupta, Murali Brahmadesam, Namrata Bapat, Alexandre Olegovich Verbitski, Jeffrey Davis, Debanjan Saha
-
Publication number: 20230092908Abstract: A database system may include a database service and a separate distributed storage service. The database service (or a database engine head node thereof) may be responsible for query parsing, optimization, and execution, transactionality, and consistency, while the storage service may be responsible for generating data pages from redo log records and for durability of those data pages. For example, in response to a write request directed to a particular data page, the database engine head node may generate a redo log record and send it, but not the data page, to a storage service node. The storage service node may store the redo log record and return a write acknowledgement to the database service prior to applying the redo log record. The server node may apply the redo log record and other redo log records to a previously stored version of the data page to create a current version.Type: ApplicationFiled: November 11, 2022Publication date: March 23, 2023Applicant: Amazon Technologies, Inc.Inventors: Anurag Windlass Gupta, Neal Fachan, Samuel James McKelvie, Laurion Darrell Burchall, Christopher Richard Newcombe, Pradeep Jnana Madhavarapu, Benjamin Tobler, James McClellan Corey
-
Patent number: 11567972Abstract: A tree-based format may be implemented for data stored in a data store. A table may be maintained across one or multiple storage nodes in storage slabs. Storage slabs may be mapped to different nodes of a tree. Each node of the tree may be assigned a different range of distribution scheme values which identify what portions of the table are stored in the storage slab. Storage slabs mapped to child nodes in the tree may be assigned portions of the range of distribution scheme values assigned to a parent. Storage nodes may be added or removed for storing the table. Storage slabs may be moved from one storage node to another in order to accommodate the addition or removal of storage nodes.Type: GrantFiled: June 30, 2016Date of Patent: January 31, 2023Assignee: Amazon Technologies, Inc.Inventors: Anurag Windlass Gupta, Andrew Edward Caldwell, Stavros Harizopoulos, Michail Petropoulos, Ramakrishna Kotla, John Benjamin Tobler
-
Patent number: 11553046Abstract: A service provider network implements seamless scaling via proxy replay of session state. Upon a trigger, such as a determination to scale a server, a scaled server may be spun up and an identifier of the scaled server provided to a first (existing) server. The first server sends the identification of the second server, and session state information for each of the connections between the first server and the request router, to the request router. For each of the connections, the request router establishes a new connection between the request router and the second (scaled) server, and replays the session state information for the connection to the second server. The request router then routes traffic between each existing client connection (e.g., the same existing client connection which carried traffic delivered to the first server) and the corresponding new connection to the second server.Type: GrantFiled: September 27, 2017Date of Patent: January 10, 2023Assignee: Amazon Technologies, Inc.Inventors: Alexey Kuznetsov, Sandor Loren Maurice, Alok Nandan Nikhil, Joseph Andrew Tucek, Alexandre Olegovich Verbitski, Lawrence Webley, Tengiz Kharatishvili, Bohan Liu, Mushahid Alam, Kamal Kant Gupta, Debanjan Saha, Anurag Windlass Gupta, Saileshwar Krishnamurthy
-
Patent number: 11500931Abstract: Using a graph representation of join history may be performed to distribute database data. Join history may be collected, captured, or tracked which describes the history of join operations between columns of different tables in a database. A graph representation of the join history may be generated. The graph representation may indicate a likelihood of different joins that may be performed between the columns of the tables of a database. An evaluation of the join history may be performed to identify columns for tables in the database to distribute the data of the tables amongst multiple storage locations according to the identified columns.Type: GrantFiled: June 1, 2018Date of Patent: November 15, 2022Assignee: Amazon Technologies, Inc.Inventors: Panagiotis Parchas, Christos Faloutsos, Anurag Windlass Gupta, Stavros Harizopoulos, Michail Petropoulos
-
Patent number: 11500852Abstract: A database system may include a database service and a separate distributed storage service. The database service (or a database engine head node thereof) may be responsible for query parsing, optimization, and execution, transactionality, and consistency, while the storage service may be responsible for generating data pages from redo log records and for durability of those data pages. For example, in response to a write request directed to a particular data page, the database engine head node may generate a redo log record and send it, but not the data page, to a storage service node. The storage service node may store the redo log record and return a write acknowledgement to the database service prior to applying the redo log record. The server node may apply the redo log record and other redo log records to a previously stored version of the data page to create a current version.Type: GrantFiled: June 26, 2020Date of Patent: November 15, 2022Assignee: Amazon Technologies, Inc.Inventors: Anurag Windlass Gupta, Neal Fachan, Samuel James McKelvie, Laurion Darrell Burchall, Christopher Richard Newcombe, Pradeep Jnana Madhavarapu, Benjamin Tobler, James McClellan Corey
-
Publication number: 20220358106Abstract: Proxy-based scaling may be performed for databases. A proxy may be implemented for a database that can establish a connection between the proxy and a database engine to perform a database queries received from a client at the proxy. A scaling event may be detected for the database responsive to which the proxy may establish a connection with a new database engine which may, in some embodiments, have different capabilities or resources that address the features or criteria that triggered the scaling event. Session state may be copied from the database engine to the new database engine so that the new database engine may be able to provide access to the database on behalf of requests received from the client through the proxy.Type: ApplicationFiled: July 22, 2022Publication date: November 10, 2022Applicant: Amazon Technologies, Inc.Inventors: Saileshwar Krishnamurthy, Alexey Kuznetsov, Mushahid Alam, Anurag Windlass Gupta, Kamal Kant Gupta, Tengiz Kharatishvili, Bohan Liu, Sandor Loren Maurice, Alok Nandan Nikhil, Nicholas Piers O'Shea, Debanjan Saha, Yoni Shalom, Jagdeep Singh Sidhu, Hongbo Song, Alexandre Olegovich Verbitski, Lawrence Douglas Webley, Siyuan Xing, Yuhui Yuan, Yibo Zhu
-
Patent number: 11494339Abstract: Data to be stored in a data block for a columnar database table may be compressed according to a multi-level compression scheme. Data to be stored in the data block may be received. The data may be compressed according a column-specific compression technique to produce compressed data. The compressed data may then be compressed according to a second compression technique different than the column-specific compression technique to produce multi-level compressed data. The multi-level compressed data may be stored in the data block. When reading from the data block, multi-level compressed data may be decompressed according to the column-specific compression technique and the default compression technique applied to the data.Type: GrantFiled: July 6, 2018Date of Patent: November 8, 2022Assignee: Amazon Tehnologies, Inc.Inventors: Stefano Stefani, Anurag Windlass Gupta
-
Patent number: 11481408Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.Type: GrantFiled: December 20, 2016Date of Patent: October 25, 2022Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 11475038Abstract: A distributed data warehouse system maintains data blocks on behalf of clients, and stores primary and secondary copies of data blocks on different disks or nodes in a cluster. The data warehouse system may back up data blocks in a key-value backup storage system. In response to a query targeting a data block previously stored in the cluster, the data warehouse system may determine whether a consistent, uncorrupted copy of the data block is available in the cluster (e.g., by applying a consistency check). If not (e.g., if a disk or node failed), the data warehouse system may automatically initiate an operation to restore the data block from the backup storage system, using a unique identifier of the data block to access a backup copy. The target data may be returned in a query response prior to restoring primary and secondary copies of the data block in the cluster.Type: GrantFiled: February 9, 2018Date of Patent: October 18, 2022Assignee: Amazon Technologies, Inc.Inventors: Deepak Agarwal, Anurag Windlass Gupta, Jakub Kulesza
-
Publication number: 20220279049Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: ApplicationFiled: March 14, 2022Publication date: September 1, 2022Applicant: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 11429630Abstract: Tiered storage may be implemented for processing data. Data processors may maintain some of a data set, including user data and metadata describing the user data, locally. The data set is also maintained a data store remote to the data processor. When processing requests are received, a determination is made as to whether the local portions of the data set can execute the processing request or one or more additional portions of the data set are needed from the remote data store. If additional portions of the data set are needed, then a request may be sent to the data store for the additional portions. Once received, the data processor may execute the processing request utilizing the additional portions. Portions of the data set maintained locally at the data processor may be selected and flushed from local storage to the remote data store.Type: GrantFiled: May 8, 2020Date of Patent: August 30, 2022Assignee: Amazon Technologies, Inc.Inventors: Anurag Windlass Gupta, Andrew Edward Caldwell
-
Patent number: 11422982Abstract: A stateful cluster may implement scaling of the stateful cluster while maintaining access to the state of the stateful cluster. A scaling event for a stateful cluster may be detected, and in response the stateful cluster may be adjusted to include a different number of nodes. The state of the cluster may then be logically distributed among the different number of nodes according to a monotone distribution scheme. The adjusted node may then service access requests according to the monotone distribution scheme. Prior to making the adjusted storage cluster available for servicing access requests, the nodes from the original cluster may still service access requests for state.Type: GrantFiled: February 22, 2019Date of Patent: August 23, 2022Assignee: Amazon Technologies, Inc.Inventors: Jakub Kulesza, Srividhya Srinivasan, Deepak Agarwal, Anurag Windlass Gupta
-
Patent number: 11423041Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.Type: GrantFiled: January 24, 2020Date of Patent: August 23, 2022Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
USING SPECIFIED PERFORMANCE ATTRIBUTES TO CONFIGURE MACHINE LEARNING PIPEPLINE STAGES FOR AN ETL JOB
Publication number: 20220261413Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.Type: ApplicationFiled: March 4, 2022Publication date: August 18, 2022Applicant: Amazon Technologies, Inc.Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle -
Patent number: 11397711Abstract: Proxy-based scaling may be performed for databases. A proxy may be implemented for a database that can establish a connection between the proxy and a database engine to perform a database queries received from a client at the proxy. A scaling event may be detected for the database responsive to which the proxy may establish a connection with a new database engine which may, in some embodiments, have different capabilities or resources that address the features or criteria that triggered the scaling event. Session state may be copied from the database engine to the new database engine so that the new database engine may be able to provide access to the database on behalf of requests received from the client through the proxy.Type: GrantFiled: May 4, 2018Date of Patent: July 26, 2022Assignee: Amazon Technologies, Inc.Inventors: Saileshwar Krishnamurthy, Alexey Kuznetsov, Mushahid Alam, Anurag Windlass Gupta, Kamal Kant Gupta, Tengiz Kharatishvili, Bohan Liu, Sandor Loren Maurice, Alok Nandan Nikhil, Nicholas Piers O'Shea, Debanjan Saha, Yoni Shalom, Jagdeep Singh Sidhu, Hongbo Song, Alexandre Olegovich Verbitski, Lawrence Douglas Webley, Siyuan Xing, Yuhui Yuan, Yibo Zhu
-
Publication number: 20220156289Abstract: A multi-column index is generated based on an interleaving of data bits for selectivity for efficient processing of data in a relational database system. Two or more columns may be identified for inclusion in the multi-column index for a relational database table. Based, at least in part, on the interleaving of data bits for selectivity from the identified columns, a multi-column index is generated for the relational database table that provides a respective index value for each entry in the relational database table. The entries of the relational database table may then be stored according to the index values of the multi-column index.Type: ApplicationFiled: September 24, 2021Publication date: May 19, 2022Applicant: Amazon Technologies, Inc.Inventor: Anurag Windlass Gupta
-
Publication number: 20220114064Abstract: Online restore operations for a database engine can be performed. A request to restore a database to a previous state can be received. Previously stored content of the database, such as snapshot stored prior to the previous state, can be identified along with log records describing changes to be made to the content prior to the previous state. State information in a query engine can be updated based on the previously stored content and log records so that queries can be processed based on the state information at the database restore to the previous state.Type: ApplicationFiled: December 22, 2021Publication date: April 14, 2022Applicant: Amazon Technologies, Inc.Inventors: Anurag Windlass Gupta, Murali Brahmadesam, Changan Han, Alexandre Olegovich Verbitski, Xiaofeng Bao, Alisdair William Owens, Xiang Peng, Benjamin S. Ellis, Raman Mittal, Konstantin Dubinets, Tiffany Jianto, Venkatesh Nandakumar, Seungmin Wei
-
Publication number: 20220100774Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.Type: ApplicationFiled: October 1, 2021Publication date: March 31, 2022Applicant: Amazon Technologies, Inc.Inventors: Mehul A. Shah, George Steven McPherson, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta, Benjamin Albert Sowell, Bohou Li
-
Patent number: 11277494Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: GrantFiled: December 20, 2016Date of Patent: March 15, 2022Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta