Patents by Inventor Mehul A. Shah
Mehul A. Shah has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12225092Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: GrantFiled: May 1, 2023Date of Patent: February 11, 2025Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 12118395Abstract: Techniques for self-tuning an analytics system via observed execution optimization are described. Upon a need for execution resources, a resource manager can select a type of executor from multiple candidate executor types based at least in part on one or more of current execution data associated with the execution of tasks of a user application and/or historic execution data associated with one or more other applications. The current execution data may include event log data originated by the driver application based on the execution of the user application and/or metric data describing characteristics of one or more worker nodes involved with executing the user application or characteristics of one or more other executors implemented by the one or more worker nodes in executing the user application.Type: GrantFiled: December 9, 2020Date of Patent: October 15, 2024Assignee: Amazon Technologies, Inc.Inventors: Anthony A. Virtuoso, Santosh Chandrachood, Bijay Singh Bisht, Mehul A. Shah
-
Patent number: 12056516Abstract: Techniques for a distributed data processing application service in a cloud provider network are described. A virtual machine bundle is obtained, the virtual machine bundle including a copy of a memory and one or more registers of a paused virtual machine executing a distributed data processing application runtime. A request to launch a virtual machine to execute a distributed data processing application runtime is received, the request to launch the virtual machine including a bundle identifier that identifies the virtual machine bundle. The virtual machine is launched based at least in part on the virtual machine bundle. The virtual machine sends a message to register the virtual machine with a cluster of virtual machines.Type: GrantFiled: September 30, 2021Date of Patent: August 6, 2024Assignee: Amazon Technologies, Inc.Inventors: Anthony A. Virtuoso, Eric Mills, Bijay Singh Bisht, Santosh Chandrachood, Mehul A. Shah, Wesley Blumenthal, Linchi Zhang, Rahul Pathak, Matt Bachman
-
Patent number: 11941017Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.Type: GrantFiled: October 21, 2022Date of Patent: March 26, 2024Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Using specified performance attributes to configure machine learning pipepline stages for an ETL job
Patent number: 11941016Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.Type: GrantFiled: March 4, 2022Date of Patent: March 26, 2024Assignee: Amazon Technologies, Inc.Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle -
Patent number: 11893044Abstract: Recognizing unknown data objects may be implemented for data objects stored in a data store. Data objects that are identified as unknown may be accessed to retrieve a portion of the data object. Different representations of the data object may be generated for recognizing different data schemas. An analysis of the representations may be performed to identify a data schema for the unknown data object. The data schema may be stored in a metadata store for the unknown data object.Type: GrantFiled: April 10, 2020Date of Patent: February 6, 2024Assignee: Amazon Technologies, Inc.Inventors: Mehul A Shah, George Steven McPherson, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 11886422Abstract: A protocol for implementing ACID transactions that provides snapshot isolation in a distributed setting that does not require synchronized clocks is described. The protocol ensures at commit time that transactions touching common objects do not commit out of order. The protocol can be used in the context of a distributed data lake built on an object store in which clients can transactionally add or remove objects from logical tables.Type: GrantFiled: March 29, 2021Date of Patent: January 30, 2024Assignee: Amazon Technologies, Inc.Inventors: Bohou Li, Vijayan Prabhakaran, Mehul A. Shah, Benjamin Sowell, Douglas Brian Terry
-
Publication number: 20230362265Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: ApplicationFiled: May 1, 2023Publication date: November 9, 2023Applicant: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 11797558Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.Type: GrantFiled: October 1, 2021Date of Patent: October 24, 2023Assignee: Amazon Technologies, Inc.Inventors: Mehul A. Shah, George Steven McPherson, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta, Benjamin Albert Sowell, Bohou Li
-
Patent number: 11714573Abstract: Techniques for storage optimization in a distributed object store are described. A storage optimization service of a provider network monitors changes to data objects in a distributed object store that are part of a data lake and are referenced by a table index. The storage optimization service determines whether particular storage optimizations involving the data objects would be beneficial, prioritizes the ordering of these optimizations with a focus on performing impactful optimizations first, while intelligently scheduling the optimizations to avoid overutilization of available resources.Type: GrantFiled: March 29, 2021Date of Patent: August 1, 2023Assignee: Amazon Technologies, Inc.Inventors: Shashank Bhardwaj, Roman Gavrilov, Brian Scott Ross, Mehul A. Shah, Benjamin Sowell, Anthony A. Virtuoso, Linan Zheng
-
Patent number: 11709809Abstract: Techniques for using tree data structures to maintain a transactionally consistent set with support for time-travel queries are described. When a transaction commits, a new version of the tree data structure is created using a copy-on-write based method such that the tree shares internal nodes with previous trees to save space. This approach may be used in the implementation of a transactional data catalog in which the files that make up a table are stored in a transactional set.Type: GrantFiled: March 29, 2021Date of Patent: July 25, 2023Assignee: Amazon Technologies, Inc.Inventors: Bohou Li, Vijayan Prabhakaran, Mehul A. Shah, Benjamin Sowell, Douglas Brian Terry
-
Patent number: 11704331Abstract: Dynamic generation of data catalogs may be implemented for accessing data sets in different storage locations. Data sets may be accessed in order to extract portions of data. Structure recognition techniques may be applied to the extracted data in order to determine structural information for the data sets. The structural information may then be stored as part of a data catalog for the data sets. Requests to access the data catalog from different clients may be received and the requested structural data supplied so that the clients may access different data sets utilizing the supplied structural data. Data catalogs may be updated as changes to data sets are made.Type: GrantFiled: July 10, 2020Date of Patent: July 18, 2023Assignee: Amazon Technologies, Inc.Inventors: Andrew Edward Caldwell, Anurag Windlass Gupta, Mehul A. Shah, Prajakta Datta Damle, George Steven McPherson
-
Patent number: 11695840Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: GrantFiled: March 14, 2022Date of Patent: July 4, 2023Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Publication number: 20230169086Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.Type: ApplicationFiled: October 21, 2022Publication date: June 1, 2023Applicant: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Publication number: 20230099186Abstract: Techniques for a distributed data processing application service in a cloud provider network are described. A virtual machine bundle is obtained, the virtual machine bundle including a copy of a memory and one or more registers of a paused virtual machine executing a distributed data processing application runtime. A request to launch a virtual machine to execute a distributed data processing application runtime is received, the request to launch the virtual machine including a bundle identifier that identifies the virtual machine bundle. The virtual machine is launched based at least in part on the virtual machine bundle. The virtual machine sends a message to register the virtual machine with a cluster of virtual machines.Type: ApplicationFiled: September 30, 2021Publication date: March 30, 2023Inventors: Anthony A. VIRTUOSO, Eric MILLS, Bijay Singh BISHT, Santosh CHANDRACHOOD, Mehul A. SHAH, Wesley BLUMENTHAL, Linchi ZHANG, Rahul PATHAK, Matt BACHMAN
-
Patent number: 11599514Abstract: Techniques for implementing systems using transactional version sets are described. Transactional version sets or t-sets include a collection of elements, each having a collection of metadata. A t-set is transactional in that a sequence of updates to one or more t-sets are made within an atomic transaction. A t-set is versioned since each committed transaction that updates it produces a new timestamped version that can be accessed via time-travel queries.Type: GrantFiled: March 29, 2021Date of Patent: March 7, 2023Assignee: Amazon Technologies, Inc.Inventors: Vinay Agrawal, Bohou Li, Vikas Malik, Tushar Poddar, Vijayan Prabhakaran, Mukesh Punhani, Mehul A. Shah, Benjamin Sowell, Douglas Brian Terry
-
Patent number: 11481408Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.Type: GrantFiled: December 20, 2016Date of Patent: October 25, 2022Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Publication number: 20220279049Abstract: Code may be dynamically routed to computing resources for execution. Code may be received for execution on behalf of a client. Execution criteria for the code may be determined and computing resources that satisfy the execution criteria may be identified. The identified computing resources may then be procured for executing the code and then the code may be routed to the procured computing resources for execution. Permissions or authorization to execute the code may be shared to ensure that computing resources executing the code have the same permissions or authorization when executing the code.Type: ApplicationFiled: March 14, 2022Publication date: September 1, 2022Applicant: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Supratik Chakraborty, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
Patent number: 11423041Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.Type: GrantFiled: January 24, 2020Date of Patent: August 23, 2022Assignee: Amazon Technologies, Inc.Inventors: George Steven McPherson, Mehul A. Shah, Prajakta Datta Damle, Gopinath Duddi, Anurag Windlass Gupta
-
USING SPECIFIED PERFORMANCE ATTRIBUTES TO CONFIGURE MACHINE LEARNING PIPEPLINE STAGES FOR AN ETL JOB
Publication number: 20220261413Abstract: Specified performance attributes may be used to configure machine learning transformations for ETL jobs. Performance attributes for a machine learning pipeline that applies a model to as part of a transformation for an ETL job may be used to configure a parameter in a stage of the machine learning pipeline. The configured stage may then be used when training the model. The trained machine learning pipeline may then be applied as part of a transformation operation included in an ETL job performed by the ETL system.Type: ApplicationFiled: March 4, 2022Publication date: August 18, 2022Applicant: Amazon Technologies, Inc.Inventors: Timothy Jones, Andrew Borthwick, Sergei Dobroshinsky, Shehzad Qureshi, Stephen Michael Ash, Pedrito Uriah Maynard-Zhang, Chethan Kommaranahalli Rudramuni, Abhishek Sharma, Juliana Saussy, Adam Lawrence Joseph Heinermann, Alaykumar Navinchandra Desai, Mehul A. Shah, Mehul Y. Shah, Anurag Windlass Gupta, Prajakta Datta Damle