Patents by Inventor ANURAG WINDLASS GUPTA

ANURAG WINDLASS GUPTA has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10089191
    Abstract: Application program data stored in system memory may be selectively persisted. An indication may be provided to an application program that an application data object or a range of application data stored in system memory may be treated as persistent. Data backup may be enabled for the application data object or range of application data in the event of a system failure, copying the application data object or range of application data from system memory to non-volatile data storage. Upon recovery from a system failure, further data backup for the application data object or the range of application data may be disabled. In some embodiments, at least some of the application data object or range of application data may be recovered for the application program to access. Data backup for the application data object or the range of application data may also be re-enabled.
    Type: Grant
    Filed: November 23, 2015
    Date of Patent: October 2, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Samuel James McKelvie, Yan Valerie Leshinsky, Ian P. Nowland, Darin Lee Frink, Anurag Windlass Gupta, Adam Douglas Morley, Christopher Nathan Watson, Michael David Marr
  • Patent number: 10089220
    Abstract: Methods and apparatus for saving state information resulting from non-idempotent operations are described. A computer system includes a system memory coupled to one or more processors. The system memory comprises at least a non-volatile portion. Elements of state information associated with an executable component that are to be stored within the non-volatile portion are identified. In response to detecting an occurrence of a particular non-idempotent operation that results in the generation of state information, selected elements of information are stored in the non-volatile portion of the system memory. In response to a request subsequent to a failure event, wherein the failure event resulted in a loss of data stored in a volatile portion of the system memory, the state information is read from the non-volatile portion.
    Type: Grant
    Filed: November 1, 2013
    Date of Patent: October 2, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Samuel James McKelvie, Anurag Windlass Gupta
  • Patent number: 10061834
    Abstract: A data store may implement incremental out-of-place updates to a dataset. A dataset may maintain data across different storage locations linked together according to an ordering schema for servicing queries. As updates to the dataset are received, the updates may be persisted but not maintained in-place. In order to update the data store and maintain the ordering schema, incremental updates to the dataset may be performed without blocking queries directed toward the dataset. The dataset may be divided into multiple data chunks that correspond to different storage locations and an updated version of the data chunk may be generated in new storage locations. The new storage locations may then replace the storage locations of the prior version of the data chunk in order to link the new storage locations to the other linked storage locations in the dataset for servicing queries.
    Type: Grant
    Filed: October 31, 2014
    Date of Patent: August 28, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Jakub Kulesza, Bharath Kumar Chelepalli, Deepak Agarwal, Anurag Windlass Gupta
  • Patent number: 10049036
    Abstract: Methods and apparatus for reliable distributed messaging are described. A computer system includes a system memory coupled to one or more processors. The system memory comprises at least a non-volatile portion. A particular location within the non-volatile portion is designated as a target location to which a sender module participating in a communication protocol is granted write permission. A receiver module participating in the communication protocol, subsequent to a failure event that results in a loss of data stored in a volatile portion of the system memory, reads a data item written by the sender program at the target location prior to the failure event. The receiver module performs an operation based on contents of the data item.
    Type: Grant
    Filed: August 20, 2017
    Date of Patent: August 14, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Samuel James McKelvie, Anurag Windlass Gupta
  • Publication number: 20180225299
    Abstract: Data may be efficiently analyzed and compressed as part of a data compression service. A data compression request may be received from a client indicating data to be compressed. An analysis of the data or metadata associated with the data may be performed. In at least some embodiments, this analysis may be a rules-based analysis. Some embodiments may employ one or more machine learning techniques to historical compression data to update the rules-based analysis. One or more compression techniques may be selected out of a plurality of compression techniques to be applied to the data. Data compression candidates may then be generated according to the selected compression techniques. In some embodiments, a compression service restriction may be enforced. One of the data compression candidates may be selected and sent in a response.
    Type: Application
    Filed: April 2, 2018
    Publication date: August 9, 2018
    Applicant: Amazon Technologies, Inc.
    Inventor: Anurag Windlass Gupta
  • Patent number: 10031813
    Abstract: A database system may maintain a plurality of log records at a distributed storage system. Each of the plurality of log records may be associated with a respective change to a data page. The plurality of log records may be transformed (e.g., cropped, prune, reduce, fused, deleted, merged, added, etc.).
    Type: Grant
    Filed: November 21, 2016
    Date of Patent: July 24, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Pradeep Jnana Madhavarapu, Neal Fachan, Anurag Windlass Gupta, Samuel James McKelvie
  • Patent number: 10019457
    Abstract: Data to be stored in a data block for a columnar database table may be compressed according to a multi-level compression scheme. Data to be stored in the data block may be received. The data may be compressed according a column-specific compression technique to produce compressed data. The compressed data may then be compressed according to a second compression technique different than the column-specific compression technique to produce multi-level compressed data. The multi-level compressed data may be stored in the data block. When reading from the data block, multi-level compressed data may be decompressed according to the column-specific compression technique and the default compression technique applied to the data.
    Type: Grant
    Filed: January 22, 2013
    Date of Patent: July 10, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Stefano Stefani, Anurag Windlass Gupta
  • Patent number: 10013440
    Abstract: Incremental, out-of-place updates may be implemented for index structures maintained for data stores. Portions of the index structure may be selected for updating, and an updated version of the portion of the index structure generated in another storage location different than a current storage location for the index structure such that the index structure may be searched in order to perform access requests. Updating the portion of the index structure may include compacting the portion of the index structure and/or merging the portion of the index structure with a sub-index structure generated from a portion of a log of index updates that may be maintained. The current portion of the index structure may then be replaced with the updated version of the current portion so that the updated version may be evaluated when searches of the index structure are performed.
    Type: Grant
    Filed: October 31, 2014
    Date of Patent: July 3, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Anurag Windlass Gupta, Jakub Kulesza
  • Publication number: 20180173774
    Abstract: History for data objects may be maintained to detect data events. An indication of an Extract, Transform, Load (ETL) process applied to one or more source data objects to generate one or more transformed data objects may be received. History for the source data objects may be updated to include the transformed data objects and the ETL process that generated the transformed data objects. An evaluation of the update may be performed to determine whether an event associated with the data lineage is triggered. If the event is triggered, a notification of the event may be sent to one or more subscribers for the event.
    Type: Application
    Filed: December 20, 2016
    Publication date: June 21, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: GEORGE STEVEN MCPHERSON, MEHUL A. SHAH, PRAJAKTA DATTA DAMLE, GOPINATH DUDDI, ANURAG WINDLASS GUPTA
  • Publication number: 20180165340
    Abstract: A distributed data warehouse system maintains data blocks on behalf of clients, and stores primary and secondary copies of data blocks on different disks or nodes in a cluster. The data warehouse system may back up data blocks in a key-value backup storage system. In response to a query targeting a data block previously stored in the cluster, the data warehouse system may determine whether a consistent, uncorrupted copy of the data block is available in the cluster (e.g., by applying a consistency check). If not (e.g., if a disk or node failed), the data warehouse system may automatically initiate an operation to restore the data block from the backup storage system, using a unique identifier of the data block to access a backup copy. The target data may be returned in a query response prior to restoring primary and secondary copies of the data block in the cluster.
    Type: Application
    Filed: February 9, 2018
    Publication date: June 14, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: Deepak Agarwal, Anurag Windlass Gupta, Jakub Kulesza
  • Publication number: 20180150548
    Abstract: Recognizing unknown data objects may be implemented for data objects stored in a data store. Data objects that are identified as unknown may be accessed to retrieve a portion of the data object. Different representations of the data object may be generated for recognizing different data schemas. An analysis of the representations may be performed to identify a data schema for the unknown data object. The data schema may be stored in a metadata store for the unknown data object.
    Type: Application
    Filed: December 20, 2016
    Publication date: May 31, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: MEHUL A. SHAH, GEORGE STEVEN MCPHERSON, PRAJAKTA DATTA DAMLE, GOPINATH DUDDI, ANURAG WINDLASS GUPTA
  • Publication number: 20180150397
    Abstract: A separate distributed buffer cache system may be implemented for a storage client of a distributed storage system. Storage I/O requests may be sent from a storage client to one or more buffer cache nodes in a distributed buffer cache system that maintain portions of an in-memory buffer cache to which the requests pertain. The distributed buffer cache system may send the write requests on to the distributed storage system to be completed, and in response to receiving acknowledgements from the storage system, sending a completion acknowledgement back to the storage client. Buffer cache nodes may update buffer cache entries for received requests such that they are not available for reads until complete at the distributed storage system. For read requests where the buffer cache entries at the buffer cache node are invalid, valid data may be obtained from the distributed storage system and sent to the storage client.
    Type: Application
    Filed: January 26, 2018
    Publication date: May 31, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: Anurag Windlass Gupta, Matthew David Allen
  • Publication number: 20180150528
    Abstract: Data transformation workflows may be generated to transform data objects. A source data schema for a data object and a target data format or target data schema for a data object may be identified. A comparison of the source data schema and the target data format or schema may be made to determine what transformations can be performed to transform the data object into the target data format or schema. Code to execute the transformation operations may then be generated. The code may be stored for subsequent modification or execution.
    Type: Application
    Filed: December 20, 2016
    Publication date: May 31, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: MEHUL A. SHAH, GEORGE STEVEN MCPHERSON, PRAJAKTA DATTA DAMLE, GOPINATH DUDDI, ANURAG WINDLASS GUPTA, BENJAMIN ALBERT SOWELL, BOHOU LI
  • Publication number: 20180150529
    Abstract: Extract, Transform, Load (ETL) processing may be initiated by detected events. A trigger event may be associated with an ETL process apply one or more transformations to a source data object. The trigger event may be detected for the ETL process and evaluated with respect to one or more execution conditions for the ETL process. If the execution conditions for the ETL process are satisfied, then the ETL process may be executed. At least some of the source data object may be obtained, the one or more transformations of the ETL process may be applied, and one or more transformed data objects may be stored.
    Type: Application
    Filed: December 20, 2016
    Publication date: May 31, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: GEORGE STEVEN MCPHERSON, MEHUL A. SHAH, PRAJAKTA DATTA DAMLE, GOPINATH DUDDI, ANURAG WINDLASS GUPTA
  • Patent number: 9946735
    Abstract: Read-only nodes of a distributed database system may implement index structure navigation using page versions. A read request may be received at a read only node of a distributed database for select data. Data pages linked together to form an index structure for data stored for the distributed database may be navigated according to versions maintained for the data pages in order to identify one or more locations to access for the select data. One or more prior versions of data pages may be selected as part navigating the index structure according to a consistent view of the distributed database associated with the read request. Change notifications may also be received at the read-only node modifying the data pages of the index structure. The index structure modifications may be applied without blocking the index structure navigation for servicing the read request.
    Type: Grant
    Filed: December 12, 2016
    Date of Patent: April 17, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Tengiz Kharatishvili, Pradeep Jnana Madhavarapu, Anurag Windlass Gupta
  • Patent number: 9934235
    Abstract: Data may be efficiently analyzed and compressed as part of a data compression service. A data compression request may be received from a client indicating data to be compressed. An analysis of the data or metadata associated with the data may be performed. In at least some embodiments, this analysis may be a rules-based analysis. Some embodiments may employ one or more machine learning techniques to historical compression data to update the rules-based analysis. One or more compression techniques may be selected out of a plurality of compression techniques to be applied to the data. Data compression candidates may then be generated according to the selected compression techniques. In some embodiments, a compression service restriction may be enforced. One of the data compression candidates may be selected and sent in a response.
    Type: Grant
    Filed: June 30, 2016
    Date of Patent: April 3, 2018
    Assignee: Amazon Technologies, Inc.
    Inventor: Anurag Windlass Gupta
  • Publication number: 20180046552
    Abstract: A log-structured data store implementing data backup may implement variable data replication. Write requests may be received at different storage nodes maintaining respective replicas of a portion of a log for data maintained in the log-structured data store. Log records indicating the write requests may be stored in the respective replicas of the log portions at the different storage nodes. The log records may be sent to a backup data store to be durability persisted as part of an archived version of the log. At some of the storage nodes, in response to determining that the log records have been durably persisted in the backup data store, storage space for the log records may be reclaimed. In other remaining storage nodes, the log records may be retained and made accessible for servicing read requests.
    Type: Application
    Filed: October 9, 2017
    Publication date: February 15, 2018
    Applicant: Amazon Technologies, Inc.
    Inventors: Pradeep Jnana Madhavarapu, Anurag Windlass Gupta, Yan Valerie Leshinsky, Samuel James McKelvie, James McClellan Corey
  • Patent number: 9892182
    Abstract: A distributed data warehouse system maintains data blocks on behalf of clients, and stores primary and secondary copies of data blocks on different disks or nodes in a cluster. The data warehouse system may back up data blocks in a key-value backup storage system. In response to a query targeting a data block previously stored in the cluster, the data warehouse system may determine whether a consistent, uncorrupted copy of the data block is available in the cluster (e.g., by applying a consistency check). If not (e.g., if a disk or node failed), the data warehouse system may automatically initiate an operation to restore the data block from the backup storage system, using a unique identifier of the data block to access a backup copy. The target data may be returned in a query response prior to restoring primary and secondary copies of the data block in the cluster.
    Type: Grant
    Filed: September 19, 2016
    Date of Patent: February 13, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Deepak Agarwal, Anurag Windlass Gupta, Jakub Kulesza
  • Patent number: 9880933
    Abstract: A separate distributed buffer cache system may be implemented for a storage client of a distributed storage system. Storage I/O requests may be sent from a storage client to one or more buffer cache nodes in a distributed buffer cache system that maintain portions of an in-memory buffer cache to which the requests pertain. The distributed buffer cache system may send the write requests on to the distributed storage system to be completed, and in response to receiving acknowledgements from the storage system, sending a completion acknowledgement back to the storage client. Buffer cache nodes may update buffer cache entries for received requests such that they are not available for reads until complete at the distributed storage system. For read requests where the buffer cache entries at the buffer cache node are invalid, valid data may be obtained from the distributed storage system and sent to the storage client.
    Type: Grant
    Filed: November 20, 2013
    Date of Patent: January 30, 2018
    Assignee: Amazon Technologies, Inc.
    Inventors: Anurag Windlass Gupta, Matthew David Allen
  • Publication number: 20180025065
    Abstract: A probabilistic data structure is generated for efficient query processing using a histogram for unsorted data in a column of a columnar database. A bucket range size is determined for multiples buckets of a histogram of a column in a columnar database table. In at least some embodiments, the histogram may be a height-balanced histogram. A probabilistic data structure is generated to indicate for which particular buckets in the histogram there is a data value stored in the data block. When an indication of a query directed to the column for select data is received, the probabilistic data structure for each of the data blocks storing data for the column may be examined to determine particular ones of the data blocks which do not need to be read in order to service the query for the select data.
    Type: Application
    Filed: September 15, 2017
    Publication date: January 25, 2018
    Applicant: Amazon Technologies, Inc.
    Inventor: ANURAG WINDLASS GUPTA