Patents by Inventor Apurv Gupta
Apurv Gupta has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11983270Abstract: An amount of data change associated with a version of a content file with respect to one or more previous versions of the content file is determined. The amount of change associated with the version of the content file is determined using a tree data structure associated with the content file that is stored on a storage cluster. One or more statistics associated with backup snapshot are provided to a server. The server is configured to determine that the amount of data change associated with the version of the content file is anomalous based in part on the one or more statistics associated with the backup snapshot. A notification that data associated with the backup snapshot is potentially infected by malicious software is received from the server. The version of the content file is indicated as being potentially infected by malicious software.Type: GrantFiled: January 22, 2021Date of Patent: May 14, 2024Assignee: Cohesity, Inc.Inventors: Prashant Gaurav, Sidharth Mishra, Karandeep Singh Chawla, Anubhav Gupta, Sudhir Srinivas, Apurv Gupta, Nagapramod Mandagere
-
Publication number: 20240135486Abstract: Described are techniques for oblique image rectification. The techniques include receiving an original image depicting an oblique view of a circular object and pre-processing the original image into an edge image. The techniques further include generating, by a machine learning model based on the edge image, a heatmap including an ellipse formed by the oblique view of the circular object. The techniques further include computing ellipse parameters describing the ellipse of the heatmap. The techniques further include performing, using the ellipse parameters, an affine transformation on the original image to generate a rectified image, where the rectified image converts the ellipse to a circle.Type: ApplicationFiled: October 23, 2022Publication date: April 25, 2024Inventors: Sebastien Gilbert, Michele Merler, Dhiraj Joshi, Apurv Gupta, Shyama Prosad Chowdhury, CHIDANSH AMITKUMAR BHATT, Nirmit V. Desai
-
Patent number: 11960363Abstract: A mechanism for storing indexing updates is described. A secondary storage system is configured to store a backup of a primary storage system. A plurality of different indexing updates of a data repository associated with the secondary storage system is received. An indexing update may include metadata and/or updates to the metadata for a data item in the backup. The indexing updates may be obtained from an indexing process. At least a portion of the indexing updates are sent for storage in an intermediate store. An indication to commit the indexing updates is received. In response, the indexing updates including the at least the portion of the indexing updates are requested to be committed. The indexing updates batched together into a batch to be committed to a metadata store.Type: GrantFiled: September 14, 2020Date of Patent: April 16, 2024Assignee: Cohesity, Inc.Inventors: Apurv Gupta, Akshat Agarwal, Manvendra Singh Tomar, Anshu Avinash
-
Patent number: 11954066Abstract: An identification of a new primary snapshot created for a primary storage system is received. A change tracking time window that is at least a portion of a period between a first capture time associated with a previous primary snapshot and a second capture time associated with the new primary snapshot is determined. Entries of a storage log of the primary storage system occurring within the change tracking time window are analyzed to coalesce changes identified in the entries of the storage log occurring within the change tracking time window into a change tracking result set. The change tracking result set is used to identify at least a portion of data changes between the previous primary snapshot and the new primary snapshot to capture in a new backup snapshot stored at a secondary storage system.Type: GrantFiled: July 21, 2021Date of Patent: April 9, 2024Assignee: Cohesity, Inc.Inventors: Amandeep Gautam, Anand Arun, Debasish Garai, Rupesh Bajaj, Himanshu Mehra, Vairavanathan Emalayan, Apurv Gupta
-
Publication number: 20240112444Abstract: Automated analog gauge reading is provided. The method comprises a computer system receiving input of an image and detecting at least one analog gauge in the image. The computer system corrects the orientation of the analog gauge in the image and detects scene text and tick labels on the analog gauge. The computer system determines a position of a pointer on the analog gauge relative to the scene text and outputs a gauge reading value based on an arithmetic progression of tick labels and angle of the pointer with respect to minimum and maximum values on the analog gauge.Type: ApplicationFiled: September 29, 2022Publication date: April 4, 2024Inventors: Michele Merler, Dhiraj Joshi, Apurv Gupta, Sebastien Gilbert, Shyama Prosad Chowdhury, Chidansh Amitkumar Bhatt, Nirmit V. Desai
-
Publication number: 20240045832Abstract: A size associated with a first content file is determined to be less than a threshold size. In response to determining that the size associated with the first content file is less than the threshold size, a combined metadata structure is updated at least in part by combining metadata of the first content file with metadata of a second content file in the combined metadata structure. A snapshot tree is updated to reference a first portion of the combined metadata structure corresponding to the first content file and to reference a second portion of the combined metadata structure corresponding to the second content file.Type: ApplicationFiled: July 21, 2023Publication date: February 8, 2024Inventors: Apurv Gupta, Anirvan Duttagupta
-
Patent number: 11886298Abstract: An identification of a new primary snapshot created for a primary storage system is received. A threshold time window based on a capture time associated with the new primary snapshot is determined. Entries of a storage log of the primary storage system occurring within the threshold time window are analyzed to identify any objects of the primary storage system that have changed during the threshold time window. Changed objects to capture in a new secondary backup stored at a secondary storage system and corresponding to the new primary snapshot are identified. Metadata of the new primary snapshot and metadata of a previous secondary backup are compared to determine for each of the objects of the primary storage system identified as having changed during the threshold time window whether a change to an object since the previous secondary backup is captured in the new primary snapshot.Type: GrantFiled: March 31, 2021Date of Patent: January 30, 2024Assignee: Cohesity, Inc.Inventors: Amandeep Gautam, Anand Arun, Debasish Garai, Rupesh Bajaj, Himanshu Mehra, Vairavanathan Emalayan, Apurv Gupta
-
Publication number: 20240004763Abstract: A backup of one or more objects is determined to be performed. Based on one or more conditions, a corresponding deduplication option among a plurality of deduplication options to utilize when backing up the one or more objects is selected. The one or more conditions at least include a condition based on a detected data change pattern. The plurality of deduplication options include a deduplication option associated with utilizing at least in part a plurality of variable-length data chunks for one or more mismatched ranges and/or one or more missing ranges associated with one of the one or more objects associated with the source system. A request to perform the backup of the one or more objects according to the corresponding selected deduplication option is provided to the source system. Backup data associated with the one or more objects is received and stored.Type: ApplicationFiled: June 29, 2022Publication date: January 4, 2024Inventors: Apurv Gupta, Mandar Suresh Naik, Zhihuan Qiu, Gurunarayanan Nagasubramanian, Anirban Mitra
-
Publication number: 20230401003Abstract: Data associated with a source system is ingested. After the data is ingested, a post-processing metadata conversion process is performed including by selecting an entry of a chunk metadata data structure and determining that a data chunk associated with the selected entry is not referenced by at least a threshold number of objects. In response to determining that the data chunk associated with the selected entry is not referenced by at least the threshold number of objects, metadata of a tree data structure node corresponding to a chunk identifier associated with the data chunk is updated to store a reference to a chunk file storing the data chunk and the selected entry is removed from the chunk metadata data structure.Type: ApplicationFiled: August 28, 2023Publication date: December 14, 2023Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron
-
Publication number: 20230350763Abstract: Range information associated with one or more objects is received from a storage system. One or more missing ranges and/or one or more mismatched ranges associated with the one or more objects is determined based on the received range information. A plurality of data chunk identifiers associated with a plurality of variable-length data chunks included in the one or more determined ranges associated with the one or more objects is provided to the storage system. A response that is used to identify among the plurality of variable-length data chunks, one or more variable-length data chunks not already stored in a storage associated with the storage system is received from the storage system. Content of the identified one or more variable-length data chunks is provided to the storage system.Type: ApplicationFiled: April 27, 2022Publication date: November 2, 2023Inventors: Apurv Gupta, Mandar Suresh Naik, Zhihuan Qiu, Gurunarayanan Nagasubramanian, Anirban Mitra
-
Patent number: 11803513Abstract: A size associated with a first content file is determined to be less than a threshold size. In response to determining that the size associated with the first content file is less than the threshold size, a combined metadata structure is updated at least in part by combining metadata of the first content file with metadata of a second content file in the combined metadata structure. A snapshot tree is updated to reference a first portion of the combined metadata structure corresponding to the first content file and to reference a second portion of the combined metadata structure corresponding to the second content file.Type: GrantFiled: February 11, 2021Date of Patent: October 31, 2023Assignee: Cohesity, Inc.Inventors: Apurv Gupta, Anirvan Duttagupta
-
Patent number: 11797220Abstract: Data is ingested from a source system including by storing a plurality of data chunks in one or more chunk files and storing corresponding chunk identifiers associated with the plurality of data chunks in a first data structure. After data ingestion is complete, one or more duplicate data chunks that were stored during the data ingestion are determined and a second data structure is updated to include one or more entries corresponding to one or more determined duplicate data chunks.Type: GrantFiled: August 20, 2021Date of Patent: October 24, 2023Assignee: Cohesity, Inc.Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron
-
Publication number: 20230315303Abstract: A plurality of data stripes and one or more parity stripes are generated using a plurality of data chunks stored in a write-ahead log based on an erasure coding configuration. The plurality of data stripes and the one or more parity stripes are stored on corresponding different storage devices. The plurality of data stripes and the one or more parity stripes are associated together under a data protection grouping container.Type: ApplicationFiled: March 31, 2022Publication date: October 5, 2023Inventors: Apurv Gupta, Akshat Agarwal
-
Patent number: 11741048Abstract: Embodiments presented herein disclose techniques for capturing a snapshot of a file system object (e.g., a file or a directory) that is associated with a write journal having outstanding data. A bridge process in a storage server receives a request to capture a snapshot of a file system object. The snapshot is a backup of a state of the file system object in a given point in time. Upon determining that the file system object has one or more outstanding updates recorded in a write journal, the bridge process generates a copy of the write journal. The bridge process captures the snapshot of the file system object. The bridge process also associates the copy of the write journal with the snapshot of the file system object.Type: GrantFiled: April 20, 2022Date of Patent: August 29, 2023Assignee: Cohesity, Inc.Inventor: Apurv Gupta
-
Publication number: 20230252042Abstract: Application-level data in a storage system are evaluated. For example, a backup analysis tool retrieves a backup object stored in the storage system. The backup analysis tool reconstructs an application object from the backup object. The backup analysis tool accesses the first application object according to a native application format associated with the first object.Type: ApplicationFiled: April 14, 2023Publication date: August 10, 2023Inventors: Apurv Gupta, Vipin Gupta, Johnny Chen, Sunil Moolchandani, Murali Vajapeyam
-
Publication number: 20230237020Abstract: A repository of replicated chunk files is analyzed to identify chunk files that meet at least a portion of combination criteria. Selected chunk files are associated together under a data protection grouping container. Erasure coding is applied to the data protection grouping container including by utilizing the selected chunk files as different data stripes of the erasure coding and generating one or more parity stripes based on the different data stripes.Type: ApplicationFiled: January 24, 2022Publication date: July 27, 2023Inventors: Apurv Gupta, Akshat Agarwal, Manvendra Singh Tomar, Donthula Akshith Reddy, Kushal Singh, Tarun Kumar Yadav, Mandar Suresh Naik
-
Publication number: 20230229323Abstract: A plurality of logical storage segments of storage drives of a plurality of storage nodes are identified. At least one of the storage nodes includes at least a first logical storage segment and a second logical storage segment included in the plurality of logical storage segments. A distributed and replicated data store using a portion of the plurality of logical storage segments that excludes at least the second logical storage segment is provided. An available storage capacity metric associated with the plurality of logical storage segments is determined to meet a first threshold. In response to the determination that the available storage capacity metric meets the first threshold, at least the second logical storage segment is dynamically deployed for use in providing the distributed and replicated data store in a manner that increases a storage capacity of the data store while maintaining a fault tolerance policy of the distributed and replicated data store.Type: ApplicationFiled: December 15, 2022Publication date: July 20, 2023Inventors: Venkatesh Pallipadi, Sachin Jain, Deepak Ojha, Apurv Gupta
-
Patent number: 11663236Abstract: Application-level data in a storage system are evaluated. For example, a backup analysis tool retrieves a backup object stored in the storage system. The backup analysis tool reconstructs an application object from the backup object. The backup analysis tool accesses the first application object according to a native application format associated with the first object.Type: GrantFiled: September 29, 2021Date of Patent: May 30, 2023Assignee: Cohesity, Inc.Inventors: Apurv Gupta, Vipin Gupta, Johnny Chen, Sunil Moolchandani, Murali Vajapeyam
-
Publication number: 20230123568Abstract: An indication of a storage service job to be performed is received. A task to be performed for the storage service job is determined. The task is added to a work queue. Execution of one or more tasks in the work queue that includes the task is dynamically managed. Resources are dynamically allocated to one or more virtualization containers that are assigned to execute the one or more tasks in the work queue. An identification of one or more new tasks to be performed for the storage service job is received from one of the virtualization containers executing the task. The one or more new tasks are added to the work queue.Type: ApplicationFiled: October 14, 2021Publication date: April 20, 2023Inventors: Sachin Jain, Himanshu Mehra, Sashikanth Madduri, Apurv Gupta
-
Publication number: 20230059679Abstract: Data is ingested from a source system including by storing a plurality of data chunks in one or more chunk files and storing corresponding chunk identifiers associated with the plurality of data chunks in a first data structure. After data ingestion is complete, one or more duplicate data chunks that were stored during the data ingestion are determined and a second data structure is updated to include one or more entries corresponding to one or more determined duplicate data chunks.Type: ApplicationFiled: August 20, 2021Publication date: February 23, 2023Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron