Patents by Inventor Zhihuan Qiu
Zhihuan Qiu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20240119038Abstract: Maintenance is performed to determine one or more content files associated with a stored backup snapshot having a size that is greater than a threshold size. It is determined that the size of a content file of the one or more content files is greater than the threshold size. In response to determining that the size of the content file is greater than the threshold size, a new tree data structure is generated and a component file metadata structure corresponding to the content file is split into a plurality of component file metadata structures for the content file.Type: ApplicationFiled: December 19, 2023Publication date: April 11, 2024Inventors: Zhihuan Qiu, Ganesha Shanmuganathan
-
Patent number: 11947497Abstract: Data is ingested from a source system. Ingesting the data includes determining corresponding chunk identifiers for a plurality of data chunks corresponding to the ingested data and for each of the plurality of data chunks, verifying whether the corresponding chunk identifier is included in a data structure tracking identifiers of data chunks that were already stored in a storage of a storage system before the data ingestion started and storing the data chunk in a storage based on the verification. After the ingesting is completed, deduplication of the ingested data chunks stored in the storage having a same chunk identifier is performed and the data structure is updated based on the deduplication.Type: GrantFiled: August 24, 2021Date of Patent: April 2, 2024Assignee: Cohesity, Inc.Inventors: Zhihuan Qiu, Yu Liu
-
Publication number: 20240036751Abstract: A specification of content to be stored in a cloud storage is received at a client-side component. A first portion of the content is divided into a plurality of data chunks. One or more data chunks of the plurality of data chunks that are to be sent via a network to be stored in the cloud storage are identified. It is determined whether a batch size of the one or more identified data chunks does not meets a threshold size. One or more data chunks of a second portion of the content that are to be stored in the cloud storage are identified. It is determined that a size of a second batch of data chunks that includes the one or more identified data chunks of the first portion of the content and the one or more identified data chunks of the second portion of the content does not meet the threshold size. It is determined that a batch period is greater than or equal to a batch threshold period. The second batch of data chunks is written to a storage of a cloud server included in a data plane.Type: ApplicationFiled: October 13, 2023Publication date: February 1, 2024Inventors: Anubhav Gupta, Praveen Kumar Yarlagadda, Venkata Ranga Radhanikanth Guturi, Zhihuan Qiu, Sarthak Agarwal
-
Patent number: 11886412Abstract: An indication to perform a backup of file system data is received. The file system data includes a content file having a size greater than a threshold size. A plurality of component file metadata structures for the content file are generated. File metadata of the content file split across the plurality of component file metadata structures is stored. The plurality of component file metadata structures are associated with different portions of the content file. A component file metadata structure of the plurality of component file metadata structures stores file metadata corresponding to a portion of the content file. The file metadata corresponding to the portion of the content file includes one or more references to locations of data chunks associated with the portion of the content file.Type: GrantFiled: September 19, 2022Date of Patent: January 30, 2024Assignee: Cohesity, Inc.Inventors: Zhihuan Qiu, Ganesha Shanmuganathan
-
Publication number: 20240004763Abstract: A backup of one or more objects is determined to be performed. Based on one or more conditions, a corresponding deduplication option among a plurality of deduplication options to utilize when backing up the one or more objects is selected. The one or more conditions at least include a condition based on a detected data change pattern. The plurality of deduplication options include a deduplication option associated with utilizing at least in part a plurality of variable-length data chunks for one or more mismatched ranges and/or one or more missing ranges associated with one of the one or more objects associated with the source system. A request to perform the backup of the one or more objects according to the corresponding selected deduplication option is provided to the source system. Backup data associated with the one or more objects is received and stored.Type: ApplicationFiled: June 29, 2022Publication date: January 4, 2024Inventors: Apurv Gupta, Mandar Suresh Naik, Zhihuan Qiu, Gurunarayanan Nagasubramanian, Anirban Mitra
-
Publication number: 20230401003Abstract: Data associated with a source system is ingested. After the data is ingested, a post-processing metadata conversion process is performed including by selecting an entry of a chunk metadata data structure and determining that a data chunk associated with the selected entry is not referenced by at least a threshold number of objects. In response to determining that the data chunk associated with the selected entry is not referenced by at least the threshold number of objects, metadata of a tree data structure node corresponding to a chunk identifier associated with the data chunk is updated to store a reference to a chunk file storing the data chunk and the selected entry is removed from the chunk metadata data structure.Type: ApplicationFiled: August 28, 2023Publication date: December 14, 2023Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron
-
Patent number: 11842060Abstract: A specification of content to be stored in a cloud storage is received at a client-side component. A portion of the content is divided into a plurality of data chunks. One or more data chunks of the plurality of data chunks that are to be sent via a network to be stored in the cloud storage are identified. It is determined whether a batch size of the one or more identified data chunks meets a threshold size. Based on the determination of whether the batch size meets the threshold size, a cloud storage destination among a plurality of different cloud storage destinations associated with different performance tiers is selected.Type: GrantFiled: August 13, 2021Date of Patent: December 12, 2023Assignee: Cohesity, Inc.Inventors: Anubhav Gupta, Praveen Kumar Yarlagadda, Venkata Ranga Radhanikanth Guturi, Zhihuan Qiu, Sarthak Agarwal
-
Publication number: 20230394010Abstract: File metadata structures of a file system are analyzed. At least one metadata element that is duplicated among the analyzed file metadata structures is identified. The at least one identified metadata element is deduplicated including by modifying at least one of the file metadata structures to reference a same instance of the identified metadata element that is referenced by another one of the file metadata structures.Type: ApplicationFiled: August 16, 2023Publication date: December 7, 2023Inventors: Anubhav Gupta, Sachin Jain, Shreyas Talele, Zhihuan Qiu
-
Publication number: 20230350763Abstract: Range information associated with one or more objects is received from a storage system. One or more missing ranges and/or one or more mismatched ranges associated with the one or more objects is determined based on the received range information. A plurality of data chunk identifiers associated with a plurality of variable-length data chunks included in the one or more determined ranges associated with the one or more objects is provided to the storage system. A response that is used to identify among the plurality of variable-length data chunks, one or more variable-length data chunks not already stored in a storage associated with the storage system is received from the storage system. Content of the identified one or more variable-length data chunks is provided to the storage system.Type: ApplicationFiled: April 27, 2022Publication date: November 2, 2023Inventors: Apurv Gupta, Mandar Suresh Naik, Zhihuan Qiu, Gurunarayanan Nagasubramanian, Anirban Mitra
-
Patent number: 11797220Abstract: Data is ingested from a source system including by storing a plurality of data chunks in one or more chunk files and storing corresponding chunk identifiers associated with the plurality of data chunks in a first data structure. After data ingestion is complete, one or more duplicate data chunks that were stored during the data ingestion are determined and a second data structure is updated to include one or more entries corresponding to one or more determined duplicate data chunks.Type: GrantFiled: August 20, 2021Date of Patent: October 24, 2023Assignee: Cohesity, Inc.Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron
-
Patent number: 11782886Abstract: A virtual machine container file is analyzed to determine which portion of the virtual machine container file corresponds to a virtual machine file system metadata of the virtual machine container file. One or more differences between a first version of a virtual machine container file and a second version of the virtual machine container file are determined at least in part by traversing a snapshot structure associated with the virtual machine container file. The determined one or more differences that corresponds to the virtual machine file system metadata portion of the virtual machine container file are identified based at least in part on the analysis of the virtual machine container file.Type: GrantFiled: September 29, 2021Date of Patent: October 10, 2023Assignee: Cohesity, Inc.Inventors: Chinmaya Manjunath, Anirvan Duttagupta, Anubhav Gupta, Sidharth Mishra, Zhihuan Qiu
-
Patent number: 11775482Abstract: File metadata structures of a file system are analyzed. At least one metadata element that is duplicated among the analyzed file metadata structures is identified. The at least one identified metadata element is deduplicated including by modifying at least one of the file metadata structures to reference a same instance of the identified metadata element that is referenced by another one of the file metadata structures.Type: GrantFiled: April 21, 2020Date of Patent: October 3, 2023Assignee: Cohesity, Inc.Inventors: Anubhav Gupta, Sachin Jain, Shreyas Talele, Zhihuan Qiu
-
Publication number: 20230289262Abstract: A plurality of portions of a content file are stored. It is determined that the content file has a size that is greater than a threshold size. In response to determining that the content file has the size that is greater than the threshold size, a plurality of component file metadata structures are generated for each of the plurality of portions of the content file. A component file metadata structure of the plurality of component file metadata structures corresponds to one of the portions of the content file. Each of the plurality of component file metadata structures includes corresponding metadata that enables data chunks associated with a corresponding portion of the content file to be located.Type: ApplicationFiled: May 15, 2023Publication date: September 14, 2023Inventors: Mohit Aron, Zhihuan Qiu, Ganesha Shanmuganathan, Malini Mahalakshmi Venkatachari
-
Publication number: 20230280924Abstract: A cloud server component determines that a size of a first cloud storage element object is at least below a first threshold. In response to the first determination, a client-side component is requested to store additional data in the cloud storage element object including by having the client-side component update the first cloud storage element with an updated version that includes previously existing data of the first cloud storage element and the additional data. The first cloud storage element object is added to a set of one or more cloud storage element objects available for update. The client-side component is configured to generate an updated version of the first cloud storage element object that has a size that is greater than or equal to the first threshold.Type: ApplicationFiled: April 25, 2023Publication date: September 7, 2023Inventors: Anubhav Gupta, Praveen Kumar Yarlagadda, Venkata Ranga Radhanikanth Guturi, Zhihuan Qiu, Sarthak Agarwal
-
Patent number: 11693741Abstract: A size associated with a content file is determined to be greater than a threshold size. Contents of the content file split across a plurality of component files are stored. Metadata, for the content file, is updated to reference a plurality of component file metadata structures for the component files. A node of the metadata is configured to track different sizes of portions of the content file stored in different component files of the plurality of component files. File metadata of the content file is split across the plurality of component file metadata structures and each component file metadata structure of the plurality of component file metadata structures specifies a corresponding structure organizing data components for a corresponding portion of the content file.Type: GrantFiled: June 15, 2021Date of Patent: July 4, 2023Assignee: Cohesity, Inc.Inventors: Mohit Aron, Zhihuan Qiu, Ganesha Shanmuganathan, Malini Mahalakshmi Venkatachari
-
Patent number: 11681454Abstract: A cloud server determines that a size of a first cloud storage element object is at least below a first threshold. In response to the first determination, a client-side component is requested to store additional data in the cloud storage element object including by having the client-side component update the first cloud storage element with an updated version that includes previously existing data of the first cloud storage element and the additional data. The first cloud storage element object is added to a set of one or more cloud storage element objects available for update. The client-side component is configured to generate an updated version of the first cloud storage element object that has a size that is greater than or equal to the first threshold.Type: GrantFiled: August 13, 2021Date of Patent: June 20, 2023Assignee: Cohesity, Inc.Inventors: Anubhav Gupta, Praveen Kumar Yarlagadda, Venkata Ranga Radhanikanth Guturi, Zhihuan Qiu, Sarthak Agarwal
-
Publication number: 20230177011Abstract: A selected data chunk associated with an object is determined to be sent to a destination. A chunk compression grouping storing the selected data chunk associated with the object is identified. The identified chunk compression grouping includes a plurality of data chunks compressed together. A data content version that includes the selected data chunk associated with the object to be provided to the destination is determined from a plurality of data content versions based at least in part a metric associated with the identified chunk compression grouping.Type: ApplicationFiled: December 8, 2021Publication date: June 8, 2023Inventors: Zhihuan Qiu, Yinzen Hwang
-
Publication number: 20230062644Abstract: Data is ingested from a source system. Ingesting the data includes determining corresponding chunk identifiers for a plurality of data chunks corresponding to the ingested data and for each of the plurality of data chunks, verifying whether the corresponding chunk identifier is included in a data structure tracking identifiers of data chunks that were already stored in a storage of a storage system before the data ingestion started and storing the data chunk in a storage based on the verification. After the ingesting is completed, deduplication of the ingested data chunks stored in the storage having a same chunk identifier is performed and the data structure is updated based on the deduplication.Type: ApplicationFiled: August 24, 2021Publication date: March 2, 2023Inventors: Zhihuan Qiu, Yu Liu
-
Publication number: 20230059679Abstract: Data is ingested from a source system including by storing a plurality of data chunks in one or more chunk files and storing corresponding chunk identifiers associated with the plurality of data chunks in a first data structure. After data ingestion is complete, one or more duplicate data chunks that were stored during the data ingestion are determined and a second data structure is updated to include one or more entries corresponding to one or more determined duplicate data chunks.Type: ApplicationFiled: August 20, 2021Publication date: February 23, 2023Inventors: Zhihuan Qiu, Sachin Jain, Anubhav Gupta, Apurv Gupta, Mohit Aron
-
Publication number: 20230016271Abstract: An indication to perform a backup of file system data is received. The file system data includes a content file having a size greater than a threshold size. A plurality of component file metadata structures for the content file are generated. File metadata of the content file split across the plurality of component file metadata structures is stored. The plurality of component file metadata structures are associated with different portions of the content file. A component file metadata structure of the plurality of component file metadata structures stores file metadata corresponding to a portion of the content file. The file metadata corresponding to the portion of the content file includes one or more references to locations of data chunks associated with the portion of the content file.Type: ApplicationFiled: September 19, 2022Publication date: January 19, 2023Inventors: Zhihuan Qiu, Ganesha Shanmuganathan