Patents Assigned to Actifio, Inc.
-
Publication number: 20140344216Abstract: Described herein are systems and methods for garbage collection prediction. A temporal graph is received, the temporal graph including nodes, the nodes including hash references to objects. An accumulated difference count is updated when a node is added to the temporal graph, the accumulated difference count including a number of hash differences between a parent node and its children nodes in the temporal graph. A divested difference count is updated when a node is removed from the temporal graph, the divested difference count including a number of hash differences referenced by the removed node but not by either a parent node of the removed node or any child nodes of the removed node. The outcome of the garbage collection is predicted based on at least one of the accumulated difference count and the divested difference count.Type: ApplicationFiled: May 14, 2014Publication date: November 20, 2014Applicant: ACTIFIO, INC.Inventors: Philip J. ABERCROMBIE, Christopher A. PROVENZANO, Keith GOLDBERG
-
Patent number: 8874863Abstract: Systems and methods are provided for an asynchronous data replication system in which the remote replication reduces bandwidth requirements by copying deduplicated differences in business data from a local storage site to a remote, backup storage site, the system comprising: a local performance storage pool for storing data; a local deduplicating storage pool for storing deduplicated data, said local deduplicating storage pool further storing metadata about data objects in the system and which has metadata analysis logic for identifying and specifying differences in a data object over time; a remote performance storage pool for storing a copy of said data, available for immediate use as a backup copy of said data to provide business continuity to said data; a remote deduplicating storage pool for storing deduplicated data; and a controller for synchronizing the remote performance storage pool to have the second version of the data object using deduplicated data.Type: GrantFiled: August 1, 2012Date of Patent: October 28, 2014Assignee: Actifio, Inc.Inventors: Madhav Mutalik, Christopher A. Provenzano, Philip J. Abercrombie
-
System and method for managing deduplicated copies of data using temporal relationships among copies
Patent number: 8843489Abstract: Systems and methods are disclosed for managing deduplicated images of data objects that change over time. The method includes: organizing unique content of each data object as a plurality of content segments and storing the content segments in a data store; for each data object, creating an organized arrangement of hash structures, wherein each structure, for a subset of the hash structures, includes a hash signature for a corresponding content segment and is associated with a reference to the corresponding content segment, and for each data object, maintaining an organized arrangement of temporal structures to represent a corresponding data object over time, wherein each structure is associated with a temporal state of the data object, and wherein each temporal state is associated with the hash structures representing the content of the data object during that temporal state.Type: GrantFiled: November 16, 2010Date of Patent: September 23, 2014Assignee: Actifio, Inc.Inventor: Christopher A. Provenzano -
Patent number: 8788769Abstract: Systems and methods for backing-up data from a first storage pool to a second storage pool using difference information between time states are disclosed. The system has a data management engine for performing data management functions, including at least a back-up function to create a back-up copy of data. By executing a sequence of snapshot operations to create point-in-time images of application data on a first storage pool, each successive point-in-time image corresponding to a specific, successive time-state of the application data, a series of snapshots is created. The snapshots are then used to create difference information indicating which application data has changed and the content of the changed application data for the corresponding time state. This difference information is then sent to a second storage pool to create a back-up copy of data for the current time-state.Type: GrantFiled: November 16, 2010Date of Patent: July 22, 2014Assignee: Actifio, Inc.Inventors: Philip J. Abercrombie, Madhav Mutalik, Christopher A. Provenzano, Mark A. Roman
-
Patent number: 8688650Abstract: Systems and methods are disclosed for efficiently creating a data fingerprint to identify or characterize contents of a data object by using a selection function to select a plurality of non-contiguous regions from the data object, the selected regions each having a small number of bytes relative to the number of bytes in the data object and being distributed throughout the data object so that the selected regions comprise a sparse subset of the data of the data object yet provide a significant probability of including bytes that change if the data object were modified; and performing a hash operation on the data to produce a fingerprint based on the sparse subset of the data object. The data fingerprint thereby efficiently provides an indication of the contents of the data object, so that comparing data fingerprints can determine if the data objects are different if the corresponding fingerprints are different.Type: GrantFiled: August 1, 2012Date of Patent: April 1, 2014Assignee: Actifio, Inc.Inventors: Madhav Mutalik, Philip J. Abercrombie, Christopher A. Provenzano
-
Publication number: 20130339302Abstract: Systems and methods are provided for backing up database storage. A protection policy for database storage is activated at a remote host, wherein the database storage is not physically accessible by the computing device, such that the out-of-band storage is not in communication with the remote host through the computing device. A backup request is transmitted to the remote host comprising data indicative of a volume on the database storage to back up according to the protection policy. A response to the backup request is received comprising staging disk requirements including a required size for a staging disk. A staging disk is determined based on the response. The staging disk is presented to the remote host so that the remote host can perform a backup of the volume to the staging disk using a database backup tool provided by a vendor of the database storage.Type: ApplicationFiled: June 18, 2013Publication date: December 19, 2013Applicant: ACTIFIO, INC.Inventors: Xiangdong ZHANG, Madhav MUTALIK, Sachindra KUMAR
-
Publication number: 20130318053Abstract: Systems and methods are disclosed for forming deduplicated images of a data object that changes over time using difference information between temporal states of the data object. The method includes organizing the content of the data object for a first temporal state as a plurality of content segments and storing the content segments in a data store; creating an organized arrangement of hash structures to represent the data object in its first temporal state; receiving difference information for the data object; forming at least one hash signature for the changed content; and storing the changed content that is unique in the data store as content segments. The method also includes determining, subsequent to receiving the changed content at the deduplicating content store, whether the changed content should be stored by searching for the hash signature for the changed higher-level hash structure in the global cache of the deduplicating content store.Type: ApplicationFiled: March 19, 2013Publication date: November 28, 2013Applicant: ACTIFIO, INC.Inventors: Christopher A. PROVENZANO, Mark A. ROMAN
-
Publication number: 20130226884Abstract: Systems and methods are disclosed for using a first deduplicating store to update a second deduplicating store with information representing how data objects change over time, said method including: at a first and a second deduplicating store, for each data object, maintaining an organized arrangement of temporal structures to represent a corresponding data object over time, wherein each structure is associated with a temporal state of the data object and wherein the logical arrangement of structures is indicative of the changing temporal states of the data object; finding a temporal state that is common to and in temporal proximity to the current state of the first and second deduplicating stores; and compiling and sending a set of hash signatures for the content that has changed from the common state to the current temporal state of the first deduplicating store.Type: ApplicationFiled: April 9, 2013Publication date: August 29, 2013Applicant: ACTIFIO, INC.Inventor: Actifio, Inc.
-
Patent number: 8417674Abstract: Systems and methods are disclosed for using a first deduplicating store to update a second deduplicating store with information representing how data objects change over time, said method comprising: at a first and a second deduplicating store, for each data object, maintaining an organized arrangement of temporal structures to represent a corresponding data object over time, wherein each structure is associated with a temporal state of the data object and wherein the logical arrangement of structures is indicative of the changing temporal states of the data object; finding a temporal state that is common to and in temporal proximity to the current state of the first and second deduplicating stores; and compiling and sending a set of hash signatures for the content that has changed from the common state to the current temporal state of the first deduplicating store.Type: GrantFiled: November 16, 2010Date of Patent: April 9, 2013Assignee: Actifio, Inc.Inventor: Christopher A. Provenzano
-
Patent number: 8402004Abstract: Systems and methods are disclosed for forming deduplicated images of a data object that changes over time using difference information between temporal states of the data object. The method includes organizing the content of the data object for a first temporal state as a plurality of content segments and storing the content segments in a data store; creating an organized arrangement of hash structures to represent the data object in its first temporal state; receiving difference information for the data object; forming at least one hash signature for the changed content; and storing the changed content that is unique in the data store as content segments, whereby a deduplicated image of the data object for a second temporal state is stored without requiring reception of a complete image of the data object for the second temporal state.Type: GrantFiled: November 16, 2010Date of Patent: March 19, 2013Assignee: Actifio, Inc.Inventors: Christopher A. Provenzano, Mark A. Roman
-
Patent number: 8396905Abstract: Systems and methods are disclosed for performing garbage collection to identify content segments no longer referenced in a deduplicating storage system in which redundant mark operations in a mark-and-sweep technique are avoided. An organized arrangement of hash structures is created for each data object, wherein each structure includes a hash signature for a corresponding content segment and is associated with a reference to the corresponding content segment, and the logical organization of the arrangement represents the logical organization of the content segments as they are represented within the data object. Additionally, for each data object, temporal states are maintained over time. Garbage collection iterates over the temporal structures and, for each temporal structure, marks the garbage collection state for the associated content segments for only the content segments that have changed relative to an immediately prior temporal state of the data object.Type: GrantFiled: November 16, 2010Date of Patent: March 12, 2013Assignee: Actifio, Inc.Inventor: Christopher A. Provenzano
-
Publication number: 20130042083Abstract: Systems and methods are provided for an asynchronous data replication system in which the remote replication reduces bandwidth requirements by copying deduplicated differences in business data from a local storage site to a remote, backup storage site, the system comprising: a local performance storage pool for storing data; a local deduplicating storage pool for storing deduplicated data, said local deduplicating storage pool further storing metadata about data objects in the system and which has metadata analysis logic for identifying and specifying differences in a data object over time; a remote performance storage pool for storing a copy of said data, available for immediate use as a backup copy of said data to provide business continuity to said data; a remote deduplicating storage pool for storing deduplicated data; and a controller for synchronizing the remote performance storage pool to have the second version of the data object using deduplicated data.Type: ApplicationFiled: August 1, 2012Publication date: February 14, 2013Applicant: ACTIFIO, INC.Inventors: Madhav MUTALIK, Christopher A. PROVENZANO, Philip J. ABERCROMBIE
-
Publication number: 20130036097Abstract: Systems and methods are disclosed for efficiently creating a data fingerprint to identify or characterize contents of a data object by using a selection function to select a plurality of non-contiguous regions from the data object, the selected regions each having a small number of bytes relative to the number of bytes in the data object and being distributed throughout the data object so that the selected regions comprise a sparse subset of the data of the data object yet provide a significant probability of including bytes that change if the data object were modified; and performing a hash operation on the data to produce a fingerprint based on the sparse subset of the data object. The data fingerprint thereby efficiently provides an indication of the contents of the data object, so that comparing data fingerprints can determine if the data objects are different if the corresponding fingerprints are different.Type: ApplicationFiled: August 1, 2012Publication date: February 7, 2013Applicant: ACTIFIO, INC.Inventors: Madhav MUTALIK, Philip J. ABERCROMBIE, Christopher A. PROVENZANO
-
Publication number: 20130036091Abstract: Systems and methods are disclosed for copying a data object to a target storage pool using a hybrid of storage pools, in which at least one of the storage pools is particularly efficient at identifying data that should be used for copying the data object to the target storage pool, and at least one of the storage pools is particularly efficient at retrieving the data that should be sent to the target storage pool. The system comprises a performance storage pool for storing data and having relatively high performance for retrieving stored data; a deduplicating storage pool for storing deduplicated data and storing metadata about data objects in the system and which has relatively high performance for identifying and specifying differences in a data object over time; and a controller for causing the performance storage pool to retrieve differences and provide the data to the target storage pool.Type: ApplicationFiled: August 1, 2012Publication date: February 7, 2013Applicant: ACTIFIO, INC.Inventors: Christopher A. PROVENZANO, Philip J. ABERCROMBIE, Madhav MUTALIK
-
Publication number: 20130036098Abstract: Systems and methods are disclosed for checking the data integrity of a data object copied between storage pools by comparing data fingerprints of data objects, comprising scheduling a series of successive copy operations over time for copying a data object from a source data store to a target data store; generating a partial fingerprint of the data object at the source data store that creates a fingerprint from a subset of the data object; sending the partial fingerprint of the data object to the target data store; sending any new data contents to the target data store; and creating a partial fingerprint of the data object at the target data store and comparing it to the received partial fingerprint to determine if they differ, thereby allowing incremental verification that the copy of the data object at the target data store is the same as at the source data store.Type: ApplicationFiled: August 1, 2012Publication date: February 7, 2013Applicant: Actifio, Inc.Inventors: Madhav MUTALIK, Philip J. ABERCROMBIE, Christopher A. PROVENZANO
-
Patent number: 8299944Abstract: Systems and methods are disclosed for storing deduplicated images in which a portion of the image is stored in encoded form directly in a hash table, the method comprising: organizing unique content of each data object as a plurality of content segments and storing the content segments in a data store; receiving content to be included in the deduplicated image of the data object; determining if the received content may be encoded using a predefined non-lossy encoding technique and in which the encoded value would fit within the field for containing a hash signature; if so, placing the encoding in the field and marking the hash structure to indicate that the field contains encoded content; otherwise, generating a hash signature for the received content and placing the hash signature in the field and placing the received content in a corresponding content segment if it is unique.Type: GrantFiled: November 16, 2010Date of Patent: October 30, 2012Assignee: Actifio, Inc.Inventor: Christopher A. Provenzano
-
Publication number: 20120124105Abstract: Systems and methods are disclosed for performing garbage collection to identify content segments no longer referenced in a deduplicating storage system in which redundant mark operations in a mark-and-sweep technique are avoided. An organized arrangement of hash structures is created for each data object, wherein each structure includes a hash signature for a corresponding content segment and is associated with a reference to the corresponding content segment, and the logical organization of the arrangement represents the logical organization of the content segments as they are represented within the data object. Additionally, for each data object, temporal states are maintained over time. Garbage collection iterates over the temporal structures and, for each temporal structure, marks the garbage collection state for the associated content segments for only the content segments that have changed relative to an immediately prior temporal state of the data object.Type: ApplicationFiled: November 16, 2010Publication date: May 17, 2012Applicant: ACTIFIO, INC.Inventor: Christopher A. PROVENZANO
-
Publication number: 20120123999Abstract: Systems and methods are disclosed for a system for managing data in accordance with service level agreements (SLAs) that specify schedules on a calendar basis for performing prescribed data management functions and for reducing inter-function redundancy. This system may include a data management engine for performing data management functions, including at least a snapshot function, and at least one back-up function. The data management engine may include a service level policy engine that receives SLAs in electronic form and which controls the scheduling of the data management functions. Each electronic SLA may be associated with a corresponding application and may specify at least one service level policy, each policy specifying a source pool, a destination pool, a copy frequency, a retention period and schedule information. A collection of policies is capable of expressing a non-uniform schedule for when a given function should be performed.Type: ApplicationFiled: November 16, 2010Publication date: May 17, 2012Applicant: Actifio, Inc.Inventors: Ashutosh Ashutosh, David F. Chang, Madhav Mutalik
-
SYSTEM AND METHOD FOR MANAGING DEDUPLICATED COPIES OF DATA USING TEMPORAL RELATIONSHIPS AMONG COPIES
Publication number: 20120124046Abstract: Systems and methods are disclosed for managing deduplicated images of data objects that change over time. The method includes: organizing unique content of each data object as a plurality of content segments and storing the content segments in a data store; for each data object, creating an organized arrangement of hash structures, wherein each structure, for a subset of the hash structures, includes a hash signature for a corresponding content segment and is associated with a reference to the corresponding content segment, and for each data object, maintaining an organized arrangement of temporal structures to represent a corresponding data object over time, wherein each structure is associated with a temporal state of the data object, and wherein each temporal state is associated with the hash structures representing the content of the data object during that temporal state.Type: ApplicationFiled: November 16, 2010Publication date: May 17, 2012Applicant: ACTIFIO, INC.Inventor: Christopher A. PROVENZANO -
Publication number: 20120124306Abstract: Systems and methods for backing-up data from a first storage pool to a second storage pool using difference information between time states are disclosed. The system has a data management engine for performing data management functions, including at least a back-up function to create a back-up copy of data. By executing a sequence of snapshot operations to create point-in-time images of application data on a first storage pool, each successive point-in-time image corresponding to a specific, successive time-state of the application data, a series of snapshots is created. The snapshots are then used to create difference information indicating which application data has changed and the content of the changed application data for the corresponding time state. This difference information is then sent to a second storage pool to create a back-up copy of data for the current time-state.Type: ApplicationFiled: November 16, 2010Publication date: May 17, 2012Applicant: ACTIFIO, INC.Inventors: Philip J. ABERCROMBIE, Madhav MUTALIK, Christopher A. PROVENZANO, Mark A. ROMAN