Method for creating a virtual data copy of a volume being restored
In one embodiment of the method, first and second data volumes are created. Thereafter, a first data portion of the first data volume is overwritten with a first data portion of the second data volume. A second data portion of the first data volume is overwritten with a second data portion of the second data volume. In one embodiment, the first and second data portions of the first data volume are overwritten with the first and second data portions of the second data volume, respectively, in response to a command to restore or synchronize the data contents of the first data volume to the data contents of the second data volume. A virtual point-in-time (PIT) copy of the first data volume is created after overwriting the first data portion but before overwriting the second data portion.
Latest Veritas Operating Corporation Patents:
The present patent application is a continuation of U.S. patent application Ser. No. 10/327,536, filed on Dec. 20, 2002 now U.S. Pat. No. 6,978,354, entitled “Method for Creating a Virtual Data Copy of a Volume Being Restored” and is incorporated by reference herein in its entirety and for all purposes.
BACKGROUND OF THE INVENTIONMany businesses rely on large-scale data processing systems for storing and processing business data.
Data processing system 10 includes a host node 12 coupled to data storage systems 14–18. The term coupled should not be limited to what is shown within
Data storage systems 14–18 include data memories 20–24, respectively. Alternatively, data memories 20–24 maybe included within a single data storage system. Each of the data memories 20–24 may take form in one or more dynamic or static random access memories, one or more arrays of magnetic or optical data storage disks, or combinations thereof. Data memories 20–24 should not be limited to the foregoing hardware components; rather, data memories 20–24 may take form in any hardware, software, or combination of hardware and software in which data may be persistently stored and accessed. Data memories 20–24 may take form in a complex construction of several hardware components operating under the direction of software. The data memories may take form in mirrored hardware. It is further noted that the present invention may find use with many types of redundancy/reliability systems. For example, the present invention may be used with Redundant Array of Independent Disks (RAID) systems. Moreover, the present invention should not be limited to use in connection with the host node of a data storage network. The present invention may find use in a storage switch or in any of many distinct appliances that can be used with a data storage system.
Data memory 20 stores data of a primary data volume. The primary data volume is the working volume of data processing system 10. Data memories 22 and 24 store or may be configured to store data of separate data volumes. For purposes of explanation, data memories 22 and 24 will be described as storing data of first and second data volumes, respectively. The first and second data volumes may be point-in-time (PIT) copies of the primary data volume or modified PIT (MPIT) copies of the primary data volume. A PIT copy, as its name implies, is a copy of the primary data volume created at some point-in-time. The first and second data volumes can be used to backup the primary data volume. The first and second data volumes can also be used to facilitate analysis of primary volume data without modifying data of the primary data volume.
As will be more fully described below, the first and second data volumes can be virtual or real. The first data volume is virtual when some data of the first data volume is stored in memory 20 (or 24). The first data volume is real when all data of the first data volume is stored in memory 22. Likewise, second data volume is virtual when some data of the second data volume is stored in memory 20 (or 22). The second data volume is real when all data of the second data volume is stored in memory 24. A virtual data volume can be converted to a real data volume via a background data copying process performed by host node 12. In the background copying process, for example, data of the first virtual volume is copied from memory 20 to memory 22 until all data of the first data volume is stored in memory 22.
Host node 12 may take form in a computer system (e.g., a server computer system) that processes requests from client computer systems (not shown). To respond to the requests, host node 12 may be required to process data of the primary data volume. Host node 12 generates read or write-data transactions that access memory 20 in response to receiving requests from client computer systems. Host node 12 is also capable of accessing memory 22 or 24 through read or write-data transactions.
Host node 12 includes a data storage management system (not shown) that takes form in software instructions executing on one or more processors (not shown) within host node 12. The data management system may include a file system and a system for managing the distribution of data of a volume across several memory devices. Volume Manager™ provided by VERITAS Software Corporation of Mountain View, Calif. is an exemplary system for managing the distribution of volume data across memory devices. Volume and disk management products from product software companies also provide a system for managing the distribution of volume data across memory devices. Hardware RAID adapter cards and RAID firmware built into computer systems likewise provide this function.
The first and second volumes can be virtual PIT or MPIT copies of the primary data volume. Host node 12 can create a first virtual volume according to the methods described in copending U.S. patent application Ser. No. 10/143,059 entitled “Method and Apparatus for Creating a Virtual Data Copy” which is incorporated herein by reference in its entirety.
When host node 12 creates the first virtual volume, host node 12 creates a pair of valid/modified (VM) maps such as maps 30 and 32 represented in
The first and second bits in each entry are designated Vn and Mn, respectively. Vn in each entry, depending on its state, indicates whether its corresponding block n in memory contains valid data. For example, when set to logical 1, V2 of VM map 30 indicates that block 2 of memory 20 contains valid primary volume data, and when set to logical 0, V2 of VM map 30 indicates that block 2 of memory 20 contains no valid primary volume data. It is noted that when Vn is set to logical 0, its corresponding memory block may contain data, but this data is not considered valid. V2 of VM map 32, when set to logical 1, indicates that block 2 of memory 22 contains valid data of the first volume (e.g., the first virtual PIT copy). V2 of VM map 32, when set to logical 0, indicates that block 2 of memory 22 does not contain valid data.
Mn in each entry, depending on its state, indicates whether data within block n of the corresponding memory has been modified. For example, when set to logical 1, M3 of VM map 30 indicates that block 3 of memory 20 contains data that was modified via a write-data transaction since creation of the first virtual volume. When set to logical 0, M3 of VM 30 indicates that block 3 of memory 20 contains unmodified data. Likewise, M3 in VM map 32, when set to logical 1, indicates that block 3 in memory 22 contains data that was modified via a write-data transaction since creation of the first virtual volume. When set to logical 0, M3 of VM map 32 indicates that block 3 of memory 22 contains unmodified data.
When VM maps 30 and 32 are first created, each entry of VM map 32 is set to logical 0, thus indicating that memory 22 contains no valid or modified data. For purposes of explanation, it is presumed that each block of data memory 20 contains valid data of the primary volume. Accordingly, Vn of each entry in VM 30 is initially set to logical 1. Lastly, Mn of each entry in VM maps 30 and 32 is initially set to logical 0. Host node 12 can change the state of one or more bits in any map entry using single or separate I/O operations at the memory address that stores the map entry.
After VM maps 30 and 32 are initiated, host node 12 may run a background process to copy data of memory 20 to memory 22 in a block by block or blocks by blocks fashion. Eventually, this background process will transform the first virtual volume into a first real volume. However, before the background copying process is started or completed, host node 12 can modify data of the primary data volume or the first virtual volume.
As noted above, before the background copying process begins or completes, the first virtual volume can be modified in accordance with write-data transactions generated by host node 12.
The primary data volume can be restored or synchonized to the contents of the first volume in response to host node 12 receiving a restore command. The restore method includes overwriting data of block n in memory 20 with data of block n of memory 22 for each block n of memory 20 that contains data that differs from data in block n of memory 22. U.S. patent application Ser. No. 10/254,753 entitled “Method and Apparatus for Restoring a Corrupted Data Volume” which is incorporated herein by reference in its entirety illustrates one method for restoring a primary data volume to the contents of a virtual volume.
In one embodiment of the restore method, host node 12 adds a third bit Rn to each entry and VM map 30.
Often, it is desirable to sequentially create new virtual PIT copies of the primary data volume during the day. Any one of these virtual PIT copies can be used to restore the primary volume should the primary volume experience data corruption. When the primary data volume is being restored to the contents of one of the virtual PIT copies, host node 12 may start to create a new virtual PIT copy in its schedule of creating virtual PIT copies. However, before host node 12 can create the new virtual PIT copy the process to restore the primary data volume to the contents of one of the virtual PIT copies, must complete (i.e., all Rn bits of VM map 30 are set to logical 1). The operation to restore the primary data volume to the state of the virtual copy may take a substantial amount of time thus delaying the creation of the new virtual PIT copy of the primary volume.
SUMMARY OF THE INVENTIONDisclosed is an apparatus and method for creating a virtual data copy of a volume being restored. In one embodiment of the method, first and second data volumes are created. Thereafter, a first data portion of the first data volume is overwritten with a first data portion of the second data volume. A second data portion of the first data volume is overwritten with a second data portion of the second data volume. In one embodiment, the first and second data portions of the first data volume are overwritten with the first and second data portions of the second data volume, respectively, in response to a command to restore or synchronize the data contents of the first data volume to the data contents of the second data volume. A virtual point-in-time (PIT) copy of the first data volume is created after overwriting the first data portion but before overwriting the second data portion.
The present invention may be better understood, and its numerous objects, features, and advantages made apparent to those skilled in the art by referencing the accompanying drawings.
The use of the same reference symbols in different drawings indicates similar or identical items.
DETAILED DESCRIPTIONThe present invention relates to an apparatus and method of creating a virtual PIT copy of a data volume while the data volume is being restored. In one embodiment, the method is implemented as software instructions executing on one or more microprocessors.
Unlike host node 12, host node 38 is capable of creating a new virtual PIT copy of the primary data volume while the primary data volume is being restored (or before all Rn bits of VM map 30 are set to logical 1) to the contents of the first virtual volume. Memory 24 can be configured to store data of the new virtual PIT copy. Host node 38 creates the new virtual PIT copy by creating VM map 34 shown in
The new virtual PIT copy can be converted into a real PIT copy of the primary data volume via host node 38 executing a background copying process.
Before the conversion process illustrated in
In addition to the ability to modify the primary data volume before the new virtual PIT copy is converted into a real PIT copy of the primary data volume, host node 38 can modify data of the new virtual PIT copy thereby transforming the new virtual PIT copy into a virtual MPIT copy.
Although the present invention has been described in connection with several embodiments, the invention is not intended to be limited to the embodiments described herein. On the contrary, it is intended to cover such alternatives, modifications, and equivalents as can be reasonably included within the scope of the invention as defined by the appended claims.
Claims
1. A method comprising:
- (a) creating a first data volume;
- (b) creating a second data volume;
- (c) overwriting a first data portion of the first data volume with a first data portion of the second data volume;
- (d) overwriting a second data portion of the first data volume with a second data portion of the second data volume;
- (e) creating a virtual point-in-time (PIT) copy of the first data volume, wherein (e) occurs after (c) but before (d);
- (m) copying a data portion from the first or second volume to memory allocated to store the virtual PIT copy;
- (n) modifying the data portion copied to the memory allocated to store the virtual PIT copy.
2. The method of claim 1 wherein (n) occurs after (c) but before (d).
3. The method of claim 2 further comprising:
- (o) copying another data portion from the first or second volume to memory allocated to store the virtual PIT copy;
- (p) modifying the other data portion copied to the memory allocated to store the virtual PIT copy, wherein (p) occurs before (m).
4. The method of claim 1 further comprising:
- (f) modifying data of a third data portion of the first data volume after (c) but before (d).
5. The method of claim 1 further comprising:
- (f) modifying data of a third data portion of the first data volume, wherein (f) occurs after (c) but before (d), and wherein (e) occurs after (f).
6. The method of claim 5 further comprising:
- (g) allocating memory to store data of the virtual PIT copy;
- (h) copying the third data portion to the memory allocated in (g) before (f).
7. The method of claim 1 wherein (c) and (d) occur in response to generation of a command to synchronize the data contents of the first data volume to the data contents of the second data volume.
8. The method of claim 1 further comprising:
- (i) modifying data of the first and second data portions of the first data volume, wherein (i) occurs before (c) and (d);
- wherein the second data volume is created as virtual PIT copy of the first data volume in (b), and wherein (b) occurs before (i).
9. The method of claim 1 further comprising:
- (j) generating a write-data transaction for modifying data of the second data portion of the first data volume, wherein (j) occurs after (e) but before (d);
- (k) copying the first data portion of the second data volume to memory allocated to the virtual PIT copy, wherein (k) occurs after (j) but before (d).
10. A computer readable medium comprising instructions executable by a computer system, wherein the computer system implements a method in response to executing the instructions, the method comprising:
- (c) overwriting a first data portion of a first data volume with a first data portion of a second data volume;
- (d) overwriting a second data portion of the first data volume with a second data portion of the second data volume;
- (e) creating a virtual point-in-time (PIT) copy of the first data volume, wherein (e) occurs after (c) but before (d);
- (m) copying a data portion from the first or second volume to memory allocated to store the virtual PIT copy;
- (n) modifying the data portion copied to the memory allocated to store the virtual PIT copy.
11. The computer readable medium of claim 10 wherein (n) occurs after (c) but before (d).
12. The computer readable medium of claim 11 wherein the method further comprises:
- (o) copying another data portion from the first or second volume to memory allocated to store the virtual PIT copy;
- (p) modifying the other data portion copied to the memory allocated to store the virtual PIT copy, wherein (p) occurs before (m).
13. The computer readable medium of claim 10 wherein the method further comprises:
- (f) modifying data of a third data portion of the first data volume after (c) but before (d).
14. The computer readable medium of claim 10 wherein the method further comprises:
- (f) modifying data of a third data portion of the first data volume, wherein (f) occurs after (c) but before (d), and wherein (e) occurs after (f).
15. The computer readable medium of claim 10 wherein the method further comprises:
- (g) allocating memory to store data of the virtual PIT copy;
- (h) copying the third data portion to the memory allocated in (g) before (f).
16. The computer readable medium of claim 10 wherein (c) and (d) occur in response to generation of a command to synchronize the data contents of the first data volume to the data contents of the second data volume.
17. The computer readable medium of claim 10 wherein the method further comprises:
- (i) modifying data of the first and second data portions of the first data volume, wherein (i) occurs before (c) and (d);
- wherein the second data volume is created as a virtual PIT copy of the first data volume in (b), and wherein (b) occurs before (i).
18. The computer readable medium of claim 10 wherein the method further comprises:
- (j) generating a write-data transaction for modifying data of the second data portion of the first data volume, wherein (j) occurs after (e) but before (d);
- (k) copying the first data portion of the second data volume to memory allocated to the virtual PIT copy, wherein (k) occurs after (j) but before (d).
19. A method comprising:
- (a) creating a first PIT copy of a primary data volume;
- (b) modifying one or more data portions of the first PIT copy or the primary data volume after creation of the first PIT copy;
- (c) synchronizing the data contents of the primary data volume to the data contents of the first PIT copy, wherein (c) occurs after (b);
- (d) creating a virtual PIT copy of the primary data volume, wherein (d) occurs before (c) has completed.
20. The method of claim 19 further comprising:
- (e) copying a data portion from the primary volume or first PIT copy to memory allocated to store the virtual PIT copy;
- (f) modifying the data portion copied to the memory allocated to store the virtual PIT copy.
21. The method of claim 20 wherein (f) occurs before (c) has completed.
22. The method of claim 21 further comprising:
- (g) copying another data portion from the primary volume or first PIT copy to the memory allocated to store the virtual PIT copy;
- (h) modifying the other data portion copied to the memory allocated to store the virtual PIT copy, wherein (g) occurs before (f).
20030084242 | May 1, 2003 | Strange et al. |
20040002934 | January 1, 2004 | Taulbee et al. |
20050010733 | January 13, 2005 | Mimatsu et al. |
Type: Grant
Filed: Nov 1, 2005
Date of Patent: Aug 15, 2006
Assignee: Veritas Operating Corporation (Mountain View, CA)
Inventors: Anand A. Kekre (Pune), John A. Colgrove (Los Altos, CA), Oleg Kiselev (Palo Alto, CA)
Primary Examiner: Mano Padmanabhan
Assistant Examiner: Paul Baker
Attorney: Campbell Stephenson Ascolese LLP
Application Number: 11/264,072
International Classification: G06F 12/00 (20060101);