VOLUME REMOTE COPY BASED ON APPLICATION PRIORITY
Example implementations described herein involve systems and methods which automatically determine volumes to be replicated for disaster recovery based on the execution priority of an application which uses the volumes. Such example implementations can involve systems and methods involving creating a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority.
Latest Patents:
The present disclosure is generally related to disaster recovery systems, and more specifically, to volume remote copy based on application priority.
Related ArtWith the growth of digital service business, availability of application serving the digital services have a huge impact against business. Therefore, to establish resiliency against incidents, disaster recovery is required.
Related art implementations facilitate disaster recovery (DR) and high availability (HA) in which data is replicated across storages.
In another related art implementation, there is a virtual machine (VM) deployment mechanism with DR based on user request for data redundancy. In this mechanism, users request whether DR is necessary when users deploy a VM. Based on the request, system decides volume locations which meets the requirement and configures replication between the volumes. Finally, the system decides VM locations which can access volumes from.
SUMMARYUsers would like to configure DR for as many applications permitted within the available resources in the order of importance of applications so as to improve resiliency of applications while using Informational Technology (IT) resource efficiently.
In addition, lower operational cost is desired because new applications for the digital service are frequently deployed to deliver new value to customer, and their requirements can change frequently.
However, related art implementations require users to determine which volumes should be replicated based on the order of importance between applications and reconfigure volume replication based on the determination as to whenever new applications are deployed or as the importance of each application changes.
Aspects of the present disclosure involve a method, which can involve creating a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority.
Aspects of the present disclosure involve a computer program, storing instructions for executing a process, the instructions involving creating a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority. The instructions can be stored on a non-transitory computer readable medium.
Aspects of the present disclosure involve a system, involving means for creating a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and means for establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority.
Aspects of the present disclosure involve an apparatus, involving a processor configured to create a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and establish replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority.
Aspects of the present disclosure involve a method, which can include establishing replication of the volume for the each of the one or more computing units on one or more servers managing a computing unit orchestrator to a second storage system in order from highest computing unit priority to lowest computing unit priority.
Aspects of the present disclosure involve a computer program, which can include establishing replication of the volume for the each of the one or more computing units on one or more servers managing a computing unit orchestrator to a second storage system in order from highest computing unit priority to lowest computing unit priority. The computer program may be stored on a non-transitory computer readable medium to be executed by one or more processors.
Aspects of the present disclosure involve a system, which can include means for establishing replication of the volume for the each of the one or more computing units on one or more servers managing a computing unit orchestrator to a second storage system in order from highest computing unit priority to lowest computing unit priority.
Aspects of the present disclosure involve an apparatus, which can involve a processor configured to establish replication of the volume for the each of the one or more computing units on one or more servers managing a computing unit orchestrator to a second storage system in order from highest computing unit priority to lowest computing unit priority.
The following detailed description provides details of the figures and example implementations of the present application. Reference numerals and descriptions of redundant elements between figures are omitted for clarity. Terms used throughout the description are provided as examples and are not intended to be limiting. For example, the use of the term “automatic” may involve fully automatic or semi-automatic implementations involving user or administrator control over certain aspects of the implementation, depending on the desired implementation of one of ordinary skill in the art practicing implementations of the present application. Selection can be conducted by a user through a user interface or other input means, or can be implemented through a desired algorithm. Example implementations as described herein can be utilized either singularly or in combination and the functionality of the example implementations can be implemented through any means according to the desired implementations.
Example implementations involve systems and mechanism which automatically determines volumes to be replicated for DR based on an execution priority of an application which uses the volumes. This mechanism automatically configures the replication of volumes for as many applications as the amount of resources allow in order of priority of applications. When a new application is deployed or the priority of an application has changed, this mechanism determines the volumes to be replicated or the volumes to be unreplicated based on the priority of the application which uses the volumes. Based on the determination, the mechanism configures and cancels replication.
It is common for users to specify the priority of each application in order to specify which applications have priority to execute when resources such as central processing units (CPUs) and memory are insufficient. Applications that should be prioritized even when resources are insufficient are important for business, and they should be prioritized for recovery even when a disaster occurs. Example implementations described herein utilize the priority of application to determine which volumes should be replicated.
The example implementations are described herein with respect to a platform which uses container technology, because the container technology is usually used as platform for a digital service which requires development and release agility. The container is one example of the computing unit which the present disclosure can be applied. However, the present disclosure is not limited thereto, and can be applicable to other forms of computing units, such as VMs, logical partitions (LPARs), jobs, processes and threads. In these cases, the example implementations can be applied by replacing the container with the respective deployment unit.
In this system, applications are executed as containers. To orchestrate container execution, the container orchestrator 104 is running on servers 101. To manage storage used by the containers, a storage driver 105 is also running on the server 101. Instances of the container orchestrator 104 and the storage driver 105 can run on each server 101 and process their task in a distributed manner. Or, only one instance of them can run on one server and process their task depending on the desired implementation.
The example of
When users request the container orchestrator 104 to prepare volumes for containers, the storage driver 105 create new volumes in a pool of storages or attach existing volumes to the container.
When users deploy containers, users specify the priority of the container. Containers with higher priority are given higher priority to execute from the container orchestrator 104. If the computing resource is insufficient to execute all containers, containers with low priority can be stopped instead.
In example implementations, the storage driver 105 in the first data center (DC1) can retrieve storage pool capacity information of storage systems 102 in the second data center (DC2), as well as volume and container information from the second data center. Based on the retrieved information and the same type of information managed by the first data center, the storage driver 105 determines which volume to be replicated or to be unreplicated based on the priority of the container. Based on the determination, the storage drivers 105 configure replications or cancel replications.
In
Storage system 102 can involve CPU 301, I/O Interface 302, Memory 304, and Network Interface 303 configured to interact with storage drives 305. Memory 304 can manage an Operating System 310, I/O Control Program 311, Management Program 312, Storage Pool Management Table 313, and Copy Management Table 314. Further details of these programs and tables and programs are provided below.
Volume field is divided into “in own cluster” field and “in paired cluster” field. “in own cluster” field has volume name, claim name, size, and replicated field. Volume name is an identifier of the volume. Claim name is an identifier of the volume claim which is pointed from the volume specified by volume name field. This information can be omitted because the storage driver can find the claim name from volume table. Size indicates volume size. Replicated involves information which indicates whether the volume has already been replicated or not.
“in paired cluster” field maintains the pair volume of the copy relation of the volume indicated by “in own cluster” field. If the copy relation is not established, this field is filled with blank values or other values indicative of blank/null. This field can include volume name, claim name, and size. Volume Name is an identifier of volume. Claim name is an identifier of volume claim which is pointed from volume specified by volume name in volume table. This information can be omitted because the storage driver can find the claim name from volume table. Size indicates volume size.
The volume name in own cluster is an identifier of the volume that is the copy source in the first cluster. The claim name in own cluster is an identifier of the volume claim that is copy source in the first cluster. Volume name in paired cluster is an identifier of the volume which is the copy destination in the second cluster. Claim name in paired cluster is an identifier of the volume claim which is the copy destination in the second cluster.
Each entry has source volume ID, source storage ID, destination volume ID, destination storage ID and status. Source volume ID is an identifier of a volume that is the copy source. Source storage ID is an identifier of the storage system that owns the source volume. Destination volume ID is an identifier of a volume that is the copy destination. Destination storage ID is an identifier of a storage which owns the destination volume. Status is the copy status. This field indicates that the copy is ongoing, completed or suspended.
When a container is deployed, the volume used by the container has not yet been created, and it may be necessary to create it newly in the first storage. In this case, the processing illustrated in
First, a storage driver determines the DR target (S1401). An example of this processing is provided in
Next, if the new DR target volume list is empty (S1410 is yes), then the storage driver finishes this processing. If not (S1410 is no), the storage driver executes the process from S1412 to S1416 for each entry of new DR target volume list (loop from S1411 and S1417). The storage driver requests the container orchestrator on the second cluster to create a volume to be copy destination. To request it, the storage driver can send the volume claim specified by the claim name of the entry of the new DR target volume list to the container orchestrator on the second cluster (S1412). In response to the request, the container orchestrator inserts the claim to the volume claim table in the second cluster and requests the storage driver on the second cluster to create the volume (S1414). Based on the request, the storage driver requests a storage system to create the volume (S1414). The storage system then creates the volume. After creating the volume, the storage driver inserts a new entry to the volume table in the second cluster. After that, the container orchestrator returns the information of the volume (S1415). Finally, the storage driver on the first cluster establishes a copy between the created volume and volume indicated by the entry of new DR target volume (S1416).
First, the storage driver on the first cluster requests a storage pool management table, a container table, a volume table and a volume claim table from the container orchestrator on the second cluster (S1501). The container orchestrator requests the storage pool management table from the storage driver on the second cluster (S1502). In response to the request, the storage driver requests the table from the storage systems and return the retrieved table to the container orchestrator (S1503). The container orchestrator returns the container table, the volume table and the volume claim table managed by itself in addition to the obtained storage pool management table (S1504).
Next, the storage driver generates the delete target volume list and the new DR target volume list (S1505) and returns the list to the processing described in
Steps from S1601 to S1615 is a determination of DR candidate without considering capacity limitation. Steps from S1616 to S1626 is a determination of DR target volumes from the DR candidate while considering capacity limitations.
First, the storage driver initializes a predicted used capacity as current used capacity of each pool (S1601). The information was obtained in the processing of
The storage driver executes process from S1602 to S1615 for each container on the first cluster in descending order of containers' priority (loop from S1602 and S1615).
If the priority of the container is equal to or higher than the first threshold (S1603 is yes), the storage driver proceeds to S1604. The first threshold indicates that the lowest priority of containers to be the DR target. The first threshold can be specified by users. This threshold can be omitted. In this case, all containers can be DR targets.
If the container does not have volumes (S1604 is no), the storage driver skips this container. If not (S1604 is yes), the storage drivers make an entry of the DR candidate container list based on the entry of the container in container table, the volume claim table and volume table and same kind of table which are retrieved from the second cluster (S1605). The storage driver finds the entries from these tables which have relations with the container and makes an entry for the DR candidate container list. After that, the storage driver executes the process from S1606 to S1608 for each volume used by the container. If the volume has been already replicated (S1606 is yes), the storage driver sets “yes” to the replicated field of the created entry of DR candidate container list (S1607). If not (S1606 is no), the storage drivers add the capacity of the volume to the predicted total capacity (S1608). By this processing, the storage driver predicts the increase of used capacity of the storage pool when the volume is replicated. The entry is then added to the DR candidate container list (S1609).
If the priority of the container is less than the first threshold (S1603 is no), the storage driver proceeds to S1610. If the container does not have volumes (S1610 is no), the storage driver skips this container. If not (S1610 is yes), the storage driver executes the steps from S1611 to S1614 for each volume used by the container.
If the volume is replicated (S1611 is yes), the storage driver proceeds to S1612. The storage driver makes an entry of the delete target volume list from its entry in the volume claim table, the volume table entry, and the volume claim table entry of the copy destination table (S1612). The storage driver adds the entry to the delete target volume list (S1613). Then, the storage driver subtracts the capacity of the volume from predicted used capacity (S1614). By this calculation, the storage driver predicts the decrease of used capacity of the pool when replication of the volume is canceled.
If the volume is no replicated (S1611 is no), the storage driver skips this volume in order to avoid deleting the unreplicated volume.
Further, the storage driver executes steps from S1616 to S1625 for each entry of DR candidate container list in ascending order of container priority (loop from S1616 and S1625). There can be multiple containers having same priority. The example implementation to determine processing order among containers having same priority is provided as follows.
In one example implementation, the storage driver prioritizes the container having a larger total capacity of volumes which have already replicated. By this method, it is possible to reduce wasted data which has been already replicated to the destination cluster. Such an example implementation involves processing the container having a larger total capacity of replicated volumes later than the container having a smaller total capacity of replicated volumes among containers having the same priority at S1616.
In another example implementation, the storage driver prioritizes containers having a smaller total capacity of volumes which have already replicated. By this method, it is possible to reduce the number of containers for which the DR configuration is canceled. Such an example implementation involves processing the container having a smaller total capacity of replicated volumes later than the container having a larger total capacity of replicated volumes among containers having the same priority at S1616.
In another example implementation, the storage driver prioritizes the container having less total volume capacity. By this method, it is possible to configure DR for more containers. Such an example implementation involves processing the container having less total capacity of volumes later than the container having larger total capacity of volumes among containers having same priority at S1616.
If the predicted used capacity is equal to or less than the used capacity threshold (S1617 is yes), the storage driver proceeds to S1626. This threshold is commonly set in order to warn when the unused capacity is low. Since this threshold is used for avoiding capacity depletion, the used capacity should not exceed this threshold. Therefore, the storage driver tries to avoid exceeding this threshold. If the projected used capacity exceeds this threshold, the storage driver determine that volumes of low priority containers should not be replicated. The storage driver tries to increase free capacity by deleting volumes from storage in the second cluster. Then, if unreplicated volumes in the second cluster are deleted, data will be lost. Therefore, the storage driver determines the deletable volumes from volumes which are the copy destination volume of volumes used by containers in the first cluster. If predicted used capacity is larger than the used capacity threshold (S1617 is no), the storage driver process steps from S1618 to S1623 for each volume used by the container.
If the replicated field is not yes (S1618 is no), the storage driver skips the steps from S1619 to S1623. If the replicated field of the volume is yes (S1618 is yes), the storage driver proceeds to S1619. If the priority of the container is equal to or higher than the second threshold (S1619 is yes), the storage driver alert that DR is not configured against containers which have high priority due to insufficient capacity (S1620). If not (S1619 is not), the storage driver skips S1620. Next, the storage driver makes an entry of delete target volume list for the volume (S1621). The storage driver adds the entry to the delete target volume list (S1622). In addition, the storage driver subtracts the capacity of the volume from the predicted used capacity (S1623). By this calculation, the storage driver predicts the decrease of used capacity of the storage pool when replication of the volume is canceled.
After processing from S1618 to S1623 for all volumes of the container, the storage driver deletes the processing entry from the DR candidate container list (S1624).
Finally, the storage driver makes a new DR target volume list by extracting entries for which “replicated” field of DR candidate container list indicates that the volume has not replicated yet (S1626).
The example implementations described herein can be utilized for IT infrastructure for digital service businesses operated in a DevOps manner. Through such example implementations, users can easily configure disaster recovery for as many applications permitted by the resources in order of importance of the applications without special settings or frequent operations. If new applications are frequently deployed, or application requirements are frequently changed, users are not required to reconfigure the system. Thus, users can achieve resource utilization efficiency, availability and ease of use.
When a problem occurs in the first storage or the first cluster, it is possible to continue the operation of the application that had been running as a container by running the container on the second cluster using the volumes replicated from the first storage to the second storage. There are three ways to operate the container in the second cluster.
The first is the hot standby configuration. In this configuration, the standby containers are launched on the second cluster before the problem occurs on the first cluster or the first storage, but the containers on the second cluster are left in standby state without executing any process. To setup this configuration, when the storage driver establishes replication of the volumes of the containers, the storage driver requests the container orchestrator on the second cluster to launch the containers on the second cluster in standby mode. In addition, when the storage driver cancels replication of the volumes and deletes the replication destination volumes, the storage driver requests the container orchestrator on the second cluster to stop and delete the containers which use the deleted volumes. These processing can be done by users manually. After the storage driver establishes or cancels the replication, users manually deploy or delete the containers on the second cluster in standby mode. The standby container on the second cluster is activated when a failover is initiated after the problem occurs. In addition, the first container on the first cluster uses the volumes in the first storage, and the second container on the second cluster uses the volumes which are replicated from the first storage to the second storage. When there is a problem with the first cluster or the first storage, the second container on the second cluster is activated and resumes operations with using the replicated volumes. This method can be configured without requiring any special capabilities in the application and can recover in a shorter time than the cold standby configuration described below. However, the second container requires IT resource to maintain.
The second is the cold standby configuration. In this configuration, standby containers are not launched on the second cluster before a problem occurs. The standby containers are deployed and launched when a failover is initiated after the problem occurs. In addition, the first container on the first cluster uses the volumes in the first storage, and the second container on the second cluster uses the volumes which are replicated from volumes on the first storage to the second storage. Similar to the hot standby configuration, this configuration does not require any special capabilities for the application. In addition, the cost for the cold standby is smaller than the hot standby because the cold standby does not require any resources to maintain the containers on the second cluster before failover.
The third is the active-active configuration. In this configuration, containers are launched on the second cluster and perform processing before a problem occurs on the first cluster or the first storage. To setup this configuration, when storage driver establishes the replication of the volumes of the containers, the storage driver requests the container orchestrator on the second cluster to launch the containers on the second cluster. In addition, when the storage driver cancels the replication of the volumes and deletes the replication destination volumes, storage driver requests the container orchestrator on the second cluster to stop and delete the containers which use the deleted volumes. These processing can be done by users manually. After the storage driver establishes or cancels the replication, users manually deploy or delete the containers on the second cluster. In order to have this configuration, the application which runs as the container must have the capability to perform processing while sharing data among multiple containers. Further, both containers on the both clusters access volumes in the first storage before failover in this configuration. After failover, both containers on the both clusters access volumes in the second storage. To implement this, path from containers to the volumes in the first storage and path from containers to the volumes in the second storage are established with multipath tool.
To avoid the split-brain situation, only one of the containers on the first cluster and the container on the second cluster should have access to the volumes in hot standby and cold standby configuration. This is because if the both containers writes data to the volumes, data consistency may be collapsed. Further, in the an active-active configuration, only one of the volumes in the first storage and the second storage need to be accessed by the containers. This is because if both containers write data to different volume, data consistency may be collapsed. To prevent these problems, it is necessary to limit the access destination to one side, such as using the volume of the first storage before the failover and using the volume of the second storage after the failover.
To achieve this, users can conduct manual operation. For the hot standby configuration, users can manually stop or inactivate the first container on the first cluster first and activate the second container on the second cluster. For cold standby configuration, users can manually stop the first container on the first cluster first and launch the second container on the second cluster. In both cases, the first container uses only volumes in the first storage and the second container use only volumes in the second storage. For active-active configuration, users manually switch volume to be used for both containers on the first cluster and the second cluster.
However, this method does not always work well to prevent the split-brain situation because it may not be possible to manually stop or inactivate the first container on the first cluster or switch access destination of the first container at failover.
If the storages are replicating data while monitoring the health status of each storage by using quorum and heartbeat communication with technology disclosed by U.S. Ser. No. 10/114,691B2, U.S. Pat. No. 9,794,342B2 and U.S. Pat. No. 8,645,649B2, the split-brain situation can be prevented more reliably and automatically with technology disclosed in U.S. Ser. No. 11/106,371.
In the hot standby configuration, when users activate standby containers running on the second cluster, the storage driver deletes access paths between the first container and the volume in the first storage and access path between the first storage and quorum with the procedure disclosed in U.S. Ser. No. 11/106,371. By this processing, the second container on the second cluster becomes accessible to the volume in the second storage, while the first container on the first cluster becomes inaccessible to the volume in the first storage. As a result, even if the first container is running, it can stop processing and prevent the split-brain situation.
In the cold standby configuration, when users launch containers onto the second cluster, the storage driver delete access paths between the first container and the volume in the first storage and the access path between the first storage and quorum with the procedure disclosed in U.S. Ser. No. 11/106,371. By this processing, the second container on the second cluster becomes accessible to the volume in the second storage, while the first container on the first cluster become inaccessible to the volume in the first storage. As a result, even if the first container is running, it can stop processing and prevent split brain.
In active-active configuration, when users request the storage driver to switch volume to be used, storage driver delete access paths between the first container and the volume in the first storage and access path between the first storage and quorum with the procedure disclosed in U.S. Ser. No. 11/106,371. By this processing, the second container on the second cluster becomes accessible to the volume in the second storage, while the first container on the first cluster become inaccessible to the volume in the first storage. As a result, even if the first container is running, the first container becomes to access the volume in the second storage.
In a first aspect, there is a method which can involve creating a volume in a first storage system for each of one or more containers newly launched on one or more servers managing a container orchestrator; and establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from highest container priority to lowest container priority as illustrated in
In a second aspect, there is a method as that of the first aspect, wherein for insufficient capacity existing in the second storage system for the replication of the volume for the each of the newly launched one or more containers, canceling the replication of the one or more containers from the lowest container priority as illustrated in
In a third aspect, there is a method as that of any of the above aspects, further comprising, for ones of the one or more containers having a same priority, for sufficient capacity existing in the second storage system, replicate the ones of the one or more containers with the same priority to the second storage system as illustrated in
In a fourth aspect, there is a method as that of any of the above aspect, further comprising, for insufficient capacity existing in the second storage system, prioritizing ones of the one or more containers with the same priority having larger total capacity of volumes that have already been replicated to the second storage system as illustrated in
In a fifth aspect, there is a method as that of any of the first through third aspects, further comprising, for insufficient capacity existing in the second storage system, prioritizing ones of the one or more containers with the same priority having smaller total capacity of volumes that have already been replicated to the second storage system as illustrated in
In a sixth aspect, there is a method as that of any of the first through third aspects, further comprising, for insufficient capacity existing in the second storage system, prioritizing ones of the one or more containers with the same priority having smaller total capacity as illustrated in
In a seventh aspect, there is a method as that of any of the above aspects, further comprising, for capacity of the second storage system is reached before all volumes of the newly one or more launched containers are replicated to the second storage system, determining one or more replicated volumes in the second storage system associated with previously launched containers having a lower priority than the remaining newly launched one or more containers; and deleting one or more of the one or more replicated volumes associated with previously launched containers having the lower priority than the remaining newly launched on or more containers in the second storage system as illustrated in
In an eight aspect, there is a method as that of any of the above aspects, further comprising, for a change in priority to one or more containers currently launched on the one or more host computers, determining whether replication is needed for the one or more containers with the changed priority; for a determination that replication is needed and that sufficient capacity exists in the second storage system, replicating the one or more containers with the changed priority to the second storage system; for the determination that replications is needed and that the second storage system has insufficient capacity, determining volumes in the second storage system associated with containers having a lower priority than the one or more containers with the changed priority, and deleting the determined volumes associated with containers having the lower priority as illustrated in
In a ninth aspect, there is a method as that of any of the above aspects, wherein the establishing replication of the volume for the each of the newly launched one or more containers to a second storage system in order from the highest container priority to the lowest container priority comprises only replicating volumes for the newly launched one or more containers having a priority beyond a threshold; wherein for a change in priority occurring in previously launched containers, only replicating volumes for ones of the previously launched containers having the change in priority beyond the threshold as illustrated in
In a tenth aspect, there is a method as that of any of the above aspects, wherein the method is executed by a management apparatus configured to manage the one or more servers, the first storage system, and the second storage system and store management information associated with the volume, the newly launched one or more containers, and storage capacity of the first storage system and the second storage system as illustrated in
In an eleventh aspect, there is a method as that in any of the first to ninth aspects, wherein the method is executed by the one or more servers as illustrated in
In a twelfth aspect, there is a method as that in any of that of the above aspects, further comprising providing an alert for when the second storage system has insufficient capacity for the replication of the newly launched one or more containers or for a previously launched container having a change in priority over a threshold as illustrated in
In a thirteenth aspect, there is a method as that in any of the above aspects, where in the method is stored as a computer program having instructions to execute any of the method steps therein through one or more processors. Such a computer program and instructions may be stored in a non-transitory computer readable medium.
In a fourteenth aspect, there is a method as that in any of the first through ninth aspects and twelfth aspect, in which the method is executed by a processor in an apparatus such as a server.
In any of the aspects as described herein, the container may be replaced with a computing unit in accordance with the desired implementation.
Some portions of the detailed description are presented in terms of algorithms and symbolic representations of operations within a computer. These algorithmic descriptions and symbolic representations are the means used by those skilled in the data processing arts to convey the essence of their innovations to others skilled in the art. An algorithm is a series of defined steps leading to a desired end state or result. In example implementations, the steps carried out require physical manipulations of tangible quantities for achieving a tangible result.
Unless specifically stated otherwise, as apparent from the discussion, it is appreciated that throughout the description, discussions utilizing terms such as “processing,” “computing,” “calculating,” “determining,” “displaying,” or the like, can include the actions and processes of a computer system or other information processing device that manipulates and transforms data represented as physical (electronic) quantities within the computer system's registers and memories into other data similarly represented as physical quantities within the computer system's memories or registers or other information storage, transmission or display devices.
Example implementations may also relate to an apparatus for performing the operations herein. This apparatus may be specially constructed for the required purposes, or it may include one or more general-purpose computers selectively activated or reconfigured by one or more computer programs. Such computer programs may be stored in a computer readable medium, such as a computer-readable storage medium or a computer-readable signal medium. A computer-readable storage medium may involve tangible mediums such as, but not limited to optical disks, magnetic disks, read-only memories, random access memories, solid state devices and drives, or any other types of tangible or non-transitory media suitable for storing electronic information. A computer readable signal medium may include mediums such as carrier waves. The algorithms and displays presented herein are not inherently related to any particular computer or other apparatus. Computer programs can involve pure software implementations that involve instructions that perform the operations of the desired implementation.
Various general-purpose systems may be used with programs and modules in accordance with the examples herein, or it may prove convenient to construct a more specialized apparatus to perform desired method steps. In addition, the example implementations are not described with reference to any particular programming language. It will be appreciated that a variety of programming languages may be used to implement the techniques of the example implementations as described herein. The instructions of the programming language(s) may be executed by one or more processing devices, e.g., central processing units (CPUs), processors, or controllers.
As is known in the art, the operations described above can be performed by hardware, software, or some combination of software and hardware. Various aspects of the example implementations may be implemented using circuits and logic devices (hardware), while other aspects may be implemented using instructions stored on a machine-readable medium (software), which if executed by a processor, would cause the processor to perform a method to carry out implementations of the present application. Further, some example implementations of the present application may be performed solely in hardware, whereas other example implementations may be performed solely in software. Moreover, the various functions described can be performed in a single unit, or can be spread across a number of components in any number of ways. When performed by software, the methods may be executed by a processor, such as a general-purpose computer, based on instructions stored on a computer-readable medium. If desired, the instructions can be stored on the medium in a compressed and/or encrypted format.
Moreover, other implementations of the present application will be apparent to those skilled in the art from consideration of the specification and practice of the techniques of the present application. Various aspects and/or components of the described example implementations may be used singly or in any combination. It is intended that the specification and example implementations be considered as examples only, with the true scope and spirit of the present application being indicated by the following claims.
Claims
1. A method, comprising:
- establishing replication of volume for each of a plurality of computing units on one or more servers managing a computing unit orchestrator to a second storage system in order from highest computing unit priority to lowest computing unit priority; and
- when capacity of the second storage system is reached before all volumes of the plurality of computing units are replicated to the second storage system;
- determining one or more replicated volumes in the second storage system associated with previously launched computing units having a lower priority than remaining computing units of the plurality of computing units that have yet to be replicated; and
- deleting one or more of the one or more replicated volumes in the second storage system associated with previously launched computing units having the lower priority than the remaining computing units of the plurality of computing units that have yet to be replicated.
2. The method of claim 1, wherein for insufficient capacity existing in the second storage system for the replication of volume for the each of the plurality of computing units, canceling the replication of the plurality of computing units from the lowest computing unit priority.
3. The method of claim 1, further comprising, for ones of the plurality of computing units having a same priority:
- for sufficient capacity existing in the second storage system, replicate the ones of the plurality of computing units with same priority to the second storage system.
4. The method of claim 3, further comprising:
- for insufficient capacity existing in the second storage system, prioritizing ones of the plurality of computing units with the same priority having larger total capacity of volumes that have already been replicated to the second storage system.
5. The method of claim 3, further comprising:
- for insufficient capacity existing in the second storage system, prioritizing ones of the plurality of computing units with the same priority having smaller total capacity of volumes that have already been replicated to the second storage system.
6. The method of claim 3, further comprising:
- for insufficient capacity existing in the second storage system, prioritizing ones of the plurality of computing units with the same priority having smaller total capacity.
7. (canceled)
8. The method of claim 1, further comprising, for a change in priority to one or more computing units currently launched on one or more host computers:
- determining whether replication is needed for the one or more computing units with the changed priority;
- for a determination that replication is needed and that sufficient capacity exists in the second storage system, replicating the one or more computing units with the changed priority to the second storage system;
- for the determination that replication is needed and that the second storage system has insufficient capacity, determining volumes in the second storage system associated with computing units having a lower priority than the one or more computing units with the changed priority, and deleting the determined volumes associated with computing units having the lower priority.
9. The method of claim 1, wherein the establishing replication of volume for the each of the plurality of computing units to a second storage system in order from highest computing unit priority to lowest computing unit priority comprises only replicating volumes for the plurality of computing units having a priority beyond a threshold;
- wherein for a change in priority occurring in previously launched computing units, only replicating volumes for ones of the previously launched computing units having the change in priority beyond the threshold.
10. The method of claim 1, wherein the method is executed by a management apparatus configured to manage the one or more servers, the first storage system, and the second storage system, and store management information associated with the replicated volumes, the plurality of computing units, and storage capacities of the first storage system and the second storage system.
11. The method of claim 1, wherein the method is executed by the one or more servers.
12. The method of claim 1, further comprising:
- providing an alert for when the second storage system has insufficient capacity for the replication of volumes of the plurality of computing units having higher priority over a threshold.
13. A non-transitory computer readable medium, storing instructions for executing a process, the instructions comprising:
- establishing replication of volume for each of a plurality of computing units to a second storage system in order from highest computing unit priority to lowest computing unit priority; and
- when capacity of the second storage system is reached before all volumes of the plurality of computing units are replicated to the second storage system;
- determining one or more replicated volumes in the second storage system associated with previously launched computing units having a lower priority than remaining computing units of the plurality of computing units that have yet to be replicated; and
- deleting one or more of the one or more replicated volumes in the second storage system associated with previously launched computing units having the lower priority than the remaining computing units of the plurality of computing units that have yet to be replicated.
14. A server, comprising:
- a processor, configured to:
- establish replication of volume for each of a plurality of computing units to a second storage system in order from highest computing unit priority to lowest computing unit priority; and
- when capacity of the second storage system is reached before all volumes of the plurality of computing units are replicated to the second storage system;
- determining one or more replicated volumes in the second storage system associated with previously launched computing units having a lower priority than remaining computing units of the plurality of computing units that have yet to be replicated; and deleting one or more of the one or more replicated volumes in the second storage system associated with previously launched computing units having the lower priority than the remaining computing units of the plurality of computing units that have yet to be replicated.
Type: Application
Filed: Aug 27, 2021
Publication Date: Mar 2, 2023
Applicant:
Inventors: Akiyoshi TSUCHIYA (San Jose, CA), Tomohiro KAWAGUCHI (Santa Clara, CA)
Application Number: 17/459,841