Abstract: The disclosure relates to systems, methods and computer-readable media for dynamically provisioning resources within a compute environment. The method aspect of the disclosure comprises A method of dynamically provisioning resources within a compute environment, the method comprises analyzing a queue of jobs to determine an availability of compute resources for each job, determining an availability of a scheduler of the compute environment to satisfy all service level agreements (SLAs) and target service levels within a current configuration of the compute resources, determining possible resource provisioning changes to improve SLA fulfillment, determining a cost of provisioning; and if provisioning changes improve overall SLA delivery, then re-provisioning at least one compute resource.
Abstract: A systems, method and computer-readable media are disclosed for providing a dynamic roll-back reservation mask in a compute environment. The method of managing compute resources within a compute environment includes, based on an agreement between a compute resource provider and a customer, creating a roll-back reservation mask for compute resources which slides ahead of current time by a period of time. Within the roll-back reservation mask, the method specifies a subset of consumers and compute resource requests which can access compute resources associated with the roll-back reservation mask and, based on received data, the method dynamically modifies at least one of (1) the period of time the roll-back reservation mask slides ahead of current time and (2) the compute resources associated with the roll-back reservation mask.
Abstract: A system and method of co-allocating resources within a compute environment are disclosed. The method embodiment of the invention comprises receiving a request for a reservation for a first type of resource, analyzing constraints and guarantees associated with the first type of resource, identifying a first group of resources that meet the request for the first type of resource and storing in a first list, receiving a request for a reservation for a second type of resource, analyzing constraints and guarantees associated with the second type of resource, identifying a second group of resources that meet the request for the second type of resource and storing in a second list, calculating a co-allocation parameter between the first group of resources and the second group of resources and reserving resources according to the calculated co-allocation parameter of the first group of resources and the second group of resources. The request may also request exclusivity of the reservation.
Abstract: A system, method and computer-readable media for providing a reservation mask for compute resources such as a cluster or a grid. The method aspect comprises identifying a need type and a group of available resources, creating a reservation mask over the identified group of resources and if a request from a consumer matches the need type, then constraining the creation of a reservation for the consumer to only use resources within the reservation mask.
Abstract: Disclosed are a system, method and computer-readable medium relating to managing resources within a compute environment having a group of nodes or computing devices. The method comprises, for each node in the compute environment: traversing a list jobs having a fixed time relationship, wherein for each job in the list, the following steps occur: obtaining a range list of available timeframes for each job, converting each availability timeframe to a start range, shifting the resulting start range in time by a job offset, for a first job, copying the resulting start range into a node range, and for all subsequent jobs, logically AND'ing the start range with the node range. Next, the method comprises logically OR'ing the node range with a global range, generating a list of acceptable resources on which to start and the timeframe at which to start and creating reservations according to the list of acceptable resources for the resources in the group of computing devices and associated job offsets.
Abstract: A system, method and non-transitory computer readable storage medium are disclosed for managing workload in a data center. The method includes receiving, at a workload manager, energy consumption information associated with at least one of a first data center and a second data center that is geographically distanced from the first data center. The method further includes receiving workload information associated with at least one of the first data center and the second data center, and transmitting, as controlled by the workload manager, at least part of workload scheduled to be processed at the second data center to the first data center based on at least one of the energy consumption information and the workload information.
Abstract: A system, method and non-transitory computer readable storage medium are disclosed for managing workload in a data center. The method includes receiving, at a workload manager, energy consumption information. The method further includes receiving workload information about a second data center that is remote from a first data center, and receiving, as controlled by the workload manager, at the first data center at least a portion of workload associated with the second data center based on at least one of the energy consumption information and the workload information.
Type:
Application
Filed:
August 12, 2010
Publication date:
March 3, 2011
Applicant:
Adaptive Computing Enterprises Inc. Formerly known as Cluster Resources, Inc.
Abstract: A system, method and computer-readable media for providing a reservation mask for compute resources such as a cluster or a grid. The method aspect comprises identifying a need type and a group of available resources, creating a reservation mask over the identified group of resources and if a request from a consumer matches the need type, then constraining the creation of a reservation for the consumer to only use resources within the reservation mask.
Abstract: A system, method and non-transitory computer readable storage medium are disclosed for managing workload in a data center. The method includes receiving data related to at least one of a current state of workload in the compute environment at a current time and future workload scheduled to consume resources in the compute environment at a future time relative to the current time, wherein the compute environment comprises a plurality of nodes in which compute resources are reserved by a workload manager for consumption, and controlling a cooling system to selectively modify a temperature of at least one node in the compute environment based on the data.
Type:
Application
Filed:
August 12, 2010
Publication date:
February 10, 2011
Applicant:
Adaptive Computing Enterprises Inc. formerly known as Cluster Resources, Inc.
Abstract: A system, method and non-transitory computer-readable storage medium are disclosed for managing workload in data centers. The method includes receiving information associated with at least one of energy consumption and a temperature of nodes each data center of a group of distributed data centers to yield received information. The method further includes analyzing workload associated with at least one second data center of the group of distributed data centers to yield an analysis of the workload, and modifying use of resources of the group of distributed data centers based on the received information and the analysis of the workload. One application can include migration of workload from one data center to another data center based on a variety of factors.
Abstract: Disclosed are a system, method and computer-readable medium relating to managing resources within a compute environment having a group of nodes or computing devices. The method comprises, for each node in the compute environment: traversing a list jobs having a fixed time relationship, wherein for each job in the list, the following steps occur: obtaining a range list of available timeframes for each job, converting each availability timeframe to a start range, shifting the resulting start range in time by a job offset, for a first job, copying the resulting start range into a node range, and for all subsequent jobs, logically AND'ing the start range with the node range. Next, the method comprises logically OR'ing the node range with a global range, generating a list of acceptable resources on which to start and the timeframe at which to start and creating reservations according to the list of acceptable resources for the resources in the group of computing devices and associated job offsets.
Abstract: Disclosed are systems, methods, computer readable media, and compute environments for establishing a schedule for processing a job in a distributed compute environment. The method embodiment comprises converting a topology of a compute environment to a plurality of endpoint-to-endpoint paths, based on the plurality of endpoint-to-endpoint paths, mapping each replica resource of a plurality of resources to one or more endpoints where each respective resource is available, iteratively identifying schedule costs associated with a relationship between endpoints and resources, and committing a selected schedule cost from the identified schedule costs for processing a job in the compute environment.
Abstract: Disclosed are systems, methods, computer readable media, and compute environments for establishing a schedule for processing a job in a distributed compute environment. The method embodiment comprises converting a topology of a compute environment to a plurality of endpoint-to-endpoint paths, based on the plurality of endpoint-to-endpoint paths, mapping each replica resource of a plurality of resources to one or more endpoints where each respective resource is available, iteratively identifying schedule costs associated with a relationship between endpoints and resources, and committing a selected schedule cost from the identified schedule costs for processing a job in the compute environment.
Abstract: A system and method are disclosed for dynamically reserving resources within a cluster environment. The method embodiment of the invention comprises receiving a request for resources in the cluster environment, monitoring events after receiving the request for resources and based on the monitored events, dynamically modifying at least one of the request for resources and the cluster environment.
Abstract: An on-demand compute environment comprises a plurality of nodes within an on-demand compute environment available for provisioning and a slave management module operating on a dedicated node within the on-demand compute environment, wherein upon instructions from a master management module at a local compute environment, the slave management module modifies at least one node of the plurality of nodes.
Abstract: A system and method are disclosed for dynamically reserving resources within a cluster environment. The method embodiment of the invention comprises receiving a request for resources in the cluster environment, monitoring events after receiving the request for resources and based on the monitored events, dynamically modifying at least one of the request for resources and the cluster environment.