Patents by Inventor Yaoping Ruan
Yaoping Ruan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20180101214Abstract: For power management in a disaggregated computing system, initial electrical power levels are distributed thereby allocating a voltage and a clock speed to each one of a set of processor cores in the disaggregated computing system. The voltage and the clock speed of respective processor cores within the set of processor cores are adjusted according to a workload priority of respective workloads performed by each respective one of the processor cores, wherein the workload priority is assigned based upon a service level agreement (SLA).Type: ApplicationFiled: October 10, 2016Publication date: April 12, 2018Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ruchi MAHINDRU, John A. BIVENS, Koushik K. DAS, Min LI, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20180102953Abstract: For measuring component utilization in a system having a plurality of subsystems, an energy consumption of each of the plurality of subsystems is monitored whether or not each subsystem performs at least a portion of an overall computation. Respective workloads are classified based upon an energy consumption pattern associated with the monitored energy consumption of each of the plurality of subsystems.Type: ApplicationFiled: October 10, 2016Publication date: April 12, 2018Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ruchi MAHINDRU, John A. BIVENS, Koushik K. DAS, Min LI, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20180101415Abstract: For measuring component utilization in a computing system, a server energy utilization reading of a statistical significant number of servers out of a total number of servers located in the datacenter is obtained by measuring, at predetermined intervals, a collective energy consumed by all processing components within each server. The collective energy is measured by virtually probing thereby monitoring an energy consumption of individual ones of all the processing components to each collect an individual energy utilization reading, where the individual energy utilization reading is aggregated over a predetermined time period to collect an energy consumption pattern associated with the server utilization reading.Type: ApplicationFiled: October 10, 2016Publication date: April 12, 2018Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ruchi MAHINDRU, John A. BIVENS, Koushik K. DAS, Min LI, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20180101220Abstract: For power management in a disaggregated computing system, a set of initial electrical power levels are allocated to a set of processor cores according to a predicted desired workload, where the set of initial power levels aggregate to an initial collective contracted power level. Electrical power is dynamically allocated to respective processor cores within the set of processor cores to produce a capacity to execute a collective demanded workload while maintaining the electrical power to the set of processor cores to an approximately constant electrical power level within a threshold of the initial collective contracted electrical power level.Type: ApplicationFiled: October 10, 2016Publication date: April 12, 2018Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Ruchi MAHINDRU, John A. BIVENS, Koushik K. DAS, Min LI, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Patent number: 9940466Abstract: A computer-implemented agent process running on a first computer automatically intercepts a command issued from the first computer to execute on a target computer prior to invocation of the command on the target computer. A server profile built for an application running on the target computer that supports the command may be retrieved. At least based on the server profile a risk enforcement policy is dynamically constructed. Based on the risk enforcement policy, one or more computer-executable enforcement actions to perform prior to sending the command to the target computer for execution is determined. Based on executing of one or more of the computer-executable enforcement actions, the command may be transmitted to execute on the target computer or prevented from executing on the target computer.Type: GrantFiled: December 7, 2016Date of Patent: April 10, 2018Assignee: International Business Machines CorporationInventors: Constantin M. Adam, Nikolaos Anerousis, Vysakh K. Chandran, Milton H. Hernandez, Debasisha K. Padhi, Yaoping Ruan, Fabio M. Tanada, Frederick Y.-F. Wu, Sai Zeng
-
Publication number: 20180074741Abstract: A memory management service occupies a configurable portion of an overall memory system in a disaggregate compute environment. The service provides optimized data organization capabilities over the pool of real memory accessible to the system. The service enables various types of data stores to be implemented in hardware, including at a data structure level. Storage capacity conservation is enabled through the creation and management of high-performance, re-usable data structure implementations across the memory pool, and then using analytics (e.g., multi-tenant similarity and duplicate detection) to determine when data organizations should be used. The service also may re-align memory to different data structures that may be more efficient given data usage and distribution patterns. The service also advantageously manages automated backups efficiently.Type: ApplicationFiled: November 3, 2017Publication date: March 15, 2018Applicant: International Business Machines CorporationInventors: John Alan Bivens, Koushik K. Das, Min Li, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld
-
Patent number: 9916636Abstract: Server resources in a data center are disaggregated into shared server resource pools, including a graphics processing unit (GPU) pool. Servers are constructed dynamically, on-demand and based on workload requirements, by allocating from these resource pools. According to this disclosure, GPU utilization in the data center is managed proactively by assigning GPUs to workloads in a fine granularity and agile way, and de-provisioning them when no longer needed. In this manner, the approach is especially advantageous to automatically provision GPUs for data analytic workloads. The approach thus provides for a “micro-service” enabling data analytic workloads to automatically and transparently use GPU resources without providing (e.g., to the data center customer) the underlying provisioning details. Preferably, the approach dynamically determines the number and the type of GPUs to use, and then during runtime auto-scales the GPUs based on workload.Type: GrantFiled: April 8, 2016Date of Patent: March 13, 2018Assignee: International Business Machines CorporationInventors: Min Li, John Alan Bivens, Koushik K. Das, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld
-
Publication number: 20180052918Abstract: Creating a summary of a plurality of texts includes tokenizing each of a plurality of texts to obtain tokens; generating a vector space using a first set of vectors having one or more obtained feature scores equal to or larger than a predefined value; executing non-hierarchical clustering using the vector space to generate a first plurality of clusters; choosing a first representative text in each of the plurality of clusters; generating a second set of vectors from each of the arrays generated based on a number of characters included in tokens of the representative texts; executing hierarchical clustering using the second set of vectors to generate a second plurality of clusters; and in response to a determining a number of clusters included in the second plurality of clusters, determining a second representative text for each of the clusters included in the second plurality of clusters.Type: ApplicationFiled: August 22, 2016Publication date: February 22, 2018Inventors: Yu Gu, Takayuki Kushida, Hiroki Nakano, Yaoping Ruan, Yuji Sugiyama
-
Publication number: 20180007127Abstract: Server resources in a data center are disaggregated into shared server resource pools. Servers are constructed dynamically, on-demand and based on a tenant's workload requirements, by allocating from these resource pools. The system also includes a license manager that operates to manage a pool of licenses that are available to be associated with resources drawn from the server resource pools. Upon provisioning of a server entity composed of resources drawn from the server resource pools, the license manager determines a license configuration suitable for the server entity. In response to receipt of information indicating a change in a composition of the server entity (e.g., as a workload is processed), the license manager determines whether an adjustment to the license configuration is required. If so, an adjusted license configuration for the server entity is determined and tracked to the tenant. The data center thus allocates appropriate licenses to server entities as required.Type: ApplicationFiled: June 30, 2016Publication date: January 4, 2018Inventors: Valentina Salapura, John Alan Bivens, Min Li, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Eugen Schenfeld
-
Publication number: 20170371730Abstract: An actuator to execute on a server may be automatically selected based on risk of failure and damage to the server. Requirement specification and environment parameters may be received. A subset of actuators may be selected based on a risk threshold from an actuator catalog database storing actuator information and actuator risk metadata associated with a plurality of actuators. The actuator risk metadata may be augmented with risk information. A ranked list of the subset of actuators may be generated based on the actuator risk metadata associated with each actuator in the subset. An actuator in the ranked list may be executed on the server.Type: ApplicationFiled: June 22, 2016Publication date: December 28, 2017Inventors: Constantin M. Adam, Anuradha Bhamidipaty, Jayan Nallacherry, Debasisha K. Padhi, Yaoping Ruan, Frederick Y.-F. Wu
-
Publication number: 20170329520Abstract: Various embodiments for optimizing memory bandwidth in a disaggregated computing system, by a processor device, are provided. Respective memory devices are assigned to respective processor devices in the disaggregated computing system, the disaggregated computing system having at least a pool of the memory devices and a pool of the processor devices. An iterative learning algorithm is used to define data boundaries of a dataset for performing an analytic function on the dataset using memory bandwidth not currently committed to a primary compute task.Type: ApplicationFiled: August 15, 2016Publication date: November 16, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: John A. BIVENS, Min LI, Ruchi MAHINDRU, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20170329519Abstract: Various embodiments for optimizing memory bandwidth in a disaggregated computing system, by a processor device, are provided. Respective memory devices are assigned to respective processor devices in the disaggregated computing system, the disaggregated computing system having at least a pool of the memory devices and a pool of the processor devices. An analytic function is performed on data resident in the pool of the memory devices using memory bandwidth not currently committed to a primary compute task.Type: ApplicationFiled: May 16, 2016Publication date: November 16, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: John A. BIVENS, Min LI, Ruchi MAHINDRU, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20170331759Abstract: Various embodiments for agile component-level resource provisioning in a disaggregated cloud computing environment, by a processor device, are provided. Respective members of pools of hardware resources within the disaggregated cloud computing environment are allocated to each respective one of a plurality of tenants according to one of a plurality of service level agreement (SLA) classes. Each respective one of the plurality of SLA classes is characterized by a given response time for the allocation of the respective members of the pools of hardware resources corresponding to a requested workload by the tenant.Type: ApplicationFiled: May 16, 2016Publication date: November 16, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Min LI, John A. BIVENS, Ruchi MAHINDRU, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20170331763Abstract: Various embodiments for elastic resource provisioning in a disaggregated cloud computing environment, by a processor device, are provided. Respective members of pools of hardware resources within the disaggregated cloud computing environment are provisioned to a tenant according to an application-level service level agreement (SLA). Upon detecting a potential violation of the application-level SLA, additional respective members of the pools of hardware resources are provisioned on a component level to the tenant to avoid a violation of the SLA by one of a scale-up process and a scale-out process.Type: ApplicationFiled: May 16, 2016Publication date: November 16, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Min LI, John A. BIVENS, Ruchi MAHINDRU, HariGovind V. RAMASAMY, Yaoping RUAN, Valentina SALAPURA, Eugen SCHENFELD
-
Patent number: 9811281Abstract: A memory management service occupies a configurable portion of an overall memory system in a disaggregate compute environment. The service provides optimized data organization capabilities over the pool of real memory accessible to the system. The service enables various types of data stores to be implemented in hardware, including at a data structure level. Storage capacity conservation is enabled through the creation and management of high-performance, re-usable data structure implementations across the memory pool, and then using analytics (e.g., multi-tenant similarity and duplicate detection) to determine when data organizations should be used. The service also may re-align memory to different data structures that may be more efficient given data usage and distribution patterns. The service also advantageously manages automated backups efficiently.Type: GrantFiled: April 7, 2016Date of Patent: November 7, 2017Assignee: International Business Machines CorporationInventors: John Alan Bivens, Koushik K. Das, Min Li, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld
-
Publication number: 20170310607Abstract: Various embodiments for allocating resources in a disaggregated cloud computing environment, by a processor device, are provided. Respective members of a pool of hardware resources are assigned to each one of a plurality of tenants based upon a classification of the respective members of the pool of hardware resources. The respective members of the pool of hardware resources are assigned to each one of the plurality of tenants independently of a hardware enclosure in which the respective members of the pool of hardware resources are physically located.Type: ApplicationFiled: April 21, 2016Publication date: October 26, 2017Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATIONInventors: Yaoping RUAN, John A. BIVENS, Koushik K. DAS, Min LI, Ruchi MAHINDRU, HariGovind V. RAMASAMY, Valentina SALAPURA, Eugen SCHENFELD
-
Publication number: 20170293994Abstract: Server resources in a data center are disaggregated into shared server resource pools, including a graphics processing unit (GPU) pool. Servers are constructed dynamically, on-demand and based on workload requirements, by allocating from these resource pools. According to this disclosure, GPU utilization in the data center is managed proactively by assigning GPUs to workloads in a fine granularity and agile way, and de-provisioning them when no longer needed. In this manner, the approach is especially advantageous to automatically provision GPUs for data analytic workloads. The approach thus provides for a “micro-service” enabling data analytic workloads to automatically and transparently use GPU resources without providing (e.g., to the data center customer) the underlying provisioning details. Preferably, the approach dynamically determines the number and the type of GPUs to use, and then during runtime auto-scales the GPUs based on workload.Type: ApplicationFiled: April 8, 2016Publication date: October 12, 2017Inventors: Min Li, John Alan Bivens, Koushik K. Das, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld
-
Publication number: 20170295107Abstract: Server resources in a data center are disaggregated into shared server resource pools. Servers are constructed dynamically, on-demand and based on workload requirements, by allocating from these resource pools. A disaggregated compute system of this type keeps track of resources that are available in the shared server resource pools, and it manages those resources based on that information. Each server entity built is assigned with a unique server ID, and each resource that comprises a component thereof is tagged with the identifier. As a workload is processed by the server entity, its composition may change, e.g. by allocating more resources to the server entity, or by de-allocating resources from the server entity. Workload requests are associated with the unique server ID for the server entity. When a workload request is received at a resource, it matches its unique server ID to that of the request before servicing the request.Type: ApplicationFiled: April 7, 2016Publication date: October 12, 2017Inventors: Valentina Salapura, John Alan Bivens, Koushik K. Das, Min Li, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Eugen Schenfeld
-
Publication number: 20170293447Abstract: A memory management service occupies a configurable portion of an overall memory system in a disaggregate compute environment. The service provides optimized data organization capabilities over the pool of real memory accessible to the system. The service enables various types of data stores to be implemented in hardware, including at a data structure level. Storage capacity conservation is enabled through the creation and management of high-performance, re-usable data structure implementations across the memory pool, and then using analytics (e.g., multi-tenant similarity and duplicate detection) to determine when data organizations should be used. The service also may re-align memory to different data structures that may be more efficient given data usage and distribution patterns. The service also advantageously manages automated backups efficiently.Type: ApplicationFiled: April 7, 2016Publication date: October 12, 2017Inventors: John Alan Bivens, Koushik K. Das, Min Li, Ruchi Mahindru, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld
-
Publication number: 20170295108Abstract: Server resources in a data center are disaggregated into shared server resource pools. Servers are constructed dynamically, on-demand and based on workload requirements and a tenant's resiliency requirements (e.g., as specified in an SLA), by allocating from these resource pools. A disaggregated compute system of this type keeps track of resources that are available in the shared server resource pools, and it manages those resources based on that information and the health of the resources. As a workload is processed by the server entity and component resources fail, the server entity composition is changed, e.g. by allocating other resources to the server entity, or by transitioning to other server entities, to ensure that a resiliency requirement is maintained.Type: ApplicationFiled: April 7, 2016Publication date: October 12, 2017Inventors: Ruchi Mahindru, John Alan Bivens, Koushik K. Das, Min Li, Harigovind V. Ramasamy, Yaoping Ruan, Valentina Salapura, Eugen Schenfeld