Patents by Inventor Si QIN

Si QIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Allocating computing resources for deferrable virtual machines

Patent number: 12333348

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting surplus capacity on a set of server nodes and determining a quantity of deferrable virtual machines (VMs) that may be scheduled over an upcoming period of time. This determination of VM quantity may be determined while minimizing risks associated with allocation failures on the set of server nodes. This disclosure described systems that facilitate features and functionality related to improving utilization of surplus resource capacity on a plurality of server nodes by implementing VMs having some flexibility in timing of deployment while also avoiding significant risk caused as a result of over-allocated storage and computing resources. In one or more embodiments, the quantity of deferrable VMs is determined and scheduled in accordance with rules of a scheduling policy.

Type: Grant

Filed: April 10, 2024

Date of Patent: June 17, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yuwen Yang, Gurpreet Virdi, Bo Qiao, Hang Dong, Karthikeyan Subramanian, Marko Lalic, Shandan Zhou, Si Qin, Thomas Moscibroda, Yunus Mohammed
DETECTING SYSTEMWIDE SERVICE ISSUES BY USING ANOMALY LOCALIZATION

Publication number: 20250086047

Abstract: Methods and systems for detecting systemwide service issues by using anomaly localization. In an example, a method includes receiving time-series monitoring data for multiple services, the time-series monitoring data including multiple dimensions and an error metric; for the monitoring data from each service, evaluating scopes within the monitoring data based on an objective function for a time-series of the error metric to identify at least one anomalous scope, each scope including at least one dimension and a value for the dimension; based on evaluating the scopes, generating a ranked list of scopes for each service based on objective function scores for the scopes; correlating the ranked lists of scopes across the multiple services to identify a cross-service anomaly; and generating an alert for the services based on the cross-service anomaly, the alert indicating at least one scope as a potential root cause for the cross-service anomaly.

Type: Application

Filed: November 27, 2024

Publication date: March 13, 2025

Applicant: Microsoft Technology Licensing, LLC

Inventors: Mohit VERMA, Julien HOACHUCK, Qingwei LIN, Pooja RANI, Namrata JAIN, Rakesh NAMINENI, Jimmy WONG, Si QIN, Yu KANG, Jeffrey Ding HE, Yingnong DANG, Jian ZHANG, Bo QIAO, Kamaljit BATH
INTERACTIVE ANALYTICS SERVICE FOR ALLOCATION FAILURE DIAGNOSIS IN CLOUD COMPUTING ENVIRONMENT

Publication number: 20250080394

Abstract: Interactive analytics are provided for resource allocation failure incidents, which may be tracked, diagnosed, summarized, and presented in near real-time for users and/or platform/service providers to understand the root cause(s) of failure incidents and actual and hypothetical, failed and successful, allocation scenarios. A capacity analyzer simulates an allocation process implemented by a resource allocation platform. The capacity analyzer may determine which resources were and/or were not eligible for allocation for a request, based on information about the resource allocation failure, resources in the region of interest, and constraints associated with the incident, and the resource allocation rules associated with the resource allocation platform. Users may quickly learn whether a request constraint, a requesting entity constraint, a capacity constraint, and/or a resource platform constraint caused a resource allocation incident.

Type: Application

Filed: August 29, 2023

Publication date: March 6, 2025

Inventors: Di WENG, Shandan ZHOU, Jue ZHANG, Bo QIAO, Si QIN, Karthikeyan SUBRAMANIAN, Thomas MOSCIBRODA
AUTOMATED RECOVERY OF STRANDED RESOURCES WITHIN A CLOUD COMPUTING ENVIRONMENT

Publication number: 20250036448

Abstract: The present application is directed to stranded resource recovery in a cloud computing environment. A resource utilization signal at each of a plurality of nodes that each hosts corresponding virtual machines (VMs) is measured. Based on each resource utilization signal, a set of candidate nodes is identified. Each candidate node comprises a stranded resource that is unutilized due to utilization of a bottleneck resource. The identification includes calculating an amount of the stranded resource at each candidate node. From a plurality of VMs hosted at the set of candidate nodes, a set of candidate VMs is identified for migration for stranded resource recovery. The identification includes calculating a score for each candidate VM based on a degree of imbalance between the stranded resource and the bottleneck resource at a candidate node hosting the candidate VM. Migration of at least one candidate VM in the set of candidate VMs is initiated.

Type: Application

Filed: November 28, 2022

Publication date: January 30, 2025

Inventors: Saurabh AGARWAL, Bo QIAO, Chao DU, Jayden CHEN, Karthikeyan SUBRAMANIAN, Nisarg SHETH, Qingwei LIN, Si QIN, Thomas MOSCIBRODA, Luke Rafael RODRIGUEZ
Detecting systemwide service issues by using anomaly localization

Patent number: 12189466

Abstract: Methods and systems for detecting systemwide service issues by using anomaly localization. In an example, a method includes receiving time-series monitoring data for multiple services, the time-series monitoring data including multiple dimensions and an error metric; for the monitoring data from each service, evaluating scopes within the monitoring data based on an objective function for a time-series of the error metric to identify at least one anomalous scope, each scope including at least one dimension and a value for the dimension; based on evaluating the scopes, generating a ranked list of scopes for each service based on objective function scores for the scopes; correlating the ranked lists of scopes across the multiple services to identify a cross-service anomaly; and generating an alert for the services based on the cross-service anomaly, the alert indicating at least one scope as a potential root cause for the cross-service anomaly.

Type: Grant

Filed: October 28, 2022

Date of Patent: January 7, 2025

Assignee: Microsoft Technology Licensing, LLC

Inventors: Mohit Verma, Julien Hoachuck, Qingwei Lin, Pooja Rani, Namrata Jain, Rakesh Namineni, Jimmy Wong, Si Qin, Yu Kang, Jeffrey Ding He, Yingnong Dang, Jian Zhang, Bo Qiao, Kamaljit Bath
Efficiency Engine In A Cloud Computing Architecture

Publication number: 20240419506

Abstract: An efficiency engine identifies container sizes for containers of a workload and allocates the containers across server clusters and nodes based on peak resource usage requirements of the containers. Runtime feedback signals are generated from monitors within the containers indicative of a quality of service and resource usage. A decision engine can identify a bin packing action to take based upon the runtime feedback signals, and a control plane can perform the identified bin packing actions to adjust bin packing based upon the runtime feedback signals. Also, adaptive adjustment can be performed based on feedback signals and using a prediction engine.

Type: Application

Filed: September 14, 2021

Publication date: December 19, 2024

Inventors: Rahul MOHANA NARAYANAMURTHY, Ye YU, Yixin FANG, Si QIN, Jie YAN, Qingwei LIN, Maosen HUANG, Tao SHEN, Xiaofeng ZHEN
Predicting expansion failures and defragmenting cluster resources

Patent number: 12112214

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting expansion failures and implementing defragmentation instructions based on the predicted expansion failures and other signals. For example, systems disclosed herein may apply a failure prediction model to determine an expansion failure prediction associated with an estimated likelihood that deployment failures will occur on a node cluster. The systems disclosed herein may further generate defragmentation instructions indicating a severity level that a defragmentation engine may execute on a cluster level to prevent expansion failures while minimizing negative customer impacts. By uniquely generating defragmentation instructions for each node cluster, a cloud computing system can minimize expansion failures, increase resource capacity, reduce costs, and provide access to reliable services to customers.

Type: Grant

Filed: July 19, 2023

Date of Patent: October 8, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shandan Zhou, Saurabh Agarwal, Karthikeyan Subramanian, Thomas Moscibroda, Paul Naveen Selvaraj, Sandeep Ramji, Sorin Iftimie, Nisarg Sheth, Wanghai Gu, Ajay Mani, Si Qin, Yong Xu, Qingwei Lin
ALLOCATING COMPUTING RESOURCES FOR DEFERRABLE VIRTUAL MACHINES

Publication number: 20240256362

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting surplus capacity on a set of server nodes and determining a quantity of deferrable virtual machines (VMs) that may be scheduled over an upcoming period of time. This determination of VM quantity may be determined while minimizing risks associated with allocation failures on the set of server nodes. This disclosure described systems that facilitate features and functionality related to improving utilization of surplus resource capacity on a plurality of server nodes by implementing VMs having some flexibility in timing of deployment while also avoiding significant risk caused as a result of over-allocated storage and computing resources. In one or more embodiments, the quantity of deferrable VMs is determined and scheduled in accordance with rules of a scheduling policy.

Type: Application

Filed: April 10, 2024

Publication date: August 1, 2024

Inventors: Yuwen YANG, Gurpreet VIRDI, Bo QIAO, Hang DONG, Karthikeyan SUBRAMANIAN, Marko LALIC, Shandan ZHOU, Si QIN, Thomas MOSCIBRODA, Yunus MOHAMMED
DETECTING SYSTEMWIDE SERVICE ISSUES BY USING ANOMALY LOCALIZATION

Publication number: 20240143433

Abstract: Methods and systems for detecting systemwide service issues by using anomaly localization. In an example, a method includes receiving time-series monitoring data for multiple services, the time-series monitoring data including multiple dimensions and an error metric; for the monitoring data from each service, evaluating scopes within the monitoring data based on an objective function for a time-series of the error metric to identify at least one anomalous scope, each scope including at least one dimension and a value for the dimension; based on evaluating the scopes, generating a ranked list of scopes for each service based on objective function scores for the scopes; correlating the ranked lists of scopes across the multiple services to identify a cross-service anomaly; and generating an alert for the services based on the cross-service anomaly, the alert indicating at least one scope as a potential root cause for the cross-service anomaly.

Type: Application

Filed: October 28, 2022

Publication date: May 2, 2024

Applicant: Microsoft Technology Licensing, LLC

Inventors: Mohit VERMA, Julien HOACHUCK, Qingwei LIN, Pooja RANI, Namrata JAIN, Rakesh NAMINENI, Jimmy WONG, Si QIN, Yu KANG, Jeffrey Ding HE, Yingnong DANG, Jian ZHANG, Bo QIAO, Kamaljit BATH
Allocating computing resources for deferrable virtual machines

Patent number: 11972301

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting surplus capacity on a set of server nodes and determining a quantity of deferrable virtual machines (VMs) that may be scheduled over an upcoming period of time. This determination of VM quantity may be determined while minimizing risks associated with allocation failures on the set of server nodes. This disclosure described systems that facilitate features and functionality related to improving utilization of surplus resource capacity on a plurality of server nodes by implementing VMs having some flexibility in timing of deployment while also avoiding significant risk caused as a result of over-allocated storage and computing resources. In one or more embodiments, the quantity of deferrable VMs is determined and scheduled in accordance with rules of a scheduling policy.

Type: Grant

Filed: April 13, 2021

Date of Patent: April 30, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yuwen Yang, Gurpreet Virdi, Bo Qiao, Hang Dong, Karthikeyan Subramanian, Marko Lalic, Shandan Zhou, Si Qin, Thomas Moscibroda, Yunus Mohammed
SYSTEM AND METHOD OF DYNAMICALLY ADJUSTING VIRTUAL MACHINES FOR A WORKLOAD

Publication number: 20230401103

Abstract: A method for dynamically adjusting a number of virtual machines for a workload, includes: receiving a probability indicator for each of a plurality of N sequential stages, where N is a natural number greater than 1, of a likelihood that a virtual machine assigned to a workload will be evicted during the N sequential stages; predicting a target number of virtual machines to configure in a current stage for a subsequent stage from among the plurality of N sequential stages based on the probability indicator, a target capacity for the workload, and a current price for maintaining a virtual machine; and configuring a number of virtual machines for the workload during the current stage based on the target number to be loaded for the workload for the subsequent stage.

Type: Application

Filed: June 9, 2022

Publication date: December 14, 2023

Inventors: Soumya RAM, Preston Tapley STEPHENSON, Alexander David FISCHER, Mahmoud SAYED, Robert Edward MINNEKER, Eli Cortex Custodio VILARINHO, Felipe VIEIRA FRUJERI, Inigo GOIRI PRESA, Sidhanth M. PANJWANI, Yandan WANG, Camille Jean COUTURIER, Jue ZHANG, Fangkai YANG, Si QIN, Qingwei LIN, Chetan BANSAL, Bowen PANG, Vivek GUPTA
PREDICTING EXPANSION FAILURES AND DEFRAGMENTING CLUSTER RESOURCES

Publication number: 20230359512

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting expansion failures and implementing defragmentation instructions based on the predicted expansion failures and other signals. For example, systems disclosed herein may apply a failure prediction model to determine an expansion failure prediction associated with an estimated likelihood that deployment failures will occur on a node cluster. The systems disclosed herein may further generate defragmentation instructions indicating a severity level that a defragmentation engine may execute on a cluster level to prevent expansion failures while minimizing negative customer impacts. By uniquely generating defragmentation instructions for each node cluster, a cloud computing system can minimize expansion failures, increase resource capacity, reduce costs, and provide access to reliable services to customers.

Type: Application

Filed: July 19, 2023

Publication date: November 9, 2023

Inventors: Shandan ZHOU, Saurabh AGARWAL, Karthikeyan SUBRAMANIAN, Thomas MOSCIBRODA, Paul Naveen SELVARAJ, Sandeep RAMJI, Sorin IFTIMIE, Nisarg SHETH, Wanghai GU, Ajay MANI, Si QIN, Yong XU, Qingwei LIN
Diagnosing and mitigating memory leak in computing nodes

Patent number: 11775407

Abstract: The present disclosure relates to systems, methods, and computer readable media for diagnosing and mitigating memory impact events, such as memory leaks, high memory usage, or other memory issues causing a host node from performing as expected on a cloud computing system. The systems described herein involve receiving locally generated memory usage data from a plurality of host nodes. The systems described herein may aggregate the memory usage data and determine a memory impact diagnosis based on a subset of the aggregated memory usage data. The systems described herein may further apply a mitigation model for mitigating the memory impact event. The systems described herein provide an end-to-end solution for diagnosing and mitigating a variety of memory issues using a dynamic and scalable system that reduces a negative impact of memory leaks and other memory issues on a cloud computing system.

Type: Grant

Filed: March 7, 2022

Date of Patent: October 3, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Cong Chen, Xinsheng Yang, Yingnong Dang, Si Qin
Predicting expansion failures and defragmenting cluster resources

Patent number: 11726836

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting expansion failures and implementing defragmentation instructions based on the predicted expansion failures and other signals. For example, systems disclosed herein may apply a failure prediction model to determine an expansion failure prediction associated with an estimated likelihood that deployment failures will occur on a node cluster. The systems disclosed herein may further generate defragmentation instructions indicating a severity level that a defragmentation engine may execute on a cluster level to prevent expansion failures while minimizing negative customer impacts. By uniquely generating defragmentation instructions for each node cluster, a cloud computing system can minimize expansion failures, increase resource capacity, reduce costs, and provide access to reliable services to customers.

Type: Grant

Filed: June 12, 2020

Date of Patent: August 15, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shandan Zhou, Saurabh Agarwal, Karthikeyan Subramanian, Thomas Moscibroda, Paul Naveen Selvaraj, Sandeep Ramji, Sorin Iftimie, Nisarg Sheth, Wanghai Gu, Ajay Mani, Si Qin, Yong Xu, Qingwei Lin
Allocating cloud resources in accordance with predicted deployment growth

Patent number: 11652720

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting deployment growth on one or more node clusters and selectively permitting deployment requests on a per cluster basis. For example, systems disclosed herein may apply tenant growth prediction system trained to output a deployment growth classification indicative of a predicted growth of deployments on a node cluster. The system disclosed herein may further utilize the deployment growth classification to determine whether a deployment request may be permitted while maintaining a sufficiently sized capacity buffer to avoid deployment failures for existing deployments previously implemented on the node cluster. By selectively permitting or denying deployments based on a variety of factors, the systems described herein can more efficiently utilize cluster resources on a per-cluster basis without causing a significant increase in deployment failures for existing customers.

Type: Grant

Filed: September 20, 2019

Date of Patent: May 16, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shandan Zhou, John Lawrence Miller, Christopher Cowdery, Thomas Moscibroda, Shanti Kemburu, Yong Xu, Si Qin, Qingwei Lin, Eli Cortez, Karthikeyan Subramanian
ALLOCATING COMPUTING RESOURCES FOR DEFERRABLE VIRTUAL MACHINES

Publication number: 20220327002

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting surplus capacity on a set of server nodes and determining a quantity of deferrable virtual machines (VMs) that may be scheduled over an upcoming period of time. This determination of VM quantity may be determined while minimizing risks associated with allocation failures on the set of server nodes. This disclosure described systems that facilitate features and functionality related to improving utilization of surplus resource capacity on a plurality of server nodes by implementing VMs having some flexibility in timing of deployment while also avoiding significant risk caused as a result of over-allocated storage and computing resources. In one or more embodiments, the quantity of deferrable VMs is determined and scheduled in accordance with rules of a scheduling policy.

Type: Application

Filed: April 13, 2021

Publication date: October 13, 2022

Inventors: Yuwen YANG, Gurpreet VIRDI, Bo QIAO, Hang DONG, Karthikeyan SUBRAMANIAN, Marko LALIC, Shandan ZHOU, Si QIN, Thomas MOSCIBRODA, Yunus MOHAMMED
DIAGNOSING AND MITIGATING MEMORY LEAK IN COMPUTING NODES

Publication number: 20220188207

Abstract: The present disclosure relates to systems, methods, and computer readable media for diagnosing and mitigating memory impact events, such as memory leaks, high memory usage, or other memory issues causing a host node from performing as expected on a cloud computing system. The systems described herein involve receiving locally generated memory usage data from a plurality of host nodes. The systems described herein may aggregate the memory usage data and determine a memory impact diagnosis based on a subset of the aggregated memory usage data. The systems described herein may further apply a mitigation model for mitigating the memory impact event. The systems described herein provide an end-to-end solution for diagnosing and mitigating a variety of memory issues using a dynamic and scalable system that reduces a negative impact of memory leaks and other memory issues on a cloud computing system.

Type: Application

Filed: March 7, 2022

Publication date: June 16, 2022

Inventors: Cong CHEN, Xinsheng YANG, Yingnong DANG, Si QIN
Diagnosing and mitigating memory leak in computing nodes

Patent number: 11269748

Abstract: The present disclosure relates to systems, methods, and computer readable media for diagnosing and mitigating memory impact events, such as memory leaks, high memory usage, or other memory issues causing a host node from performing as expected on a cloud computing system. The systems described herein involve receiving locally generated memory usage data from a plurality of host nodes. The systems described herein may aggregate the memory usage data and determine a memory impact diagnosis based on a subset of the aggregated memory usage data. The systems described herein may further apply a mitigation model for mitigating the memory impact event. The systems described herein provide an end-to-end solution for diagnosing and mitigating a variety of memory issues using a dynamic and scalable system that reduces a negative impact of memory leaks and other memory issues on a cloud computing system.

Type: Grant

Filed: April 22, 2020

Date of Patent: March 8, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Cong Chen, Xinsheng Yang, Yingnong Dang, Si Qin
Speech communication system and method for improving speech intelligibility

Patent number: 11227622

Abstract: A speech communication system for improving speech intelligibility may comprise one or more processors; and a memory storing instructions that, when executed by the one or more processors, cause the system to perform: determining a cutoff frequency based on an estimation of a spectrum of noise, wherein the cutoff frequency defines a noise dominant region of frequency; lifting a spectrum of a speech above the noise dominant region of frequency, wherein a frequency range of the spectrum of the speech increases by the cutoff frequency; and applying an adaptive filter to the speech to achieve echo cancelation, wherein the adaptive filter is controlled by a volume of the noise.

Type: Grant

Filed: December 6, 2018

Date of Patent: January 18, 2022

Assignee: Beijing DiDi Infinity Technology and Development Co., Ltd.

Inventors: Yi Zhang, Hui Song, Yongtao Sha, Si Qin
PREDICTING EXPANSION FAILURES AND DEFRAGMENTING CLUSTER RESOURCES

Publication number: 20210389894

Abstract: The present disclosure relates to systems, methods, and computer readable media for predicting expansion failures and implementing defragmentation instructions based on the predicted expansion failures and other signals. For example, systems disclosed herein may apply a failure prediction model to determine an expansion failure prediction associated with an estimated likelihood that deployment failures will occur on a node cluster. The systems disclosed herein may further generate defragmentation instructions indicating a severity level that a defragmentation engine may execute on a cluster level to prevent expansion failures while minimizing negative customer impacts. By uniquely generating defragmentation instructions for each node cluster, a cloud computing system can minimize expansion failures, increase resource capacity, reduce costs, and provide access to reliable services to customers.

Type: Application

Filed: June 12, 2020

Publication date: December 16, 2021

Inventors: Shandan ZHOU, Saurabh AGARWAL, Karthikeyan SUBRAMANIAN, Thomas MOSCIBRODA, Paul Naveen SELVARAJ, Sandeep RAMJI, Sorin IFTIMIE, Nisarg SHETH, Wanghai GU, Ajay MANI, Si QIN, Yong XU, Qingwei LIN

1 2 next