Patents by Inventor Sanjay Ramanujan

Sanjay Ramanujan has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240143414
    Abstract: The techniques disclosed herein enable systems to perform repeatable and iterative load testing and performance benchmarking for artificial intelligence models deployed in a cloud computing environment. This is achieved by utilizing load profiles and representative workloads generated based on the load profiles to evaluate an artificial intelligence model under various workload contexts. The representative workload is then executed by the artificial intelligence model utilizing available computing infrastructure. Performance metrics are extracted from the execution and analyzed to provide insight into various performance dynamics such as the relationship between latency and data throughput. In addition, load profiles and input datasets are dynamically adjusted to evaluate different scenarios and use cases enabling the system to automatically test the artificial intelligence model across diverse applications.
    Type: Application
    Filed: October 27, 2022
    Publication date: May 2, 2024
    Inventors: Sanjay RAMANUJAN, Rakesh KELKAR, Hari Krishnan SRINIVASAN, Karthik RAMAN, Hema Vishnu POLA, Sagar TANEJA, Mradul KARMODIYA
  • Patent number: 11900171
    Abstract: A cloud computing capacity management system can include a fine-grained admission control layer, a policy engine, and an enforcement layer. The fine-grained admission control layer can be configured to ingest capacity signals and create a capacity mitigation policy, based at least in part on the capacity signals, to protect available capacity of a cloud computing system for prioritized users. The capacity mitigation policy can be directed to users of the cloud computing system. The policy engine can be configured to control how the capacity mitigation policy is applied to the cloud computing system. The enforcement layer can be configured to handle incoming resource requests and to enforce resource limits based on the capacity mitigation policy as applied by the policy engine.
    Type: Grant
    Filed: February 2, 2021
    Date of Patent: February 13, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Gurpreet Virdi, Fernando Gonzalez Todisco, Karthikeyan Subramanian, Sanjay Ramanujan, Sorin Iftimie, Xing wen Wang, Thomas Moscibroda, Yunus Mohammed, Vi Lam Nguyen, Rostislav Sudakov
  • Publication number: 20230239194
    Abstract: To improve the reliability of nodes that are utilized by a cloud computing provider, information about the entire lifecycle of nodes can be collected and used to predict when nodes are likely to experience failures based at least in part on early lifecycle errors. In one aspect, a plurality of failure issues experienced by a plurality of production nodes in a cloud computing system during a pre-production phase can be identified. A subset of the plurality of failure issues can be selected based at least in part on correlation with service outages for the plurality of production nodes during a production phase. A comparison can be performed between the subset of the plurality of failure issues and a set of failure issues experienced by a pre-production node during the pre-production phase. A risk score for the pre-production node can be calculated based at least in part on the comparison.
    Type: Application
    Filed: January 26, 2023
    Publication date: July 27, 2023
    Inventors: Sanjay RAMANUJAN, Luke Rafael RODRIGUEZ, Muhammad Khizar QAZI, Aleksandr Mikhailovich GERSHAFT, Marwan Elias JUBRAN, Saurabh AGARWAL
  • Patent number: 11609811
    Abstract: An automated root-cause analysis (RCA) system may provide a fully automated platform that provides dependency and execution order modeling for tasks included in a capacity provisioning process, anomaly detection, ticket correlation, root-cause analysis, monitoring and feedback, and data visualization. The automated RCA system may continuously collect and store data for use in determining a root cause of a blockage on a capacity provisioning process. The blockage may be identified in a ticket generated by a cloud-computing system. The automated RCA system may receive the ticket and attempt to determine the root cause of the blockage based on root causes associated with previous tickets generated by the cloud-computing system. The automated RCA system may identify a true root cause, recommend repair items based on the true root cause, identify one or more responsible teams to drive a fix, and provide an estimated time for completion.
    Type: Grant
    Filed: August 27, 2020
    Date of Patent: March 21, 2023
    Inventors: Sanjay Ramanujan, Andrew Tianze Wang, Marwan Elias Jubran, Weiping Hu, Xiaoguang Fan
  • Patent number: 11582087
    Abstract: To improve the reliability of nodes that are utilized by a cloud computing provider, information about the entire lifecycle of nodes can be collected and used to predict when nodes are likely to experience failures based at least in part on early lifecycle errors. In one aspect, a plurality of failure issues experienced by a plurality of production nodes in a cloud computing system during a pre-production phase can be identified. A subset of the plurality of failure issues can be selected based at least in part on correlation with service outages for the plurality of production nodes during a production phase. A comparison can be performed between the subset of the plurality of failure issues and a set of failure issues experienced by a pre-production node during the pre-production phase. A risk score for the pre-production node can be calculated based at least in part on the comparison.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: February 14, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Sanjay Ramanujan, Luke Rafael Rodriguez, Muhammad Khizar Qazi, Aleksandr Mikhailovich Gershaft, Marwan Elias Jubran, Saurabh Agarwal
  • Patent number: 11550634
    Abstract: A method for minimizing allocation failures in a cloud computing system without overprovisioning may include determining a predicted supply for a virtual machine series in a system unit of the cloud computing system during an upcoming time period. The predicted supply may be based on a shared available current capacity and a shared available future added capacity for the virtual machine series in the system unit. The method may also include predicting an available capacity for the virtual machine series in the system unit during the upcoming time period. The predicted available capacity may be based at least in part on a predicted demand for the virtual machine series in the system unit during the upcoming time period and the predicted supply. The method may also include taking at least one mitigation action in response to determining that the predicted demand exceeds the predicted supply during the upcoming time period.
    Type: Grant
    Filed: March 8, 2019
    Date of Patent: January 10, 2023
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Saurabh Agarwal, Maitreyee Ramprasad Joshi, Vinayak Ramnath Karnataki, Neha Keshari, Gowtham Natarajan, Yash Purohit, Sanjay Ramanujan, Karthikeyan Subramanian, Ambrose Thomas Treacy, Shandan Zhou
  • Publication number: 20220245001
    Abstract: A cloud computing capacity management system can include a fine-grained admission control layer, a policy engine, and an enforcement layer. The fine-grained admission control layer can be configured to ingest capacity signals and create a capacity mitigation policy, based at least in part on the capacity signals, to protect available capacity of a cloud computing system for prioritized users. The capacity mitigation policy can be directed to users of the cloud computing system. The policy engine can be configured to control how the capacity mitigation policy is applied to the cloud computing system. The enforcement layer can be configured to handle incoming resource requests and to enforce resource limits based on the capacity mitigation policy as applied by the policy engine.
    Type: Application
    Filed: February 2, 2021
    Publication date: August 4, 2022
    Inventors: Gurpreet VIRDI, Fernando GONZALEZ TODISCO, Karthikeyan SUBRAMANIAN, Sanjay RAMANUJAN, Sorin IFTIMIE, Xing wen WANG, Thomas MOSCIBRODA, Yunus MOHAMMED, Vi Lam NGUYEN, Rostislav SUDAKOV
  • Publication number: 20220066852
    Abstract: An automated root-cause analysis (RCA) system may provide a fully automated platform that provides dependency and execution order modeling for tasks included in a capacity provisioning process, anomaly detection, ticket correlation, root-cause analysis, monitoring and feedback, and data visualization. The automated RCA system may continuously collect and store data for use in determining a root cause of a blockage on a capacity provisioning process. The blockage may be identified in a ticket generated by a cloud-computing system. The automated RCA system may receive the ticket and attempt to determine the root cause of the blockage based on root causes associated with previous tickets generated by the cloud-computing system. The automated RCA system may identify a true root cause, recommend repair items based on the true root cause, identify one or more responsible teams to drive a fix, and provide an estimated time for completion.
    Type: Application
    Filed: August 27, 2020
    Publication date: March 3, 2022
    Inventors: Sanjay RAMANUJAN, Andrew Tianze WANG, Marwan Elias JUBRAN, Weiping HU, Xiaoguang FAN
  • Publication number: 20210184916
    Abstract: To improve the reliability of nodes that are utilized by a cloud computing provider, information about the entire lifecycle of nodes can be collected and used to predict when nodes are likely to experience failures based at least in part on early lifecycle errors. In one aspect, a plurality of failure issues experienced by a plurality of production nodes in a cloud computing system during a pre-production phase can be identified. A subset of the plurality of failure issues can be selected based at least in part on correlation with service outages for the plurality of production nodes during a production phase. A comparison can be performed between the subset of the plurality of failure issues and a set of failure issues experienced by a pre-production node during the pre-production phase. A risk score for the pre-production node can be calculated based at least in part on the comparison.
    Type: Application
    Filed: December 17, 2019
    Publication date: June 17, 2021
    Inventors: Sanjay RAMANUJAN, Luke Rafael RODRIGUEZ, Muhammad Khizar QAZI, Aleksandr Mikhailovich GERSHAFT, Marwan Elias JUBRAN, Saurabh AGARWAL
  • Publication number: 20200285525
    Abstract: A method for minimizing allocation failures in a cloud computing system without overprovisioning may include determining a predicted supply for a virtual machine series in a system unit of the cloud computing system during an upcoming time period. The predicted supply may be based on a shared available current capacity and a shared available future added capacity for the virtual machine series in the system unit. The method may also include predicting an available capacity for the virtual machine series in the system unit during the upcoming time period. The predicted available capacity may be based at least in part on a predicted demand for the virtual machine series in the system unit during the upcoming time period and the predicted supply. The method may also include taking at least one mitigation action in response to determining that the predicted demand exceeds the predicted supply during the upcoming time period.
    Type: Application
    Filed: March 8, 2019
    Publication date: September 10, 2020
    Inventors: Saurabh AGARWAL, Maitreyee Ramprasad JOSHI, Vinayak Ramnath KARNATAKI, Neha KESHARI, Gowtham NATARAJAN, Yash PUROHIT, Sanjay RAMANUJAN, Karthikeyan SUBRAMANIAN, Ambrose Thomas TREACY, Shandan ZHOU
  • Patent number: 8799049
    Abstract: Various embodiments of a planning and execution process used to forecast contact volumes that will occur over the course of a season and determine resources necessary to handle the contact volumes are disclosed. The planning and execution process may be used to determine staffing resources necessary to service contact volumes across a plurality of support channels. The contact volume forecast may be continually updated over the course of the season, and different models may be used to generate the contact volume forecast at different times during the season.
    Type: Grant
    Filed: January 11, 2007
    Date of Patent: August 5, 2014
    Assignee: Intuit Inc.
    Inventors: Sanjay Ramanujan, Andrea E. Fisher
  • Patent number: 8260649
    Abstract: Various embodiments of a system and method for determining staffing requirements or other resource requirements for a plurality of contact channels are disclosed. A contact volume forecast may be performed at a beginning of a season. The contact volume forecast may predict variability in contact volume over a plurality of time periods in the season for each of the contact channels. A resource plan may be determined based on the contact volume forecast. The resource plan may indicate an amount of resources required for each contact channel for each of the plurality of time periods in the season. For at least one time period, the resource plan may specify that resources are to be shifted into a first contact channel from one or more other contact channels.
    Type: Grant
    Filed: January 11, 2007
    Date of Patent: September 4, 2012
    Assignee: Intuit Inc.
    Inventors: Sanjay Ramanujan, Andrea E. Fisher
  • Publication number: 20080172286
    Abstract: Various embodiments of a planning and execution process used to forecast contact volumes that will occur over the course of a season and determine resources necessary to handle the contact volumes are disclosed. The planning and execution process may be used to determine staffing resources necessary to service contact volumes across a plurality of support channels. The contact volume forecast may be continually updated over the course of the season, and different models may be used to generate the contact volume forecast at different times during the season.
    Type: Application
    Filed: January 11, 2007
    Publication date: July 17, 2008
    Inventors: Sanjay Ramanujan, Andrea E. Fisher
  • Publication number: 20080172275
    Abstract: Various embodiments of a system and method for determining staffing requirements or other resource requirements for a plurality of contact channels are disclosed. A contact volume forecast may be performed at a beginning of a season. The contact volume forecast may predict variability in contact volume over a plurality of time periods in the season for each of the contact channels. A resource plan may be determined based on the contact volume forecast. The resource plan may indicate an amount of resources required for each contact channel for each of the plurality of time periods in the season. For at least one time period, the resource plan may specify that resources are to be shifted into a first contact channel from one or more other contact channels.
    Type: Application
    Filed: January 11, 2007
    Publication date: July 17, 2008
    Inventors: Sanjay Ramanujan, Andrea E. Fisher