Patents Assigned to CAST AI Group, Inc.
  • Patent number: 12271284
    Abstract: A device launches a respective instance on each respective cloud service provider (CSP) of a plurality of CSPs. The device receives, from each respective instance, performance benchmark data for each CSP shape of the respective CSP on which the respective instance is launched. The device inputs the performance benchmark data from each respective instance into a model and receives, as output from the model, a determination of, for each CSP shape, group of a plurality of groups to which the CSP shape belongs. The device ranks each group based on a parameter, and provides for display to a user a recommended CSP shape based on the ranking.
    Type: Grant
    Filed: November 28, 2023
    Date of Patent: April 8, 2025
    Assignee: CAST AI Group, Inc.
    Inventors: Leonid Kuperman, Laurent Gil
  • Patent number: 12255817
    Abstract: A multi-cloud service system establishes tunnels and network overlays across multiple CSPs while meeting a criterion for a latency threshold. The system conducts a latency benchmarking evaluation across each cloud region for multiple CSPs and based on the latency bench marking evaluation results, the system may identify a group of cloud regions that satisfy a criterion such as predetermined maximum latency threshold or geographical restriction. The system may provision the group of cloud regions by provisioning a tunnel between nodes of the multiple CSPs. The system further establishes an overlay network on top of the tunnel by encapsulating packets using encapsulation end point such as VTEP (VXLAN tunnel end point) over VXLAN (Virtual Extension Local Area Network), which may help to ensure reliable transmission of packets from pod to pod. The system may inject user data into each node to initiate operations across the provisioned nodes using injected user data.
    Type: Grant
    Filed: January 27, 2023
    Date of Patent: March 18, 2025
    Assignee: CAST AI Group, Inc.
    Inventors: Saulius Mašnauskas, Rokas Bilevičius, Tadeuš Varnas, Augustinas Stirbis, Leonid Kuperman
  • Patent number: 12253927
    Abstract: A device launches a respective instance on each respective cloud service provider (CSP) of a plurality of CSPs. The device receives, from each respective instance, performance benchmark data for each CSP shape of the respective CSP on which the respective instance is launched. The device inputs the performance benchmark data from each respective instance into a model and receives, as output from the model, a determination of, for each CSP shape, group of a plurality of groups to which the CSP shape belongs. The device ranks each group based on a parameter, and provides for display to a user a recommended CSP shape based on the ranking.
    Type: Grant
    Filed: January 18, 2024
    Date of Patent: March 18, 2025
    Assignee: CAST AI Group, Inc.
    Inventors: Leonid Kuperman, Laurent Gil
  • Patent number: 12236193
    Abstract: Systems or methods for the selection of large language models (LLMs). A system receives a request from a service that hosts an application. The request is configured to be processed by an LLM to generate a response. The system applies a classification model to the request to determine the class of the request. The classification model is a language model trained to receive text and classify the text into a plurality of classes. The system selects an LLM from a plurality of candidate LLMs based in part on the determined class of the request and recommends the selected LLM to the application.
    Type: Grant
    Filed: April 19, 2024
    Date of Patent: February 25, 2025
    Assignee: CAST AI Group, Inc.
    Inventors: Leonid Kuperman, Žilvinas Urbonas, Laurynas Stasys, Kyrylo Yefimenko
  • Patent number: 12052173
    Abstract: A multi-cloud service system establishes tunnels and network overlays across multiple CSPs while meeting a criterion for a latency threshold. The system conducts a latency benchmarking evaluation across each cloud region for multiple CSPs and based on the latency bench marking evaluation results, the system may identify a group of cloud regions that satisfy a criterion such as predetermined maximum latency threshold or geographical restriction. The system may provision the group of cloud regions by provisioning a tunnel between nodes of the multiple CSPs. The system further establishes an overlay network on top of the tunnel by encapsulating packets using encapsulation end point such as VTEP (VXLAN tunnel end point) over VXLAN (Virtual Extension Local Area Network), which may help to ensure reliable transmission of packets from pod to pod. The system may inject user data into each node to initiate operations across the provisioned nodes using injected user data.
    Type: Grant
    Filed: July 20, 2021
    Date of Patent: July 30, 2024
    Assignee: CAST AI Group, Inc.
    Inventors: Saulius Ma{hacek over (s)}nauskas, Rokas Bilevi{hacek over (c)}ius, Tadeu{hacek over (s)} Varnas, Augustinas Stirbis, Leonid Kuperman
  • Patent number: 11868227
    Abstract: A device launches a respective instance on each respective cloud service provider (CSP) of a plurality of CSPs. The device receives, from each respective instance, performance benchmark data for each CSP shape of the respective CSP on which the respective instance is launched. The device inputs the performance benchmark data from each respective instance into a model and receives, as output from the model, a determination of, for each CSP shape, group of a plurality of groups to which the CSP shape belongs. The device ranks each group based on a parameter, and provides for display to a user a recommended CSP shape based on the ranking.
    Type: Grant
    Filed: May 17, 2021
    Date of Patent: January 9, 2024
    Assignee: CAST AI Group, Inc.
    Inventors: Leonid Kuperman, Laurent Gil
  • Patent number: 11595306
    Abstract: A multi-cloud service system establishes tunnels and network overlays across multiple CSPs while meeting a criterion for a latency threshold. The system conducts a latency benchmarking evaluation across each cloud region for multiple CSPs and based on the latency bench marking evaluation results, the system may identify a group of cloud regions that satisfy a criterion such as predetermined maximum latency threshold or geographical restriction. The system may provision the group of cloud regions by provisioning a tunnel between nodes of the multiple CSPs. The system further establishes an overlay network on top of the tunnel by encapsulating packets using encapsulation end point such as VTEP (VXLAN tunnel end point) over VXLAN (Virtual Extension Local Area Network), which may help to ensure reliable transmission of packets from pod to pod. The system may inject user data into each node to initiate operations across the provisioned nodes using injected user data.
    Type: Grant
    Filed: July 20, 2021
    Date of Patent: February 28, 2023
    Assignee: CAST AI Group, Inc.
    Inventors: Saulius Ma{hacek over (s)}nauskas, Rokas Bilevi{hacek over (c)}ius, Tadeu{hacek over (s)} Varnas, Augustinas Stirbis, Leonid Kuperman