Patents by Inventor Roy Kim

Roy Kim has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20260127169
    Abstract: A direct data path between a storage system and memory of one or more processing devices is enabled for a machine learning workload to be executed on the processing devices. Remote direct memory access (RDMA) is configured to transfer data between the storage system and the memory of the one or more processing devices while bypassing host memory. Data transfers for the machine learning workload are executed over the direct data path.
    Type: Application
    Filed: December 18, 2025
    Publication date: May 7, 2026
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20260111420
    Abstract: Log data generated by a computing system is received. An analytics operation is executed on the log data. Pattern recognition associated with the log data is applied to identify anomalous behaviors of the computing system.
    Type: Application
    Filed: December 17, 2025
    Publication date: April 23, 2026
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20260111154
    Abstract: Executing a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: receiving, by a graphical processing unit (‘GPU’) server, a dataset transformed by a storage system that is external to the GPU server; and executing, by the GPU server, one or more machine learning algorithms using the transformed dataset as input.
    Type: Application
    Filed: December 17, 2025
    Publication date: April 23, 2026
    Inventors: BRIAN GOLD, EMILY POTYRAJ, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20260072911
    Abstract: Execution of a pipeline of a client system and access patterns to one or more storage resources used by the pipeline is monitored. A bottleneck in the pipeline is identified based at least in part on the access patterns. A reconfiguration of resources is initiated, including reallocating compute resources or storage resources, to resolve the bottleneck in the pipeline.
    Type: Application
    Filed: November 20, 2025
    Publication date: March 12, 2026
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20260056947
    Abstract: A dataset is received from a data source. A real-time analytics operation is executed on at least a portion of the dataset prior to fully persisting the dataset. Results of the real-time analytics operation are stored.
    Type: Application
    Filed: October 29, 2025
    Publication date: February 26, 2026
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20260056948
    Abstract: A method is disclosed for managing transformed datasets in a compute cluster environment. The method includes identifying, based on one or more machine learning models to be executed on a compute cluster comprising a plurality of GPU servers, one or more transformations to apply to a dataset. The method further includes generating a transformed dataset based on the one or more transformations, storing the transformed dataset, receiving a request to transmit the transformed dataset to at least one GPU server of the plurality of GPU servers, and, responsive to the request, transmitting the stored transformed dataset to the at least one GPU server without re-performing the one or more transformations on the dataset.
    Type: Application
    Filed: October 31, 2025
    Publication date: February 26, 2026
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 12517685
    Abstract: Executing a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: receiving, by a graphical processing unit (‘GPU’) server, a dataset transformed by a storage system that is external to the GPU server; and executing, by the GPU server, one or more machine learning algorithms using the transformed dataset as input.
    Type: Grant
    Filed: October 30, 2023
    Date of Patent: January 6, 2026
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Potyraj, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Publication number: 20250362836
    Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
    Type: Application
    Filed: July 31, 2025
    Publication date: November 27, 2025
    Inventors: BRIAN GOLD, EMILY POTYRAJ, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Publication number: 20250335437
    Abstract: Improving machine learning models in an artificial intelligence infrastructure includes: storing, within one or more storage systems of an artificial intelligence infrastructure, information describing a dataset and one or more transformations applied to the dataset resulting in a transformed dataset; and storing, within the one or more storage systems, information describing only portions of previous versions of a machine learning model that differ from a current version of the machine learning model, wherein the previous versions used the transformed dataset as input during one or more prior executions by the artificial intelligence infrastructure.
    Type: Application
    Filed: July 2, 2025
    Publication date: October 30, 2025
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 12455705
    Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
    Type: Grant
    Filed: September 12, 2023
    Date of Patent: October 28, 2025
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Watkins, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Patent number: 12373428
    Abstract: Improving machine learning models in an artificial intelligence infrastructure includes: storing, within one or more storage systems of an artificial intelligence infrastructure, information describing a dataset and one or more transformations applied to the dataset resulting in a transformed dataset; and storing, within the one or more storage systems, information describing only portions of previous versions of a machine learning model that differ from a current version of the machine learning model, wherein the previous versions used the transformed dataset as input during one or more prior executions by the artificial intelligence infrastructure.
    Type: Grant
    Filed: April 26, 2022
    Date of Patent: July 29, 2025
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Watkins, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Publication number: 20240192898
    Abstract: Executing a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: receiving, by a graphical processing unit (‘GPU’) server, a dataset transformed by a storage system that is external to the GPU server; and executing, by the GPU server, one or more machine learning algorithms using the transformed dataset as input.
    Type: Application
    Filed: October 30, 2023
    Publication date: June 13, 2024
    Inventors: BRIAN GOLD, EMILY POTYRAJ, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 12008404
    Abstract: Executing a big data analytics pipeline in a storage system that includes compute resources and shared storage resources, including: receiving, from a data producer, a dataset; storing, within the storage system, the dataset; allocating processing resources to an analytics application; and executing the analytics application on the processing resources, including ingesting the dataset from the storage system.
    Type: Grant
    Filed: April 14, 2022
    Date of Patent: June 11, 2024
    Assignee: PURE STORAGE, INC.
    Inventors: Ivan Jibaja, Prashant Jaikumar, Stefan Dorsett, Curtis Pullen, Roy Kim
  • Publication number: 20240028266
    Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
    Type: Application
    Filed: September 12, 2023
    Publication date: January 25, 2024
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 11803338
    Abstract: Executing a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: receiving, by a graphical processing unit (‘GPU’) server, a dataset transformed by a storage system that is external to the GPU server; and executing, by the GPU server, one or more machine learning algorithms using the transformed dataset as input.
    Type: Grant
    Filed: November 30, 2021
    Date of Patent: October 31, 2023
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Potyraj, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Patent number: 11768636
    Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
    Type: Grant
    Filed: December 27, 2022
    Date of Patent: September 26, 2023
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Watkins, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Publication number: 20230126789
    Abstract: Generating a transformed dataset for use by a machine learning model in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: storing, within one or more storage systems, a transformed dataset generated by applying one or more transformations to a dataset that are identified based on one or more expected input formats of data received as input data by one or more machine learning models to be executed on one or more servers; and transmitting, from the one or more storage systems to the one or more servers without reapplying the one or more transformations on the dataset, the transformed dataset including data in the one or more expected formats of data to be received as input data by the one or more machine learning models.
    Type: Application
    Filed: December 27, 2022
    Publication date: April 27, 2023
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 11556280
    Abstract: Data transformation caching in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, in dependence upon one or more machine learning models to be executed on the GPU servers, one or more transformations to apply to a dataset; generating, in dependence upon the one or more transformations, a transformed dataset; storing, within one or more of the storage systems, the transformed dataset; receiving a plurality of requests to transmit the transformed dataset to one or more of the GPU servers; and responsive to each request, transmitting, from the one or more storage systems to the one or more GPU servers without re-performing the one or more transformations on the dataset, the transformed dataset.
    Type: Grant
    Filed: May 29, 2020
    Date of Patent: January 17, 2023
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Watkins, Ivan Jibaja, Igor Ostrovsky, Roy Kim
  • Publication number: 20220253443
    Abstract: Improving machine learning models in an artificial intelligence infrastructure includes: storing, within one or more storage systems of an artificial intelligence infrastructure, information describing a dataset and one or more transformations applied to the dataset resulting in a transformed dataset; and storing, within the one or more storage systems, information describing only portions of previous versions of a machine learning model that differ from a current version of the machine learning model, wherein the previous versions used the transformed dataset as input during one or more prior executions by the artificial intelligence infrastructure.
    Type: Application
    Filed: April 26, 2022
    Publication date: August 11, 2022
    Inventors: BRIAN GOLD, EMILY WATKINS, IVAN JIBAJA, IGOR OSTROVSKY, ROY KIM
  • Patent number: 11403290
    Abstract: Ensuring reproducibility in an artificial intelligence infrastructure that includes one or more storage systems and one or more graphical processing unit (‘GPU’) servers, including: identifying, by a unified management plane, one or more transformations applied to a dataset by the artificial intelligence infrastructure, wherein applying the one or more transformations to the dataset causes the artificial intelligence infrastructure to generate a transformed dataset; storing, within the one or more storage systems, information describing the dataset, the one or more transformations applied to the dataset, and the transformed dataset; identifying, by the unified management plane, one or more machine learning models executed by the artificial intelligence infrastructure using the transformed dataset as input; and storing, within the one or more storage systems, information describing one or more machine learning models executed using the transformed dataset as input.
    Type: Grant
    Filed: July 18, 2019
    Date of Patent: August 2, 2022
    Assignee: PURE STORAGE, INC.
    Inventors: Brian Gold, Emily Watkins, Ivan Jibaja, Igor Ostrovsky, Roy Kim