Patents by Inventor Sanggyu SHIN

Sanggyu SHIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240095066
    Abstract: A method including, for each of a plurality of job nodes, corresponding to a job request of a scheduler node, distributing an application container and a sidecar container of a corresponding job node of the plurality of job nodes, and storing, by the sidecar container, information about a respective state of the application container in a memory through a communication between the sidecar container and at least one sidecar container of another job node of the plurality of job nodes.
    Type: Application
    Filed: February 23, 2023
    Publication date: March 21, 2024
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Sanggyu SHIN
  • Patent number: 11868912
    Abstract: Disclosed is a multi-device based inference method and apparatus, where the multi-device based inference method includes receiving information related to operation devices performing an operation included in a neural network and a graph corresponding to the neural network, obtaining a size of an output of the operation in a forward direction of the graph based on the information and the graph, dividing an input of the operation in a backward direction of the graph based on the information, the graph, and the size of the output, and performing an inference based on the divided input.
    Type: Grant
    Filed: September 14, 2020
    Date of Patent: January 9, 2024
    Assignee: Samsung Electronics Co., Ltd.
    Inventor: Sanggyu Shin
  • Publication number: 20220237040
    Abstract: An accelerator resource management method and apparatus are disclosed. The accelerator resource management method includes receiving a task request for a neural network-related task and a resource scheduling policy for the neural network-related task, obtaining information on a current resource utilization status of an accelerator cluster comprising a plurality of accelerators, in response to the task request, and allocating an accelerator resource for performing the task based on a utility of a resource allocation that is based on the resource scheduling policy and the information.
    Type: Application
    Filed: July 12, 2021
    Publication date: July 28, 2022
    Applicants: Samsung Electronics Co., Ltd., SNU R&DB FOUNDATION
    Inventors: Sanggyu SHIN, Soojeong KIM, Byung-Gon CHUN, Kyunggeun LEE
  • Publication number: 20220083838
    Abstract: A method includes predicting, for sets of input data, an input data number of a subsequent interval of a first interval using an input data number of the first interval and an input data number of a previous interval of the first interval set in a neural network inference optimization, determining the predicted input data number to be a batch size of the subsequent interval, determining whether pipelining is to be performed in a target device based on a resource state of the target device, and applying, to the target device, an inference policy including the determined batch size and a result of the determining of whether the pipelining is to be performed.
    Type: Application
    Filed: April 29, 2021
    Publication date: March 17, 2022
    Applicant: Samsung Electronics Co., Ltd.
    Inventors: UISEOK SONG, SANGGYU SHIN
  • Publication number: 20220075645
    Abstract: An operation method includes: dividing a model to be executed in an accelerator into a plurality of stages; determining, for each of the stages, a maximum batch size processible in an on-chip memory of the accelerator; determining the determined maximum batch sizes to each be a candidate batch size to be applied to the model; and determining, to be a final batch size to be applied to the model, one of the determined candidate batch sizes that minimizes a sum of a computation cost of executing the model in the accelerator and a memory access cost.
    Type: Application
    Filed: April 2, 2021
    Publication date: March 10, 2022
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Sanggyu SHIN, Yeongsik LEE
  • Publication number: 20210248501
    Abstract: Disclosed is a multi-device based inference method and apparatus, where the multi-device based inference method includes receiving information related to operation devices performing an operation included in a neural network and a graph corresponding to the neural network, obtaining a size of an output of the operation in a forward direction of the graph based on the information and the graph, dividing an input of the operation in a backward direction of the graph based on the information, the graph, and the size of the output, and performing an inference based on the divided input.
    Type: Application
    Filed: September 14, 2020
    Publication date: August 12, 2021
    Applicant: Samsung Electronics Co., Ltd.
    Inventor: Sanggyu SHIN