Patents by Inventor Sanggyu SHIN

Sanggyu SHIN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHOD AND APPARATUS WITH SIDECAR PATTERN CHECKPOINTING

Publication number: 20240095066

Abstract: A method including, for each of a plurality of job nodes, corresponding to a job request of a scheduler node, distributing an application container and a sidecar container of a corresponding job node of the plurality of job nodes, and storing, by the sidecar container, information about a respective state of the application container in a memory through a communication between the sidecar container and at least one sidecar container of another job node of the plurality of job nodes.

Type: Application

Filed: February 23, 2023

Publication date: March 21, 2024

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Sanggyu SHIN
Multi-device based inference method and apparatus

Patent number: 11868912

Abstract: Disclosed is a multi-device based inference method and apparatus, where the multi-device based inference method includes receiving information related to operation devices performing an operation included in a neural network and a graph corresponding to the neural network, obtaining a size of an output of the operation in a forward direction of the graph based on the information and the graph, dividing an input of the operation in a backward direction of the graph based on the information, the graph, and the size of the output, and performing an inference based on the divided input.

Type: Grant

Filed: September 14, 2020

Date of Patent: January 9, 2024

Assignee: Samsung Electronics Co., Ltd.

Inventor: Sanggyu Shin
ACCELERATOR RESOURCE MANAGEMENT METHOD AND APPARATUS

Publication number: 20220237040

Abstract: An accelerator resource management method and apparatus are disclosed. The accelerator resource management method includes receiving a task request for a neural network-related task and a resource scheduling policy for the neural network-related task, obtaining information on a current resource utilization status of an accelerator cluster comprising a plurality of accelerators, in response to the task request, and allocating an accelerator resource for performing the task based on a utility of a resource allocation that is based on the resource scheduling policy and the information.

Type: Application

Filed: July 12, 2021

Publication date: July 28, 2022

Applicants: Samsung Electronics Co., Ltd., SNU R&DB FOUNDATION

Inventors: Sanggyu SHIN, Soojeong KIM, Byung-Gon CHUN, Kyunggeun LEE
METHOD AND APPARATUS WITH NEURAL NETWORK INFERENCE OPTIMIZATION IMPLEMENTATION

Publication number: 20220083838

Abstract: A method includes predicting, for sets of input data, an input data number of a subsequent interval of a first interval using an input data number of the first interval and an input data number of a previous interval of the first interval set in a neural network inference optimization, determining the predicted input data number to be a batch size of the subsequent interval, determining whether pipelining is to be performed in a target device based on a resource state of the target device, and applying, to the target device, an inference policy including the determined batch size and a result of the determining of whether the pipelining is to be performed.

Type: Application

Filed: April 29, 2021

Publication date: March 17, 2022

Applicant: Samsung Electronics Co., Ltd.

Inventors: UISEOK SONG, SANGGYU SHIN
OPERATION METHOD OF HOST PROCESSOR AND ACCELERATOR, AND ELECTRONIC DEVICE INCLUDING THE SAME

Publication number: 20220075645

Abstract: An operation method includes: dividing a model to be executed in an accelerator into a plurality of stages; determining, for each of the stages, a maximum batch size processible in an on-chip memory of the accelerator; determining the determined maximum batch sizes to each be a candidate batch size to be applied to the model; and determining, to be a final batch size to be applied to the model, one of the determined candidate batch sizes that minimizes a sum of a computation cost of executing the model in the accelerator and a memory access cost.

Type: Application

Filed: April 2, 2021

Publication date: March 10, 2022

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Sanggyu SHIN, Yeongsik LEE
MULTI-DEVICE BASED INFERENCE METHOD AND APPARATUS

Publication number: 20210248501

Abstract: Disclosed is a multi-device based inference method and apparatus, where the multi-device based inference method includes receiving information related to operation devices performing an operation included in a neural network and a graph corresponding to the neural network, obtaining a size of an output of the operation in a forward direction of the graph based on the information and the graph, dividing an input of the operation in a backward direction of the graph based on the information, the graph, and the size of the output, and performing an inference based on the divided input.

Type: Application

Filed: September 14, 2020

Publication date: August 12, 2021

Applicant: Samsung Electronics Co., Ltd.

Inventor: Sanggyu SHIN

METHOD AND APPARATUS WITH SIDECAR PATTERN CHECKPOINTING

Multi-device based inference method and apparatus

ACCELERATOR RESOURCE MANAGEMENT METHOD AND APPARATUS

METHOD AND APPARATUS WITH NEURAL NETWORK INFERENCE OPTIMIZATION IMPLEMENTATION

OPERATION METHOD OF HOST PROCESSOR AND ACCELERATOR, AND ELECTRONIC DEVICE INCLUDING THE SAME

MULTI-DEVICE BASED INFERENCE METHOD AND APPARATUS