Patents by Inventor Bernhard Egger
Bernhard Egger has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12321733Abstract: An apparatus includes a processor configured to generate each of intermediate representation codes corresponding to each of a plurality of loop structures obtained that corresponds to a neural network computation based on an input specification file of hardware; schedule instructions included in each of the intermediate representation codes corresponding to the plurality of loop structures; select, based on latency values predicted according to scheduling results of the intermediate representation codes, any one code among the intermediate representation codes; and allocate, based on a scheduling result of the selected intermediate representation code, instructions included in the selected intermediate representation code to resources of the hardware included in the apparatus.Type: GrantFiled: November 1, 2022Date of Patent: June 3, 2025Assignees: Samsung Electronics Co., Ltd., Seoul National University R&DB FoundationInventors: Bernhard Egger, Hyemi Min
-
Publication number: 20240202527Abstract: A method of processing data is performed by a computing device including processing hardware and storage hardware, the method including: converting, by the processing hardware, a neural network, stored in the storage hardware, from a first neural network format into a second neural network format; obtaining, by the processing hardware, information about hardware configured to perform a neural network operation for the neural network and obtaining partition information; dividing the neural network in the second neural network format into partitions, wherein the dividing is based on the information about the hardware and the partition information, wherein each partition includes a respective layer with an input thereto and an output thereof; optimizing each of the partitions based on a relationship between the input and the output of the corresponding layer; and converting the optimized partitions into the first neural network format.Type: ApplicationFiled: July 17, 2023Publication date: June 20, 2024Applicants: SAMSUNG ELECTRONICS CO., LTD., Seoul National University R&DB FoundationInventors: Seok-Young YOON, Bernhard EGGER, Hyemi MIN, Jaume Mateu CUADRAT
-
Publication number: 20240193406Abstract: A method and apparatus with scheduling a neural network (NN), which relate to extracting and scheduling priorities of operation sets, are provided. A scheduler may be configured to receive a loop structure corresponding to a NN model, generate a plurality of operation sets based on the loop structure, generate a priority table for the operation sets based on memory benefits of the operation sets, and schedule the operation sets based on the priority table.Type: ApplicationFiled: November 3, 2023Publication date: June 13, 2024Applicants: SAMSUNG ELECTRONICS CO., LTD., Seoul National University R&DB FoundationInventors: Seok-Young YOON, Bernhard EGGER, Hyemi MIN, Jungyoon KWON, Jaume Mateu CUADRAT
-
Publication number: 20240185077Abstract: Apparatuses and methods for drawing a quantization configuration are disclosed, where A method may include generating genes by cataloging possible combinations of a quantization precision and a calibration method for each of layers of a pre-trained neural network, determining layer sensitivity for each of the layers based on combinations corresponding to the genes, determining priorities of the genes and selecting some of the genes based on the respective priority of the genes, generating progeny genes by performing crossover on the selected genes, calculating layer sensitivity for each of the layers corresponding to a combination of the crossover, and updating one or more of the genes using the progeny genes based on a comparison of layer sensitivity of the genes and layer sensitivity of the progeny genes.Type: ApplicationFiled: May 19, 2023Publication date: June 6, 2024Applicants: SAMSUNG ELECTRONICS CO., LTD., Seoul National University R&DB FoundationInventors: Seok-Young YOON, Bernhard EGGER, Daon PARK, Jungyoon KWON, Hyemi MIN
-
Publication number: 20230195439Abstract: An apparatus includes a processor configured to generate each of intermediate representation codes corresponding to each of a plurality of loop structures obtained that corresponds to a neural network computation based on an input specification file of hardware; schedule instructions included in each of the intermediate representation codes corresponding to the plurality of loop structures; select, based on latency values predicted according to scheduling results of the intermediate representation codes, any one code among the intermediate representation codes; and allocate, based on a scheduling result of the selected intermediate representation code, instructions included in the selected intermediate representation code to resources of the hardware included in the apparatus.Type: ApplicationFiled: November 1, 2022Publication date: June 22, 2023Applicants: SAMSUNG ELECTRONICS CO., LTD., Seoul National University R&DB FoundationInventors: Bernhard EGGER, Hyemi MIN
-
Patent number: 11301016Abstract: Provided are computing devices, each including a plurality of cores, and methods of allocating power to the plurality of cores. The computing device includes: a control core group including a plurality of control cores, the control core group configured to allocate a power budget to processing cores according to an energy management policy and state information of the processing cores, and transmit the allocated power budget to at least one of a lower control core and the processing cores; and a processing core group including at least one or more of the processing cores, the processing core group configured to perform computations based on the power budget allocated by the control core group, and transmit state information of the processing cores to the control core group, the state information of the processing cores having been modified based on the computations performed.Type: GrantFiled: July 22, 2019Date of Patent: April 12, 2022Assignees: SEOUL NATIONAL UNIVERSITY R&DB FOUNDATION, SAMSUNG ELECTRONICS CO., LTD.Inventors: Bernhard Egger, Younghyun Cho, Su-Rim Oh, Dong-hoon Yoo
-
Publication number: 20210279587Abstract: A method and an apparatus for generating a code for a neural network operation are disclosed. The method includes receiving information on hardware configured to perform a neural network operation of the neural network, generating, using a processor, a target mapping model mapping the neural network operation on processing elements available to perform the neural network operation based on the information and a structure of the neural network, and generating a code to configure the hardware to perform the neural network operation based on the target mapping model.Type: ApplicationFiled: March 3, 2021Publication date: September 9, 2021Applicants: Samsung Electronics Co., Ltd., SNU R&DB FOUNDATIONInventors: Bernhard EGGER, Minsu KIM, Hyemi MIN
-
Patent number: 10713095Abstract: A method of controlling a multi-core processor includes allocating at least one core of the multi-core processor to at least one process for execution; generating a translation table with respect to the at least one process to translate a logical ID of the at least one core allocated to the at least one process to a physical ID; and controlling the at least one process based on the translation table generated with respect to the at least one process.Type: GrantFiled: March 27, 2017Date of Patent: July 14, 2020Assignee: Samsung Electronics Co., Ltd.Inventors: Donghoon Yoo, Bernhard Egger
-
Patent number: 10503557Abstract: A multi-core computing device includes a control core group having first low-level control cores and a processing core group. The control core group allocates work groups for executing an Open Computing Language (OpenCL) kernel to the first low-level control cores and first processing cores among the processing core group. The processing core group performs processing of the work groups allocated by the control core group outputs results of the processing. The control cores are hierarchically grouped.Type: GrantFiled: October 18, 2017Date of Patent: December 10, 2019Assignees: SAMSUNG ELECTRONICS CO., LTD., SEOUL NATIONAL UNIVERSITY R&DB FOUNDATIONInventors: Bernhard Egger, Su-Rim Oh, Younghyun Cho, Dong-Hoon Yoo
-
Publication number: 20190339760Abstract: Provided are computing devices, each including a plurality of cores, and methods of allocating power to the plurality of cores. The computing device includes: a control core group including a plurality of control cores, the control core group configured to allocate a power budget to processing cores according to an energy management policy and state information of the processing cores, and transmit the allocated power budget to at least one of a lower control core and the processing cores; and a processing core group including at least one or more of the processing cores, the processing core group configured to perform computations based on the power budget allocated by the control core group, and transmit state information of the processing cores to the control core group, the state information of the processing cores having been modified based on the computations performed.Type: ApplicationFiled: July 22, 2019Publication date: November 7, 2019Applicants: Samsung Electronics Co., Ltd., SEOUL NATIONAL UNIVERSITY R&DB FOUNDATIONInventors: Bernhard Egger, Younghyun CHO, Su-Rim Oh, Dong-hoon Yoo
-
Patent number: 10409351Abstract: Provided are computing devices, each including a plurality of cores, and methods of allocating power to the plurality of cores. The computing device includes: a control core group including a plurality of control cores, the control core group configured to allocate a power budget to processing cores according to an energy management policy and state information of the processing cores, and transmit the allocated power budget to at least one of a lower control core and the processing cores; and a processing core group including at least one or more of the processing cores, the processing core group configured to perform computations based on the power budget allocated by the control core group, and transmit state information of the processing cores to the control core group, the state information of the processing cores having been modified based on the computations performed.Type: GrantFiled: October 19, 2017Date of Patent: September 10, 2019Assignees: Samsung Electronics Co., Ltd., Seoul National University R&DB FoundationInventors: Bernhard Egger, Younghyun Cho, Su-Rim Oh, Dong-hoon Yoo
-
Patent number: 10140247Abstract: Methods and apparatuses are provided for compressing configuration data. The configuration data, which includes control data corresponding to at least one processing unit used in each of a plurality of cycles, is stored. A plurality of processing units of a reconfigurable processor is divided into a plurality of groups. The configuration data is partitioned into a plurality of pieces of sub-configuration data. Each piece of sub-configuration data corresponding to a respective one of the plurality of groups. If a plurality of adjacent cycles include identical control data, the configuration data is compressed by deleting control data of all but one of the plurality of adjacent cycles, for each sub-configuration data.Type: GrantFiled: September 6, 2017Date of Patent: November 27, 2018Assignees: Samsung Electronics Co., Ltd, Seoul National University R&DB FoundationInventors: Bernhard Egger, Ho-chan Lee, Yeon-bok Lee, Suk-jin Kim
-
Publication number: 20180246554Abstract: Provided are computing devices, each including a plurality of cores, and methods of allocating power to the plurality of cores. The computing device includes: a control core group including a plurality of control cores, the control core group configured to allocate a power budget to processing cores according to an energy management policy and state information of the processing cores, and transmit the allocated power budget to at least one of a lower control core and the processing cores; and a processing core group including at least one or more of the processing cores, the processing core group configured to perform computations based on the power budget allocated by the control core group, and transmit state information of the processing cores to the control core group, the state information of the processing cores having been modified based on the computations performed.Type: ApplicationFiled: October 19, 2017Publication date: August 30, 2018Applicants: Samsung Electronics Co., Ltd., Seoul National University R&DB FoundationInventors: Bernhard EGGER, Younghyun CHO, Su-Rim OH, Dong-hoon YOO
-
Publication number: 20180181443Abstract: A multi-core computing device includes a control core group having first low-level control cores and a processing core group. The control core group allocates work groups for executing an Open Computing Language (OpenCL) kernel to the first low-level control cores and first processing cores among the processing core group. The processing core group performs processing of the work groups allocated by the control core group outputs results of the processing. The control cores are hierarchically grouped.Type: ApplicationFiled: October 18, 2017Publication date: June 28, 2018Applicant: SEOUL NATIONAL UNIVERSITY R&DB FOUNDATIONInventors: BERNHARD EGGER, Su-Rim Oh, Younghyun Cho, Dong-Hoon Yoo
-
Patent number: 9959191Abstract: A dynamic library profiling method and a dynamic library profiling system including writing a first break point instruction at a start address of a dynamic library function, recording a first event count value that is a process performance management unit (PMU) count when a target process executes the first break point instruction, writing a second break point instruction to a return address of the dynamic library function, and calculating a PMU count generated in a processor core while the dynamic library function is executed, by comparing the recorded first event count value with a second event count value that is a process PMU count when the target process executes the second break point instruction, wherein the process PMU count is a cumulative value of PMU counts generated in the processor core while the target process is executed.Type: GrantFiled: November 22, 2013Date of Patent: May 1, 2018Assignees: SAMSUNG ELECTRONICS CO., LTD., SEOUL NATIONAL UNIVERSITY R&DB FOUNDATIONInventors: Min-Ju Lee, Bernhard Egger, Jae-jin Lee, Young-Lak Kim, Hong-Gyu Kim, Hong-June Kim
-
Publication number: 20180067895Abstract: Methods and apparatuses are provided for compressing configuration data. The configuration data, which includes control data corresponding to at least one processing unit used in each of a plurality of cycles, is stored. A plurality of processing units of a reconfigurable processor is divided into a plurality of groups. The configuration data is partitioned into a plurality of pieces of sub-configuration data. Each piece of sub-configuration data corresponding to a respective one of the plurality of groups. If a plurality of adjacent cycles include identical control data, the configuration data is compressed by deleting control data of all but one of the plurality of adjacent cycles, for each sub-configuration data.Type: ApplicationFiled: September 6, 2017Publication date: March 8, 2018Inventors: Bernhard EGGER, Ho-chan LEE, Yeon-bok LEE, Suk-jin KIM
-
Publication number: 20170277571Abstract: A method of controlling a multi-core processor includes allocating at least one core of the multi-core processor to at least one process for execution; generating a translation table with respect to the at least one process to translate a logical ID of the at least one core allocated to the at least one process to a physical ID; and controlling the at least one process based on the translation table generated with respect to the at least one process.Type: ApplicationFiled: March 27, 2017Publication date: September 28, 2017Applicants: Samsung Electronics Co., Ltd., Seoul National University R&DB FoundationInventors: Donghoon YOO, Bernhard EGGER
-
Patent number: 9727528Abstract: Provided is a reconfigurable processor capable of reducing the routing processing time of routing nodes by driving the routing nodes at a greater frequency than a driving frequency of the processing elements. The reconfigurable processor includes one or more processing elements configured to be driven at a first driving frequency, and one or more routing nodes configured to be provided on paths that are formed between the processing elements, and to be driven at a second driving frequency that is greater than the first driving frequency.Type: GrantFiled: July 7, 2011Date of Patent: August 8, 2017Assignee: Samsung Electronics Co., Ltd.Inventors: Bernhard Egger, Taisong Jin, Won-Sub Kim
-
Patent number: 9411582Abstract: An apparatus for processing an invalid operation in a prologue and/or an epilogue of a loop includes a register file including a first region for storing a data validity value indicating whether data is valid or invalid, and a second region for storing the data; and a functional unit configured to determine whether an operation is valid or invalid based on a value of a first region of each of one or more input sources received from the register file, and output a destination including a value based on the value of the first region of each of the input sources.Type: GrantFiled: March 15, 2013Date of Patent: August 9, 2016Assignees: Samsung Electronics Co., Ltd., Seoul Electronics University R&DB FoundationInventors: Seong-Hun Jeong, Bernhard Egger, Won-Sub Kim
-
Patent number: 9342480Abstract: An apparatus and method for generating a very long instruction word (VLIW) command that supports predicated execution, and a VLIW processor and method for processing a VLIW are provided herein. The VLIW command includes an instruction bundle formed of a plurality of instructions to be executed in parallel and a single value indicating predicated execution, and is generated using the apparatus and method for generating a VLIW command. The VLIW processor decodes the instruction bundle and executes the instructions, which are included in the decoded instruction bundle, in parallel, according to the value indicating predicated execution.Type: GrantFiled: October 28, 2013Date of Patent: May 17, 2016Assignee: Samsung Electronics Co., Ltd.Inventors: Bernhard Egger, Soo-jung Ryu, Dong-hoon Yoo, Il-hyun Park