Patents by Inventor Hongbo Rong

Hongbo Rong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for supporting programmatic control of a compiler for generating high-performance spatial hardware

Patent number: 11163546

Abstract: A method for designing a system on a target device includes generating an intermediate representation of the system from a functional specification of a high-level description of a system. From the high-level description of the system, one or more directives are identified that (1) transform a portion of the system with a specific technique, (2) build a spatial layout for the system by dividing the system according to functionalities, and (3) specialize the system in response to the spatial layout. The intermediate representation of the system is modified in response to the one or more directives.

Type: Grant

Filed: December 15, 2017

Date of Patent: November 2, 2021

Assignee: Intel Corporation

Inventor: Hongbo Rong
Technologies for automatic reordering of sparse matrices

Patent number: 10310826

Abstract: Technologies for automatic reordering of sparse matrices include a computing device to determine a distributivity of an expression defined in a code region of a program code. The expression is determined to be distributive if semantics of the expression are unaffected by a reordering of an input/output of the expression. The computing device performs inter-dependent array analysis on the expression to determine one or more clusters of inter-dependent arrays of the expression, wherein each array of a cluster of the one or more clusters is inter-dependent on each other array of the cluster, and performs bi-directional data flow analysis on the code region by iterative backward and forward propagation of reorderable arrays through expressions in the code region based on the one or more clusters of the inter-dependent arrays. The backward propagation is based on a backward transfer function and the forward propagation is based on a forward transfer function.

Type: Grant

Filed: November 19, 2015

Date of Patent: June 4, 2019

Assignee: Intel Corporation

Inventors: Hongbo Rong, Jongsoo Park, Todd A. Anderson
Conjugate code generation for efficient dynamic optimizations

Patent number: 10268497

Abstract: Methods and apparatus relating to conjugate code generation for efficient dynamic optimizations are described. In an embodiment, a binary code and an intermediate representation (IR) code are generated based at least partially on a source program. The binary code and the intermediate code are transmitted to a virtual machine logic. The binary code and the IR code each include a plurality of regions that are in one-to-one correspondence. Other embodiments are also claimed and described.

Type: Grant

Filed: October 24, 2013

Date of Patent: April 23, 2019

Assignee: Intel Corporation

Inventors: Hongbo Rong, Hyunchul Park, Cheng Wang, Youfeng Wu
Method and Apparatus for Supporting Programmatic Control of a Compiler for Generating High-Performance Spatial Hardware

Publication number: 20190042222

Abstract: A method for designing a system on a target device includes generating an intermediate representation of the system from a functional specification of a high-level description of a system. From the high-level description of the system, one or more directives are identified that (1) transform a portion of the system with a specific technique, (2) build a spatial layout for the system by dividing the system according to functionalities, and (3) specialize the system in response to the spatial layout. The intermediate representation of the system is modified in response to the one or more directives.

Type: Application

Filed: December 15, 2017

Publication date: February 7, 2019

Inventor: Hongbo Rong
Technologies for optimizing sparse matrix code with field-programmable gate arrays

Patent number: 9977663

Abstract: Technologies for optimizing sparse matrix code include a target computing device having a processor and a field-programmable gate array (FPGA). A compiler identifies a performance-critical loop in a sparse matrix source code and generates optimized executable code, including processor code and FPGA code. The target computing device executes the optimized executable code, using the processor for the processor code and the FPGA for the FPGA code. The processor executes a first iteration of the loop, generates reusable optimization data in response to executing the first iteration, and stores the reusable optimization data in a shared memory. The FPGA accesses the optimization data in the shared memory, executes additional iterations of the loop, and optimizes the additional iterations of the loop based on the optimization data. The optimization data may include, for example, loop-invariant data, reordered data, or alternate data storage representations. Other embodiments are described and claimed.

Type: Grant

Filed: July 1, 2016

Date of Patent: May 22, 2018

Assignee: Intel Corporation

Inventors: Hongbo Rong, Gilles A. Pokam
Technologies for persistent memory programming

Patent number: 9940229

Abstract: Technologies for persistent memory programming include a computing device having a persistent memory including one or more nonvolatile regions. The computing device may assign a virtual memory address of a target location in persistent memory to a persistent memory pointer using persistent pointer strategy, and may dereference the pointer using the same strategy. Persistent pointer strategies include off-holder, ID-in-value, optimistic rectification, and pessimistic rectification. The computing device may log changes to persistent memory during the execution of a data consistency section, and commit changes to the persistent memory when the last data consistency section ends. Data consistency sections may be grouped by log group identifier. Using type metadata stored in the nonvolatile region, the computing device may identify the type of a root object within the nonvolatile region and then recursively identify the type of all objects referenced by the root object. Other embodiments are described and claimed.

Type: Grant

Filed: September 25, 2014

Date of Patent: April 10, 2018

Assignee: Intel Corporation

Inventors: Xipeng Shen, Youfeng Wu, Cheng Wang, Hyunchul Park, Hongbo Rong
TECHNOLOGIES FOR OPTIMIZING SPARSE MATRIX CODE WITH FIELD-PROGRAMMABLE GATE ARRAYS

Publication number: 20180004496

Abstract: Technologies for optimizing sparse matrix code include a target computing device having a processor and a field-programmable gate array (FPGA). A compiler identifies a performance-critical loop in a sparse matrix source code and generates optimized executable code, including processor code and FPGA code. The target computing device executes the optimized executable code, using the processor for the processor code and the FPGA for the FPGA code. The processor executes a first iteration of the loop, generates reusable optimization data in response to executing the first iteration, and stores the reusable optimization data in a shared memory. The FPGA accesses the optimization data in the shared memory, executes additional iterations of the loop, and optimizes the additional iterations of the loop based on the optimization data. The optimization data may include, for example, loop-invariant data, reordered data, or alternate data storage representations. Other embodiments are described and claimed.

Type: Application

Filed: July 1, 2016

Publication date: January 4, 2018

Inventors: Hongbo Rong, Gilles A. Pokam
Methods, systems and apparatus to optimize sparse matrix applications

Patent number: 9720663

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to optimize sparse matrix execution. An example disclosed apparatus includes a context former to identify a matrix function call from a matrix function library, the matrix function call associated with a sparse matrix, a pattern matcher to identify an operational pattern associated with the matrix function call, and a code generator to associate a function data structure with the matrix function call exhibiting the operational pattern, the function data structure stored external to the matrix function library, and facilitate a runtime link between the function data structure and the matrix function call.

Type: Grant

Filed: June 25, 2015

Date of Patent: August 1, 2017

Assignee: INTEL CORPORATION

Inventors: Hongbo Rong, Jong Soo Park, Mikhail Smelyanskiy, Geoff Lowney
Instruction and logic to monitor loop trip count and remove loop optimizations

Patent number: 9715388

Abstract: Logic and instruction to monitor loop trip count are disclosed. Loop trip count information of a loop may be stored in a dedicated hardware buffer. Average loop trip count of the loop may be calculated based on the stored loop trip count information. Based on the average trip count, loop optimizations may be removed from the loop. The stored loop trip count information may include an identifier identifying the loop, a total loop trip count of the loop, and an exit count of the loop.

Type: Grant

Filed: March 30, 2012

Date of Patent: July 25, 2017

Assignee: Intel Corporation

Inventors: Jaewoong Chung, Hyunchul Park, Hongbo Rong, Cheng Wang, Youfeng Wu
Technologies for low-level composable high performance computing libraries

Patent number: 9690552

Abstract: Technologies for generating composable library functions include a first computing device that includes a library compiler configured to compile a composable library and second computing device that includes an application compiler configured to compose library functions of the composable library based on a plurality of abstractions written at different levels of abstractions. For example, the abstractions may include an algorithm abstraction at a high level, a blocked-algorithm abstraction at medium level, and a region-based code abstraction at a low level. Other embodiments are described and claimed herein.

Type: Grant

Filed: December 27, 2014

Date of Patent: June 27, 2017

Assignee: Intel Corporation

Inventors: Hongbo Rong, Peng Tu, Tatiana Shpeisman, Hai Liu, Todd A. Anderson, Youfeng Wu, Paul M. Petersen, Victor W. Lee, P. G. Lowney, Arch D. Robison, Cheng Wang
TECHNOLOGIES FOR AUTOMATIC REORDERING OF SPARSE MATRICES

Publication number: 20170147301

Abstract: Technologies for automatic reordering of sparse matrices include a computing device to determine a distributivity of an expression defined in a code region of a program code. The expression is determined to be distributive if semantics of the expression are unaffected by a reordering of an input/output of the expression. The computing device performs inter-dependent array analysis on the expression to determine one or more clusters of inter-dependent arrays of the expression, wherein each array of a cluster of the one or more clusters is inter-dependent on each other array of the cluster, and performs bi-directional data flow analysis on the code region by iterative backward and forward propagation of reorderable arrays through expressions in the code region based on the one or more clusters of the inter-dependent arrays. The backward propagation is based on a backward transfer function and the forward propagation is based on a forward transfer function.

Type: Application

Filed: November 19, 2015

Publication date: May 25, 2017

Inventors: Hongbo Rong, Jongsoo Park, Todd A. Anderson
Co-designed dynamic language accelerator for a processor

Patent number: 9542211

Abstract: In an embodiment, a processor includes at least one core and a dynamic language accelerator to execute a bytecode responsive to a memory mapped input/output (MMIO) operation on a file descriptor associated with the dynamic language accelerator. The processor may block execution of native code while the dynamic language accelerator executes the bytecode. Other embodiments are described and claimed.

Type: Grant

Filed: March 26, 2014

Date of Patent: January 10, 2017

Assignee: Intel Corporation

Inventors: Cheng Wang, Youfeng Wu, Hongbo Rong, Hyunchul Park
METHODS, SYSTEMS AND APPARATUS TO OPTIMIZE SPARSE MATRIX APPLICATIONS

Publication number: 20160378442

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to optimize sparse matrix execution. An example disclosed apparatus includes a context former to identify a matrix function call from a matrix function library, the matrix function call associated with a sparse matrix, a pattern matcher to identify an operational pattern associated with the matrix function call, and a code generator to associate a function data structure with the matrix function call exhibiting the operational pattern, the function data structure stored external to the matrix function library, and facilitate a runtime link between the function data structure and the matrix function call.

Type: Application

Filed: June 25, 2015

Publication date: December 29, 2016

Inventors: Hongbo Rong, Jong Soo Park, Mikhail Smelyanskiy, Geoff Lowney
Allocation of alias registers in a pipelined schedule

Patent number: 9495168

Abstract: In an embodiment, a system includes a processor including one or more cores and a plurality of alias registers to store memory range information associated with a plurality of operations of a loop. The memory range information references one or more memory locations within a memory. The system also includes register assignment means for assigning each of the alias registers to a corresponding operation of the loop, where the assignments are made according to a rotation schedule, and one of the alias registers is assigned to a first operation in a first iteration of the loop and to a second operation in a subsequent iteration of the loop. The system also includes the memory coupled to the processor. Other embodiments are described and claimed.

Type: Grant

Filed: May 30, 2013

Date of Patent: November 15, 2016

Assignee: Intel Corporation

Inventors: Hongbo Rong, Cheng Wang, Hyunchul Park, Youfeng Wu
TECHNOLOGIES FOR LOW-LEVEL COMPOSABLE HIGH PERFORMANCE COMPUTING LIBRARIES

Publication number: 20160188305

Abstract: Technologies for generating composable library functions include a first computing device that includes a library compiler configured to compile a composable library and second computing device that includes an application compiler configured to compose library functions of the composable library based on a plurality of abstractions written at different levels of abstractions. For example, the abstractions may include an algorithm abstraction at a high level, a blocked-algorithm abstraction at medium level, and a region-based code abstraction at a low level. Other embodiments are described and claimed herein.

Type: Application

Filed: December 27, 2014

Publication date: June 30, 2016

Inventors: Hongbo Rong, Peng Tu, Tatiana Shpeisman, Hai Liu, Todd A. Anderson, Youfeng Wu, Arthur N. Glew, Paul M. PetersEn, Victor W. Lee, P.G. Lowney, Arch D. Robinson, Cheng Wang
Software pipelining at runtime

Patent number: 9239712

Abstract: Apparatuses and methods may provide for determining a level of performance for processing one or more loops by a dynamic compiler and executing code optimizations to generate a pipelined schedule for the one or more loops that achieves the determined level of performance within a prescribed time period. In one example, a dependence graph may be established for the one or more loops, and each dependence graph may be partitioned into stages based on the level of performance.

Type: Grant

Filed: March 29, 2013

Date of Patent: January 19, 2016

Assignee: Intel Corporation

Inventors: Hongbo Rong, Hyunchul Park, Youfeng Wu
Dynamic optimization of pipelined software

Patent number: 9170792

Abstract: In an embodiment, a system includes a processor including at least one core to execute operations of a loop that includes S stages. The system also includes stage insertion means for adding a delay stage to the loop to increase a lifetime of a corresponding register associated with a first variable of the loop and to delay storage of contents of the register. The system also includes a dynamic random access memory (DRAM). Other embodiments are described and claimed.

Type: Grant

Filed: May 30, 2013

Date of Patent: October 27, 2015

Assignee: Intel Corporation

Inventors: Hyunchul Park, Hongbo Rong, Youfeng Wu
CO-DESIGNED DYNAMIC LANGUAGE ACCELERATOR FOR A PROCESSOR

Publication number: 20150277866

Abstract: In an embodiment, a processor includes at least one core and a dynamic language accelerator to execute a bytecode responsive to a memory mapped input/output (MMIO) operation on a file descriptor associated with the dynamic language accelerator. The processor may block execution of native code while the dynamic language accelerator executes the bytecode. Other embodiments are described and claimed.

Type: Application

Filed: March 26, 2014

Publication date: October 1, 2015

Inventors: Cheng Wang, Youfeng Wu, Hongbo Rong, Hyunchul Park
CONJUGATE CODE GENERATION FOR EFFICIENT DYNAMIC OPTIMIZATIONS

Publication number: 20150212836

Abstract: Methods and apparatus relating to conjugate code generation for efficient dynamic optimizations are described. In an embodiment, a binary code and an intermediate representation (IR) code are generated based at least partially on a source program. The binary code and the intermediate code are transmitted to a virtual machine logic. The binary code and the IR code each include a plurality of regions that are in one-to-one correspondence. Other embodiments are also claimed and described.

Type: Application

Filed: October 24, 2013

Publication date: July 30, 2015

Inventors: Hongbo Rong, Hyunchul Park, Cheng Wang, Youfeng Wu
TECHNOLOGIES FOR PERSISTENT MEMORY PROGRAMMING

Publication number: 20150169226

Abstract: Technologies for persistent memory programming include a computing device having a persistent memory including one or more nonvolatile regions. The computing device may assign a virtual memory address of a target location in persistent memory to a persistent memory pointer using persistent pointer strategy, and may dereference the pointer using the same strategy. Persistent pointer strategies include off-holder, ID-in-value, optimistic rectification, and pessimistic rectification. The computing device may log changes to persistent memory during the execution of a data consistency section, and commit changes to the persistent memory when the last data consistency section ends. Data consistency sections may be grouped by log group identifier. Using type metadata stored in the nonvolatile region, the computing device may identify the type of a root object within the nonvolatile region and then recursively identify the type of all objects referenced by the root object. Other embodiments are described and claimed.

Type: Application

Filed: September 25, 2014

Publication date: June 18, 2015

Inventors: Xipeng Shen, Youfeng Wu, Cheng Wang, Hyunchul Park, Hongbo Rong

1 2 next