Patents by Inventor Hongzhong Zheng

Hongzhong Zheng has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

MEMORY PRIMING AND INITALIZATION SYSTEMS AND METHODS

Publication number: 20230245711

Abstract: The present invention provides systems and methods for efficiently and effectively priming and initializing a memory. In one embodiment, a memory controller includes a normal data path and a priming path. The normal data path directs storage operations during a normal memory read/write operation after power startup of a memory chip. The priming path includes a priming module, wherein the priming module directs memory priming operations during a power startup of the memory chip, including forwarding a priming pattern for storage in a write pattern mode register of a memory chip and selection of a memory address in the memory chip for initialization with the priming pattern. The priming pattern includes information corresponding to proper initial data values. The priming pattern can also include proper corresponding error correction code (ECC) values. The priming module can include a priming pattern register that stores the priming pattern.

Type: Application

Filed: January 19, 2021

Publication date: August 3, 2023

Inventors: Dimin NIU, Shuangchen LI, Tianchan GUAN, Hongzhong ZHENG
DRAM ASSIST ERROR CORRECTION MECHANISM FOR DDR SDRAM INTERFACE

Publication number: 20230229555

Abstract: A method of correcting a memory error of a dynamic random-access memory module (DRAM) using a double data rate (DDR) interface, the method includes conducting a memory transaction including multiple bursts with a memory controller to send data from data chips of the DRAM to the memory controller, detecting one or more errors using an ECC chip of the DRAM, determining a number of the bursts having the errors using the ECC chip of the DRAM, determining whether the number of the bursts having the errors is greater than a threshold number, determining a type of the errors, and directing the memory controller based on the determined type of the errors, wherein the DRAM includes a single ECC chip per memory channel.

Type: Application

Filed: March 28, 2023

Publication date: July 20, 2023

Inventors: Dimin NIU, Mu-Tien CHANG, Hongzhong ZHENG, Hyun-Joong KIM, Won-hyung SONG, Jangseok CHOI
Scalable system-in-package architectures

Patent number: 11704271

Abstract: A system-in-package architecture in accordance with aspects includes a logic die and one or more memory dice coupled together in a three-dimensional slack. The logic die can include one or more global building blocks and a plurality of local building blocks. The number of local building blocks can be scalable. The local building blocks can include a plurality of engines and memory controllers. The memory controllers can be configured to directly couple one or more of the engines to the one or more memory dice. The number and type of local building blocks, and the number and types of engines and memory controllers can be scalable.

Type: Grant

Filed: August 20, 2020

Date of Patent: July 18, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Lide Duan, Wei Han, Yuhao Wang, Fei Xue, Yuanwei Fang, Hongzhong Zheng
HBM based memory lookup engine for deep learning accelerator

Patent number: 11681451

Abstract: A storage device and method of controlling a storage device are disclosed. The storage device includes a host, a logic die, and a high bandwidth memory stack including a memory die. A computation lookup table is stored on a memory array of the memory die. The host sends a command to perform an operation utilizing a kernel and a plurality of input feature maps, includes finding the product of a weight of the kernel and values of multiple input feature maps. The computation lookup table includes a row corresponding to a weight of the kernel, and a column corresponding to a value of the input feature maps. A result value stored at a position corresponding to a row and a column is the product of the weight corresponding to the row and the value corresponding to the column.

Type: Grant

Filed: September 13, 2021

Date of Patent: June 20, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Peng Gu, Krishna T. Malladi, Hongzhong Zheng
Flash memory with improved bandwidth

Patent number: 11658168

Abstract: A flash memory device includes a plurality of flash memory cell arrays, wherein: a flash memory cell array in the plurality of flash memory cell arrays comprises a plurality of layers of flash memory cell planes; and a flash memory cell plane includes a plurality of flash memory cells. The flash memory device further includes a logic circuitry coupled to the plurality of flash memory cell arrays, configured to perform operations using the plurality of flash memory cell arrays; and a sensing circuitry configured to access a corresponding flash memory cell plane among the plurality of flash memory cell planes.

Type: Grant

Filed: August 5, 2020

Date of Patent: May 23, 2023

Inventors: Fei Xue, Shuangchen Li, Dimin Niu, Hongzhong Zheng
GRAPH NEURAL NETWORK ACCELERATOR WITH NEGATIVE SAMPLING

Publication number: 20230153568

Abstract: This application describes an accelerator, a computer system, and a method for accelerating Graph Neural Network (GNN) node attribute fetching. The accelerator comprises a graph structure processor configured to obtain, according to a root node in a graph, a plurality of candidate nodes for GNN processing on the root node; a GNN sampler configured to generate sampled graph nodes based on the plurality of candidate nodes from the graph structure processor; and a GNN attribute processor configured to fetch attribute data of the sampled graph nodes received from the GNN sampler for the GNN processing on the root node.

Type: Application

Filed: January 12, 2022

Publication date: May 18, 2023

Inventors: Tianchan GUAN, Shuangchen LI, Heng LIU, Hongzhong ZHENG
Cache Memory That Supports Tagless Addressing

Publication number: 20230153251

Abstract: The disclosed embodiments relate to a computer system with a cache memory that supports tagless addressing. During operation, the system receives a request to perform a memory access, wherein the request includes a virtual address. In response to the request, the system performs an address-translation operation, which translates the virtual address into both a physical address and a cache address. Next, the system uses the physical address to access one or more levels of physically addressed cache memory, wherein accessing a given level of physically addressed cache memory involves performing a tag-checking operation based on the physical address. If the access to the one or more levels of physically addressed cache memory fails to hit on a cache line for the memory access, the system uses the cache address to directly index a cache memory, wherein directly indexing the cache memory does not involve performing a tag-checking operation and eliminates the tag storage overhead.

Type: Application

Filed: November 22, 2022

Publication date: May 18, 2023

Inventors: Hongzhong Zheng, Trung A. Diep
ACCESS FRIENDLY MEMORY ARCHITECTURE OF GRAPH NEURAL NETWORK SAMPLING

Publication number: 20230153250

Abstract: This specification describes methods and systems for accelerating attribute data access for graph neural network (GNN) processing. An example method includes: receiving a root node identifier corresponding to a node in a graph for GNN processing; determining one or more candidate node identifiers according to the root node identifier, wherein attribute data corresponding to the one or more candidate node identifiers are sequentially stored in a memory; and sampling one or more graph node identifiers at least from the one or more candidate node identifiers for the GNN processing.

Type: Application

Filed: January 21, 2022

Publication date: May 18, 2023

Inventors: Heng LIU, Tianchan GUAN, Shuangchen LI, Hongzhong ZHENG
COMPUTING SYSTEM FOR IMPLEMENTING ARTIFICIAL NEURAL NETWORK MODELS AND METHOD FOR IMPLEMENTING ARTIFICIAL NEURAL NETWORK MODELS

Publication number: 20230153570

Abstract: The present application discloses a computing system for implementing an artificial neural network model. The artificial neural network model has a structure of multiple layers. The computing system comprises a first processing unit, a second processing unit, and a third processing unit. The first processing unit performs computations of the first layer based on a first part of input data of the first layer to generate a first part of output data. The second processing unit performs computations of the first layer based on a second part of the input data of the first layer so as to generate a second part of the output data. The third processing unit performs computations of the second layer based on the first part and the second part of the output data. The first processing unit, the second processing unit, and the third processing unit have the same structure.

Type: Application

Filed: March 18, 2022

Publication date: May 18, 2023

Inventors: TIANCHAN GUAN, SHENGCHENG WANG, DIMIN NIU, HONGZHONG ZHENG
PROCESSING SYSTEM THAT INCREASES THE MEMORY CAPACITY OF A GPGPU

Publication number: 20230144693

Abstract: The total memory space that is logically available to a processor in a general-purpose graphics processing unit (GPGPU) module is increased to accommodate terabyte-sized amounts of data by utilizing the memory space in an external memory module, and by further utilizing a portion of the memory space in a number of other external memory modules.

Type: Application

Filed: January 21, 2022

Publication date: May 11, 2023

Inventors: Yuhao WANG, Dimin NIU, Yijin GUAN, Shengcheng WANG, Shuangchen LI, Hongzhong ZHENG
GRAPH NEURAL NETWORK METHOD AND ASSOCIATED MACHINE AND SYSTEM

Publication number: 20230142254

Abstract: The present application discloses a graph neural network processing method and associated machine and system. The graph neural network method is used for a master, wherein the master, a first worker and a second worker train the graph neural network in a distributed environment. The method includes: receiving a first request from the first worker and a second request from the second worker, wherein the first worker sends the first request to the master to obtain at least an attribute of a first requested node, and the second worker sends a second request to the master to obtain at least an attribute of a second requested node; determining whether the first requested node and the second requested node are the same nodes and generating a determination result accordingly; and selectively performing broadcast or unicast to the first worker and the second worker, at least based on the determination result.

Type: Application

Filed: January 25, 2022

Publication date: May 11, 2023

Inventors: YANHONG WANG, TIANCHAN GUAN, SHUANGCHEN LI, HONGZHONG ZHENG
Methods and Apparatuses for Addressing Memory Caches

Publication number: 20230142048

Abstract: A cache memory includes cache lines to store information. The stored information is associated with physical addresses that include first, second, and third distinct portions. The cache lines are indexed by the second portions of respective physical addresses associated with the stored information. The cache memory also includes one or more tables, each of which includes respective table entries that are indexed by the first portions of the respective physical addresses. The respective table entries in each of the one or more tables are to store indications of the second portions of respective physical addresses associated with the stored information.

Type: Application

Filed: November 14, 2022

Publication date: May 11, 2023

Inventors: Trung Diep, Hongzhong Zheng
DEVICES AND METHODS FOR ACCESSING AND RETRIEVING DATA IN A GRAPH

Publication number: 20230137162

Abstract: A programmable device receives commands from a processor and, based on the commands: identifies a root node in a graph; identifies nodes in the graph that are neighbors of the root node; identifies nodes in the graph that are neighbors of the neighbors; retrieves data associated with the root node; retrieves data associated with at least a subset of the nodes that are neighbors of the root node and that are neighbors of the neighbor nodes; and writes the data that is retrieved into a memory.

Type: Application

Filed: January 21, 2022

Publication date: May 4, 2023

Inventors: Shuangchen LI, Tianchan GUAN, Zhe ZHANG, Heng LIU, Wei HAN, Dimin NIU, Hongzhong ZHENG
PROGRAMMABLE ACCESS ENGINE ARCHITECTURE FOR GRAPH NEURAL NETWORK AND GRAPH APPLICATION

Publication number: 20230128180

Abstract: This specification describes methods and systems for accessing attribute data in graph neural network (GNN) processing. An example system includes: a plurality of cores, each of the plurality of cores comprises a key-value fetcher and a filter, and is programmable using a software interface to support a plurality of data formats of the GNN attribute data, wherein: the key-value fetcher is programmable using the software interface to perform key-value fetching associated with accessing the GNN attribute data, and the filter of at least one of the plurality of cores is programmable using the software interface to sample node identifiers associated with accessing the GNN attribute data; and a first memory communicatively coupled with the plurality of cores, wherein the first memory is configured to store data shared by the plurality of cores.

Type: Application

Filed: January 12, 2022

Publication date: April 27, 2023

Inventors: Heng LIU, Shuangchen LI, Tianchan GUAN, Hongzhong ZHENG
ISA EXTENSION FOR HIGH-BANDWIDTH MEMORY

Publication number: 20230119291

Abstract: A method of processing in-memory commands in a high-bandwidth memory (HBM) system includes sending a function-in-HBM instruction to the HBM by a HBM memory controller of a GPU. A logic component of the HBM receives the FIM instruction and coordinates the instructions execution using the controller, an ALU, and a SRAM located on the logic component.

Type: Application

Filed: December 14, 2022

Publication date: April 20, 2023

Inventors: Mu-Tien Chang, Krishna T. Malladi, Dimin Niu, Hongzhong Zheng
Narrow DRAM channel systems and methods

Patent number: 11625341

Abstract: The systems and methods are configured to efficiently and effectively access memory. In one embodiment, a memory controller comprises a request queue, a buffer, a control component, and a data path system. The request queue receives memory access requests. The control component is configured to process information associated with access requests via a first narrow memory channel and a second narrow memory channel. The first narrow memory channel and the second narrow memory channel can have a portion of command/control communication lines and address communication lines that are included in and shared between the first narrow memory channel and the second narrow memory channel. The data path system can include a first data module and one set of unshared data lines associated with the first memory channel and a second data module and another set of unshared data lines associated with second memory channel.

Type: Grant

Filed: August 11, 2020

Date of Patent: April 11, 2023

Assignee: Alibaba Group Holding Limited

Inventors: Jilan Lin, Dimin Niu, Shuangchen Li, Hongzhong Zheng, Yuan Xie
DRAM assist error correction mechanism for DDR SDRAM interface

Patent number: 11625296

Abstract: A method of correcting a memory error of a dynamic random-access memory module (DRAM) using a double data rate (DDR) interface, the method includes conducting a memory transaction including multiple bursts with a memory controller to send data from data chips of the DRAM to the memory controller, detecting one or more errors using an ECC chip of the DRAM, determining a number of the bursts having the errors using the ECC chip of the DRAM, determining whether the number of the bursts having the errors is greater than a threshold number, determining a type of the errors, and directing the memory controller based on the determined type of the errors, wherein the DRAM includes a single ECC chip per memory channel.

Type: Grant

Filed: May 13, 2021

Date of Patent: April 11, 2023

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dimin Niu, Mu-Tien Chang, Hongzhong Zheng, Hyun-Joong Kim, Won-hyung Song, Jangseok Choi
PRUNING HARDWARE UNIT FOR TRAINING NEURAL NETWORK

Publication number: 20230109617

Abstract: A system for pruning weights during training of a neural network includes a configurable pruning hardware unit that is configured to: receive, from a neural network training engine, inputs including the weights, gradients associated with the weights, and a prune indicator per weight; select unpruned weights for pruning; prune the unpruned weights selected for pruning; update the prune indicator per weight for the weights that are selected and pruned; and provide the updated prune indicator to the training engine for the next iteration or epoch. The pruning hardware unit can be configured to perform incremental pruning or non-incremental pruning.

Type: Application

Filed: December 9, 2022

Publication date: April 6, 2023

Inventors: Tianchan GUAN, Yuan GAO, Hongzhong ZHENG, Minghai QIN, Chunsheng LIU, Dimin NIU
MEMORY LOOKUP COMPUTING MECHANISMS

Publication number: 20230101422

Abstract: According to some example embodiments of the present disclosure, in a method for a memory lookup mechanism in a high-bandwidth memory system, the method includes: using a memory die to conduct a multiplication operation using a lookup table (LUT) methodology by accessing a LUT, which includes floating point operation results, stored on the memory die; sending, by the memory die, a result of the multiplication operation to a logic die including a processor and a buffer; and conducting, by the logic die, a matrix multiplication operation using computation units.

Type: Application

Filed: November 30, 2022

Publication date: March 30, 2023

Inventors: Peng Gu, Krishna T. Malladi, Hongzhong Zheng
PROCESSING SYSTEM THAT INCREASES THE CAPACITY OF A VERY FAST MEMORY

Publication number: 20230088939

Abstract: The maximum capacity of a very fast memory in a system that requires very fast memory access times is increased by adding a memory with remote access times that are slower than required, and then moving infrequently accessed data from the memory with the very fast access times to the memory with the slow access times.

Type: Application

Filed: January 21, 2022

Publication date: March 23, 2023

Inventors: Yuhao WANG, Dimin NIU, Yijin GUAN, Shengcheng WANG, Shuangchen LI, Hongzhong ZHENG

prev … 2 3 4 5 6 7 8 9 10 … next