Patents by Inventor Dimin Niu

Dimin Niu has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Two-dimensional processing array with a vertically stacked memory tile array

Patent number: 12346286

Abstract: The present disclosure discloses a processor. The processor is used to perform parallel computation and includes a logic die and a memory die. The logic die includes a plurality of processor cores and a plurality of networks on chip, wherein each processor core is programmable. The plurality of networks on chip are correspondingly connected to the plurality of processor cores, so that the plurality of processor cores form a two-dimensional mesh network. The memory die and the processor core are stacked vertically, wherein the memory die includes a plurality of memory tiles, and when the processor performs the parallel computation, the plurality of memory tiles do not have cache coherency; wherein, the plurality of memory tiles correspond to the plurality of processor cores in a one-to-one or one-to-many manner.

Type: Grant

Filed: December 12, 2022

Date of Patent: July 1, 2025

Assignee: ALIBABA (CHINA) CO., LTD.

Inventors: Shuangchen Li, Zhe Zhang, Dimin Niu, Hongzhong Zheng
Scaling out architecture for dram-based processing unit (DPU)

Patent number: 12340101

Abstract: A processor includes a plurality of memory units, each of the memory units including a plurality of memory cells, wherein each of the memory units is configurable to operate as memory, as a computation unit, or as a hybrid memory-computation unit.

Type: Grant

Filed: March 2, 2024

Date of Patent: June 24, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dimin Niu, Shuangchen Li, Bob Brennan, Krishna T. Malladi, Hongzhong Zheng
Graph acceleration solution with cloud FPGA

Patent number: 12332828

Abstract: This application describes systems and methods for facilitating memory access for graph neural network (GNN) processing. An example method includes fetching, by an access engine circuitry implemented on a circuitry board, a portion of structure data of a graph from a pinned memory in a host memory of a host via a first peripheral component interconnect express (PCIe) connection; performing node sampling using the fetched portion of the structure data of the graph to select one or more sampled nodes; fetching, by the access engine circuitry, a portion of attribute data of the graph from the pinned memory via the first PCIe connection; sending the fetched portion of the attribute data of the graph to one or more processors; and performing, by the one or more processors, GNN processing for the graph using the fetched portion of the attribute data of the graph.

Type: Grant

Filed: November 30, 2022

Date of Patent: June 17, 2025

Assignee: Alibaba (China) Co., Ltd.

Inventors: Shuangchen Li, Dimin Niu, Hongzhong Zheng, Zhe Zhang, Yuhao Wang
SYSTEM-ON-CHIP, INTEGRATED CIRCUIT ASSEMBLY, AND PROCESSING METHOD THEREOF

Publication number: 20250192130

Abstract: An integrated circuit assembly includes a first die formed by a first wafer layer and a first insulation layer, a second die formed by a second wafer layer and a second insulation layer, a third die formed by a third wafer layer, a third insulation layer, and a fourth insulation layer. A first semiconductor device is formed in the first wafer layer and forms a first bonding point on the first insulation layer. A second semiconductor device is formed in the second wafer layer and forms a second bonding point on the second insulation layer. A third bonding point of the third insulation layer and a fourth bonding point of the fourth insulation layer are connected through the interior of the third wafer layer. The third insulation layer and the first insulation layer are hybrid bonded and the fourth insulation layer and the second insulation layer are hybrid bonded.

Type: Application

Filed: December 9, 2024

Publication date: June 12, 2025

Inventors: Yijin Guan, Tianchan Guan, Zhaoyang Du, Shuangchen Li, Dimin Niu, Hongzhong Zheng
BANDWIDTH BOOSTED STACKED MEMORY

Publication number: 20250181512

Abstract: A high bandwidth memory system. In some embodiments, the system includes: a memory stack having a plurality of memory dies and eight 128-bit channels; and a logic die, the memory dies being stacked on, and connected to, the logic die; wherein the logic die may be configured to operate a first channel of the 128-bit channels in: a first mode, in which a first 64 bits operate in pseudo-channel mode, and a second 64 bits operate as two 32-bit fine-grain channels, or a second mode, in which the first 64 bits operate as two 32-bit fine-grain channels, and the second 64 bits operate as two 32-bit fine-grain channels.

Type: Application

Filed: February 12, 2025

Publication date: June 5, 2025

Inventors: Krishna T. MALLADI, Mu-Tien CHANG, Dimin NIU, Hongzhong ZHENG
3D-STACKED MEMORY WITH RECONFIGURABLE COMPUTE LOGIC

Publication number: 20250147659

Abstract: A 3D-stacked memory device including: a base die including a plurality of switches to direct data flow and a plurality of arithmetic logic units (ALUs) to compute data; a plurality of memory dies stacked on the base die; and an interface to transfer signals to control the base die.

Type: Application

Filed: January 10, 2025

Publication date: May 8, 2025

Inventors: Mu-Tien Chang, Prasun Gera, Dimin Niu, Hongzhong Zheng
Computing system for implementing artificial neural network models and method for implementing artificial neural network models

Patent number: 12271802

Abstract: The present application discloses a computing system for implementing an artificial neural network model. The artificial neural network model has a structure of multiple layers. The computing system comprises a first processing unit, a second processing unit, and a third processing unit. The first processing unit performs computations of the first layer based on a first part of input data of the first layer to generate a first part of output data. The second processing unit performs computations of the first layer based on a second part of the input data of the first layer so as to generate a second part of the output data. The third processing unit performs computations of the second layer based on the first part and the second part of the output data. The first processing unit, the second processing unit, and the third processing unit have the same structure.

Type: Grant

Filed: March 18, 2022

Date of Patent: April 8, 2025

Assignee: ALIBABA DAMO (HANGZHOU) TECHNOLOGY CO., LTD.

Inventors: Tianchan Guan, Shengcheng Wang, Dimin Niu, Hongzhong Zheng
Bandwidth boosted stacked memory

Patent number: 12248402

Abstract: A high bandwidth memory system. In some embodiments, the system includes: a memory stack having a plurality of memory dies and eight 128-bit channels; and a logic die, the memory dies being stacked on, and connected to, the logic die; wherein the logic die may be configured to operate a first channel of the 128-bit channels in: a first mode, in which a first 64 bits operate in pseudo-channel mode, and a second 64 bits operate as two 32-bit fine-grain channels, or a second mode, in which the first 64 bits operate as two 32-bit fine-grain channels, and the second 64 bits operate as two 32-bit fine-grain channels.

Type: Grant

Filed: November 28, 2022

Date of Patent: March 11, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Krishna T. Malladi, Mu-Tien Chang, Dimin Niu, Hongzhong Zheng
HBM RAS CACHE ARCHITECTURE

Publication number: 20250077370

Abstract: According to one general aspect, an apparatus may include a plurality of stacked integrated circuit dies that include a memory cell die and a logic die. The memory cell die may be configured to store data at a memory address. The logic die may include an interface to the stacked integrated circuit dies and configured to communicate memory accesses between the memory cell die and at least one external device. The logic die may include a reliability circuit configured to ameliorate data errors within the memory cell die. The reliability circuit may include a spare memory configured to store data, and an address table configured to map a memory address associated with an error to the spare memory. The reliability circuit may be configured to determine if the memory access is associated with an error, and if so completing the memory access with the spare memory.

Type: Application

Filed: November 19, 2024

Publication date: March 6, 2025

Inventors: Dimin NIU, Krishna MALLADI, Hongzhong ZHENG
DRAM assist error correction mechanism for DDR SDRAM interface

Patent number: 12242344

Abstract: A method of correcting a memory error of a dynamic random-access memory module (DRAM) using a double data rate (DDR) interface, the method includes conducting a memory transaction including multiple bursts with a memory controller to send data from data chips of the DRAM to the memory controller, detecting one or more errors using an ECC chip of the DRAM, determining a number of the bursts having the errors using the ECC chip of the DRAM, determining whether the number of the bursts having the errors is greater than a threshold number, determining a type of the errors, and directing the memory controller based on the determined type of the errors, wherein the DRAM includes a single ECC chip per memory channel.

Type: Grant

Filed: March 28, 2023

Date of Patent: March 4, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Dimin Niu, Mu-Tien Chang, Hongzhong Zheng, Hyun-Joong Kim, Won-hyung Song, Jangseok Choi
SWITCH, MEMORY SHARING METHOD, SYSTEM, COMPUTING DEVICE, AND STORAGE MEDIUM

Publication number: 20250068577

Abstract: The present invention provides a switch, which is equipped with multiple connection interfaces, for connecting to multiple external processors respectively, enabling mutual access to the respective memories of these processors through the switch. The switch is configured to: through a memory request service component corresponding to a first processor, set within the switch, receive a first memory request sent by the first processor; convert the first memory request into a second memory request aimed at accessing the memory of a second processor and send this second memory request to a memory response service component corresponding to the second processor within the switch; through the memory response service component, convert the second memory request into a third memory request for accessing local memory and send this third memory request to the second processor to access the memory resources corresponding to the second processor.

Type: Application

Filed: August 22, 2024

Publication date: February 27, 2025

Inventors: Yijin GUAN, Dimin NIU, Tianchan GUAN, Zhaoyang DU, Hongzhong ZHENG
MEMORY POOLING CONFIGURATION SYSTEM, METHOD, ELECTRONIC DEVICE, AND STORAGE MEDIUM

Publication number: 20250068468

Abstract: A memory pooling configuration system includes: a plurality of servers, an expansion switch in communication with the plurality of servers, and a memory configuration device. A target server acquires first memory pooling configuration information, which instructs the target server to send a second memory access request to the expansion switch when a virtual address indicated by a first memory access request does not correspond to a physical address space of the target server. The expansion switch acquires third memory pooling configuration information, which instructs the expansion switch to forward the second memory access request to an associated server based on the second memory access request. The associated server acquires second memory pooling configuration information, which instructs the associated server to return the memory access result of the second memory access request to the expansion switch, which forwards the memory access result to the target server.

Type: Application

Filed: August 23, 2024

Publication date: February 27, 2025

Inventors: Tianchan GUAN, Dimin NIU, Yijin GUAN, Zhaoyang DU, Hongzhong ZHENG
DATA COMMUNICATION METHOD, MEMORY POOLING SWITCH DEVICE, CLOUD COMPUTING SYSTEM, AND STORAGE MEDIUM

Publication number: 20250068748

Abstract: The embodiments of the present invention provide a data communication method, a memory pooling switch device, a cloud computing system, and a storage medium. The data communication method comprises: converting verification preprocessed memory data from a receiver's communication protocol format to a sender's communication protocol format, wherein the receiver's communication protocol format and the sender's communication protocol format are used for asymmetric communication protocols on a communication bus between a server and a memory pooling switch device; performing verification processing on the verification preprocessed memory data to obtain a verification result; transmitting the verification result to the server via the communication bus in the sender's communication protocol format.

Type: Application

Filed: August 22, 2024

Publication date: February 27, 2025

Inventors: Tianchan GUAN, Yijin GUAN, Zhaoyang DU, Dimin NIU
Error detection, prediction and handling techniques for system-in-package memory architectures

Patent number: 12216537

Abstract: A system-in-package including a logic die and one or more memory dice can include a reliability availability serviceability (RAS) memory management unit (MMU) for memory error detection, memory error prediction and memory error handling. The RAS MMU can receive memory health information, on-die memory error information, system error information and read address information for the one or more memory dice. The RAS MMU can manage the memory blocks of the one or more memory dice based on the memory health information, on-die memory error type, system error type and read address. The RAS MMU can also further manage the memory blocks based on received on-die memory temperature information and or system temperature information.

Type: Grant

Filed: November 4, 2020

Date of Patent: February 4, 2025

Assignee: Alibaba Group Holding Limited

Inventors: Dimin Niu, Tianchan Guan, Hongzhong Zheng, Shuangchen Li
Computer system, memory expansion device and method for use in computer system

Patent number: 12216923

Abstract: The present application provides a computer system, a memory expansion device and a method for use in the computer system. The computer system includes multiple hosts and multiple memory expansion devices; the memory expansion devices correspond to the hosts in a one-to-one manner. Each host includes a CPU and a memory; each memory expansion device includes a first interface and multiple second interfaces. The first interface is configured to allow each memory expansion device to communicate with the corresponding CPU via a first coherence interconnection protocol, and the second interface is configured to allow each memory expansion device to communicate with a portion of memory expansion devices via a second coherence interconnection protocol. Any two memory expansion devices communicate with each other via at least two different paths, and the number of memory expansion devices that at least one of the two paths passes through is not more than one.

Type: Grant

Filed: December 12, 2022

Date of Patent: February 4, 2025

Assignee: ALIBABA (CHINA) CO., LTD.

Inventors: Yijin Guan, Tianchan Guan, Dimin Niu, Hongzhong Zheng
ASYNCHRONOUS COMMUNICATION PROTOCOL COMPATIBLE WITH SYNCHRONOUS DDR PROTOCOL

Publication number: 20250036584

Abstract: A memory module that includes a non-volatile memory and an asynchronous memory interface to interface with a memory controller is presented. The asynchronous memory interface may use repurposed pins of a double data rate (DDR) memory channel to send an asynchronous data to the memory controller. The asynchronous data may be device feedback indicating a status of the non-volatile memory.

Type: Application

Filed: October 16, 2024

Publication date: January 30, 2025

Inventors: Dimin NIU, Mu-Tien CHANG, Hongzhong ZHENG, Sun Young LIM, Indong KIM, Jangseok CHOI, Craig HANSON
Flexible memory extension systems and methods

Patent number: 12210768

Abstract: The presented systems enable efficient and effective network communications. The presented systems enable efficient and effective network communications. In one embodiment a memory device includes a memory module, including a plurality of memory chips configured to store information; and an inter-chip network (ICN)/shared smart memory extension (SMX) memory interface controller (ICN/SMX memory interface controller) configured to interface between the memory module and an inter-chip network (ICN), wherein the ICN is configured to communicatively couple the memory device to a parallel processing unit (PPU). In one exemplary implementation, the ICN/SMX memory controller includes a plurality of package buffers, an ICN physical layer interface, a PRC/MAC interface, and a switch. The memory device and be a memory card including memory module (e.g., DDR DIMM, etc.).

Type: Grant

Filed: July 15, 2022

Date of Patent: January 28, 2025

Assignee: Alibaba (China) Co., Ltd.

Inventors: Dimin Niu, Yijin Guan, Shengcheng Wang, Yuhao Wang, Shuangchen Li, Hongzhong Zheng
3D-stacked memory with reconfigurable compute logic

Patent number: 12197726

Abstract: A 3D-stacked memory device including: a base die including a plurality of switches to direct data flow and a plurality of arithmetic logic units (ALUs) to compute data; a plurality of memory dies stacked on the base die; and an interface to transfer signals to control the base die.

Type: Grant

Filed: September 15, 2023

Date of Patent: January 14, 2025

Assignee: Samsung Electronics Co., Ltd.

Inventors: Mu-Tien Chang, Prasun Gera, Dimin Niu, Hongzhong Zheng
Data processing system and memory management method of data processing system

Patent number: 12189539

Abstract: A memory management method of a data processing system is provided. The memory management method includes: creating a first memory zone and a second memory zone related to a first node of a first server, wherein the first server is located in the data processing system, and the first node includes a processor and a first memory; mapping the first memory zone to the first memory; and mapping the second memory zone to a second memory of a second server, wherein the second server is located in the data processing system, and the processor is configured to access the second memory of the second server through an interface circuit of the first server and through an interface circuit of the second server.

Type: Grant

Filed: December 13, 2022

Date of Patent: January 7, 2025

Assignee: ALIBABA (CHINA) CO., LTD.

Inventors: Dimin Niu, Yijin Guan, Tianchan Guan, Shuangchen Li, Hongzhong Zheng
Asynchronous communication protocol compatible with synchronous DDR protocol

Patent number: 12189546

Abstract: A memory module that includes a non-volatile memory and an asynchronous memory interface to interface with a memory controller is presented. The asynchronous memory interface may use repurposed pins of a double data rate (DDR) memory channel to send an asynchronous data to the memory controller. The asynchronous data may be device feedback indicating a status of the non-volatile memory.

Type: Grant

Filed: July 25, 2022

Date of Patent: January 7, 2025

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Dimin Niu, Mu-Tien Chang, Hongzhong Zheng, Sun Young Lim, Indong Kim, Jangseok Choi, Craig Hanson

1 2 3 4 5 … next