Patents by Inventor Yuhao SHU

Yuhao SHU has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DUAL-SIX-TRANSISTOR (D6T) IN-MEMORY COMPUTING (IMC) ACCELERATOR SUPPORTING ALWAYS-LINEAR DISCHARGE AND REDUCING DIGITAL STEPS

Publication number: 20240233815

Abstract: A dual-six-transistor (D6T) in-memory computing (IMC) accelerator supporting always-linear discharge and reducing digital steps is provided. In the IMC accelerator, three effective techniques are proposed: (1) A D6T bitcell can reliably run at 0.4 V and enter a standby mode at 0.26 V, to support parallel processing of dual decoupled ports. (2) An always-linear discharge and convolution mechanism (ALDCM) not only reduces a voltage of a bit line (BL), but also keeps linear calculation throughout an entire voltage range of the BL. (3) A bypass of a bias voltage time converter (BVTC) reduces digital steps, but still keeps high energy efficiency and computing density at a low voltage. A measurement result of the IMC accelerator shows that the IMC accelerator achieves an average energy efficiency of 8918 TOPS/W (8b×8b), and an average computing density of 38.6 TOPS/mm2 (8b×8b) in a 55 nm CMOS technology.

Type: Application

Filed: October 9, 2023

Publication date: July 11, 2024

Applicant: SHANGHAITECH UNIVERSITY

Inventors: Hongtu ZHANG, Yuhao SHU, Yajun HA
ENERGY-EFFICIENT MEMORY FOR CRYOGENIC COMPUTING

Publication number: 20240233796

Abstract: An energy-efficient memory for cryogenic computing is provided. The energy-efficient memory includes a plurality of memory banks, where each of the memory banks includes a cryogenic semi-static, dual-port, boost-free gain cell (CSDB-GC) macro module, a universal address decoder, and a different address decoder. The CSDB-GC macro module includes a plurality of columns of local blocks, and each of the local blocks includes a plurality of CSDB-GC memory cells. A final measurement result of a 16 Kb CSDB-eDRAM shows that the 16 Kb CSDB-eDRAM achieves data retention time (DRT) of 16.67 seconds, which is 2.6 times longer than DRT of a state-of-the-art cryogenic eDRAM at a temperature of 4.2 K, and achieves lower refresh power (0.11 pW/Kb). In addition, the 16 Kb CSDB-eDRAM also achieves shorter access time, namely, 710 ps (1.41 GHz). Compared with the state-of-the-art work, the 16 Kb CSDB-eDRAM has a lowest dynamic power consumption overhead, namely, 49.23 uW/Kb.

Type: Application

Filed: November 9, 2023

Publication date: July 11, 2024

Applicant: SHANGHAITECH UNIVERSITY

Inventors: Yuhao SHU, Hongtu ZHANG, Yajun HA
ENERGY-EFFICIENT CRYOGENIC-IN-MEMORY-COMPUTING (CIMC) ACCELERATOR

Publication number: 20240221811

Abstract: An energy-efficient cryogenic-in-memory-computing (CIMC) accelerator includes cryogenic 3T (C3T) macros. Each of the C3T macros comprises a C3T array containing M rows×N columns of bitcells. An input signal is converted into a timing sequence signal of a corresponding pulse width by using a digital timing sequence converter array. A C3T bitcell of a corresponding row in the C3T macro is controlled to perform charging and discharging on a read bit line (RBL) of a corresponding column. A voltage on the RBL of the corresponding column is sampled by a sense amplifier configured in each C3T macro to obtain a final result. With adaptive reference voltage configuration and storage on the chip, this design can achieve fast and low-power boolean/convolutional computing.

Type: Application

Filed: August 3, 2023

Publication date: July 4, 2024

Applicant: SHANGHAITECH UNIVERSITY

Inventors: Yuhao SHU, Hongtu ZHANG, Yajun HA
DUAL-SIX-TRANSISTOR (D6T) IN-MEMORY COMPUTING (IMC) ACCELERATOR SUPPORTING ALWAYS-LINEAR DISCHARGE AND REDUCING DIGITAL STEPS

Publication number: 20240135989

Abstract: A dual-six-transistor (D6T) in-memory computing (IMC) accelerator supporting always-linear discharge and reducing digital steps is provided. In the IMC accelerator, three effective techniques are proposed: (1) A D6T bitcell can reliably run at 0.4 V and enter a standby mode at 0.26 V, to support parallel processing of dual decoupled ports. (2) An always-linear discharge and convolution mechanism (ALDCM) not only reduces a voltage of a bit line (BL), but also keeps linear calculation throughout an entire voltage range of the BL. (3) A bypass of a bias voltage time converter (BVTC) reduces digital steps, but still keeps high energy efficiency and computing density at a low voltage. A measurement result of the IMC accelerator shows that the IMC accelerator achieves an average energy efficiency of 8918 TOPS/W (8b×8b), and an average computing density of 38.6 TOPS/mm2 (8b×8b) in a 55 nm CMOS technology.

Type: Application

Filed: October 8, 2023

Publication date: April 25, 2024

Applicant: SHANGHAITECH UNIVERSITY

Inventors: Hongtu ZHANG, Yuhao SHU, Yajun HA
Enhanced dynamic random access memory (eDRAM)-based computing-in-memory (CIM) convolutional neural network (CNN) accelerator

Patent number: 11875244

Abstract: An enhanced dynamic random access memory (eDRAM)-based computing-in-memory (CIM) convolutional neural network (CNN) accelerator comprises four P2ARAM blocks, where each of the P2ARAM blocks includes a 5T1C ping-pong eDRAM bit cell array composed of 64×16 5T1C ping-pong eDRAM bit cells. In each of the P2ARAM blocks, 64×2 digital time converters convert a 4-bit activation value into different pulse widths from a row direction and input the pulse widths into the 5T1C ping-pong eDRAM bit cell array for calculation. A total of 16×2 convolution results are output in a column direction of the 5T1C ping-pong eDRAM bit cell array. The CNN accelerator uses the 5T1C ping-pong eDRAM bit cells to perform multi-bit storage and convolution in parallel. An S2M-ADC scheme is proposed to allot an area of an input sampling capacitor of an ABL to sign-numerical SAR ADC units of a C-DAC array without adding area overhead.

Type: Grant

Filed: August 5, 2022

Date of Patent: January 16, 2024

Assignee: SHANGHAITECH UNIVERSITY

Inventors: Hongtu Zhang, Yuhao Shu, Yajun Ha
High-energy-efficiency binary neural network accelerator applicable to artificial intelligence internet of things

Patent number: 11762700

Abstract: A high-energy-efficiency binary neural network accelerator applicable to artificial intelligence Internet of Things is provided. 0.3-0.6V sub/near threshold 10T1C multiplication bit units with series capacitors are configured for charge domain binary convolution. An anti-process deviation differential voltage amplification array between bit lines and DACs is configured for robust pre-amplification in 0.3V batch standardized operations. A lazy bit line reset scheme further reduces energy, and inference accuracy losses can be ignored. Therefore, a binary neural network accelerator chip based on in-memory computation achieves peak energy efficiency of 18.5 POPS/W and 6.06 POPS/W, which are respectively improved by 21× and 135× compared with previous macro and system work [9, 11].

Type: Grant

Filed: January 19, 2023

Date of Patent: September 19, 2023

Assignee: SHANGHAITECH UNIVERSITY

Inventors: Hongtu Zhang, Yuhao Shu, Yajun Ha
ENHANCED DYNAMIC RANDOM ACCESS MEMORY (EDRAM)-BASED COMPUTING-IN-MEMORY (CIM) CONVOLUTIONAL NEURAL NETWORK (CNN) ACCELERATOR

Publication number: 20230196079

Abstract: An enhanced dynamic random access memory (eDRAM)-based computing-in-memory (CIM) convolutional neural network (CNN) accelerator comprises four P2ARAM blocks, where each of the P2ARAM blocks includes a 5T1C ping-pong eDRAM bit cell array composed of 64×16 5T1C ping-pong eDRAM bit cells. In each of the P2ARAM blocks, 64×2 digital time converters convert a 4-bit activation value into different pulse widths from a row direction and input the pulse widths into the 5T1C ping-pong eDRAM bit cell array for calculation. A total of 16×2 convolution results are output in a column direction of the 5T1C ping-pong eDRAM bit cell array. The CNN accelerator uses the 5T1C ping-pong eDRAM bit cells to perform multi-bit storage and convolution in parallel. An S2M-ADC scheme is proposed to allot an area of an input sampling capacitor of an ABL to sign-numerical SAR ADC units of a C-DAC array without adding area overhead.

Type: Application

Filed: August 5, 2022

Publication date: June 22, 2023

Applicant: SHANGHAITECH UNIVERSITY

Inventors: Hongtu ZHANG, Yuhao SHU, Yajun HA

DUAL-SIX-TRANSISTOR (D6T) IN-MEMORY COMPUTING (IMC) ACCELERATOR SUPPORTING ALWAYS-LINEAR DISCHARGE AND REDUCING DIGITAL STEPS

ENERGY-EFFICIENT MEMORY FOR CRYOGENIC COMPUTING

ENERGY-EFFICIENT CRYOGENIC-IN-MEMORY-COMPUTING (CIMC) ACCELERATOR

DUAL-SIX-TRANSISTOR (D6T) IN-MEMORY COMPUTING (IMC) ACCELERATOR SUPPORTING ALWAYS-LINEAR DISCHARGE AND REDUCING DIGITAL STEPS

Enhanced dynamic random access memory (eDRAM)-based computing-in-memory (CIM) convolutional neural network (CNN) accelerator

High-energy-efficiency binary neural network accelerator applicable to artificial intelligence internet of things

ENHANCED DYNAMIC RANDOM ACCESS MEMORY (EDRAM)-BASED COMPUTING-IN-MEMORY (CIM) CONVOLUTIONAL NEURAL NETWORK (CNN) ACCELERATOR