Patents by Inventor Yuhao Wang

Yuhao Wang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

DUAL-MODAL MEMORY INTERFACE CONTROLLER

Publication number: 20220121586

Abstract: A dual-model memory interface of a computing system is provided, configurable to present memory interfaces having differently-graded bandwidth capacity to different processors of the computing system. A mode switch controller of the memory interface controller, based on at least an arbitration rule written to a configuration register, switches the memory interface controller between a narrow-band mode and a wide-band mode. In each mode, the memory interface controller disables either a plurality of narrow-band memory interfaces of the memory interface controller according to a first bus standard, or a wide-band memory interface of the memory interface controller according to a second bus standard. The memory interface controller virtualizes a plurality of system memory units of the computing system as a virtual wide-band memory unit according to the second bus standard, or virtualizes a system memory unit of the computing system as a virtual narrow-band memory unit according to the first bus standard.

Type: Application

Filed: October 16, 2020

Publication date: April 21, 2022

Applicant: Alibaba Group Holding Limited

Inventors: Yuhao Wang, Wei Han, Dimin Niu, Lide Duan, Shuangchen Li, Fei Xue, Hongzhong Zheng
Optical imaging lens assembly

Publication number: 20220099925

Abstract: An optical imaging lens assembly sequentially includes, from an object side to an image side along an optical axis, a first lens (E1), second lens (E2), third lens (E3), fourth lens (E4), fifth lens (E5) and sixth lens (E6) with refractive power. The first lens (E1) has a positive refractive power, and an image-side surface (S2) of the first lens is a concave surface. The second lens (E2) has a negative refractive power. The fifth lens (E5) has a negative refractive power, and an object-side surface (S9) of the fifth lens is a concave surface. A distance TTL from an object-side surface (S1) of the first lens (E1) to an imaging surface (S15) of the optical imaging lens assembly and a total effective focal length f of the optical imaging lens assembly satisfy TTL/f?0.85.

Type: Application

Filed: October 10, 2019

Publication date: March 31, 2022

Inventors: Jianke WENREN, Yuhao WANG, Fujian DAI, Liefeng ZHAO
MEMORY INTERCONNECTION ARCHITECTURE SYSTEMS AND METHODS

Publication number: 20220101887

Abstract: The systems and methods are configured to efficiently and effectively include processing capabilities in memory. In one embodiment, a processing in memory (PIM) chip a memory array, logic components, and an interconnection network. The memory array is configured to store information. In one exemplary implementation the memory array includes storage cells and array periphery components. The logic components can be configured to process information stored in the memory array. The interconnection network is configured to communicatively couple the logic components. The interconnection network can include interconnect wires, and a portion of the interconnect wires are located in a metal layer area that is located above the memory array.

Type: Application

Filed: September 29, 2020

Publication date: March 31, 2022

Inventors: Wei HAN, Shuangchen LI, Lide DUAN, Hongzhong ZHENG, Dimin NIU, Yuhao WANG, Xiaoxin FAN
VIDEO ENCODING TECHNIQUE UTILIZING USER GUIDED INFORMATION IN CLOUD ENVIRONMENT

Publication number: 20220078473

Abstract: The present disclosure relates to a computer-implemented method for processing video data. The method comprises receiving a user input corresponding to a first picture of the video data, generating, based on the user input, prediction information of the first picture with respect a reference picture of the video data, and encoding the first picture using the prediction information.

Type: Application

Filed: September 8, 2020

Publication date: March 10, 2022

Inventors: Yuhao WANG, Minghai QIN, Jian LOU, Yen-Kuang Chen
METHODS OF BREAKING DOWN COARSE-GRAINED TASKS FOR FINE-GRAINED TASK RE-SCHEDULING

Publication number: 20220075622

Abstract: A method of scheduling instructions in a processing system comprising a processing unit and one or more co-processors comprises dispatching a plurality of instructions from a master processor to a co-processor of the one or more co-processors, wherein each instruction of the plurality of instructions comprises one or more additional fields, wherein at least one field comprises grouping information operable to consolidate the plurality of instructions for decomposition, and wherein at least one field comprises control information. The method also comprises decomposing the plurality of instructions into a plurality of fine-grained instructions, wherein the control information comprises rules associated with decomposing the plurality of instructions into the plurality of fine-grained instructions. Further, the method comprises scheduling the plurality of fine-grained instructions to execute on the co-processor, wherein the scheduling is performed in a non-sequential order.

Type: Application

Filed: September 4, 2020

Publication date: March 10, 2022

Inventors: Fei XUE, Yuhao WANG, Fei SUN, Hongzhong ZHENG
SCALABLE SYSTEM-IN-PACKAGE ARCHITECTURES

Publication number: 20220058150

Abstract: A system-in-package architecture in accordance with aspects includes a logic die and one or more memory dice coupled together in a three-dimensional slack. The logic die can include one or more global building blocks and a plurality of local building blocks. The number of local building blocks can be scalable. The local building blocks can include a plurality of engines and memory controllers. The memory controllers can be configured to directly couple one or more of the engines to the one or more memory dice. The number and type of local building blocks, and the number and types of engines and memory controllers can be scalable.

Type: Application

Filed: August 20, 2020

Publication date: February 24, 2022

Inventors: Lide DUAN, Wei HAN, Yuhao WANG, Fei XUE, Yuanwei FANG, Hongzhong ZHENG
PROGRAMMABLE AND HIERARCHICAL CONTROL OF EXECUTION OF GEMM OPERATION ON ACCELERATOR

Publication number: 20220058237

Abstract: The present disclosure relates to a method for controlling execution of a GEMM operation on an accelerator comprising multiple computation units, a first memory device, and a second memory device. The method comprises determining an execution manner of the GEMM operation, the execution manner comprising partition information of the GEMM operation and computation unit allocation information of the partitioned GEMM operation; generating one or more instructions to compute the partitioned GEMM operation on one or more allocated computation units; and issuing the one or more instructions to at least one of a first queue and a second queue, which enables at least one of a first local controller and a second local controller to execute the one or more instructions, wherein the first local controller and the second local controller are configured to control data movement between the computation units, the first memory device, and the second memory device.

Type: Application

Filed: August 21, 2020

Publication date: February 24, 2022

Inventors: Yuhao Wang, Fei Sun, Fei Xue, Yen-Kuang Chen, Hongzhong Zheng
USING TAGGED INSTRUCTION EXTENSION TO EXPRESS DEPENDENCY FOR MEMORY-BASED ACCELERATOR INSTRUCTIONS

Publication number: 20220058024

Abstract: A method of performing out-of-order execution in a processing system comprising a processing unit and one or more accelerators comprises dispatching a plurality of coarse-grained instructions, each instruction extended to comprise one or more tags, wherein each tag comprises dependency information for the respective instruction expressed at a coarse-grained level. The method also comprises translating the plurality of coarse-grained instructions into a plurality of fine-grained instructions, wherein the dependency information is translated into dependencies expressed at a fine-grained level. Further, the method comprises resolving the dependencies at the fine-grained level and scheduling the plurality of fine-grained instructions for execution across the one or more accelerators in the processing system.

Type: Application

Filed: August 18, 2020

Publication date: February 24, 2022

Inventors: Yuanwei FANG, Fei SUN, Fei XUE, Yuejian XIE, Yuhao WANG, Yen-Kuang CHEN
VECTOR ACCELERATOR FOR ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Publication number: 20220051086

Abstract: The present disclosure provides an accelerator for processing a vector or matrix operation. The accelerator comprises a vector processing unit comprising a plurality of computation units having circuitry configured to process a vector operation in parallel; a matrix multiplication unit comprising a first matrix multiplication operator, a second matrix multiplication operator, and an accumulator, the first matrix multiplication operator and the second matrix multiplication operator having circuitry configured to process a matrix operation and the accumulator having circuitry configured to accumulate output results of the first matrix multiplication operator and the second matrix multiplication operator; and a memory storing input data for the vector operation or the matrix operation and being configured to communicate with the vector processing unit and the matrix multiplication unit.

Type: Application

Filed: July 22, 2021

Publication date: February 17, 2022

Inventors: Fei XUE, Wei HAN, Yuhao WANG, Fei SUN, Lide DUAN, Shuangchen LI, Dimin NIU, Tianchan GUAN, Linyong HUANG, Zhaoyang DU, Hongzhong ZHENG
Techniques for determining importance of encoded image components for artificial intelligence tasks

Patent number: 11170260

Abstract: A system for determining the importance of encoded image components for artificial intelligence tasks includes an image capture or storage unit, a processor and a communication interface. The processor can receive components of transformed domain image data from the one or more image capture or storage units across the communication interface. The processor can be configured to determine the relative importance of the components of the transformed domain image data for an artificial intelligence task.

Type: Grant

Filed: November 14, 2019

Date of Patent: November 9, 2021

Assignee: Alibaba Group Holding Limited

Inventors: Kai Xu, Minghai Qin, Yuhao Wang, Fei Sun, Yen-kuang Chen, Yuan Xie
SYSTOLIC ARRAY-FRIENDLY DATA PLACEMENT AND CONTROL

Publication number: 20210334142

Abstract: The present disclosure relates to an accelerator for systolic array-friendly data placement. The accelerator may include: a systolic array comprising a plurality of operation units, wherein the systolic array is configured to receive staged input data and perform operations using the staged input to generate staged output data, the staged output data comprising a number of segments; a controller configured to execute one or more instructions to generate a pattern generation signal; a data mask generator; and a memory configured to store the staged output data using the generated masks. The data mask generator may include circuitry configured to: receive the pattern generation signal from the controller, and, based on the received signal, generate a mask corresponding to each segment of the staged output data.

Type: Application

Filed: April 24, 2020

Publication date: October 28, 2021

Inventors: Yuhao Wang, Xiaoxin Fan, Dimin Niu, Chunsheng Liu, Wei Han
FREQUENCY DOMAIN NEURAL NETWORK ACCELERATOR

Publication number: 20210319289

Abstract: The present disclosure relates to systems and methods concerning a system including a host device and a convolutional neural network hardware accelerator. The hardware accelerator can be configured, at least in part by the host device, to generate activation data from spatial-domain input data and spatial-domain weight data using frequency-domain operations. The hardware accelerator can include one or more discrete Fourier transform units configured to generate a frequency-domain representation of the input data. The hardware accelerator can include a multiplication unit configured to generate a frequency-domain representation of the activation data by element-wise complex multiplication of the frequency-domain representation of the input data and a frequency-domain representation of the weight data.

Type: Application

Filed: April 13, 2020

Publication date: October 14, 2021

Inventors: Wei HAN, Xiaoxin FAN, Yuhao WANG
USING ELECTRICAL CONNECTIONS THAT TRAVERSE SCRIBE LINES TO CONNECT DEVICES ON A CHIP

Publication number: 20210320080

Abstract: A chip or integrated circuit includes a layer that includes a first device and a second device. A scribe line is located between the first device and the second device and separates the first device from the second device. An electrically conductive connection traverses the scribe line and is coupled to the first device and the second device, thus connecting the first and second devices.

Type: Application

Filed: April 13, 2020

Publication date: October 14, 2021

Inventors: Shuangchen LI, Wei HAN, Dimin NIU, Yuhao WANG, Hongzhong ZHENG
OPTICAL IMAGING SYSTEM

Publication number: 20210278637

Abstract: The present disclosure discloses an optical imaging system including, sequentially from an object side to an image side along an optical axis, a first lens having refractive power; a second lens having negative refractive power; a third lens having negative refractive power; a fourth lens having refractive power, a convex object-side surface and a concave image-side surface; and a fifth lens having refractive power. A distance TTL along the optical axis from an object-side surface of the first lens to an imaging plane of the optical imaging system and half of a diagonal length ImgH of an effective pixel area on the imaging plane of the optical imaging system satisfy: 1.0<TTL/ImgH<1.5.

Type: Application

Filed: January 5, 2021

Publication date: September 9, 2021

Inventors: Yuhao Wang, Yang Li, Lingbo He, Fujian Dai, Liefeng Zhao
Method and system for memory control

Patent number: 11068200

Abstract: Methods and systems are provided for improving memory control. A memory architecture includes a plurality of memory units and an interface. A respective memory unit of the plurality of memory units is configured with a Processing-In-Memory (PIM) architecture. The interface includes a plurality of lines. The interface is coupled between the plurality of memory units and a host. The interface is configured to receive one or more signals from a host via the plurality of lines. The respective memory unit of the plurality of memory units is coupled with a respective line of the plurality of lines, and the respective memory unit is further configured to receive a respective signal of the one or more signals via the interface so as to be individually selected by the host.

Type: Grant

Filed: November 27, 2019

Date of Patent: July 20, 2021

Assignee: Alibaba Group Holding Limited

Inventors: Dimin Niu, Lide Duan, Yuhao Wang, Xiaoxin Fan, Zhibin Xiao
METHOD AND SYSTEM FOR PROCESSING A NEURAL NETWORK

Publication number: 20210209462

Abstract: Embodiments of the disclosure provide methods and systems for processing a neural network associated with an input matrix having a first number of elements. The method can include: dividing the input matrix into a plurality of vectors, each vector having a second number of elements; grouping the plurality of vectors into a first group of vectors and a second group of vectors; and pruning the first group of vectors and the second group of vectors.

Type: Application

Filed: January 7, 2020

Publication date: July 8, 2021

Inventors: Ao REN, Tao ZHANG, Yuhao WANG, Yuan XIE
METHOD AND SYSTEM FOR MEMORY CONTROL

Publication number: 20210157516

Abstract: Methods and systems are provided for improving memory control. A memory architecture includes a plurality of memory units and an interface. A respective memory unit of the plurality of memory units is configured with a Processing-In-Memory (PIM) architecture. The interface includes a plurality of lines. The interface is coupled between the plurality of memory units and a host. The interface is configured to receive one or more signals from a host via the plurality of lines. The respective memory unit of the plurality of memory units is coupled with a respective line of the plurality of lines, and the respective memory unit is further configured to receive a respective signal of the one or more signals via the interface so as to be individually selected by the host.

Type: Application

Filed: November 27, 2019

Publication date: May 27, 2021

Inventors: Dimin Niu, Lide Duan, Yuhao Wang, Xiaoxin Fan, Zhibin Xiao
TECHNIQUES TO DYNAMICALLY GATE ENCODED IMAGE COMPONENTS FOR ARTIFICIAL INTELLIGENCE TASKS

Publication number: 20210150768

Abstract: A system for processing encoded image components for artificial intelligence tasks. The system can include one or more compute units, one or more controllers and memory. The one or more controllers can include one or more micro-op schedulers and one or more channel switches. The one or more compute units can be configured to process components of the transformed domain image data according to one or more micro-operations for an artificial intelligence task. The one or more channel switches can be configured to selectively control the transfer of the components of transformed domain image data to the one or more compute units based on one or more gating flags. The one or more channel switches can also be configured to selectively control generation of the one or more micro-operations by the one or more micro-op schedulers based on the one or more gating flags.

Type: Application

Filed: November 14, 2019

Publication date: May 20, 2021

Inventors: Kai XU, Minghai QIN, Yuhao WANG, Fei SUN, Yen-kuang CHEN, Yuan XIE
RECONSTRUCTING TRANSFORMED DOMAIN INFORMATION IN ENCODED VIDEO STREAMS

Publication number: 20210152832

Abstract: Discrete cosine transformation (DCT) information can be estimated from adjacent blocks of the same frame. DCT information can be estimated from different frames. Motion vectors can be used to track the position of objects in some frames of the video. For example, a stream of encoded frames is received; the encoded frames are entropy decoded and dequantized to produce DCT information for blocks of the frames; and DCT information for a block in a frame is determined using the DCT information produced from the entropy decoding and dequantizing for a different block.

Type: Application

Filed: November 14, 2019

Publication date: May 20, 2021

Inventors: Minghai QIN, Yen-kuang CHEN, Kai XU, Yuhao WANG, Fei SUN, Yuan XIE
TECHNIQUES FOR DETERMINING IMPORTANCE OF ENCODED IMAGE COMPONENTS FOR ARTIFICIAL INTELLIGENCE TASKS

Publication number: 20210150265

Abstract: A system for determining the importance of encoded image components for artificial intelligence tasks includes an image capture or storage unit, a processor and a communication interface. The processor can receive components of transformed domain image data from the one or more image capture or storage units across the communication interface. The processor can be configured to determine the relative importance of the components of the transformed domain image data for an artificial intelligence task.

Type: Application

Filed: November 14, 2019

Publication date: May 20, 2021

Inventors: Kai XU, Minghai QIN, Yuhao WANG, Fei SUN, Yen-kuang CHEN, Yuan XIE

prev 1 2 3 4 next