Patents by Inventor Po-An TSAI

Po-An TSAI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

COMPRESSION OF MACHINE LEARNING MODELS VIA SPARSIFICATION AND QUANTIZATION

Publication number: 20250094864

Abstract: Machine learning is a process that learns a model from a given dataset, where the model can then be used to make a prediction about new data. In order to reduce the size, computation, and latency of a machine learning model, a compression technique can be employed which includes model sparsification and quantization. To limit the extent to which the quality of the model is impacted when uniformly applying sparsification and quantization to all values of the model, the present disclosure provides for a hybrid sparsification and quantization of the model.

Type: Application

Filed: March 12, 2024

Publication date: March 20, 2025

Inventors: Po-An Tsai, Geonhwa Jeong, Jeffrey Michael Pool
METHOD AND APPARATUS FOR DIRECT CONVOLUTION CALCULATION

Publication number: 20250060938

Abstract: Systems and methods for efficient convolution based on matrix multiply and add (MMA) are described. An example processor having a plurality of processing lanes is configured to perform convolution of a matrix of activation elements and a filter matrix in accordance with a configurable series of instructions including a plurality of MMA instructions and shift instructions while reusing activation elements already loaded to the datapath or associated memory over a plurality of MMA operations. Associated methods are also described.

Type: Application

Filed: August 14, 2023

Publication date: February 20, 2025

Inventors: Jack CHOQUETTE, Po-An TSAI, Alexander L. MINKIN, Manan PATEL, Neal Clayton CRAGO, Daniel STIFFLER, Kefeng DUAN, Yu-Jung CHEN, Jing LI, Qian WANG, Ronny KRASHINSKY, Jun YANG, Feng XIE
Support device and base thereof

Patent number: 12196406

Abstract: A base is configured for a bracket. The base includes a hollow body, a plurality of supporting branches, and an illuminating module. The hollow body is connected to the bracket and has a bottom part and a first sidewall. The bottom part has an open hole. The first sidewall has a transparent structure. The plurality of supporting branches is disposed around the hollow body to lift the hollow body. The illuminating module is disposed in the hollow body and includes a sleeve and a base plate. The sleeve has a second sidewall, a first end, and a second end opposite to the first end. The second sidewall has an opening. The position of the opening is corresponding to the transparent structure. The base plate is disposed on the first end. The base plate is provided with a light source. The light source projects light beams toward the second end.

Type: Grant

Filed: December 23, 2022

Date of Patent: January 14, 2025

Assignee: ASUSTEK COMPUTER INC.

Inventors: Kai Chieh Hsu, Chih-Wei Chuang, Yaw-Huei Chiou, Peng Chao Wang, Po-An Tsai, Hao-Chun Lai
GENERATING SPARSE NEURAL NETWORKS

Publication number: 20240152407

Abstract: Apparatuses, systems, and techniques to determine a configuration based at least in part on data stored by at least one data structure of a workload at runtime, and transform the workload into a sparse workload based at least in part on the configuration. In at least one embodiment, one or more sparse workloads (e.g., one or more sparse neural networks) are generated based at least in part on, for example, one or more workloads (e.g., one or more neural networks).

Type: Application

Filed: July 17, 2023

Publication date: May 9, 2024

Inventors: Geonhwa Jeong, Po-An Tsai, Jeffrey Michael Pool
SUPPORT DEVICE AND BASE THEREOF

Publication number: 20240027061

Abstract: A base is configured for a bracket. The base includes a hollow body, a plurality of supporting branches, and an illuminating module. The hollow body is connected to the bracket and has a bottom part and a first sidewall. The bottom part has an open hole. The first sidewall has a transparent structure. The plurality of supporting branches is disposed around the hollow body to lift the hollow body. The illuminating module is disposed in the hollow body and includes a sleeve and a base plate. The sleeve has a second sidewall, a first end, and a second end opposite to the first end. The second sidewall has an opening. The position of the opening is corresponding to the transparent structure. The base plate is disposed on the first end. The base plate is provided with a light source. The light source projects light beams toward the second end.

Type: Application

Filed: December 23, 2022

Publication date: January 25, 2024

Inventors: Kai Chieh HSU, Chih-Wei CHUANG, Yaw-Huei CHIOU, Peng Chao WANG, Po-An TSAI, Hao-Chun LAI
TRENCH-GATE FIELD EFFECT TRANSISTOR

Publication number: 20230411470

Abstract: A trench-gate field effect transistor includes a plurality of trenches, a plurality of gate electrode units, and a plurality of source electrode units. Each of the trenches has a first trench region, a second trench region having a width less than that of the first trench region, and a neck trench region extending between the first trench region and the second trench region. Each of the gate electrode units includes a pair of first gate electrode portions disposed in the first trench region, a pair of second gate electrode portions disposed in the neck trench region, and a third gate electrode portion disposed in the second trench region. Each of the source electrode units includes a first source electrode portion disposed between a pair of the first gate electrode portions, and a second source electrode portion connected to the first source electrode portion.

Type: Application

Filed: May 19, 2023

Publication date: December 21, 2023

Applicant: FORCE MOS TECHNOLOGY CO., LTD.

Inventors: Kao-Way TU, Yuan-Shun CHANG, Po-An TSAI, Huan-Chung WENG
PRUNING AND ACCELERATING NEURAL NETWORKS WITH HIERARCHICAL FINE-GRAINED STRUCTURED SPARSITY

Publication number: 20230062503

Abstract: Hierarchical structured sparse parameter pruning and processing improves runtime performance and energy efficiency of neural networks. In contrast with conventional (non-structured) pruning which allows for any distribution of the non-zero values within a matrix that achieves the desired sparsity degree (e.g., 50%) and is consequently difficult to accelerate, structured hierarchical sparsity requires each multi-element unit at the coarsest granularity of the hierarchy to be pruned to the desired sparsity degree. The global desired sparsity degree is a function of the per-level sparsity degrees. Distribution of non-zero values within each multi-element unit is constrained according to the per-level sparsity degree at the particular level of the hierarchy. Each level of the hierarchy may be associated with a hardware (e.g., logic or circuit) structure that can be enabled or disabled according to the per-level sparsity.

Type: Application

Filed: February 28, 2022

Publication date: March 2, 2023

Inventors: Yannan Wu, Po-An Tsai, Saurav Muralidharan, Joel Springer Emer
FLEXIBLE ACCELERATOR FOR A TENSOR WORKLOAD

Publication number: 20220083314

Abstract: Accelerators are generally utilized to provide high performance and energy efficiency for tensor algorithms. Currently, an accelerator will be specifically designed around the fundamental properties of the tensor algorithm and shape it supports, and thus will exhibit sub-optimal performance when used for other tensor algorithms and shapes. The present disclosure provides a flexible accelerator for tensor workloads. The flexible accelerator can be a flexible tensor accelerator or a FPGA having a dynamically configurable inter-PE network supporting different tensor shapes and different tensor algorithms including at least a GEMM algorithm, a 2D CNN algorithm, and a 3D CNN algorithm, and/or having a flexible DPU in which a dot product length of its dot product sub-units is configurable based on a target compute throughput that is less than or equal to a maximum throughput of the flexible DPU.

Type: Application

Filed: June 9, 2021

Publication date: March 17, 2022

Inventors: Po An Tsai, Neal Crago, Angshuman Parashar, Joel Springer Emer, Stephen William Keckler
FLEXIBLE ACCELERATOR FOR A TENSOR WORKLOAD

Publication number: 20220083500

Abstract: Accelerators are generally utilized to provide high performance and energy efficiency for tensor algorithms. Currently, an accelerator will be specifically designed around the fundamental properties of the tensor algorithm and shape it supports, and thus will exhibit sub-optimal performance when used for other tensor algorithms and shapes. The present disclosure provides a flexible accelerator for tensor workloads. The flexible accelerator can be a flexible tensor accelerator or a FPGA having a dynamically configurable inter-PE network supporting different tensor shapes and different tensor algorithms including at least a GEMM algorithm, a 2D CNN algorithm, and a 3D CNN algorithm, and/or having a flexible DPU in which a dot product length of its dot product sub-units is configurable based on a target compute throughput.

Type: Application

Filed: June 9, 2021

Publication date: March 17, 2022

Inventors: Po An Tsai, Neal Crago, Angshuman Parashar, Joel Springer Emer, Stephen William Keckler
Resource based virtual computing instance scheduling

Patent number: 10956227

Abstract: Examples provide two-tiered scheduling within a cluster. A coarse-grained analysis is performed on a candidate set of hosts to select a host for a virtual computing instance based on optimization of at least one resource. A host is selected based on the analysis results. The identified virtual computing instance is placed on the selected host. A fine-grained analysis is performed on a set of communication graphs for a plurality of virtual computing instances to generate a set of penalty scores. A set of communicating virtual computing instances are selected based on the set of penalty scores. A first virtual computing instance from a first host is relocated to a second host to minimize a distance between the first virtual computing instance and a second virtual computing instance. Relocating the first virtual computing instance reduces at least one penalty score for the set of communicating virtual computing instances.

Type: Grant

Filed: February 11, 2019

Date of Patent: March 23, 2021

Assignee: VMware, Inc.

Inventors: Po-An Tsai, Sahan Gamage, Rean Griffith
Shielded gate MOSFET and fabricating method thereof

Patent number: 10700175

Abstract: A fabricating method of a shielded gate MOSFET is provided, includes the steps of forming a semiconductor substrate having a trench, forming a sacrifice oxide layer in the trench, the sacrifice oxide layer covering a side wall of the trench, forming a source polycrystalline silicon region in the trench, forming an insulation oxide layer above the source polycrystalline silicon region to have the source polycrystalline silicon region fully enclosed by the sacrifice oxide layer and the insulation oxide layer, depositing polycrystalline silicon into the trench and carrying out a back etching to control a thickness of the insulation oxide layer above the source polycrystalline silicon region, forming a gate oxide layer in the trench, the gate oxide layer covering the side wall of the trench, forming a gate polycrystalline silicon region in the trench, and forming a body layer and a heavily doped region around the trench in an ion implantation manner.

Type: Grant

Filed: January 10, 2019

Date of Patent: June 30, 2020

Assignee: Force MOS Technology Co., Ltd.

Inventors: Kao-Way Tu, Po-An Tsai, Huan-Chung Weng
Shielded Gate MOSFET and Fabricating Method Thereof

Publication number: 20200105890

Abstract: A fabricating method of a shielded gate MOSFET is provided, including steps of: forming a semiconductor substrate having a trench; forming a sacrifice oxide layer in the trench, the sacrifice oxide layer covering a side wall of the trench; forming a source polycrystalline silicon region in the trench; forming an insulation oxide layer above the source polycrystalline silicon region to have the source polycrystalline silicon region fully enclosed by the sacrifice oxide layer and the insulation oxide layer; depositing polycrystalline silicon into the trench and carrying out a back etching to control a thickness of the insulation oxide layer above the source polycrystalline silicon region; forming a gate oxide layer in the trench, the gate oxide layer covering the side wall of the trench; forming a gate polycrystalline silicon region in the trench; and forming a body layer and a heavily doped region around the trench in an ion implantation manner.

Type: Application

Filed: January 10, 2019

Publication date: April 2, 2020

Inventors: Kao-Way Tu, Po-An Tsai, Huan-Chung Weng
Projecting device and electronic device having a projecting device

Patent number: 10401717

Abstract: A projecting device is provided. The projecting device is adapted to assembling with an electronic device. The projecting device comprises a main body, a light emitting portion, a rotating portion, and an adjusting portion. The main body includes a first opening and a second opening. The light emitting portion is disposed in the main body, and transmits a light through the first opening. The rotating portion is disposed in the main body and connected with the light emitting portion. The adjusting portion is disposed in the second opening and connected with the rotating portion. The light emitting portion drives the rotating portion to rotate through the adjusting portion to adjust the angle of the light.

Type: Grant

Filed: August 1, 2018

Date of Patent: September 3, 2019

Assignee: ASUSTEK COMPUTER INC.

Inventor: Po-An Tsai
RESOURCE BASED VIRTUAL COMPUTING INSTANCE SCHEDULING

Publication number: 20190188050

Abstract: Examples provide two-tiered scheduling within a cluster. A coarse-grained analysis is performed on a candidate set of hosts to select a host for a virtual computing instance based on optimization of at least one resource. A host is selected based on the analysis results. The identified virtual computing instance is placed on the selected host. A fine-grained analysis is performed on a set of communication graphs for a plurality of virtual computing instances to generate a set of penalty scores. A set of communicating virtual computing instances are selected based on the set of penalty scores. A first virtual computing instance from a first host is relocated to a second host to minimize a distance between the first virtual computing instance and a second virtual computing instance. Relocating the first virtual computing instance reduces at least one penalty score for the set of communicating virtual computing instances.

Type: Application

Filed: February 11, 2019

Publication date: June 20, 2019

Inventors: Po-An Tsai, Sahan Gamage, Rean Griffith
Resource based virtual computing instance scheduling

Patent number: 10241840

Abstract: Examples provide two-tiered scheduling within a cluster. A coarse-grained analysis is performed on a candidate set of hosts to select a host for a virtual computing instance based on optimization of at least one resource. A host is selected based on the analysis results. The identified virtual computing instance is placed on the selected host. A fine-grained analysis is performed on a set of communication graphs for a plurality of virtual computing instances to generate a set of penalty scores. A set of communicating virtual computing instances are selected based on the set of penalty scores. A first virtual computing instance from a first host is relocated to a second host to minimize a distance between the first virtual computing instance and a second virtual computing instance. Relocating the first virtual computing instance reduces at least one penalty score for the set of communicating virtual computing instances.

Type: Grant

Filed: September 30, 2016

Date of Patent: March 26, 2019

Assignee: VMware, Inc.

Inventors: Po-An Tsai, Sahan Gamage, Rean Griffith
PROJECTING DEVICE AND ELECTRONIC DEVICE HAVING THE SAME

Publication number: 20190041730

Abstract: A projecting device is provided. The projecting device is adapted to assembling with an electronic device. The projecting device comprises a main body, a light emitting portion, a rotating portion, and an adjusting portion. The main body includes a first opening and a second opening. The light emitting portion is disposed in the main body, and transmits a light through the first opening. The rotating portion is disposed in the main body and connected with the light emitting portion. The adjusting portion is disposed in the second opening and connected with the rotating portion. The light emitting portion drives the rotating portion to rotate through the adjusting portion to adjust the angle of the light.

Type: Application

Filed: August 1, 2018

Publication date: February 7, 2019

Inventor: Po-An TSAI
RESOURCE BASED VIRTUAL COMPUTING INSTANCE SCHEDULING

Publication number: 20180095776

Abstract: Examples provide two-tiered scheduling within a cluster. A coarse-grained analysis is performed on a candidate set of hosts to select a host for a virtual computing instance based on optimization of at least one resource. A host is selected based on the analysis results. The identified virtual computing instance is placed on the selected host. A fine-grained analysis is performed on a set of communication graphs for a plurality of virtual computing instances to generate a set of penalty scores. A set of communicating virtual computing instances are selected based on the set of penalty scores. A first virtual computing instance from a first host is relocated to a second host to minimize a distance between the first virtual computing instance and a second virtual computing instance. Relocating the first virtual computing instance reduces at least one penalty score for the set of communicating virtual computing instances.

Type: Application

Filed: September 30, 2016

Publication date: April 5, 2018

Inventors: Po-An Tsai, Sahan Gamage, Rean Griffith
Waist Adhering Material for Paper Diaper

Publication number: 20130046268

Abstract: A low-cost waist adhering material for paper diaper for mass production and easy application comprises a replacement material. The replacement material is composed of a non-textile fabric layer disposed at a top end for connecting with hooks of a paper diaper, and a molded positioning layer disposed underneath the non-textile fabric layer and on a surface of the paper diaper for positioning the non-textile fabric layer. The non-textile fabric layer has at least one through hole area. The through hole area is composed of a plurality of through holes spaced at intervals penetrating through the non-textile fabric layer.

Type: Application

Filed: July 29, 2012

Publication date: February 21, 2013

Inventors: Po-An Tsai, Charng-Ching Ou, Hsu-Feng Shih
Leakage Proof Base Material for Paper Diaper

Publication number: 20120323199

Abstract: A leakage proof base material for paper diaper is provided which is more convenient to use, suitable to be mass produced, more skin-friendly and with a lower cost. The leakage proof base material for paper diaper comprises a leakage proof base material which is composed of a leakage proof membrane as a top layer and a non-textile fabric layer as a bottom layer, and surfaces of the leakage proof membrane and the non-textile fabric layer are combined together partially or entirely. At least one through hole area is disposed on the non-textile fabric layer, the through hole area is composed of a plurality of through holes which are penetrated through the non-textile fabric layer and are spaced at intervals.

Type: Application

Filed: February 8, 2012

Publication date: December 20, 2012

Inventors: Po-An Tsai, Charng-Ching Ou, Hsu-Feng Shih
DIAPER WITH IMPROVED CONJUGATION STRUCTURE

Publication number: 20120095431

Abstract: A diaper having an improved conjugated structure that is convenient, low-cost and easy for mass production is disclosed. The diaper includes a side wing having a magic hook located thereon and a rear thin sheet having an anti-leaking layer and a non-textile fabrics layer conjugated entirely or partially. The conjugation strength between the magic hook and the rear thin sheet is about 100 to 700 g/inch at 180 degrees, and the shear stress at 180 degrees is over 1000 g/inch. With the conjugation between the magic hook and the rear thin sheet, the user can randomly attach the magic hook to the surface of the non-textile fabrics layer to adjust the tightness of the diaper.

Type: Application

Filed: June 20, 2011

Publication date: April 19, 2012

Inventors: Po-An TSAI, Charng-Ching OU, Hsu-Feng SHIH