Patents by Inventor Shu-Hsuan Chou

Shu-Hsuan Chou has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10921876
    Abstract: Power and performance of a multi-core system is managed dynamically by adjusting power table indices at runtime. Runtime statistics is measured, when an application is executed on a first core of a first type at a first operating point (OPP) in a first time period, and on a second core of a second core type at a second OPP in a second time period. A controller estimates, based on the runtime statistics, a first pair of indices associated with a first OPP for the first core and a second pair of indices associated with a second OPP for the second core. During runtime, the controller incorporates the first pair of indices and the second pair of indices into power table indices; and determines, from the power table indices, selected indices associated with a selected OPP of a core of a selected core type for executing the application.
    Type: Grant
    Filed: April 26, 2018
    Date of Patent: February 16, 2021
    Assignee: MediaTek Inc.
    Inventors: Jih-Ming Hsu, Tai-Hua Lu, Pei-Yu Huang, Chien-Yuan Lai, Shu-Hsuan Chou, I-Cheng Cheng, Yun-Ching Li, Ming Hsien Lee
  • Publication number: 20190332157
    Abstract: Power and performance of a multi-core system is managed dynamically by adjusting power table indices at runtime. Runtime statistics is measured, when an application is executed on a first core of a first type at a first operating point (OPP) in a first time period, and on a second core of a second core type at a second OPP in a second time period. A controller estimates, based on the runtime statistics, a first pair of indices associated with a first OPP for the first core and a second pair of indices associated with a second OPP for the second core. During runtime, the controller incorporates the first pair of indices and the second pair of indices into power table indices; and determines, from the power table indices, selected indices associated with a selected OPP of a core of a selected core type for executing the application.
    Type: Application
    Filed: April 26, 2018
    Publication date: October 31, 2019
    Inventors: Jih-Ming Hsu, Tai-Hua Lu, Pei-Yu Huang, Chien-Yuan Lai, Shu-Hsuan Chou, I-Cheng Cheng, Yun-Ching Li, Ming Hsien Lee
  • Publication number: 20140215284
    Abstract: A dynamic scaling processor device and processing method thereof, having a timing decoder, a multi-cycle controller, a correction flip-flop. The timing decoder is provided with a plurality of cycles therein, to receive a plurality of instructions, to select corresponding cycles as its predetermined cycles based on type of each instruction, and output the predetermined cycles and its corresponding instructions to the multi-cycle controller. The multi-cycle controller computes results of the instructions based on the predetermined cycles or a single cycle, and outputs them to the correction flip-flop. The error detection flip-flop utilizes a first clock signal and a stalled second clock signal, to sample a same result, and correct the results when outcomes of samplings are different.
    Type: Application
    Filed: August 6, 2013
    Publication date: July 31, 2014
    Applicant: National Chung Cheng University
    Inventors: Tien-Fu CHEN, Shu-Hsuan CHOU, Po-Hao WANG, Yung-Hui YU
  • Publication number: 20110307741
    Abstract: A non-intrusive debugging framework for parallel software based on a super multi-core framework is composed of a plurality of core clusters. Each of the core clusters includes a plurality of core processors and a debug node. Each of the core processors includes a DCP. The DCPs and the debug node are interconnected via at least one channel to constitute a communication network inside each of the core clusters. The core clusters are interconnected via a ring network. In this way, the memory inside each of the debug nodes constitutes a non-uniform debug memory space for debugging without affecting execution of the parallel program, such that it is applicable to current diversified dynamic debugging methods under the super multi-core system.
    Type: Application
    Filed: October 14, 2010
    Publication date: December 15, 2011
    Applicant: NATIONAL CHUNG CHENG UNIVERSITY
    Inventors: Tien-Fu Chen, Che-Neng Wen, Shu-Hsuan Chou, Yen-Lan Hsu
  • Patent number: 7987313
    Abstract: A hierarchical ring architecture is constructed with on-chip networks. The on-chip network includes two type-0 ring nodes and two type-1 ring nodes. Multiple data transfer is provided in parallel between multiple processor cores or multiple functional units and register banks with a dynamic configuration. A low control complexity, an optimized local bandwidth, an optimized remote node path, a low routing complexity, and a simplified circuit is thus obtained.
    Type: Grant
    Filed: February 11, 2008
    Date of Patent: July 26, 2011
    Assignee: National Chung Cheng University
    Inventors: Shu-Hsuan Chou, Ming-Ku Chang, Yi-Chao Chan, Tien-Fu Chen
  • Patent number: 7917793
    Abstract: The present invention uses a swing structure to avoid using a clock period at a non-efficient execution time. The execution time is precisely controlled to enhance a performance of a processor using a low voltage. Thus, synchronization problems in a chip under different environments are solved for high reliability.
    Type: Grant
    Filed: February 11, 2008
    Date of Patent: March 29, 2011
    Assignee: National Chung Cheng University
    Inventors: Shu-Hsuan Chou, Yi-Chao Chan, Ming-Ku Chang, Tien-Fu Chen
  • Publication number: 20100287400
    Abstract: The present invention uses a swing structure to avoid using a clock period at a non-efficient execution time. The execution time is precisely controlled to enhance a performance of a processor using a low voltage. Thus, synchronization problems in a chip under different environments are solved for high reliability.
    Type: Application
    Filed: February 11, 2008
    Publication date: November 11, 2010
    Applicant: National Chung Cheng University
    Inventors: Shu-Hsuan Chou, Yi-Chao Chan, Ming-Ku Chang, Tien-Fu Chen
  • Publication number: 20100287326
    Abstract: A hierarchical ring architecture is constructed with on-chip networks. The on-chip network includes two type-0 ring nodes and two type-1 ring nodes. The present invention provides multiple data transfer in parallel between multiple processor cores or multiple function units and register banks with dynamic configuration. The present invention thus obtains a low control complexity, an optimized local bandwidth, an optimized remote node path, a low routing complexity and a simplified circuit.
    Type: Application
    Filed: February 11, 2008
    Publication date: November 11, 2010
    Applicant: National Chung Cheng University
    Inventors: Shu-Hsuan Chou, Ming-Ku Chang, Yi-Chao Chan, Tien-Fu Chen
  • Publication number: 20080046689
    Abstract: A cooperative multithreading architecture includes an instruction cache, capable of providing a micro-VLIW instruction; a first cluster, connects to the instruction cache to fetch the micro-VLIW instruction; and a second cluster, connects to the instruction cache to fetch the micro-VLIW instruction and capable of execution acceleration. The second cluster includes a second front-end module, connects to the instruction cache and capable of requesting and dispatching the micro-VLIW instruction; a helper dynamic scheduler, connects to the second front-end module and capable of dispatching the micro-VLIW instruction; a non-shared data path, connects to the second front-end module and capable of providing a wider data path; and a shared data path, connected to the helper dynamic scheduler and capable of assisting a control part of the non-shared data path. The first cluster and the second cluster carry out execution of the respective micro-instructions in parallel.
    Type: Application
    Filed: August 21, 2006
    Publication date: February 21, 2008
    Inventors: Tien-Fu Chen, Shu-Hsuan Chou, Chieh-Jen Cheng, Zhi-Heng Kang
  • Publication number: 20070234015
    Abstract: An apparatus and method of providing flexible load and store for multimedia applications are provided by the present invention, which comprising a register file, a load and store unit, a memory, a selective maskable permutable and collector load module (SMPCKM), and a control unit. The load and store unit includes a selective permutable and scatter store module (SPSSM), which can perform selective, permutable, and scatter store operation. Driving control signals by the control unit to control the operation state. With the present invention, permuting data could be efficient. The source data could be permuted arbitrarily with different operation modes according to the load and store characteristic, and then stored the source data to destination location. Moreover, the use of the load and store unit can reduce burden of performing permutable operation which needs extra instructions, such that performance can be enhanced.
    Type: Application
    Filed: March 6, 2007
    Publication date: October 4, 2007
    Inventors: Tien-Fu Chen, Chih-Heng Kang, Shu-Hsuan Chou