Patents by Inventor Ken Lueh

Ken Lueh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 10699362
    Abstract: Embodiments provide support for divergent control flow in heterogeneous compute operations on a fused execution unit. On embodiment provides for a processing apparatus comprising a fused execution unit including multiple graphics execution units having a common instruction pointer; logic to serialize divergent function calls by the fused execution unit, the logic configured to compare a call target of execution channels within the fused execution unit and create multiple groups of channels, each group of channels associated with a single call target; and wherein the fused execution unit is to execute a first group of channels via a first execution unit and a second group of channels via a second execution unit.
    Type: Grant
    Filed: June 23, 2016
    Date of Patent: June 30, 2020
    Assignee: INTEL CORPORATION
    Inventors: Pratik J. Ashar, Guei-Yuan Ken Lueh, Kaiyu Chen, Subramaniam Maiyuran, Brent A. Schwartz, Darin M. Starkey
  • Publication number: 20170372446
    Abstract: Embodiments provide support for divergent control flow in heterogeneous compute operations on a fused execution unit. On embodiment provides for a processing apparatus comprising a fused execution unit including multiple graphics execution units having a common instruction pointer; logic to serialize divergent function calls by the fused execution unit, the logic configured to compare a call target of execution channels within the fused execution unit and create multiple groups of channels, each group of channels associated with a single call target; and wherein the fused execution unit is to execute a first group of channels via a first execution unit and a second group of channels via a second execution unit.
    Type: Application
    Filed: June 23, 2016
    Publication date: December 28, 2017
    Applicant: Intel Corporation
    Inventors: Pratik J. Ashar, Guei-Yuan Ken Lueh, Kaiyu Chen, Subramaniam Maiyuran, Brent A. Schwartz, Darin M. Starkey
  • Patent number: 9262166
    Abstract: Various embodiments are directed to a heterogeneous processor architecture comprised of a CPU and a GPU on the same processor die. The heterogeneous processor architecture may optimize source code in a GPU compiler using vector strip mining to reduce instructions of arbitrary vector lengths into GPU supported vector lengths and loop peeling. It may be first determined that the source code is eligible for optimization if more than one machine code instruction of compiled source code under-utilizes GPU instruction bandwidth limitations. The initial vector strip mining results may be discarded and the first iteration of the inner loop body may be peeled out of the loop. The type of operands in the source code may be lowered and the peeled out inner loop body of source code may be vector strip mined again to obtain optimized source code.
    Type: Grant
    Filed: November 30, 2011
    Date of Patent: February 16, 2016
    Assignee: INTEL CORPORATION
    Inventors: Xiaozhu Kang, Biju George, Ken Lueh
  • Publication number: 20130297919
    Abstract: Various embodiments are directed to a heterogeneous processor architecture comprised of a CPU and a GPU on the same processor die. The heterogeneous processor architecture may optimize source code in a GPU compiler using vector strip mining to reduce instructions of arbitrary vector lengths into GPU supported vector lengths and loop peeling. It may be first determined that the source code is eligible for optimization if more than one machine code instruction of compiled source code under-utilizes GPU instruction bandwidth limitations. The initial vector strip mining results may be discarded and the first iteration of the inner loop body may be peeled out of the loop. The type of operands in the source code may be lowered and the peeled out inner loop body of source code may be vector strip mined again to obtain optimized source code.
    Type: Application
    Filed: November 30, 2011
    Publication date: November 7, 2013
    Inventors: Xiaozhu Kang, Biju George, Ken Lueh
  • Patent number: 7953158
    Abstract: A method including providing a stream of content to a processor, transforming kernels within the stream of content through affine modeling, transforming the affine modeled kernels, stream contracting kernel processes, and stream blocking the kernel processes.
    Type: Grant
    Filed: June 30, 2005
    Date of Patent: May 31, 2011
    Assignee: Intel Corporation
    Inventors: Shih-wei Liao, Zhaohui Du, Gansha Wu, Ken Lueh, Zhiwei Ying, Jinzhan Peng
  • Publication number: 20070003161
    Abstract: A method including providing a stream of content to a processor, transforming kernels within the stream of content through affine modeling, transforming the affine modeled kernels, stream contracting kernel processes, and stream blocking the kernel processes.
    Type: Application
    Filed: June 30, 2005
    Publication date: January 4, 2007
    Inventors: Shih-wei Liao, Zhaohui Du, Gansha Wu, Ken Lueh, Zhiwei Ying, Jinzhan Peng
  • Publication number: 20060288338
    Abstract: Translating a virtual machine instruction into a value which when logically combined with a base value yields an address of interpretation code to perform the virtual machine instruction.
    Type: Application
    Filed: June 15, 2005
    Publication date: December 21, 2006
    Inventors: Jinzhan Peng, Ken Lueh, Gansha Wu