Patents by Inventor Yen-Kuang Chen

Yen-Kuang Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8463837
    Abstract: A method and apparatus for performing bi-linear interpolation and motion compensation including multiply-add operations and byte shuffle operations on packed data in a processor. In one embodiment, two or more lines of 2n+1 content byte elements may be shuffled to generate a first and second packed data respectively including at least a first and a second 4n byte elements including 2n?1 duplicated elements. A third packed data including sums of products is generated from the first packed data and packed byte coefficients by a multiply-add instruction. A fourth packed data including sums of products is generated from the second packed data and elements and packed byte coefficients by another multiply-add instruction. Corresponding sums of products of the third and fourth packed data are then summed, and may be rounded and averaged.
    Type: Grant
    Filed: October 17, 2003
    Date of Patent: June 11, 2013
    Assignee: Intel Corporation
    Inventors: Yen-Kuang Chen, Minerva M. Yeung
  • Patent number: 8463820
    Abstract: In some embodiments, the invention involves utilizing a tree merge sort in a platform to minimize cache reads/writes when sorting large amounts of data. An embodiment uses blocks of pre-sorted data residing in “leaf nodes” residing in memory storage. A pre-sorted block of data from each leaf node is read from memory and stored in faster cache memory. A tree merge sort is performed on the nodes that are cache resident until a block of data migrates to a root node. Sorted blocks reaching the root node are written to memory storage in an output list until all pre-sorted data blocks have been moved to cache and merged upward to the root. The completed output list in memory storage is a list of the fully sorted data. Other embodiments are described and claimed.
    Type: Grant
    Filed: May 26, 2009
    Date of Patent: June 11, 2013
    Assignee: Intel Corporation
    Inventors: Jatin Chhugani, Sanjeev Kumar, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Victor W. Lee, William Macy
  • Patent number: 8417891
    Abstract: Embodiments of shared cache memories for multi-core processors are presented. In one embodiment, a cache memory comprises a group of sampling cache sets and a controller to determine a number of misses that occur in the group of sampling cache sets. The controller is operable to determine a victim cache line for a cache set based at least in part on the number of misses.
    Type: Grant
    Filed: December 15, 2008
    Date of Patent: April 9, 2013
    Assignee: Intel Corporation
    Inventors: Wenlong Li, Yu Chen, Changkyu Kim, Christopher J. Hughes, Yen-Kuang Chen
  • Publication number: 20120290799
    Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.
    Type: Application
    Filed: July 24, 2012
    Publication date: November 15, 2012
    Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
  • Publication number: 20120224630
    Abstract: In an encoding process, video data are represented as a bitstream of a quantized base layer and at least two enhancement layers, with each picture in each layer identified by a start code. The base layer, plus a number of enhancement layers capable of being transmitted by the communication channel's bandwidth, are transmitted on the communication channel.
    Type: Application
    Filed: May 16, 2012
    Publication date: September 6, 2012
    Inventors: Wen-Hsiao Peng, Yen-Kuang Chen
  • Patent number: 8230172
    Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.
    Type: Grant
    Filed: December 3, 2009
    Date of Patent: July 24, 2012
    Assignee: Intel Corporation
    Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
  • Publication number: 20120166761
    Abstract: A processing core implemented on a semiconductor chip is described having first execution unit logic circuitry that includes first comparison circuitry to compare each element in a first input vector against every element of a second input vector. The processing core also has second execution logic circuitry that includes second comparison circuitry to compare a first input value against every data element of an input vector.
    Type: Application
    Filed: December 22, 2010
    Publication date: June 28, 2012
    Inventors: Christopher J. Hughes, Mark J. Charney, Yen-Kuang Chen, Jesus Corbal, Andrew T. Forsyth, Milind B. Girkar, Jonathan C. Hall, Hideki Ido, Robert Valentine, Jeffrey Wiedemeier
  • Publication number: 20120159130
    Abstract: A system and method are configured to detect conflicts when converting scalar processes to parallel processes (“SIMDifying”). Conflicts may be detected for an unordered single index, an ordered single index and/or ordered pairs of indices. Conflicts may be further detected for read-after-write dependencies. Conflict detection is configured to identify operations (i.e., iterations) in a sequence of iterations that may not be done in parallel.
    Type: Application
    Filed: December 21, 2010
    Publication date: June 21, 2012
    Inventors: Mikhail Smelyanskiy, Yen-Kuang Chen, Daehyun Kim, Christopher J. Hughes, Victor W. Lee
  • Patent number: 8199809
    Abstract: In an encoding process, video data are represented as a bitstream of a quantized base layer and at least two enhancement layers, with each picture in each layer identified by a start code. The base layer, plus a number of enhancement layers capable of being transmitted by the communication channel's bandwidth, are transmitted on the communication channel.
    Type: Grant
    Filed: October 14, 2008
    Date of Patent: June 12, 2012
    Assignee: Intel Corporation
    Inventors: Wen-Hsiao Peng, Yen-Kuang Chen
  • Patent number: 8171223
    Abstract: A directory of a private cache hierarchy is provided to maintain coherency between data stored in the cache hierarchy, where the directory is to enable concurrent cache-to-cache transfer of data to two private caches from another private cache. This directory can be implemented in a system having a multi-core processor. Other embodiments are described.
    Type: Grant
    Filed: December 3, 2008
    Date of Patent: May 1, 2012
    Assignee: Intel Corporation
    Inventors: Christopher J. Hughes, Changkyu Kim, Yen-Kuang Chen
  • Patent number: 8151012
    Abstract: Methods, apparatuses and systems to decrease the energy consumption of a memory chip while increasing its effect bandwidth during the execution of any workload. Methods, apparatuses and systems may allow a memory chip utilize a plurality of virtual row buffers to respond to requests for data included in a memory array block. Methods, apparatuses and systems may further eliminate or reduce the cost associated with transferring unnecessary data from a memory array block to row buffers by altering the data transfer size between a memory array block and a row buffer.
    Type: Grant
    Filed: September 25, 2009
    Date of Patent: April 3, 2012
    Assignee: Intel Corporation
    Inventors: Changkyu Kim, Albert Lin, Christopher J. Hughes, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Zeshan A. Chishti, Bryan K. Casper
  • Publication number: 20120042121
    Abstract: A scatter/gather technique optimizes unstructured streaming memory accesses, providing off-chip bandwidth efficiency by accessing only useful data at a fine granularity, and off-loading memory access overhead by supporting address calculation, data shuffling, and format conversion.
    Type: Application
    Filed: October 24, 2011
    Publication date: February 16, 2012
    Inventors: Daehyun Kim, Christopher J. Hughes, Yen-Kuang Chen, Partha Kundu
  • Patent number: 8074026
    Abstract: A scatter/gather technique optimizes unstructured streaming memory accesses, providing off-chip bandwidth efficiency by accessing only useful data at a fine granularity, and off-loading memory access overhead by supporting address calculation, data shuffling, and format conversion.
    Type: Grant
    Filed: May 10, 2006
    Date of Patent: December 6, 2011
    Assignee: Intel Corporation
    Inventors: Daehyun Kim, Christopher J. Hughes, Yen-Kuang Chen, Partha Kundu
  • Patent number: 8028129
    Abstract: In one embodiment, the present invention includes a method for determining if a state of data is indicative of a first class of data, re-classifying the data from a second class to the first class based on the determination, and moving the data to a first portion of a shared cache associated with a first requester unit based on the re-classification. Other embodiments are described and claimed.
    Type: Grant
    Filed: July 2, 2009
    Date of Patent: September 27, 2011
    Assignee: Intel Corporation
    Inventors: Christopher J. Hughes, Yen-Kuang Chen
  • Publication number: 20110148896
    Abstract: A region or group of pixels may be textured as a unit, using a range specifier and one or more anchor pixels to define the group. In some embodiments, processing grouped pixels improves efficiency.
    Type: Application
    Filed: December 23, 2009
    Publication date: June 23, 2011
    Inventors: Victor W. Lee, Ganesh S. Dasika, Mikhail Smelyanskiy, Jose Gonzalez, Changkyu Kim, Jatin Chhugani, Yen-Kuang Chen, Julio Gago, Santiago Galan, Victor Moya Del Barrio
  • Publication number: 20110145184
    Abstract: Methods and systems to translate input labels of arcs of a network, corresponding to a sequence of states of the network, to a list of output grammar elements of the arcs, corresponding to a sequence of grammar elements. The network may include a plurality of speech recognition models combined with a weighted finite state machine transducer (WFST). Traversal may include active arc traversal, and may include active arc propagation. Arcs may be processed in parallel, including arcs originating from multiple source states and directed to a common destination state. Self-loops associated with states may be modeled within outgoing arcs of the states, which may reduce synchronization operations. Tasks may be ordered with respect to cache-data locality to associate tasks with processing threads based at least in part on whether another task associated with a corresponding data object was previously assigned to the thread.
    Type: Application
    Filed: December 14, 2009
    Publication date: June 16, 2011
    Inventors: Kisun You, Christopher J. Hughes, Yen-Kuang Chen
  • Publication number: 20110138128
    Abstract: A technique to track shared information in a multi-core processor or multi-processor system. In one embodiment, core identification information (“core IDs”) are used to track shared information among multiple cores in a multi-core processor or multiple processors in a multi-processor system.
    Type: Application
    Filed: December 7, 2009
    Publication date: June 9, 2011
    Inventors: Yen-Kuang Chen, Christopher J. Hughes, Changkyn Kim
  • Publication number: 20110138122
    Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.
    Type: Application
    Filed: December 3, 2009
    Publication date: June 9, 2011
    Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
  • Publication number: 20110134137
    Abstract: A texture unit may be used utilized to perform general purpose mathematical computations such as dot products. This enables some general purpose computations and operations to be offloaded from a central processing unit to the texture unit. The texture unit may use linear interpolators in order to perform the dot product calculations.
    Type: Application
    Filed: December 8, 2009
    Publication date: June 9, 2011
    Inventors: Victor W. Lee, Mikhail Smelyanskiy, Ganesh S. Dasika, Jose Gonzalez, Jatin Chhugani, Yen-Kuang Chen, Changkyu Kim, Julio Gago, Santiago Galan, Victor Moya Del Barrio
  • Publication number: 20110078340
    Abstract: Methods, apparatuses and systems to decrease the energy consumption of a memory chip while increasing its effect bandwidth during the execution of any workload. Methods, apparatuses and systems may allow a memory chip utilize a plurality of virtual row buffers to respond to requests for data included in a memory array block. Methods, apparatuses and systems may further eliminate or reduce the cost associated with transferring unnecessary data from a memory array block to row buffers by altering the data transfer size between a memory array block and a row buffer.
    Type: Application
    Filed: September 25, 2009
    Publication date: March 31, 2011
    Inventors: CHANGKYU KIM, Albert Lin, Christopher J. Hughes, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Zeshan A. Chishti, Bryan K. Casper