Patents by Inventor Yen-Kuang Chen

Yen-Kuang Chen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for efficient bi-linear interpolation and motion compensation

Patent number: 8463837

Abstract: A method and apparatus for performing bi-linear interpolation and motion compensation including multiply-add operations and byte shuffle operations on packed data in a processor. In one embodiment, two or more lines of 2n+1 content byte elements may be shuffled to generate a first and second packed data respectively including at least a first and a second 4n byte elements including 2n?1 duplicated elements. A third packed data including sums of products is generated from the first packed data and packed byte coefficients by a multiply-add instruction. A fourth packed data including sums of products is generated from the second packed data and elements and packed byte coefficients by another multiply-add instruction. Corresponding sums of products of the third and fourth packed data are then summed, and may be rounded and averaged.

Type: Grant

Filed: October 17, 2003

Date of Patent: June 11, 2013

Assignee: Intel Corporation

Inventors: Yen-Kuang Chen, Minerva M. Yeung
System and method for memory bandwidth friendly sorting on multi-core architectures

Patent number: 8463820

Abstract: In some embodiments, the invention involves utilizing a tree merge sort in a platform to minimize cache reads/writes when sorting large amounts of data. An embodiment uses blocks of pre-sorted data residing in “leaf nodes” residing in memory storage. A pre-sorted block of data from each leaf node is read from memory and stored in faster cache memory. A tree merge sort is performed on the nodes that are cache resident until a block of data migrates to a root node. Sorted blocks reaching the root node are written to memory storage in an output list until all pre-sorted data blocks have been moved to cache and merged upward to the root. The completed output list in memory storage is a list of the fully sorted data. Other embodiments are described and claimed.

Type: Grant

Filed: May 26, 2009

Date of Patent: June 11, 2013

Assignee: Intel Corporation

Inventors: Jatin Chhugani, Sanjeev Kumar, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Victor W. Lee, William Macy
Shared cache memories for multi-core processors

Patent number: 8417891

Abstract: Embodiments of shared cache memories for multi-core processors are presented. In one embodiment, a cache memory comprises a group of sampling cache sets and a controller to determine a number of misses that occur in the group of sampling cache sets. The controller is operable to determine a victim cache line for a cache set based at least in part on the number of misses.

Type: Grant

Filed: December 15, 2008

Date of Patent: April 9, 2013

Assignee: Intel Corporation

Inventors: Wenlong Li, Yu Chen, Changkyu Kim, Christopher J. Hughes, Yen-Kuang Chen
GATHER AND SCATTER OPERATIONS IN MULTI-LEVEL MEMORY HIERARCHY

Publication number: 20120290799

Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.

Type: Application

Filed: July 24, 2012

Publication date: November 15, 2012

Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
METHOD AND APPARATUS FOR PROVIDING PREDICTION MODE FINE GRANULARITY SCALABILITY

Publication number: 20120224630

Abstract: In an encoding process, video data are represented as a bitstream of a quantized base layer and at least two enhancement layers, with each picture in each layer identified by a start code. The base layer, plus a number of enhancement layers capable of being transmitted by the communication channel's bandwidth, are transmitted on the communication channel.

Type: Application

Filed: May 16, 2012

Publication date: September 6, 2012

Inventors: Wen-Hsiao Peng, Yen-Kuang Chen
Gather and scatter operations in multi-level memory hierarchy

Patent number: 8230172

Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.

Type: Grant

Filed: December 3, 2009

Date of Patent: July 24, 2012

Assignee: Intel Corporation

Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
VECTOR CONFLICT INSTRUCTIONS

Publication number: 20120166761

Abstract: A processing core implemented on a semiconductor chip is described having first execution unit logic circuitry that includes first comparison circuitry to compare each element in a first input vector against every element of a second input vector. The processing core also has second execution logic circuitry that includes second comparison circuitry to compare a first input value against every data element of an input vector.

Type: Application

Filed: December 22, 2010

Publication date: June 28, 2012

Inventors: Christopher J. Hughes, Mark J. Charney, Yen-Kuang Chen, Jesus Corbal, Andrew T. Forsyth, Milind B. Girkar, Jonathan C. Hall, Hideki Ido, Robert Valentine, Jeffrey Wiedemeier
MECHANISM FOR CONFLICT DETECTION USING SIMD

Publication number: 20120159130

Abstract: A system and method are configured to detect conflicts when converting scalar processes to parallel processes (“SIMDifying”). Conflicts may be detected for an unordered single index, an ordered single index and/or ordered pairs of indices. Conflicts may be further detected for read-after-write dependencies. Conflict detection is configured to identify operations (i.e., iterations) in a sequence of iterations that may not be done in parallel.

Type: Application

Filed: December 21, 2010

Publication date: June 21, 2012

Inventors: Mikhail Smelyanskiy, Yen-Kuang Chen, Daehyun Kim, Christopher J. Hughes, Victor W. Lee
Method and apparatus for providing prediction mode fine granularity scalability

Patent number: 8199809

Abstract: In an encoding process, video data are represented as a bitstream of a quantized base layer and at least two enhancement layers, with each picture in each layer identified by a start code. The base layer, plus a number of enhancement layers capable of being transmitted by the communication channel's bandwidth, are transmitted on the communication channel.

Type: Grant

Filed: October 14, 2008

Date of Patent: June 12, 2012

Assignee: Intel Corporation

Inventors: Wen-Hsiao Peng, Yen-Kuang Chen
Method and system to increase concurrency and control replication in a multi-core cache hierarchy

Patent number: 8171223

Abstract: A directory of a private cache hierarchy is provided to maintain coherency between data stored in the cache hierarchy, where the directory is to enable concurrent cache-to-cache transfer of data to two private caches from another private cache. This directory can be implemented in a system having a multi-core processor. Other embodiments are described.

Type: Grant

Filed: December 3, 2008

Date of Patent: May 1, 2012

Assignee: Intel Corporation

Inventors: Christopher J. Hughes, Changkyu Kim, Yen-Kuang Chen
Virtual row buffers for use with random access memory

Patent number: 8151012

Abstract: Methods, apparatuses and systems to decrease the energy consumption of a memory chip while increasing its effect bandwidth during the execution of any workload. Methods, apparatuses and systems may allow a memory chip utilize a plurality of virtual row buffers to respond to requests for data included in a memory array block. Methods, apparatuses and systems may further eliminate or reduce the cost associated with transferring unnecessary data from a memory array block to row buffers by altering the data transfer size between a memory array block and a row buffer.

Type: Grant

Filed: September 25, 2009

Date of Patent: April 3, 2012

Assignee: Intel Corporation

Inventors: Changkyu Kim, Albert Lin, Christopher J. Hughes, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Zeshan A. Chishti, Bryan K. Casper
Scatter-Gather Intelligent Memory Architecture For Unstructured Streaming Data On Multiprocessor Systems

Publication number: 20120042121

Abstract: A scatter/gather technique optimizes unstructured streaming memory accesses, providing off-chip bandwidth efficiency by accessing only useful data at a fine granularity, and off-loading memory access overhead by supporting address calculation, data shuffling, and format conversion.

Type: Application

Filed: October 24, 2011

Publication date: February 16, 2012

Inventors: Daehyun Kim, Christopher J. Hughes, Yen-Kuang Chen, Partha Kundu
Scatter-gather intelligent memory architecture for unstructured streaming data on multiprocessor systems

Patent number: 8074026

Abstract: A scatter/gather technique optimizes unstructured streaming memory accesses, providing off-chip bandwidth efficiency by accessing only useful data at a fine granularity, and off-loading memory access overhead by supporting address calculation, data shuffling, and format conversion.

Type: Grant

Filed: May 10, 2006

Date of Patent: December 6, 2011

Assignee: Intel Corporation

Inventors: Daehyun Kim, Christopher J. Hughes, Yen-Kuang Chen, Partha Kundu
Dynamically re-classifying data in a shared cache

Patent number: 8028129

Abstract: In one embodiment, the present invention includes a method for determining if a state of data is indicative of a first class of data, re-classifying the data from a second class to the first class based on the determination, and moving the data to a first portion of a shared cache associated with a first requester unit based on the re-classification. Other embodiments are described and claimed.

Type: Grant

Filed: July 2, 2009

Date of Patent: September 27, 2011

Assignee: Intel Corporation

Inventors: Christopher J. Hughes, Yen-Kuang Chen
Grouping Pixels to be Textured

Publication number: 20110148896

Abstract: A region or group of pixels may be textured as a unit, using a range specifier and one or more anchor pixels to define the group. In some embodiments, processing grouped pixels improves efficiency.

Type: Application

Filed: December 23, 2009

Publication date: June 23, 2011

Inventors: Victor W. Lee, Ganesh S. Dasika, Mikhail Smelyanskiy, Jose Gonzalez, Changkyu Kim, Jatin Chhugani, Yen-Kuang Chen, Julio Gago, Santiago Galan, Victor Moya Del Barrio
METHODS AND SYSTEMS TO TRAVERSE GRAPH-BASED NETWORKS

Publication number: 20110145184

Abstract: Methods and systems to translate input labels of arcs of a network, corresponding to a sequence of states of the network, to a list of output grammar elements of the arcs, corresponding to a sequence of grammar elements. The network may include a plurality of speech recognition models combined with a weighted finite state machine transducer (WFST). Traversal may include active arc traversal, and may include active arc propagation. Arcs may be processed in parallel, including arcs originating from multiple source states and directed to a common destination state. Self-loops associated with states may be modeled within outgoing arcs of the states, which may reduce synchronization operations. Tasks may be ordered with respect to cache-data locality to associate tasks with processing threads based at least in part on whether another task associated with a corresponding data object was previously assigned to the thread.

Type: Application

Filed: December 14, 2009

Publication date: June 16, 2011

Inventors: Kisun You, Christopher J. Hughes, Yen-Kuang Chen
Technique for tracking shared data in a multi-core processor or multi-processor system

Publication number: 20110138128

Abstract: A technique to track shared information in a multi-core processor or multi-processor system. In one embodiment, core identification information (“core IDs”) are used to track shared information among multiple cores in a multi-core processor or multiple processors in a multi-processor system.

Type: Application

Filed: December 7, 2009

Publication date: June 9, 2011

Inventors: Yen-Kuang Chen, Christopher J. Hughes, Changkyn Kim
GATHER AND SCATTER OPERATIONS IN MULTI-LEVEL MEMORY HIERARCHY

Publication number: 20110138122

Abstract: Methods and apparatus relating to gather or scatter operations in a multi-level cache are described. In some embodiments, a logic may determine whether to perform gather or scatter operations at a first memory or a second memory, based in part on a relative performance of performing the gather or scatter operations at the first memory and the second memory. Other embodiments are also described and claimed.

Type: Application

Filed: December 3, 2009

Publication date: June 9, 2011

Inventors: Christopher J. Hughes, Yen-Kuang Chen, Changkyu Kim, Daehyun Kim, Victor W. Lee, Anthony-Trung D. Nguyen, Nadathur Rajagopalan Satish
Texture Unit for General Purpose Computing

Publication number: 20110134137

Abstract: A texture unit may be used utilized to perform general purpose mathematical computations such as dot products. This enables some general purpose computations and operations to be offloaded from a central processing unit to the texture unit. The texture unit may use linear interpolators in order to perform the dot product calculations.

Type: Application

Filed: December 8, 2009

Publication date: June 9, 2011

Inventors: Victor W. Lee, Mikhail Smelyanskiy, Ganesh S. Dasika, Jose Gonzalez, Jatin Chhugani, Yen-Kuang Chen, Changkyu Kim, Julio Gago, Santiago Galan, Victor Moya Del Barrio
VIRTUAL ROW BUFFERS FOR USE WITH RANDOM ACCESS MEMORY

Publication number: 20110078340

Abstract: Methods, apparatuses and systems to decrease the energy consumption of a memory chip while increasing its effect bandwidth during the execution of any workload. Methods, apparatuses and systems may allow a memory chip utilize a plurality of virtual row buffers to respond to requests for data included in a memory array block. Methods, apparatuses and systems may further eliminate or reduce the cost associated with transferring unnecessary data from a memory array block to row buffers by altering the data transfer size between a memory array block and a row buffer.

Type: Application

Filed: September 25, 2009

Publication date: March 31, 2011

Inventors: CHANGKYU KIM, Albert Lin, Christopher J. Hughes, Anthony-Trung D. Nguyen, Yen-Kuang Chen, Zeshan A. Chishti, Bryan K. Casper

prev … 6 7 8 9 10 11 12 13 14 next