Patents by Inventor William C. Hasenplaugh

William C. Hasenplaugh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Update mask for handling interaction between fills and updates

Patent number: 9251073

Abstract: A multi core processor implements a cash coherency protocol in which probe messages are address-ordered on a probe channel while responses are un-ordered on a response channel. When a first core generates a read of an address that misses in the first core's cache, a line fill is initiated. If a second core is writing the same address, the second core generates an update on the addressed ordered probe channel. The second core's update may arrive before or after the first core's line fill returns. If the update arrived before the fill returned, a mask is maintained to indicate which portions of the line were modified by the update so that the late arriving line fill only modifies portions of the line that were unaffected by the earlier-arriving update.

Type: Grant

Filed: December 31, 2012

Date of Patent: February 2, 2016

Assignee: Intel Corporation

Inventors: Simon C. Steely, William C. Hasenplaugh
Short circuit of probes in a chain

Patent number: 9201792

Abstract: A multi-core processing apparatus may provide a cache probe and data retrieval method. The method may comprise sending a memory request from a requester to a record keeping structure. The memory request may have a memory address of a memory that stores requested data. The method may further comprise determining that a local last accessor of the memory address may have a copy of the requested data up to date with the memory. The local last accessor may be within a local domain that the requester belongs to. The method may further comprise sending a cache probe to the local last accessor and retrieving a latest value of the requested data from the local last accessor to the requester.

Type: Grant

Filed: December 29, 2011

Date of Patent: December 1, 2015

Assignee: Intel Corporation

Inventors: Simon C. Steely, Jr., Samantika Subramaniam, William C. Hasenplaugh, Joel S. Emer
Retrieval of previously accessed data in a multi-core processor

Patent number: 9146871

Abstract: A multi-core processing apparatus may provide a cache probe and data retrieval method. The method may comprise sending a memory request from a requester to a record keeping structure. The memory request may have a memory address of a memory that stores requested data. The method may further comprise determining a last accessor of the memory address, sending a cache probe to the last accessor, determining the last accessor no longer has a copy of the line; and sending a request for the previously accessed version of the line. The request may bypass the tag-directories and obtain the requested data from memory.

Type: Grant

Filed: December 28, 2011

Date of Patent: September 29, 2015

Assignee: Intel Corporation

Inventors: Simon C. Steely, Jr., William C. Hasenplaugh, Joel S. Emer
Variable precision floating point multiply-add circuit

Patent number: 9104474

Abstract: Embodiments of the present invention may provide methods and circuits for energy efficient floating point multiply and/or add operations. A variable precision floating point circuit may determine the certainty of the result of a multiply-add floating point calculation in parallel with the floating-point calculation. The variable precision floating point circuit may use the certainty of the inputs in combination with information from the computation, such as, binary digits that cancel, normalization shifts, and rounding, to perform a calculation of the certainty of the result. A floating point multiplication circuit may determine whether a lowest portion of a multiplication result could affect the final result and may induce a replay of the multiplication operation when it is determined that the result could affect the final result.

Type: Grant

Filed: December 28, 2012

Date of Patent: August 11, 2015

Assignee: Intel Corporation

Inventors: Himanshu Kaul, Mark A. Anders, Sanu K. Mathew, Ram K. Krishnamurthy, William C. Hasenplaugh, Randy L. Allmon, Jonathan Enoch
Efficient support of sparse data structure access

Patent number: 9037804

Abstract: Method and apparatus to efficiently organize data in caches by storing/accessing data of varying sizes in cache lines. A value may be assigned to a field indicating the size of usable data stored in a cache line. If the field indicating the size of the usable data in the cache line indicates a size less than the maximum storage size, a value may be assigned to a field in the cache line indicating which subset of the data in the field to store data is usable data. A cache request may determine whether the size of the usable data in a cache line is equal to the maximum data storage size. If the size of the usable data in the cache line is equal to the maximum data storage size the entire stored data in the cache line may be returned.

Type: Grant

Filed: December 29, 2011

Date of Patent: May 19, 2015

Assignee: Intel Corporation

Inventors: Simon C. Steely, Jr., William C. Hasenplaugh, Joel S. Emer
HARDWARE COMPILATION AND/OR TRANSLATION WITH FAULT DETECTION AND ROLL BACK FUNCTIONALITY

Publication number: 20150046910

Abstract: Hardware compilation and/or translation with fault detection and roll back functionality are disclosed. Compilation and/or translation logic receives programs encoded in one language, and encodes the programs into a second language including instructions to support processor features not encoded into the original language encoding of the programs. In one embodiment, an execution unit executes instructions of the second language including an operation-check instruction to perform a first operation and record the first operation result for a comparison, and an operation-test instruction to perform a second operation and a fault detection operation by comparing the second operation result to the recorded first operation result.

Type: Application

Filed: October 14, 2014

Publication date: February 12, 2015

Inventors: Nicholas Cheng Hwa Chee, Tryggve Fossum, William C. Hasenplaugh
SCALABLE MULTI-LAYER 2D-MESH ROUTERS

Publication number: 20150003281

Abstract: Architectures, apparatus and systems employing scalable multi-layer 2D-mesh routers. A 2D router mesh comprises bi-direction pairs of linked paths coupled between pairs of IO interfaces and configured in a plurality of rows and columns forming a 2D mesh. Router nodes are located at the intersections of the rows and columns, and are configured to forward data units between IO inputs and outputs coupled to the mesh at its edges through use of shortest path routes defined by agents at the IO interfaces. Multiple instances of the 2D meshes may be employed to support bandwidth scaling of the router architecture. One implementation of a multi-layer 2D mesh is built using a standard tile that is tessellated to form a 2D array of standard tiles, with each 2D mesh layer offset and overlaid relative to the other 2D mesh layers. IO interfaces are then coupled to the multi-layer 2D mesh via muxes/demuxes and/or crossbar interconnects.

Type: Application

Filed: June 26, 2013

Publication date: January 1, 2015

Inventors: William C. Hasenplaugh, Tryggve Fossum, Judson S. Leonard
Hardware compilation and/or translation with fault detection and roll back functionality

Patent number: 8893094

Abstract: Hardware compilation and/or translation with fault detection and roll back functionality are disclosed. Compilation and/or translation logic receives programs encoded in one language, and encodes the programs into a second language including instructions to support processor features not encoded into the original language encoding of the programs. In one embodiment, an execution unit executes instructions of the second language including an operation-check instruction to perform a first operation and record the first operation result for a comparison, and an operation-test instruction to perform a second operation and a fault detection operation by comparing the second operation result to the recorded first operation result.

Type: Grant

Filed: December 30, 2011

Date of Patent: November 18, 2014

Assignee: Intel Corporation

Inventors: Nicholas Cheng Hwa Chee, Tryggve Fossum, William C. Hasenplaugh
RETRIEVAL OF PREVIOUSLY ACCESSED DATA IN A MULTI-CORE PROCESSOR

Publication number: 20140215162

Abstract: A multi-core processing apparatus may provide a cache probe and data retrieval method. The method may comprise sending a memory request from a requester to a record keeping structure. The memory request may have a memory address of a memory that stores requested data. The method may further comprise determining a last accessor of the memory address, sending a cache probe to the last accessor, determining the last accessor no longer has a copy of the line; and sending a request for the previously accessed version of the line. The request may bypass the tag-directories and obtain the requested data from memory.

Type: Application

Filed: December 28, 2011

Publication date: July 31, 2014

Inventors: Simon C. Steeley, JR., William C. Hasenplaugh, Joel S. Emer
HIGH BANDWIDTH FULL-BLOCK WRITE COMMANDS

Publication number: 20140201446

Abstract: A micro-architecture may provide a hardware and software of a high bandwidth write command. The micro-architecture may invoke a method to perform the high bandwidth write command. The method may comprise sending a write request from a requester to a record keeping structure. The write request may have a memory address of a memory that stores requested data. The method may further determine copies of the requested data being present in a distributed cache system outside the memory, sending invalidation requests to elements holding copies of the requested data in the distributed cache system, sending a notification to the requester to inform presence of copies of the requested data and sending a write response message after a latest value of the requested data and all invalidation acknowledgements have been received.

Type: Application

Filed: December 28, 2011

Publication date: July 17, 2014

Inventors: Simon C. Steeley, JR., William C. Hasenplaugh, Joel S. Emer, Samantika Subramaniam
UPDATE MASK FOR HANDLING INTERACTION BETWEEN FILLS AND UPDATES

Publication number: 20140189251

Abstract: A multi core processor implements a cash coherency protocol in which probe messages are address-ordered on a probe channel while responses are un-ordered on a response channel. When a first core generates a read of an address that misses in the first core's cache, a line fill is initiated. If a second core is writing the same address, the second core generates an update on the addressed ordered probe channel. The second core's update may arrive before or after the first core's line fill returns. If the update arrived before the fill returned, a mask is maintained to indicate which portions of the line were modified by the update so that the late arriving line fill only modifies portions of the line that were unaffected by the earlier-arriving update.

Type: Application

Filed: December 31, 2012

Publication date: July 3, 2014

Inventors: Simon C. Steely, JR., William C. Hasenplaugh
DISTRIBUTED POWER MANAGEMENT FOR MULTI-CORE PROCESSORS

Publication number: 20140189413

Abstract: A system and method for performing distributed power control in a processor comprising an array of cores enables each core to regulate power at least partially independently. Global power management settings are made accessible to all cores and communication between cores propagates power consumption information between nearest neighbors in the array. Each core attempts to best regulate its own power consumption in accordance with global power consumption information and/or specific instructions from a global power manager. In this manner local opportunistic load balancing may be achieved in a scalable manner suitable for a large array of cores.

Type: Application

Filed: December 31, 2012

Publication date: July 3, 2014

Inventors: William C. Hasenplaugh, Tryggve Fossum
VARIABLE PRECISION FLOATING POINT MULTIPLY-ADD CIRCUIT

Publication number: 20140188968

Abstract: Embodiments of the present invention may provide methods and circuits for energy efficient floating point multiply and/or add operations. A variable precision floating point circuit may determine the certainty of the result of a multiply-add floating point calculation in parallel with the floating-point calculation. The variable precision floating point circuit may use the certainty of the inputs in combination with information from the computation, such as, binary digits that cancel, normalization shifts, and rounding, to perform a calculation of the certainty of the result. A floating point multiplication circuit may determine whether a lowest portion of a multiplication result could affect the final result and may induce a replay of the multiplication operation when it is determined that the result could affect the final result.

Type: Application

Filed: December 28, 2012

Publication date: July 3, 2014

Inventors: Himanshu KAUL, Mark A. ANDERS, Sanu K. MATHEW, Ram K. KRISHNAMURTHY, William C. HASENPLAUGH, Randy L. ALLMON, Jonathan ENOCH
DOMAIN STATE

Publication number: 20140052920

Abstract: Method and apparatus to efficiently maintain cache coherency by reading/writing a domain state field associated with a tag entry within a cache tag directory. A value may be assigned to a domain state field of a tag entry in a cache tag directory. The cache tag directory may belong to a hierarchy of cache tag directories. Each tag entry may be associated with a cache line from a cache belonging to a first domain. The first domain may contain multiple caches. The value of the domain state field may indicate whether its associated cache line can be read or changed.

Type: Application

Filed: December 29, 2011

Publication date: February 20, 2014

Inventors: Simon C. Steely, JR., William C. Hasenplaugh, Joel S. Emer
DATA CONTROL USING LAST ACCESSOR INFORMATION

Publication number: 20140006716

Abstract: In some implementations, a shared cache structure may be provided for sharing data among a plurality of processor cores. A data structure may be associated with the shared cache structure, and may include a plurality of entries, with each entry corresponding to one of the cache lines in the shared cache. Each entry in the data structure may further include a field to identify a processor core that most recently requested the data of the cache line corresponding to the entry. When a request for a particular cache line is received, a request for the data may be sent to a particular processor core identified in the data structure as the last accessor of the data.

Type: Application

Filed: December 29, 2011

Publication date: January 2, 2014

Inventors: Simon C. Steeley, JR., William C. Hasenplaugh
SIGNATURE BASED HIT-PREDICTING CACHE

Publication number: 20140006717

Abstract: An apparatus may comprise a cache file having a plurality of cache lines and a hit predictor. The hit predictor may contain a table of counter values indexed with signatures that are associated with the plurality of cache lines. The apparatus may fill cache lines into the cache file with either low or high priority. Low priority lines may be chosen to be replaced by a replacement algorithm before high priority lines. In this way, the cache naturally may contain more high priority lines than low priority ones. This priority filling process may improve the performance of most replacement schemes including the best known schemes which are already doing better than LRU.

Type: Application

Filed: June 29, 2012

Publication date: January 2, 2014

Applicant: INTEL CORPORATION

Inventors: Simon C. STEELY, JR., William C. HASENPLAUGH, Aamer JALEEL, Joel S. EMER, Carole-Jean WU
ADDRESS RANGE PRIORITY MECHANISM

Publication number: 20130339621

Abstract: Method and apparatus to efficiently manage data in caches. Data in caches may be managed based on priorities assigned to the data. Data may be requested by a process using a virtual address of the data. The requested data may be assigned a priority by a component in a computer system called an address range priority assigner (ARP). The ARP may assign a particular priority to the requested data if the virtual address of the requested data is within a particular range of virtual addresses. The particular priority assigned may be high priority and the particular range of virtual addresses may be smaller than a cache's capacity.

Type: Application

Filed: December 23, 2011

Publication date: December 19, 2013

Inventors: Simon Steely, JR., Samantika Subramaniam, William C. Hasenplaugh
SHORT CIRCUIT OF PROBES IN A CHAIN

Publication number: 20130326147

Abstract: A multi-core processing apparatus may provide a cache probe and data retrieval method. The method may comprise sending a memory request from a requester to a record keeping structure. The memory request may have a memory address of a memory that stores requested data. The method may further comprise determining that a local last accessor of the memory address may have a copy of the requested data up to date with the memory. The local last accessor may be within a local domain that the requester belongs to. The method may further comprise sending a cache probe to the local last accessor and retrieving a latest value of the requested data from the local last accessor to the requester.

Type: Application

Filed: December 29, 2011

Publication date: December 5, 2013

Inventors: Simon C. Steely, JR., Samantika Subramaniam, William C. Hasenplaugh, Joel S. Emer
EFFICIENT SUPPORT OF SPARSE DATA STRUCTURE ACCESS

Publication number: 20130297883

Abstract: Method and apparatus to efficiently organize data in caches by storing/accessing data of varying sizes in cache lines. A value may be assigned to a field indicating the size of usable data stored in a cache line. If the field indicating the size of the usable data in the cache line indicates a size less than the maximum storage size, a value may be assigned to a field in the cache line indicating which subset of the data in the field to store data is usable data. A cache request may determine whether the size of the usable data in a cache line is equal to the maximum data storage size. If the size of the usable data in the cache line is equal to the maximum data storage size the entire stored data in the cache line may be returned.

Type: Application

Filed: December 29, 2011

Publication date: November 7, 2013

Inventors: Simon C. Steely, JR., William C. Hasenplaugh, Joel S. Emer
HARDWARE COMPILATION AND/OR TRANSLATION WITH FAULT DETECTION AND ROLL BACK FUNCTIONALITY

Publication number: 20130173893

Abstract: Hardware compilation and/or translation with fault detection and roll back functionality are disclosed. Compilation and/or translation logic receives programs encoded in one language, and encodes the programs into a second language including instructions to support processor features not encoded into the original language encoding of the programs. In one embodiment, an execution unit executes instructions of the second language including an operation-check instruction to perform a first operation and record the first operation result for a comparison, and an operation-test instruction to perform a second operation and a fault detection operation by comparing the second operation result to the recorded first operation result.

Type: Application

Filed: December 30, 2011

Publication date: July 4, 2013

Inventors: Nicholas Cheng Hwa Chee, Tryggve Fossum, William C. Hasenplaugh

prev 1 2 3 4 next