Patents Assigned to ETA SCALE AB

System and method for dynamic enforcement of store atomicity

Patent number: 11334485

Abstract: A computer system for dynamic enforcement of store atomicity includes multiple processor cores, local cache memory for each processor core, a shared memory, a separate store buffer for each processor core for executed stores that are not yet performed and a coherence mechanism. A first processor core load on a first processor core receives a value at a first time from a first processor core store in the store buffer and prevents any other first processor core load younger than the first processor core load in program order from committing until a second time when the first processor core store is performed. Between the first time and the second time any load younger in program load than the first processor core load and having an address matched by coherence invalidation or an address matched by an eviction is squashed.

Type: Grant

Filed: December 16, 2019

Date of Patent: May 17, 2022

Assignee: ETA SCALE AB

Inventors: Stefanos Kaxiras, Alberto Ros
System and method for event monitoring in cache coherence protocols without explicit invalidations

Patent number: 11237966

Abstract: Synchronization events associated with cache coherence are monitored without using invalidations. A callback-read is issued to a memory address associated with the synchronization event, which callback-read either reads the last value written in the memory address or blocks until a next write takes place in the memory address and reads a newly written value.

Type: Grant

Filed: June 28, 2019

Date of Patent: February 1, 2022

Assignee: ETA SCALE AB

Inventors: Stefanos Kaxiras, Alberto Ros
System and method for self-invalidation, self-downgrade cachecoherence protocols

Patent number: 11188464

Abstract: Methods and systems for self-invalidating cachelines in a computer system having a plurality of cores are described. A first one of the plurality of cores, requests to load a memory block from a cache memory local to the first one of the plurality of cores, which request results in a cache miss. This results in checking a read-after-write detection structure to determine if a race condition exists for the memory block. If a race condition exists for the memory block, program order is enforced by the first one of the plurality of cores at least between any older loads and any younger loads with respect to the load that detects the prior store in the first one of the plurality of cores that issued the load of the memory block and causing one or more cache lines in the local cache memory to be self-invalidated.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 30, 2021

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras
Systems and methods for invisible speculative execution

Patent number: 11163576

Abstract: A system and method for efficiently preventing visible side-effects in the memory hierarchy during speculative execution is disclosed. Hiding the side-effects of executed instructions in the whole memory hierarchy is both expensive, in terms of performance and energy, and complicated. A system and method is disclosed to hide the side-effects of speculative loads in the cache(s) until the earliest time these speculative loads become non-speculative. A refinement is disclosed where loads that hit in the L1 cache are allowed to proceed by keeping their side-effects on the L1 cache hidden until these loads become non-speculative, and all other speculative loads that miss in the cache(s) are prevented from executing until they become non-speculative. To limit the performance deterioration caused by these delayed loads, a system and method is disclosed that augments the cache(s) with a value predictor or a re-computation engine that supplies predicted or recomputed values to the loads that missed in the cache(s).

Type: Grant

Filed: March 20, 2020

Date of Patent: November 2, 2021

Assignee: ETA SCALE AB

Inventors: Christos Sakalis, Stefanos Kaxiras, Alberto Ros, Alexandra Jimborean, Magnus Själander
Systems and methods for non-speculative store coalescing and generating atomic write sets using address subsets

Patent number: 11119920

Abstract: A method for performing store buffer coalescing in a multiprocessor computer system includes forming, in a coalescing store buffer associated with a core in said multiprocessor system, an atomic group of writes; and performing each individual write in said atomic group in an order which is a function of an address in a memory system to which each of the writes in said atomic group are being written.

Type: Grant

Filed: April 18, 2019

Date of Patent: September 14, 2021

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras
System and method for non-speculative reordering of load accesses

Patent number: 11106468

Abstract: Methods and systems for maintaining validity of a memory model in a multiple core computer system are described. A first core prevents a store instruction from being performed by another core until a condition is met which enables reordered instructions to validly execute.

Type: Grant

Filed: May 23, 2018

Date of Patent: August 31, 2021

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras
Multi-core computer systems with private/shared cache line indicators

Patent number: 11068410

Abstract: According to embodiments described herein, the hierarchical complexity for coherence protocols associated with clustered cache architectures can be encapsulated in a simple function, i.e., that of determining when a data block is shared entirely within a cluster (i.e., a sub-tree of the hierarchy) and is private from the outside. This allows embodiments to eliminate complex recursive coherence operations that span the hierarchy and instead employ simple coherence mechanisms such as self-invalidation and write-through but which are restricted to operate where a data block is shared. Thus embodiments recognize that, in the context of clustered cache hierarchies, data can be shared entirely within one cluster but can be private (unshared) to this cluster when viewed from the perspective of other clusters. This characteristic of the data can be determined and then used to locally simplify coherence protocols.

Type: Grant

Filed: March 4, 2019

Date of Patent: July 20, 2021

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras
System and method for self-invalidation, self-downgrade cachecoherence protocols

Patent number: 10528471

Abstract: Methods and systems for self-invalidating cachelines in a computer system having a plurality of cores are described. A first one of the plurality of cores, requests to load a memory block from a cache memory local to the first one of the plurality of cores, which request results in a cache miss. This results in checking a read-after-write detection structure to determine if a race condition exists for the memory block. If a race condition exists for the memory block, program order is enforced by the first one of the plurality of cores at least between any older loads and any younger loads with respect to the load that detects the prior store in the first one of the plurality of cores that issued the load of the memory block and causing one or more cache lines in the local cache memory to be self-invalidated.

Type: Grant

Filed: December 27, 2017

Date of Patent: January 7, 2020

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras
System and method for event monitoring in cache coherence protocols without explicit invalidations

Patent number: 10387312

Abstract: Synchronization events associated with cache coherence are monitored without using invalidations. A callback-read is issued to a memory address associated with the synchronization event, which callback-read either reads the last value written in the memory address or blocks until a next write takes place in the memory address and reads a newly written value.

Type: Grant

Filed: January 2, 2015

Date of Patent: August 20, 2019

Assignee: ETA SCALE AB

Inventors: Stefanos Kaxiras, Alberto Ros
Systems and methods for coherence in clustered cache hierarchies

Patent number: 10324861

Abstract: According to embodiments described herein, the hierarchical complexity for coherence protocols associated with clustered cache architectures can be encapsulated in a simple function, i.e., that of determining when a data block is shared entirely within a cluster (i.e., a sub-tree of the hierarchy) and is private from the outside. This allows embodiments to eliminate complex recursive coherence operations that span the hierarchy and instead employ simple coherence mechanisms such as self-invalidation and write-through but which are restricted to operate where a data block is shared. Thus embodiments recognize that, in the context of clustered cache hierarchies, data can be shared entirely within one cluster but can be private (unshared) to this cluster when viewed from the perspective of other clusters. This characteristic of the data can be determined and then used to locally simplify coherence protocols.

Type: Grant

Filed: February 4, 2016

Date of Patent: June 18, 2019

Assignee: ETA SCALE AB

Inventors: Alberto Ros, Stefanos Kaxiras