With Shared Cache (epo) Patents (Class 711/E12.038)

Dynamically Controlling Cache Size To Maximize Energy Efficiency

Publication number: 20130111121

Abstract: In one embodiment, the present invention is directed to a processor having a plurality of cores and a cache memory coupled to the cores and including a plurality of partitions. The processor can further include a logic to dynamically vary a size of the cache memory based on a memory boundedness of a workload executed on at least one of the cores. Other embodiments are described and claimed.

Type: Application

Filed: October 31, 2011

Publication date: May 2, 2013

Inventors: Avinash N. Ananthakrishnan, Efraim Rotem, Eliezer Wiessman, Doron Rajwan, Nadav Shulman, Alon Naveh, Hisham Abu-Salah
PERFORMANCE OPTIMIZATION AND DYNAMIC RESOURCE RESERVATION FOR GUARANTEED COHERENCY UPDATES IN A MULTI-LEVEL CACHE HIERARCHY

Publication number: 20130061002

Abstract: A cache includes a cache pipeline, a request receiver configured to receive off chip coherency requests from an off chip cache and a plurality of state machines coupled to the request receiver. The cache also includes an arbiter coupled between the plurality of state machines and the cache pipe line and is configured to give priority to off chip coherency requests as well as a counter configured to count the number of coherency requests sent from the cache pipeline to a lower level cache. The cache pipeline is halted from sending coherency requests when the counter exceeds a predetermined limit.

Type: Application

Filed: November 7, 2012

Publication date: March 7, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: International Business Machines Corporation
MITIGATING BUSY TIME IN A HIGH PERFORMANCE CACHE

Publication number: 20130060997

Abstract: Various embodiments of the present invention mitigate busy time in a hierarchical store-through memory cache structure. In one embodiment, a cache directory associated with a memory cache is divided into a plurality of portions each associated with a portion memory cache. Simultaneous cache lookup operations and cache write operations between the plurality of portions of the cache directory are supported. Two or more store commands are simultaneously processed in a shared cache pipeline communicatively coupled to the plurality of portions of the cache directory.

Type: Application

Filed: October 31, 2012

Publication date: March 7, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventor: International Business Machines Corporation
METHOD AND SYSTEM FOR SHARED HIGH SPEED CACHE IN SAS SWITCHES

Publication number: 20130054883

Abstract: A data storage system includes at least one host device configured to initiate a data request, at least one target device configured to store data, and a serial attached SCSI (SAS) switch coupled between the at least one host device and the at least one target device. The SAS switch includes a cache memory and includes control programming configured to determine whether data of the data request is stored in the cache is at least one of data stored in the cache memory of the SAS switch or data to be written in the cache memory of the SAS switch. The cache memory of the SAS switch is a shared cache that is shared across each of the at least one host device and the at least one target device.

Type: Application

Filed: August 26, 2011

Publication date: February 28, 2013

Applicant: LSI CORPORATION

Inventor: Ankit Sihare
SHARED COPY CACHE ACROSS NETWORKED DEVICES

Publication number: 20130046935

Abstract: A copy cache feature that can be shared across networked devices is provided. Content added to copy cache through a “copy”, a “like”, or similar command through one device may be forwarded to a server providing cloud-based services to a user and/or another device associated with the user such that the content can be inserted into the same or other files on other computing devices by the user. In addition to seamless movement of copy cache content across devices, the content may be made available in a context-based manner and/or sortable manner.

Type: Application

Filed: August 18, 2011

Publication date: February 21, 2013

Applicant: MICROSOFT CORPORATION

Inventor: Rajesh Ramanathan
CUSTOM CACHING

Publication number: 20130042065

Abstract: Methods and systems are presented for custom caching. Application threads define caches. The caches may be accessed through multiple index keys, which are mapped to multiple application thread-defined keys. Methods provide for the each index key and each application thread-defined key to be symmetrical. The index keys are used for loading data from one or more data sources into the cache stores on behalf of the application threads. Application threads access the data from the cache store by providing references to the caches and the application-supplied keys. Some data associated with some caches may be shared from the cache store by multiple application threads. Additionally, some caches are exclusively accessed by specific application threads.

Type: Application

Filed: September 14, 2012

Publication date: February 14, 2013

Applicant: eBay Inc.

Inventors: Christopher J. Kasten, Greg Seitz
Video Object Placement for Cooperative Caching

Publication number: 20130042071

Abstract: A method, an apparatus and an article of manufacture for placing at least one object at at least one cache of a set of cooperating caching nodes with limited inter-node communication bandwidth. The method includes transmitting information from the set of cooperating caching nodes regarding object accesses to a placement computation component, determining object popularity distribution based on the object access information, and instructing the set of cooperating caching nodes of at least one object to cache, the at least one node at which each object is to be cached, and a manner in which the at least one cached object is to be shared among the at least one caching node based on the object popularity distribution and cache and object sizes such that a cumulative hit rate at the at least one cache is increased while a constraint on inter-node communication bandwidth is not violated.

Type: Application

Filed: August 10, 2011

Publication date: February 14, 2013

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Malolan Chetlur, Umamaheswari C. Devi, Shivkumar Kalyanaraman
ELECTRONIC SYSTEM AND METHOD FOR SELECTIVELY ALLOWING ACCESS TO A SHARED MEMORY

Publication number: 20130042072

Abstract: An electronic system, an integrated circuit and a method for display are disclosed. The electronic system contains a first device, a memory and a video/audio compression/decompression device such as a decoder/encoder. The electronic system is configured to allow the first device and the video/audio compression/decompression device to share the memory. The electronic system may be included in a computer in which case the memory is a main memory. Memory access is accomplished by one or more memory interfaces, direct coupling of the memory to a bus, or direct coupling of the first device and decoder/encoder to a bus. An arbiter selectively provides access for the first device and/or the decoder/encoder to the memory based on priority. The arbiter may be monolithically integrated into a memory interface. The decoder may be a video decoder configured to comply with the MPEG-2 standard. The memory may store predicted images obtained from a preceding image.

Type: Application

Filed: October 18, 2012

Publication date: February 14, 2013

Applicant: STMicroelectronics, Inc.

Inventor: STMicroelectronics, Inc.
Shared cache memory control

Publication number: 20130042070

Abstract: A data processing system 2 includes a cache hierarchy having a plurality of local cache memories and a shared cache memory 18. State data 30, 32 stored within the shared cache memory 18 on a per cache line basis is used to control whether or not that cache line of data is stored and managed in accordance with non-inclusive operation or inclusive operation of the cache memory system. Snoop transactions are filtered on the basis of data indicating whether or not a cache line of data is unique or non-unique. A switch from non-inclusive operation to inclusive operation may be performed in dependence upon the transaction type of a received transaction requesting a cache line of data.

Type: Application

Filed: August 8, 2011

Publication date: February 14, 2013

Applicant: ARM LIMITED

Inventors: Jamshed Jalal, Mark David Werkheiser, Brett Stanley Feero, Michael Alan Filippo
INTERFACE APPARATUS, CALCULATION PROCESSING APPARATUS, INTERFACE GENERATION APPARATUS, AND CIRCUIT GENERATION APPARATUS

Publication number: 20130031311

Abstract: There is provided is an interface apparatus including: a stream converter receiving write-addresses and write-data, storing the received data in a buffer, and sorting the stored write-data in the order of the write-addresses to output the write-data as stream-data; a cache memory storing received stream-data if a load-signal indicates that the stream-data are necessarily loaded and outputting data stored in a storage device corresponding to an input cache-address as cache-data; a controller determining whether or not data allocated with a read-address have already been loaded, outputting the load-signal instructing the loading on the cache memory if not loaded, and outputting a load-address indicating a load-completed-address of the cache memory; and at least one address converter calculating which one of the storage devices the allocated data are stored in, by using the load-address, outputting the calculated value as the cache-address to the cache memory, and outputting the cache-data as read-data.

Type: Application

Filed: October 4, 2012

Publication date: January 31, 2013

Applicant: Sony Corporation

Inventor: Sony Corporation
POWER REDUCTION USING UNMODIFIED INFORMATION IN EVICTED CACHE LINES

Publication number: 20130019064

Abstract: Embodiments of the present disclosure describe techniques and configurations to reduce power consumption using unmodified information in evicted cache lines. A method includes identifying unmodified information of a cache line stored in a cache of a processor, tracking the unmodified information using a bit vector comprising one or more bits to indicate the unmodified information of the cache line, and selectively suppressing a write operation or send operation for the unmodified information of the cache line that is evicted from the cache to an input/output (I/O) component coupled to the cache, the selective suppressing being based on the one or more bits, and the I/O component being an outer component external to the cache. Other embodiments may be described and/or claimed.

Type: Application

Filed: July 11, 2011

Publication date: January 17, 2013

Inventors: Mahesh K. Kumashikar, Ashok Jagannathan
MEMORY ACCESS MONITOR

Publication number: 20130013864

Abstract: For each access request received at a shared cache of the data processing device, a memory access pattern (MAP) monitor predicts which of the memory banks, and corresponding row buffers, would be accessed by the access request if the requesting thread were the only thread executing at the data processing device. By recording predicted accesses over time for a number of access requests, the MAP monitor develops a pattern of predicted memory accesses by executing threads. The pattern can be employed to assign resources at the shared cache, thereby managing memory more efficiently.

Type: Application

Filed: July 6, 2011

Publication date: January 10, 2013

Applicant: ADVANCED MICRO DEVICES, INC.

Inventors: Jaewoong Chung, Shekhar Srikantaiah, Lisa Hsu
WRITE-THROUGH CACHE OPTIMIZED FOR DEPENDENCE-FREE PARALLEL REGIONS

Publication number: 20120331232

Abstract: An apparatus and computer program product for improving performance of a parallel computing system. A first hardware local cache controller associated with a first local cache memory device of a first processor detects an occurrence of a false sharing of a first cache line by a second processor running the program code and allows the false sharing of the first cache line by the second processor. The false sharing of the first cache line occurs upon updating a first portion of the first cache line in the first local cache memory device by the first hardware local cache controller and subsequent updating a second portion of the first cache line in a second local cache memory device by a second hardware local cache controller.

Type: Application

Filed: September 5, 2012

Publication date: December 27, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Alexandre E. Eichenberger, Alan Gara, Martin Ohmacht, Vijayalakshmi Srinivasan
Memory Caching for Browser Processes

Publication number: 20120317363

Abstract: There is disclosed a method in which a process is initiated to handle a set of information, which includes one or more resources. In the method the set of information is examined to determine whether the set of information includes a resource stored as a shareable cache element in a memory. If the determination indicates that the set of information includes a resource stored as a shareable cache element, the shareable cache element is used as the resource of the set of information.

Type: Application

Filed: May 25, 2011

Publication date: December 13, 2012

Applicant: NOKIA CORPORATION

Inventors: Juha Uola, Johan Wikman
REQUEST TO OWN CHAINING IN MULTI-SOCKETED SYSTEMS

Publication number: 20120290794

Abstract: A method including: receiving multiple local requests to access the cache line; inserting, into an address chain, multiple entries corresponding to the multiple local requests; identifying a first entry at a head of the address chain; initiating, in response to identifying the first entry and in response to the first entry corresponding to a request to own the cache line, a traversal of the address chain; setting, during the traversal of the address chain, a state element identified in a second entry; receiving a foreign request to access the cache line; inserting, in response to setting the state element, a third entry corresponding to the foreign request into the address chain after the second entry; and relinquishing, in response to inserting the third entry after the second entry in the address chain, the cache line to a foreign thread after executing the multiple local requests.

Type: Application

Filed: August 29, 2011

Publication date: November 15, 2012

Applicant: ORACLE INTERNATIONAL CORPORATION

Inventors: Connie Wai Mun Cheung, Madhavi Kondapaneni, Joann Yin Lam, Ramaswamy Sivaramakrishnan
DYNAMIC LOCKSTEP CACHE MEMORY REPLACEMENT LOGIC

Publication number: 20120272006

Abstract: To facilitate dynamic lockstep support, replacement states and/or logic used to select particular cache lines for replacement with new allocations in accord with replacement algorithms or strategies may be enhanced to provide generally independent replacement contexts for use in respective lockstep and performance modes. In some cases, replacement logic that may be otherwise conventional in its selection of cache lines for new allocations in accord with a first-in, first-out (FIFO), round-robin, random, least recently used (LRU), pseudo LRU, or other replacement algorithm/strategy is at least partially replicated to provide lockstep and performance instances that respectively cover lockstep and performance partitions of a cache. In some cases, a unified instance of replacement logic may be reinitialized with appropriate states at (or coincident with) transitions between performance and lockstep modes of operation.

Type: Application

Filed: April 19, 2011

Publication date: October 25, 2012

Applicant: FREESCALE SEMICONDUCTOR, INC.

Inventor: William C. Moyer
Cache memory structure and method

Publication number: 20120265939

Abstract: The invention relates to a cache memory and method for controlling access to data. According to the invention, a control area which is advantageously formed separate from a data area is provided for controlling the access to data stored in the cache and to be read by applicative processes. The control area includes at least one release area with offsets and data version definition sections. Application to shared memories for client server architectures.

Type: Application

Filed: May 3, 2011

Publication date: October 18, 2012

Inventors: Virginie Amar, Luc Capanaccia, Guillaume Touffait, Sébastien Pellise, Xavier Leblanc
TRANSACTIONAL PROCESSING FOR CLUSTERED FILE SYSTEMS

Publication number: 20120265940

Abstract: Systems and methods for transactional processing within a clustered file system wherein user defined transactions operate on data segments of the file system data. The users are provided within an interface for using a transactional mechanism, namely services for opening, writing and rolling-back transactions. A distributed shared memory technology is utilized to facilitate efficient and coherent cache management within the clustered file system based on the granularity of data segments (rather than files).

Type: Application

Filed: June 22, 2012

Publication date: October 18, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Lior ARONOVICH, Yair TOAFF, Gil PAZ, Ron ASHER
SIMULTANEOUS EVICTION AND CLEANING OPERATIONS IN A CACHE

Publication number: 20120260041

Abstract: Embodiments provide a method comprising receiving, at a cache associated with a central processing unit that is disposed on an integrated circuit, a request to perform a cache operation on the cache; in response to receiving and processing the request, determining that first data cached in a first cache line of the cache is to be written to a memory that is coupled to the integrated circuit; identifying a second cache line in the cache, the second cache line being complimentary to the first cache line; transmitting a single memory instruction from the cache to the memory to write to the memory (i) the first data from the first cache line and (ii) second data from the second cache line; and invalidating the first data in the first cache line, without invalidating the second data in the second cache line.

Type: Application

Filed: April 4, 2012

Publication date: October 11, 2012

Inventors: Adi Habusha, Eitan Joshua, Shaul Chapman
USING A MIGRATION CACHE TO CACHE TRACKS DURING MIGRATION

Publication number: 20120254546

Abstract: Provided are a method, system, and computer program product for using a migration cache to cache tracks during migration. In response to a migration operation, a determination is made of a first set of tracks in the source storage indicated in an extent list and of a second set of tracks in the extent. The tracks in the source storage in the first set are copied to a migration cache. The tracks in the second set are copied directly from the source storage to the destination storage without buffering in the migration cache. The tracks in the first set are copied from the migration cache to the destination storage. The migration operation is completed in response to copying the first set of tracks from the migration cache to the destination storage and copying the second set of tracks from the source storage to the destination storage.

Type: Application

Filed: May 2, 2012

Publication date: October 4, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: David Montgomery, Todd Charles Sorenson
Storage system for controlling disk cache

Patent number: 8281076

Abstract: A storage system coupled to a host computer, including: a non-volatile medium that stores data; a disk cache that temporarily stores data stored in the non-volatile medium, where the disk cache is divided into a plurality of independent disk cache partitions; a control unit that controls an input and an output of data to and from the non-volatile medium; and a memory unit that stores information used by the control unit, including consistency control information setting respective commands permitted for each of the disk cache partitions, to guarantee consistency of the data; wherein the control unit is configured to determine whether or not to execute a requested command for a given disk cache partition, by referring to the consistency control information setting respective commands permitted for each of the disk cache partitions.

Type: Grant

Filed: July 9, 2010

Date of Patent: October 2, 2012

Assignee: Hitachi, Ltd.

Inventors: Akiyoshi Hashimoto, Aki Tomita
EFFECTIVE PREFETCHING WITH MULTIPLE PROCESSORS AND THREADS

Publication number: 20120246406

Abstract: A processing system includes a memory and a first core configured to process applications. The first core includes a first cache. The processing system includes a mechanism configured to capture a sequence of addresses of the application that miss the first cache in the first core and to place the sequence of addresses in a storage array; and a second core configured to process at least one software algorithm. The at least one software algorithm utilizes the sequence of addresses from the storage array to generate a sequence of prefetch addresses. The second core issues prefetch requests for the sequence of the prefetch addresses to the memory to obtain prefetched data and the prefetched data is provided to the first core if requested.

Type: Application

Filed: June 4, 2012

Publication date: September 27, 2012

Applicant: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Gordon Bernard Bell, Gordon Taylor Davis, Jeffrey Haskell Derby, Anil Krishna, Srinivasan Ramani, Ken Vu, Steve Woolet
APPARATUS AND METHOD FOR HETEROGENEOUS CHIP MULTIPROCESSORS VIA RESOURCE ALLOCATION AND RESTRICTION

Publication number: 20120239875

Abstract: A method and apparatus for heterogeneous chip multiprocessors (CMP) via resource restriction. In one embodiment, the method includes the accessing of a resource utilization register to identify a resource utilization policy. Once accessed, a processor controller ensures that the processor core utilizes a shared resource in a manner specified by the resource utilization policy. In one embodiment, each processor core within a CMP includes an instruction issue throttle resource utilization register, an instruction fetch throttle resource utilization register and other like ways of restricting its utilization of shared resources within a minimum and maximum utilization level. In one embodiment, resource restriction provides a flexible manner for allocating current and power resources to processor cores of a CMP that can be controlled by hardware or software. Other embodiments are described and claimed.

Type: Application

Filed: May 29, 2012

Publication date: September 20, 2012

Inventors: Tryggve Fossum, George Chrysos, Todd A. Dutton
Scheduling Workloads Based On Cache Asymmetry

Publication number: 20120233393

Abstract: In one embodiment, a processor includes a first cache and a second cache, a first core associated with the first cache and a second core associated with the second cache. The caches are of asymmetric sizes, and a scheduler can intelligently schedule threads to the cores based at least in part on awareness of this asymmetry and resulting cache performance information obtained during a training phase of at least one of the threads.

Type: Application

Filed: March 8, 2011

Publication date: September 13, 2012

Inventors: Xiaowei Jiang, Li Zhao, Ravishankar Iyer
Leveraging transactional memory hardware to accelerate virtualization emulation

Patent number: 8266387

Abstract: Various technologies and techniques are disclosed for using transactional memory hardware to accelerate virtualization or emulation. One or more central processing units are provided with transactional memory hardware that is operable to accelerate virtualization. The transactional memory hardware has a facility to maintain private state, a facility to render memory accesses from other central processing units visible to software, and support for atomic commit of the private state. The transactional memory hardware can be used, for example, to facilitate emulation of precise exception semantics. The private state is operable to enable an emulated state to remain inconsistent with an architectural state and only synchronized on certain boundaries. An optimized sequence of instructions is executed using chunk-accurate simulation to try and achieve a same end effect.

Type: Grant

Filed: June 27, 2007

Date of Patent: September 11, 2012

Assignee: Microsoft Corporation

Inventors: Martin Taillefer, Darek Mihocka, Bruno Silva
NETWORK-ON-CHIP SYSTEM INCLUDING ACTIVE MEMORY PROCESSOR

Publication number: 20120226865

Abstract: Disclosed is a network-on-chip system including an active memory processor for processing increased communication latency by multiple processors and memories. The network-on-chip system includes a plurality of processing elements that request to perform an active memory operation for a predetermined operation from a shared memory to reduce access latency of the shared memory, and an active memory processor connected to the processing elements through a network, storing codes for processing custom transaction in request to the active memory operation, performing an operation addresses or data stored in a shared cache memory or the shared memory based on the codes and transmitting the performed operation result to the processing elements.

Type: Application

Filed: December 9, 2009

Publication date: September 6, 2012

Applicant: SNU R&DB FOUNDATION

Inventors: Ki-Young Choi, Jun-Hee Yoo, Sung-Joo Yoo, Hyun-Chul Shin
Cache control device and control method

Patent number: 8261021

Abstract: In order to control an access request to the cache shared between a plurality of threads, a storage unit for storing a flag provided in association with each of the threads is included. If the threads enter the execution of an atomic instruction, a defined value is written to the flags stored in the storage unit. Furthermore, if the atomic instruction is completed, a defined value different from the above defined value is written, thereby displaying whether or not the threads are executing the atomic instruction. If an access request is issued from a certain thread, it is judged whether or not a thread different from the certain thread is executing the atomic instruction by referencing the flag values in the storage unit. If it is judged that another thread is executing the atomic instruction, the access request is kept standby. This makes it possible to realize the exclusive control processing necessary for processing the atomic instruction according to simple configuration.

Type: Grant

Filed: December 17, 2009

Date of Patent: September 4, 2012

Assignee: Fujitsu Limited

Inventor: Naohiro Kiyota
SHARED MEMORY SYSTEM AND CONTROL METHOD THEREFOR

Publication number: 20120221795

Abstract: A shared memory system provides an access monitoring mechanism 112 with a definition for taking clusters for motion picture attributes as pieces of cluster memory 1 and 2. When a DSP (2) 104 makes access to memory while adding attribute information about an image to the access, the access monitoring mechanism 112 outputs to a cluster memory space selector 119 control information 131 that permits making of access to the pieces of cluster memory 1 and 2. Based on the control information 131, the cluster memory space selector 119 sorts access from the DSP (2) 104 to the cluster memory 1 or 2. The same also applies to access from a GPU 105. A plurality of master processors share shared memory 110 divided into a plurality of clusters 111, thereby holding coherence of cache memory.

Type: Application

Filed: May 9, 2012

Publication date: August 30, 2012

Applicant: PANASONIC CORPORATION

Inventors: Masahiro HOSHAKU, Yukiteru Murao, Daisuke Horigome, Masanori Okinoi
HETEROGENEOUS PROCESSORS SHARING A COMMON CACHE

Publication number: 20120215984

Abstract: A multi-core processor providing heterogeneous processor cores and a shared cache is presented.

Type: Application

Filed: February 27, 2012

Publication date: August 23, 2012

Inventors: Frank T. Hady, Mason B. Cabot, John Beck, Mark B. Rosenbluth
Allocating processor cores with cache memory associativity

Patent number: 8244982

Abstract: Techniques are generally described related to a multi-core processor with a plurality of processor cores and a cache memory shared by at least some of the processor cores. The multi-core processor can be configured for separately allocating a respective level of cache memory associativity to each of the processing cores.

Type: Grant

Filed: August 21, 2009

Date of Patent: August 14, 2012

Assignee: Empire Technology Development LLC

Inventors: Andrew Wolfe, Thomas Martin Conte
ADDRESS TRANSLATION CACHING AND I/O CACHE PERFORMANCE IMPROVEMENT IN VIRTUALIZED ENVIRONMENTS

Publication number: 20120203950

Abstract: Methods and apparatus relating to improving address translation caching and/or input/output (I/O) cache performance in virtualized environments are described. In one embodiment, a hint provided by an endpoint device may be utilized to update information stored in an I/O cache. Such information may be utilized for implementation of a more efficient replacement policy in an embodiment. Other embodiments are also disclosed.

Type: Application

Filed: April 17, 2012

Publication date: August 9, 2012

Inventors: Mahesh Wagh, Jasmin Ajanovic
Cache Partitioning in Virtualized Environments

Publication number: 20120198172

Abstract: A mechanism is provided in a virtual machine monitor for providing cache partitioning in virtualized environments. The mechanism assigns a virtual identification (ID) to each virtual machine in the virtualized environment. The processing core stores the virtual ID of the virtual machine in a special register. The mechanism also creates an entry for the virtual machine in a partition table. The mechanism may partition a shared cache using a vertical (way) partition and/or a horizontal partition. The entry in the partition table includes a vertical partition control and a horizontal partition control. For each cache access, the virtual machine passes the virtual ID along with the address to the shared cache. If the cache access results in a miss, the shared cache uses the partition table to select a victim cache line for replacement.

Type: Application

Filed: April 11, 2012

Publication date: August 2, 2012

Applicant: International Business Machines Corporation

Inventors: Jiang Lin, Lixin Zhang
Accelerating Cache State Transfer on a Directory-Based Multicore Architecture

Publication number: 20120173819

Abstract: Technologies are generally described herein for accelerating a cache state transfer in a multicore processor. The multicore processor may include first, second, and third tiles. The multicore processor may initiate migration of a thread executing on the first core at the first tile from the first tile to the second tile. The multicore processor may determine block addresses of blocks to be transferred from a first cache at the first tile to a second cache at the second tile, and identify that a directory at the third tile corresponds to the block addresses. The multicore processor may update the directory to reflect that the second cache shares the blocks. The multicore processor may transfer the blocks from the first cache in the first tile to the second cache in the second tile effective to complete the migration of the thread from the first tile to the second tile.

Type: Application

Filed: December 29, 2010

Publication date: July 5, 2012

Inventor: Yan Solihin
SUBCACHE AFFINITY

Publication number: 20120166729

Abstract: A method and apparatus for controlling affinity of subcaches is disclosed. When a core compute unit evicts a line of victim data, a prioritized search for space allocation on available subcaches is executed, in order of proximity between the subcache and the compute unit. The victim data may be injected into an adjacent subcache if space is available. Otherwise, a line may be evicted from the adjacent subcache to make room for the victim data or the victim data may be sent to the next closest subcache. To retrieve data, a core compute unit sends a Tag Lookup Request message directly to the nearest subcache as well as to a cache controller, which controls routing of messages to all of the subcaches. A Tag Lookup Response message is sent back to the cache controller to indicate if the requested data is located in the nearest sub-cache.

Type: Application

Filed: December 22, 2010

Publication date: June 28, 2012

Applicant: ADVANCED MICRO DEVICES, INC.

Inventor: Greggory D. Donley
COMPUTING PLATFORM POWER MANAGEMENT WITH ADAPTIVE CACHE FLUSH

Publication number: 20120166731

Abstract: In some embodiments, an adaptive break-even time, based on the load level of the cache, may be employed.

Type: Application

Filed: December 22, 2010

Publication date: June 28, 2012

Inventors: CHRISTIAN MACIOCCO, REN WANG, TSUNG-YUAN C. TAI
Demand Based Partitioning of Microprocessor Caches

Publication number: 20120151146

Abstract: Associativity of a multi-core processor cache memory to a logical partition is managed and controlled by receiving a plurality of unique logical processing partition identifiers into registration of a multi-core processor, each identifier being associated with a logical processing partition on one or more cores of the multi-core processor; responsive to a shared cache memory miss, identifying a position in a cache directory for data associated with the address, the shared cache memory being multi-way set associative; associating a new cache line entry with the data and one of the registered unique logical processing partition identifiers; modifying the cache directory to reflect the association; and caching the data at the new cache line entry, wherein said shared cache memory is effectively shared on a line-by-line basis among said plurality of logical processing partitions of said multi-core processor.

Type: Application

Filed: February 16, 2012

Publication date: June 14, 2012

Inventors: Bret Ronald Olszewski, Steven Wayne White
METHOD AND SYSTEM FOR DETERMINING A CACHE MEMORY CONFIGURATION FOR TESTING

Publication number: 20120151144

Abstract: A method and computer device for determining the cache memory configuration. The method includes allocating an amount of cache memory from a first memory level of the cache memory, and determining a read transfer time for the allocated amount of cache memory. The allocated amount of cache memory then is increased and the read transfer time for the increased allocated amount of cache memory is determined. The allocated amount of cache memory continues to be increased and the read transfer time determined for the each allocated amount until all of the cache memory in all of the cache memory levels has been allocated. The cache memory configuration is determined based on the read transfer times from the allocated portions of the cache memory. The determined cache memory configuration includes the number of cache memory levels and the respective capacities of each cache memory level.

Type: Application

Filed: December 8, 2010

Publication date: June 14, 2012

Inventor: William Judge Yohn
LATENCY MANAGEMENT SYSTEM AND METHOD FOR MULTIPROCESSOR SYSTEM

Publication number: 20120151154

Abstract: A latency management apparatus and method are provided. A latency management apparatus for a multiprocessor system having a plurality of processors and shared memory, when the shared memory and each of the processors is configured to generate a delayed signal, includes a delayed signal detector configured to detect the generated delayed signal; and one or more latency managers configured to manage an operation latency of any one of the processors upon detection of the delayed signal.

Type: Application

Filed: June 20, 2011

Publication date: June 14, 2012

Applicant: Samsung Electronics Co., Ltd.

Inventors: Woong Seo, Soo-Jung Ryu, Moo-Kyoung Chung, HoYoung Kim, Young-Chul Cho
PROGRAMMABLE ATOMIC MEMORY USING HARDWARE VALIDATION AGENT

Publication number: 20120144120

Abstract: A processing core in a multi-processing core system is configured to execute a sequence of instructions as an atomic memory transaction. Executing each instruction in the sequence comprises validating that the instruction meets a set of one or more atomicity criteria, including that executing the instruction does not require accessing shared memory. Executing the atomic memory transaction may comprise storing memory data from a source cache line into a target register, reading or modifying the memory data stored in the target register as part of executing the sequence, and storing a value from the target register to the source cache line.

Type: Application

Filed: December 7, 2010

Publication date: June 7, 2012

Inventors: Benjamin C. Serebrin, David A. Kaplan
PROGRAMMABLE ATOMIC MEMORY USING STORED ATOMIC PROCEDURES

Publication number: 20120144119

Abstract: A processing core in a multi-processing core system is configured to execute a sequence of instructions as a single atomic memory transaction. The processing core validates that the sequence meets a set of one or more atomicity criteria, including that no instruction in the sequence instructs the processing core to access shared memory. After validating the sequence, the processing core executes the sequence as a single atomic memory transaction, such as by locking a source cache line that stores shared memory data, executing the validated sequence of instructions, storing a result of the sequence into the source cache line, and unlocking the source cache line.

Type: Application

Filed: December 7, 2010

Publication date: June 7, 2012

Inventors: Benjamin C. Serebrin, David A. Kaplan, Anton Chernoff
Demand Based Partitioning of Microprocessor Caches

Publication number: 20120144121

Abstract: Associativity of a multi-core processor cache memory to a logical partition is managed and controlled by receiving a plurality of unique logical processing partition identifiers into registration of a multi-core processor, each identifier being associated with a logical processing partition on one or more cores of the multi-core processor; responsive to a shared cache memory miss, identifying a position in a cache directory for data associated with the address, the shared cache memory being multi-way set associative; associating a new cache line entry with the data and one of the registered unique logical processing partition identifiers; modifying the cache directory to reflect the association; and caching the data at the new cache line entry, wherein the shared cache memory is effectively shared on a line-by-line basis among the plurality of logical processing partitions of the multi-core processor.

Type: Application

Filed: February 16, 2012

Publication date: June 7, 2012

Inventors: Bret Ronald Olszewski, Steven Wayne While
System and Method for a Cache in a Multi-Core Processor

Publication number: 20120137075

Abstract: The invention relates to a multi-core processor system, in particular a single-package multi-core processor system, comprising at least two processor cores, preferably at least four processor cores, each of said at least two cores, preferably at least four processor cores, having a local LEVEL-1 cache, a tree communication structure combining the multiple LEVEL-1 caches, the tree having at least one node, preferably at least three nodes for a four processor core multi-core processor, and TAG information is associated to data managed within the tree, usable in the treatment of the data.

Type: Application

Filed: June 9, 2010

Publication date: May 31, 2012

Applicant: HYPERION CORE, INC.

Inventor: Martin Vorbach
Multiple Critical Word Bypassing in a Memory Controller

Publication number: 20120137078

Abstract: In one embodiment, a memory controller may be configured to transmit two or more critical words (or beats) corresponding to two or more different read requests prior to returning the remaining beats of the read requests. Such an embodiment may reduce latency to the sources of the memory requests, which may be stalled awaiting the critical words. The remaining words may fill a cache block or other buffer, but may not be required by the sources as quickly as the critical words in order to support higher performance. In some embodiments, once a remaining beat of a block is transmitted, all of the remaining beats may be transmitted contiguously. In other embodiments, additional critical words may be forwarded between remaining beats of a block.

Type: Application

Filed: November 29, 2010

Publication date: May 31, 2012

Inventors: Sukalpa Biswas, Hao Chen, Brian P. Lilly
DATA PROCESSING APPARATUS AND DATA PROCESSING METHOD

Publication number: 20120110268

Abstract: The data processing apparatus according to an embodiment of the present invention includes: a first processor; a second processor; and an external RAM to/from which the first processor writes/reads data, the first processor including a cache memory for storing data used in the first processor in association with an address on the external RAM, and the data being written to the cache memory by the second processor not through the external RAM.

Type: Application

Filed: January 11, 2012

Publication date: May 3, 2012

Applicant: Renesas Electronics Corporation

Inventor: Mitsunobu Tanigawa
Shared data prefetching with memory region cache line monitoring

Patent number: 8112587

Abstract: A method, circuit arrangement, and design structure for prefetching data for responding to a memory request, in a shared memory computing system of the type that includes a plurality of nodes, is provided. Prefetching data comprises, receiving, in response to a first memory request by a first node, presence data for a memory region associated with the first memory request from a second node that sources data requested by the first memory request, and selectively prefetching at least one cache line from the memory region based on the received presence data. Responding to a memory request comprises tracking presence data associated with memory regions associated with cached cache lines in the first node, and, in response to a memory request by a second node, forwarding the tracked presence data for a memory region associated with the memory request to the second node.

Type: Grant

Filed: April 30, 2009

Date of Patent: February 7, 2012

Assignee: International Business Machines Corporation

Inventors: Jason F. Cantin, Steven R. Kunkel
Location updates for a distributed data store

Patent number: 8108612

Abstract: Version indicators within an existing range can be associated with a data partition in a distributed data store. A partition reconfiguration can be associated with one of multiple partitions in the data store, and a new version indicator that is outside the existing range can be assigned to the reconfigured partition. Additionally, a broadcast message can be sent to multiple nodes, which can include storage nodes and/or client nodes that are configured to communicate with storage nodes to access data in a distributed data store. The broadcast message can include updated location information for data in the data store. In addition, a response message can be sent to a requesting node of the multiple nodes in response to receiving from that node a message that requests updated location information for the data. The response message can include the requested updated location information.

Type: Grant

Filed: May 15, 2009

Date of Patent: January 31, 2012

Assignee: Microsoft Corporation

Inventors: Lu Xun, Hua-Jun Zeng, Muralidhar Krishnaprasad, Radhakrishnan Srikanth, Ankur Agrawal, Balachandar Pavadaisamy
METHOD AND APPARATUS FOR IMPLEMENTING CACHE COHERENCY OF A PROCESSOR

Publication number: 20120017049

Abstract: An advanced processor comprises a plurality of multithreaded processor cores each having a data cache and instruction cache. A data switch interconnect is coupled to each of the processor cores and configured to pass information among the processor cores. A messaging network is coupled to each of the processor cores and a plurality of communication ports. In one aspect of an embodiment of the invention, the data switch interconnect is coupled to each of the processor cores by its respective data cache, and the messaging network is coupled to each of the processor cores by its respective message station. Advantages of the invention include the ability to provide high bandwidth communications between computer systems and memory in an efficient and cost-effective manner.

Type: Application

Filed: May 7, 2011

Publication date: January 19, 2012

Applicant: NETLOGIC MICROSYSTEMS, INC.

Inventor: David T. HASS
Push for sharing instruction

Patent number: 8099557

Abstract: In one embodiment, a system comprises a first processor, a main memory system, and a cache hierarchy coupled between the first processor and the main memory system. The cache hierarchy comprises at least a first cache. The first processor is configured to execute a first instruction, including forming an address responsive to one or more operands of the first instruction. The system is configured to push a first cache block that is hit by the first address in the first cache to a target location within the cache hierarchy or the main memory system, wherein the target location is unspecified in a definition of the first instruction within an instruction set architecture implemented by the first processor, and wherein the target location is implementation-dependent.

Type: Grant

Filed: February 26, 2008

Date of Patent: January 17, 2012

Assignee: GLOBALFOUNDRIES Inc.

Inventors: John D. McCalpin, Patrick N. Conway
METHOD AND APPARATUS FOR MANAGING MEMORY IN A MOBILE ELECTRONIC DEVICE

Publication number: 20120011322

Abstract: According to embodiments described in the specification, a method and apparatus for managing memory in a mobile electronic device are provided. The method comprises: receiving a request to install an application; receiving at least one indication of data intended to be maintained in a shared cache; determining, based on the at least one indication, whether data corresponding to the intended data exists in the shared cache; upon a negative determination, writing the intended data to the shared cache; and repeating the receiving at least one indication, the determining and the writing for at least one additional application.

Type: Application

Filed: November 18, 2009

Publication date: January 12, 2012

Inventor: Ankur AGGARWAL
Network with Distributed Shared Memory

Publication number: 20120005431

Abstract: A computer network with distributed shared memory, including a clustered memory cache aggregated from and comprised of physical memory locations on a plurality of physically distinct computing systems. The clustered memory cache is accessible by a plurality of clients on the computer network and is configured to perform page caching of data items accessed by the clients. The network also includes a policy engine operatively coupled with the clustered memory cache, where the policy engine is configured to control where data items are cached in the clustered memory cache.

Type: Application

Filed: September 9, 2011

Publication date: January 5, 2012

Inventors: Jason P. Gross, Ranjit B. Pandit, Clive G. Cook, Thomas H. Matson

prev 1 2 3 4 5 next