Processing Control For Data Transfer Patents (Class 712/225)

DISAMBIGUATION-FREE OUT OF ORDER LOAD STORE QUEUE

Publication number: 20150100765

Abstract: In a processor, a disambiguation-free out of order load store queue method. The method includes implementing a memory resource that can be accessed by a plurality of asynchronous cores; implementing a store retirement buffer, wherein stores from a store queue have entries in the store retirement buffer in original program order; and upon dispatch of a subsequent load from a load queue, searching the store retirement buffer for address matching. The method further includes in cases where there are a plurality of address matches, locating a correct forwarding entry by scanning for the store retirement buffer for a first match; and forwarding data from the first match to the subsequent load.

Type: Application

Filed: December 12, 2014

Publication date: April 9, 2015

Inventor: Mohammad A. ABDALLAH
REORDERED SPECULATIVE INSTRUCTION SEQUENCES WITH A DISAMBIGUATION-FREE OUT OF ORDER LOAD STORE QUEUE

Publication number: 20150100766

Abstract: In a processor, a disambiguation-free out of order load store queue method. The method includes implementing a memory resource that can be accessed by a plurality of asynchronous cores; implementing a store retirement buffer, wherein stores from a store queue have entries in the store retirement buffer in original program order; and implementing speculative execution, wherein results of speculative execution can be saved in the store retirement/reorder buffer as a speculative state. The method further includes, upon dispatch of a subsequent load from a load queue, searching the store retirement buffer for address matching; and, in cases where there are a plurality of address matches, locating a correct forwarding entry by scanning for the store retirement buffer for a first match, and forwarding data from the first match to the subsequent load. Once speculative outcomes are known, the speculative state is retired to memory.

Type: Application

Filed: December 12, 2014

Publication date: April 9, 2015

Inventor: Mohammad A. ABDALLAH
Page fault prediction for processing vector instructions

Patent number: 9003171

Abstract: A system including a processor that handles a TLB miss while executing a vector read instruction in a processor is described herein. During operation, the processor performs a lookup in a TLB for addresses in active elements in the vector read instruction. The processor then determines that a TLB miss occurred for the address from an active element other than a first active element. Upon predicting that a page table walk for the vector read instruction will result in a page fault, the processor sets a bit in a corresponding bit position in an FSR. A set bit in a bit position in FSR indicates that data in a corresponding element of the vector read instruction is invalid. The processor then immediately performs memory reads for at least one of the first active element and other active elements for which TLB misses did not occur.

Type: Grant

Filed: June 23, 2011

Date of Patent: April 7, 2015

Assignee: Apple Inc.

Inventor: Jeffry E. Gonion
TECHNIQUE FOR COUNTING VALUES IN A REGISTER

Publication number: 20150089207

Abstract: A parallel counter accesses data generated by an application and stored within a register. The register includes different segments that include different portions of the application data. The parallel counter is configured to count the number of values within each segment that have a particular characteristic in a parallel fashion. The parallel counter may then return the individual segment counts to the application, or combine those segment counts and return a register count to the application. Advantageously, applications that rely on population count operations may be accelerated. Further, increasing the number of segments in a given register may reduce the time needed to count the values in that register, thereby providing a scalable solution to population counting. Additionally, the architecture of the parallel counter is sufficiently flexible to allow both register counting and segment counting, thereby combining two separate functionalities into just one hardware unit.

Type: Application

Filed: September 20, 2013

Publication date: March 26, 2015

Applicant: NVIDIA CORPORATION

Inventors: Robert OHANNESSIAN, Brian FAHS
SHIFT INSTRUCTION WITH PER-ELEMENT SHIFT COUNTS AND FULL-WIDTH SOURCES

Publication number: 20150082010

Abstract: Techniques for packing and unpacking data from a source register using a particular shift instruction are provided. The shift instructions takes, as input, a source register that contains a plurality of elements and a shift count register that contains a plurality of shift counts. Each shift count indicates how much to shift bits from the source registers. Where “source” bits are shifted (or copied) to in an output register depends on the position of the shift count in the shift count register. The shift counts may correspond to one or more bytes from the source register. The shift instruction may initiate a left shift operation or a right shift operation.

Type: Application

Filed: September 16, 2013

Publication date: March 19, 2015

Applicant: Oracle International Corporation

Inventor: Albert Martin
Store data forwarding with no memory model restrictions

Patent number: 8984261

Abstract: Embodiments relate to loading data in a pipelined microprocessor. An aspect includes issuing a load request that comprises a load address requiring at least one block of data the same size as a largest contiguous granularity of data returned from a cache. Another aspect includes determining that the load address matches at least one block address. Another aspect includes, based on determining that there is an address match, reading a data block from a buffer register and sending the data to satisfy the load request; comparing a unique set id of the data block to the set id of the matching address after sending the data block; based on determining that there is a set id match, continuing the load request, or, based on determining that there is not a set id match, setting a store-forwarding state of the matching address to no store-forwarding and rejecting the load request.

Type: Grant

Filed: October 22, 2013

Date of Patent: March 17, 2015

Assignee: International Business Machines Corporation

Inventors: Brian D. Barrick, Barry W. Krumm, James R. Mitchell, Bradley Nelson, Aaron Tsai, Chung-Lung K. Shum, Michael H. Wood
Cache and disk management method, and a controller using the method

Patent number: 8977816

Abstract: A cache and disk management method is provided. In the cache and disk management method, a command to delete all valid data stored in a cache, or specific data corresponding to a part of the valid data may be transmitted to a plurality of member disks. That is, all of the valid data or the specific data may exist in the cache only, and may be deleted from the plurality of member disks. Accordingly, the plurality of member disks may secure more space, an internal copy overhead may be reduced, and more particularly, solid state disks may achieve better performance.

Type: Grant

Filed: December 23, 2009

Date of Patent: March 10, 2015

Assignee: OCZ Storage Solutions Inc.

Inventor: Soo Gil Jeong
Control of entry of program instructions to a fetch stage within a processing pipepline

Patent number: 8977815

Abstract: A processing pipeline 6, 8, 10, 12 is provided with a main query stage 20 and a fetch stage 22. A buffer 24 stores program instructions which have missed within a cache memory 14. Query generation circuitry within the main query stage 20 and within a buffer query stage 26 serve to concurrently generate a main query request and a buffer query request sent to the cache memory 14. The cache memory returns a main query response and a buffer query response. Arbitration circuitry 28 controls multiplexers 30, 32 and 34 to direct the program instruction at the main query stage 20, and the program instruction stored within the buffer 24 and the buffer query stage 26 to pass either to the fetch stage 22 or to the buffer 24. The multiplexer 30 can also select a new instruction to be passed to the main query stage 20.

Type: Grant

Filed: November 29, 2010

Date of Patent: March 10, 2015

Assignee: ARM Limited

Inventors: Frode Heggelund, Rune Holm, Andreas Due Engh-Halstvedt, Edvard Feilding
Apparatus and method for early issue and recovery for a conditional load instruction having multiple outcomes

Patent number: 8977837

Abstract: At least one instruction of a sequence of program instructions has a plurality of alternative outcomes including at least a first outcome that is independent of at least one operand and a second outcome that is dependent on the at least one operand. The at least one operand is a value generated by a preceding instruction in the sequence. The at least one instruction is issued for execution independently of when the at least one operand is generated by the preceding instruction. Recovery circuitry is provided to perform a recovery operation in the event that the second outcome is to be executed for the at least one instruction and the at least one operand has not been generated by the preceding instruction when the at least one instruction is to be executed by said instruction execution circuitry.

Type: Grant

Filed: May 27, 2009

Date of Patent: March 10, 2015

Assignee: ARM Limited

Inventors: Robert Gregory McDonald, Paul Gilbert Meyer
SPECIALIZED MEMORY DISAMBIGUATION MECHANISMS FOR DIFFERENT MEMORY READ ACCESS TYPES

Publication number: 20150067305

Abstract: A system and method for efficient predicting and processing of memory access dependencies. A computing system includes control logic that marks a detected load instruction as a first type responsive to predicting the load instruction has high locality and is a candidate for store-to-load (STL) data forwarding. The control logic marks the detected load instruction as a second type responsive to predicting the load instruction has low locality and is not a candidate for STL data forwarding. The control logic processes a load instruction marked as the first type as if the load instruction is dependent on an older store operation. The control logic processes a load instruction marked as the second type as if the load instruction is independent on any older store operation.

Type: Application

Filed: August 30, 2013

Publication date: March 5, 2015

Applicant: Advanced Micro Devices, Inc.

Inventors: Lena E. Olson, Yasuko Eckert, Srilatha Manne
INTER-CORE COMMUNICATION VIA UNCORE RAM

Publication number: 20150067306

Abstract: A microprocessor includes a plurality of processing cores and an uncore random access memory (RAM) readable and writable by each of the plurality of processing cores. Each core of the plurality of processing cores comprises microcode run by the core that implements architectural instructions of an instruction set architecture of the microprocessor. The microcode is configured to both read and write the uncore RAM to accomplish inter-core communication between the plurality of processing cores.

Type: Application

Filed: May 19, 2014

Publication date: March 5, 2015

Applicant: VIA TECHNOLOGIES, INC.

Inventors: G. Glenn Henry, Terry Parks, Rodney E. Hooker, Stephan Gaskins
Executing instructions for managing constant pool base register used for accessing constants during subroutine execution

Patent number: 8972705

Abstract: A constant data accessing system having a constant pool comprises a computer processor having a constant pool base register, a compiler having a constant pool handler, and an instruction set module having a constant pool instruction set unit. The constant pool base register is configured to store a value of constant pool base address of one or a plurality of subroutines which have constants to be accessed.

Type: Grant

Filed: November 16, 2011

Date of Patent: March 3, 2015

Assignee: Andes Technology Corporation

Inventors: Wei-Hao Chiao, Haw-Luen Tsai, Chen-Wei Chang, Hong-Men Su
Code section optimization by removing memory barrier instruction and enclosing within a transaction that employs hardware transaction memory

Patent number: 8972704

Abstract: A code section of a computer program to be executed by a computing device includes memory barrier instructions. Where the code section satisfies a threshold, the code section is modified, by enclosing the code section within a transaction that employs hardware transactional memory of the computing device, and removing the memory barrier instructions from the code section. Execution of the code section as has been enclosed within the transaction can be monitored to yield monitoring results. Where the monitoring results satisfy an abort threshold corresponding to excessive aborting of the execution of the code section as has been enclosed within the transaction, the code section is split into code sub-sections, and each code sub-section enclosed within a separate transaction that employs the hardware transactional memory. Splitting the code section sections and enclosing each code sub-section within a separate transaction can decrease occurrence of the code section aborting during execution.

Type: Grant

Filed: December 15, 2011

Date of Patent: March 3, 2015

Assignee: International Business Machines Corporation

Inventors: Toshihiko Koju, Takuya Nakaike, Ali Ijaz Sheikh, Harold Wade Cain, III, Maged M. Michael
Software module object analysis

Patent number: 8966635

Abstract: In one implementation, an object analysis system identifies an object within a software module, and determines a size of the object based on at least one operation within the software module. The object analysis system identifies the object and determines the size of the object without reference to source code of the software module.

Type: Grant

Filed: February 24, 2012

Date of Patent: February 24, 2015

Assignee: Hewlett-Packard Development Company, L.P.

Inventor: David Bryan Dewey
Modifying commands

Patent number: 8966231

Abstract: The present disclosure includes methods, devices, modules, and systems for modifying commands. One device embodiment includes a memory controller including a channel, wherein the channel includes a command queue configured to hold commands, and circuitry configured to modify at least a number of commands in the queue and execute the modified commands.

Type: Grant

Filed: October 11, 2011

Date of Patent: February 24, 2015

Assignee: Micron Technology, Inc.

Inventor: Mehdi Asnaashari
Method and system for preventing unauthorized processor mode switches

Patent number: 8959339

Abstract: A system comprising a processor adapted to activate multiple security levels for the system and a monitoring device coupled to the processor and employing security rules pertaining to the multiple security levels. The monitoring device restricts usage of the system if the processor activates the security levels in a sequence contrary to the security rules.

Type: Grant

Filed: January 30, 2006

Date of Patent: February 17, 2015

Assignee: Texas Instruments Incorporated

Inventor: Gregory R. Conti
Management of data processing security in a secondary processor

Patent number: 8959304

Abstract: A data processing apparatus comprises a primary processor, a secondary processor configured to perform secure data processing operations and non-secure data processing operations and a memory configured to store secure data used by the secondary processor when performing the secure data processing operations and configured to store non-secure data used by the secondary processor when performing the non-secure data processing operations, wherein the secure data cannot be accessed by the non-secure data processing operations, wherein the secondary processor comprises a memory management unit configured to administer accesses to the memory from the secondary processor, the memory management unit configured to perform translations between virtual memory addresses used by the secondary processor and physical memory addresses used by the memory, wherein the translations are configured in dependence on a page table base address, the page table base address identifying a storage location in the memory of a set of des

Type: Grant

Filed: February 26, 2013

Date of Patent: February 17, 2015

Assignee: ARM Limited

Inventors: Dominic Hugo Symes, Ola Hugosson, Donald Felton, Sean Tristram Ellis
Hardware Streaming Unit

Publication number: 20150046687

Abstract: A processor having a streaming unit is disclosed. In one embodiment, a processor includes one or more execution units configured to execute instructions of a processor instruction set. The processor further includes a streaming unit configured to execute a first instruction of the processor instruction set, wherein executing the first instruction comprises the streaming unit loading a first data stream from a memory of a computer system responsive to execution of a first instruction. The first data stream comprises a plurality of data elements. The first instruction includes a first argument indicating a starting address of the first stream, a second argument indicating a stride between the data elements, and a third argument indicative of an ending address of the stream. The streaming unit is configured to output a second data stream corresponding to the first data stream.

Type: Application

Filed: August 6, 2013

Publication date: February 12, 2015

Applicant: Oracle International Corporation

Inventors: Darryl J. Gove, David L. Weaver, Gerald Zuraski
INSTRUCTION SOURCE SPECIFICATION

Publication number: 20150039867

Abstract: Techniques are disclosed relating to specification of instruction operands. In some embodiments, this may involve assigning operands to source inputs. In one embodiment, an instruction includes one or more mapping values, each of which corresponds to a source of the instruction and each of which specifies a location value. In this embodiment, the instruction includes one or more location values that are each usable to identify an operand for the instruction. In this embodiment, a method may include accessing operands using the location values and assigning accessed operands to sources using the mapping values. In one embodiment, the sources may correspond to inputs of an execution block. In one embodiment, a destination mapping value in the instruction may specify a location value that indicates a destination for storing an instruction result.

Type: Application

Filed: July 31, 2013

Publication date: February 5, 2015

Applicant: Apple Inc.

Inventors: James S. Blomgren, Terence M. Potter
METHOD, DEVICE AND SYSTEM FOR CONTROL SIGNALLING IN A DATA PATH MODULE OF A DATA STREAM PROCESSING ENGINE

Publication number: 20150033001

Abstract: Techniques and mechanisms for exchanging control signals in a data path module of a data stream processing engine. In an embodiment, the data path module may be configured to form a set of one or more data paths corresponding to an instruction which is to be executed. In another embodiment, data processing units of the data path module may be configured to exchange one or more control signals for elastic execution of the instruction.

Type: Application

Filed: December 29, 2011

Publication date: January 29, 2015

Inventor: Vladimir Ivanov
SYSTEM, METHOD, AND COMPUTER PROGRAM PRODUCT FOR COOPERATIVE MULTI-THREADING FOR VECTOR THREADS

Publication number: 20150026438

Abstract: A system, method, and computer program product for ensuring forward progress of threads that implement divergent operations in a single-instruction, multiple data (SIMD) architecture is disclosed. The method includes the steps of allocating a queue data structure to a thread block including a plurality of threads, determining that a current instruction specifies a yield operation, pushing a token onto the second side of the queue data structure, disabling any active threads in the thread block, popping a next pending token from the first side of the queue data structure, and activating one or more threads in the thread block according to a mask included in the next pending token.

Type: Application

Filed: July 18, 2013

Publication date: January 22, 2015

Inventors: Olivier Giroux, Gregory Frederick Diamos
Multi-threaded packet processing

Patent number: 8934332

Abstract: A system is disclosed for concurrently processing order sensitive data packets. A first data packet from a plurality of sequentially ordered data packets is directed to a first offload engine. A second data packet from the plurality of sequentially ordered data packets is directed to a second offload engine, wherein the second data packet is sequentially subsequent to the first data packet. The second offload engine receives information from the first offload engine, wherein the information reflects that the first offload engine is processing the first data packet. Based on the information received at the second offload engine, the second offload engine processes the second data packet so that critical events in the processing of the first data packet by the first offload engine occur prior to critical events in the processing of the second data packet by the second offload engine.

Type: Grant

Filed: February 29, 2012

Date of Patent: January 13, 2015

Assignee: International Business Machines Corporation

Inventors: Ronald E. Fuhs, Scott M. Willenborg
Optimizing performance of instructions based on sequence detection or information associated with the instructions

Patent number: 8935514

Abstract: In one embodiment, the present invention includes an instruction decoder that can receive an incoming instruction and a path select signal and decode the incoming instruction into a first instruction code or a second instruction code responsive to the path select signal. The two different instruction codes, both representing the same incoming instruction may be used by an execution unit to perform an operation optimized for different data lengths. Other embodiments are described and claimed.

Type: Grant

Filed: August 28, 2013

Date of Patent: January 13, 2015

Assignee: Intel Corporation

Inventors: Ohad Falik, Lihu Rappoport, Ron Gabor, Yulia Kurolap, Michael Mishaeli
METHOD AND DEVICE FOR RECOMBINING RUNTIME INSTRUCTION

Publication number: 20150012732

Abstract: A method for recombining runtime instruction comprising: an instruction running environment is buffered; the machine instruction segment to be scheduled is obtained; the second jump instruction which directs an entry address of an instruction recombining platform is inserted before the last instruction of the obtained machine instruction segment to generate the recombined instruction segment comprising the address A?; the value A of the address register of the buffered instruction running environment is modified to the address A?; the instruction running environment is recovered.

Type: Application

Filed: September 25, 2014

Publication date: January 8, 2015

Applicant: Beijing Zhongtian Antai Technology Co., Ltd.

Inventor: Jiaxiang Wang
Processing unexpected messages at a compute node of a parallel computer

Patent number: 8930962

Abstract: Methods, apparatuses, and computer program products for processing unexpected messages at a compute node of a parallel computer are provided. Embodiments include receiving, by the compute node, a portion of a message from another compute node of the parallel computer, the message comprising a plurality of separate portions; in response to receiving the portion of the message, determining, by the compute node, whether one of the applications executing on the compute node, has indicated that the message is expected; if one of the applications executing on the compute node has not indicated that the message is expected, storing, by the compute node, the portion of the message in an unexpected message buffer within the compute node; and if one of the applications executing on the compute node has indicated that the message is expected, storing the portion of the message at a storage destination indicated by the message.

Type: Grant

Filed: February 22, 2012

Date of Patent: January 6, 2015

Assignee: International Business Machines Corporation

Inventors: Charles J. Archer, James E. Carey, Philip J. Sanders, Brian E. Smith
Method and apparatus for supporting target-side security in a cache coherent system

Patent number: 8930638

Abstract: A cache coherency controller, a system comprising such, and a method of its operation are disclosed. The coherency controller ensures that target-side security checking rules are not violated by the performance-improving processes commonly used in coherency controllers such as dropping, merging, invalidating, forwarding, and snooping. This is done by ensuring that requests marked for target-side security checking and any other requests to overlapping addresses are forwarded directly to the target-side security filter without modification or side effects.

Type: Grant

Filed: November 27, 2012

Date of Patent: January 6, 2015

Assignee: QUALCOMM Technologies, Inc.

Inventors: Laurent Moll, Jean-Jacques Lecler, Philippe Boucard
Handling media streams in a programmable bit processor

Patent number: 8930682

Abstract: In one embodiment, the present invention is directed to a bit processor that includes an execution unit to, responsive to an instruction for access of data of a first bit width, access data of a second bit width, the second bit width having a different number of bits than the first bit width when some of the data accessed includes non-stream data. Other embodiments are described and claimed.

Type: Grant

Filed: December 16, 2010

Date of Patent: January 6, 2015

Assignee: Intel Corporation

Inventors: Raul Gutierrez, Suryaprasad Kareenahalli, Daniel Nemiroff, Balaji Vembu
Method and apparatus for transferring data from a first domain to a second domain

Patent number: 8930664

Abstract: Data is written from a first domain to a FIFO memory buffer in a second domain. The first domain uses a first clock signal, the second domain uses a second clock signal and the memory buffer uses the first clock signal that is delivered alongside the data. The data is read from the memory buffer using the second clock signal. A read pointer is adjusted and synchronised with the delivered first clock signal. A token is generated using the delivered first clock signal, based on the read pointer. The token represents a capacity of the memory buffer having been made available. The token is passed to the first domain and synchronised with the first clock signal. The writing of data to the memory buffer is controlled based on a comparison between the synchronised token and a previously received token.

Type: Grant

Filed: February 11, 2013

Date of Patent: January 6, 2015

Assignee: Broadcom Corporation

Inventors: Ari Tapani Kulmala, Jaakko Illmari Sertamo
COMMON ARCHITECTURE STATE PRESENTATION FOR PROCESSOR HAVING PROCESSING CORES OF DIFFERENT TYPES

Publication number: 20150006861

Abstract: A method is described that includes deciding to migrate a thread from a first processing core to a second processing core. The method also includes automatically in hardware migrating first context of the thread of the first processing core whose register definition is also found on the second processing core to the second processing core. The method also includes automatically in hardware migrating second context of the thread of the first processing core whose register definition is not found on the second processing core to a first storage location external to the second processing core. The message also includes automatically in hardware migrating third context of the thread from a second storage location external to the second processing core to register definition found on the second processing core but not found on the first processing core.

Type: Application

Filed: June 29, 2013

Publication date: January 1, 2015

Inventors: Bret TOLL, Jason W. BRANDT, John HOLM
CONTROL DEVICE AND ACCESS SYSTEM UTILIZING THE SAME

Publication number: 20140380026

Abstract: A control device coupled between a first memory and a second memory and including an execution unit, a first storage unit, a second storage unit, a selection unit and a processing unit is disclosed. The execution unit executes a specific instruction set to access the first and the second memories. The first storage unit is configured to store a first instruction set. The second storage unit is configured to store a second instruction set. The selection unit outputs one of the first and the second instruction sets to serve as the specific instruction set according to a control signal. The processing unit generates the control signal according to an execution state of the execution unit.

Type: Application

Filed: March 3, 2014

Publication date: December 25, 2014

Applicant: Silicon Motion, Inc.

Inventors: Yu-Wei CHYAN, Jiyun-Wei LIN
STACK ACCESS TRACKING USING DEDICATED TABLE

Publication number: 20140380022

Abstract: A processor employs a prediction table at a front end of its instruction pipeline, whereby the prediction table stores address register and offset information for store instructions; and stack offset information for stack access instructions. The stack offset information for a corresponding instruction indicates the location of the data accessed by the instruction at the processor stack relative to a base location. The processor uses pattern matching to identify predicted dependencies between load/store instructions and predicted dependencies between stack access instructions. A scheduler unit of the instruction pipeline uses the predicted dependencies to perform store-to-load forwarding or other operations that increase efficiency and reduce power consumption at the processing system.

Type: Application

Filed: June 20, 2013

Publication date: December 25, 2014

Inventors: Kai Troester, Luke Yen
Speculative scheduling of memory instructions in out-of-order processor based on addressing mode comparison

Patent number: 8918625

Abstract: A processor that executes instructions out of program order is described. In some implementations, a processor detects whether a second memory operation is dependent on a first memory operation prior to memory address calculation. If the processor detects that the second memory operation is not dependent on the first memory operation, the processor is configured to allow the second memory operation to be scheduled. If the processor detects that the second memory operation is dependent on the first memory operation, the processor is configured to prevent the second memory operation from being scheduled until the first memory operation has been scheduled to reduce the likelihood of having to reexecute the second memory operation.

Type: Grant

Filed: November 15, 2011

Date of Patent: December 23, 2014

Assignee: Marvell International Ltd.

Inventors: R. Frank O'Bleness, Sujat Jamil, Tom Hameenanttila
Multithreaded processor with multiple concurrent pipelines per thread

Patent number: 8918627

Abstract: A multithreaded processor comprises a plurality of hardware thread units, an instruction decoder coupled to the thread units for decoding instructions received therefrom, and a plurality of execution units for executing the decoded instructions. The multithreaded processor is configured for controlling an instruction issuance sequence for threads associated with respective ones of the hardware thread units. On a given processor clock cycle, only a designated one of the threads is permitted to issue one or more instructions, but the designated thread that is permitted to issue instructions varies over a plurality of clock cycles in accordance with the instruction issuance sequence. The instructions are pipelined in a manner which permits at least a given one of the threads to support multiple concurrent instruction pipelines.

Type: Grant

Filed: October 15, 2009

Date of Patent: December 23, 2014

Assignee: QUALCOMM Incorporated

Inventors: Erdem Hokenek, Mayan Moudgill, Michael J. Schulte, C. John Glossner
Method and system for reducing abort rates in speculative lock elision using contention management mechanisms

Patent number: 8914620

Abstract: Hardware-based transactional memory mechanisms, such as Speculative Lock Elision (SLE), may allow multiple threads to concurrently execute critical sections protected by the same lock as speculative transactions. Such transactions may abort due to contention or due to misidentification of code as a critical section. In various embodiments, speculative execution mechanisms may be augmented with software and/or hardware contention management mechanisms to reduce abort rates. Speculative execution hardware may send a hardware interrupt signal to notify software components of a speculative execution event (e.g., abort). Software components may respond by implementing concurrency-throttling mechanisms and/or by determining a mode of execution (e.g., speculative, non-speculative) for a given section and communicating that determination to the hardware speculative execution mechanisms, e.g., by writing it into a lock predictor cache.

Type: Grant

Filed: December 29, 2008

Date of Patent: December 16, 2014

Assignee: Oracle America, Inc.

Inventor: David Dice
Execution migration

Patent number: 8904154

Abstract: An execution migration approach includes bringing the computation to the locus of the data: when a memory instruction requests an address not cached by the current core, the execution context (current program counter, register values, etc.) moves to the core where the data is cached.

Type: Grant

Filed: April 15, 2011

Date of Patent: December 2, 2014

Assignee: Massachusetts Institute of Technology

Inventors: Srinivas Devadas, Omer Khan, Mieszko Lis, Keun Sup Shim, Myong Hyon Cho
Vector loads with multiple vector elements from a same cache line in a scattered load operation

Patent number: 8904153

Abstract: Mechanisms for performing a scattered load operation are provided. With these mechanisms, an extended address is received in a cache memory of a processor. The extended address has a plurality of data element address portions that specify a plurality of data elements to be accessed using the single extended address. Each of the plurality of data element address portions is provided to corresponding data element selector logic units of the cache memory. Each data element selector logic unit in the cache memory selects a corresponding data element from a cache line buffer based on a corresponding data element address portion provided to the data element selector logic unit. Each data element selector logic unit outputs the corresponding data element for use by the processor.

Type: Grant

Filed: September 7, 2010

Date of Patent: December 2, 2014

Assignee: International Business Machines Corporation

Inventors: Alexandre E. Eichenberger, Michael K. Gschwind, Valentina Salapura
UNIQUE PACKED DATA ELEMENT IDENTIFICATION PROCESSORS, METHODS, SYSTEMS, AND INSTRUCTIONS

Publication number: 20140351567

Abstract: A method of an aspect includes receiving a unique packed data element identification instruction. The unique packed data element identification instruction indicates a source packed data having a plurality of packed data elements and indicates a destination storage location. A unique packed data element identification result is stored in the destination storage location in response to the unique packed data element identification instruction. The unique packed data element identification result indicates which of the plurality of the packed data elements are unique in the source packed data. Other methods, apparatus, systems, and instructions are disclosed.

Type: Application

Filed: December 30, 2011

Publication date: November 27, 2014

Inventors: Mikhail Plotnikov, Andrey Naraikin, Elmoustapha Ould-Ahmed-Vall, Sergey Shalnov
Serial flash memory and address transmission method thereof

Patent number: 8898439

Abstract: A serial flash memory and an address transmission method thereof. The serial flash memory selectively addresses a first memory space according to a first address length or addresses a second memory space according to a second address length longer than the first address length. If the first memory space is addressed according to the first address length, a first memory address is completely received within an address time duration so that data corresponding to the first memory address is initially outputted from a starting clock. In the address transmission method, if the second memory space is addressed according to the second address length, a portion of a second memory address is received within the address time duration. The other portion of the second memory address is received within a waiting time duration so that data corresponding to the second memory address is initially outputted from the starting clock.

Type: Grant

Filed: July 16, 2010

Date of Patent: November 25, 2014

Assignee: Macronix International Co., Ltd.

Inventors: Kuen-Long Chang, Yufe-Feng Lin, Chun-Hsiung Hung
Inter-thread data communications in a computer processor

Patent number: 8893153

Abstract: A first set of one or more hardware threads for receiving messages sent from hardware threads are registered. After receiving indications of a message location value and a number, the message location value is increments and sent to a different hardware thread of the first set of one or more hardware threads until the message location value has been incremented the number of times or a criterion for interrupting the incrementing and sending is satisfied. An actual number of times the message location value was incremented is indicated to a hardware thread that sent the indications of the message location value and the number.

Type: Grant

Filed: October 11, 2013

Date of Patent: November 18, 2014

Assignee: International Business Machines Corporation

Inventors: Patrick J. Bohrer, Ahmed Gheith, James L. Peterson
Programmable cryptographic integrated circuit

Patent number: 8891757

Abstract: A cryptographic integrated circuit including a programmable main processor for executing cryptographic functions, an internal memory, and a data transmission bus to which the main processor and the internal memory are electrically connected. The cryptographic integrated circuit also includes a programmable arithmetic coprocessor that has specific hardware arithmetic units each being designed to carry out a predetermined arithmetical operation. The programmable arithmetic coprocessor is separate from the main processor and is also electrically connected to the data transmission bus.

Type: Grant

Filed: February 17, 2012

Date of Patent: November 18, 2014

Assignee: Bull SAS

Inventor: Patrick Le Quéré
Processor and system using a mask register to track progress of gathering and prefetching elements from memory

Patent number: 8892848

Abstract: A device, system and method for assigning values to elements in a first register, where each data field in a first register corresponds to a data element to be written into a second register, and where for each data field in the first register, a first value may indicate that the corresponding data element has not been written into the second register and a second value indicates that the corresponding data element has been written into the second register, reading the values of each of the data fields in the first register, and for each data field in the first register having the first value, gathering the corresponding data element and writing the corresponding data element into the second register, and changing the value of the data field in the first register from the first value to the second value. Other embodiments are described and claimed.

Type: Grant

Filed: July 5, 2011

Date of Patent: November 18, 2014

Assignee: Intel Corporation

Inventors: Eric Sprangle, Anwar Rohillah, Robert Cavin, Tom Forsyth, Michael Abrash
STREAMING MEMORY TRANSPOSE OPERATIONS

Publication number: 20140331032

Abstract: According to one general aspect, an apparatus may include a load/store unit, an execution unit, and a first and a second data path. The load/store unit may be configured to load/store data from/to a memory and transmit the data to/from an execution unit, wherein the data includes a plurality of elements. The execution unit may be configured to perform an operation upon the data. The load/store unit may be configured to transmit the data to/from the execution unit via either a first data path configured to communicate, without transposition, the data between the load/store unit and the execution unit, or a second data path configured to communicate, with transposition, the data between the load/store unit and the execution unit, wherein transposition includes dynamically distributing portions of the data amongst a plurality of elements according to an instruction.

Type: Application

Filed: September 3, 2013

Publication date: November 6, 2014

Inventors: Ashraf Ahmed, Nicholas Todd Humphries, Marc Michael Augustin
Memory controller with inter-core interference detection

Patent number: 8880809

Abstract: Embodiments are described for a method for controlling access to memory in a processor-based system comprising monitoring a number of interference events, such as bank contentions, bus contentions, row-buffer conflicts, and increased write-to-read turnaround time caused by a first core in the processor-based system that causes a delay in access to the memory by a second core in the processor-based system; deriving a control signal based on the number of interference events; and transmitting the control signal to one or more resources of the processor-based system to reduce the number of interference events from an original number of interference events.

Type: Grant

Filed: October 29, 2012

Date of Patent: November 4, 2014

Assignee: Advanced Micro Devices Inc.

Inventors: Gabriel Loh, James O'Connor
Low power, high performance, heterogeneous, scalable processor architecture

Patent number: 8880850

Abstract: One embodiment of the present includes a heterogeneous, high-performance, scalable processor having at least one W-type sub-processor capable of processing W bits in parallel, W being an integer value, at least one N-type sub-processor capable of processing N bits in parallel, N being an integer value smaller than W by a factor of two. The processor further includes a shared bus coupling the at least one W-type sub-processor and at least one N-type sub-processor and memory shared coupled to the at least one W-type sub-processor and the at least one N-type sub-processor, wherein the W-type sub-processor rearranges memory to accommodate execution of applications allowing for fast operations.

Type: Grant

Filed: February 25, 2013

Date of Patent: November 4, 2014

Assignee: Icelero Inc

Inventors: Amit Ramchandran, John Reid Hauser
Remote control device and communication system

Patent number: 8860558

Abstract: A remote control device includes: a vibration power generator configured to convert externally applied vibrations to electric power; a storage section charged with the electric power obtained by the vibration power generator; a switch provided between the vibration power generator and the storage section; and a control circuit configured to output a vibration instruction signal and turn the switch on when the remaining power of the storage section becomes smaller than a predetermined amount. The electronic apparatus instructs the user to vibrate the remote control device in response to the vibration instruction signal.

Type: Grant

Filed: March 7, 2012

Date of Patent: October 14, 2014

Assignee: Panasonic Corporation

Inventor: Tsuyoshi Katayama
Loading/discarding acquired data for vector load instruction upon determination of prediction success of multiple preceding branch instructions

Patent number: 8850167

Abstract: Provided is a processor including an instruction issue unit that issues a vector load instruction read from a main memory based on branch target prediction of a branch target in a branch instruction, a data acquisition unit that starts issue of a plurality of acquisition requests for acquiring a plurality of vector data based on the issued vector load instruction from the main memory, a determination unit that determines a success or a failure of the branch target prediction after the branch target is determined, and a vector load management unit that, when the branch target prediction is determined to be a success, acquires all vector data based on the plurality of acquisition requests and then transfers all the vector data to a vector register, and, when the branch target prediction is determined to be a failure, discards the vector data acquired by the issued acquisition requests.

Type: Grant

Filed: September 22, 2011

Date of Patent: September 30, 2014

Assignee: NEC Corporation

Inventor: Masao Fukagawa
Load pair disjoint facility and instruction therefore

Patent number: 8850166

Abstract: A Load/Store Disjoint instruction, when executed by a CPU, accesses operands from two disjoint memory locations and sets condition code indicators to indicate whether or not the two operands appeared to be accessed atomically by means of block-concurrent interlocked fetch with no intervening stores to the operands from other CPUs. In a Load Pair Disjoint form of the instruction, the accesses are loads and the disjoint data is stored in general registers.

Type: Grant

Filed: February 18, 2010

Date of Patent: September 30, 2014

Assignee: International Business Machines Corporation

Inventors: Christian Jacobi, Marcel Mitran, Timothy J. Slegel, Charles F. Webb
Memory device using extended interface commands

Patent number: 8843731

Abstract: A memory device includes a serial interface buffer that receives a hardware-decodable command and an extended interface command. The memory device also includes a logic module that directs the hardware-decodable command to a register for execution by a microcontroller. The logic module additionally loads a command received following the extended interface command into a sub-op-code register, wherein the logic module remains passive after loading the command received following the extended interface command into the sub-op-code register. Also included is a microcontroller that interprets the command in the sub-op-code register.

Type: Grant

Filed: December 30, 2010

Date of Patent: September 23, 2014

Assignee: Micron Technology, Inc.

Inventors: Christopher Bueb, Poorna Kale, Todd Legler
METHOD FOR POPULATING A SOURCE VIEW DATA STRUCTURE BY USING REGISTER TEMPLATE SNAPSHOTS

Publication number: 20140281426

Abstract: A method for populating a source view data structure by using register template snapshots. The method includes receiving an incoming instruction sequence using a global front end; grouping the instructions to form instruction blocks; using a plurality of register templates to track instruction destinations and instruction sources by populating the register template with block numbers corresponding to the instruction blocks, wherein the block numbers corresponding to the instruction blocks indicate interdependencies among the blocks of instructions; populating a source view data structure, wherein the source view data structure stores sources corresponding to the instruction blocks as recorded by the plurality of register templates; and determining which of the plurality of instruction blocks are ready for dispatch by using the populated source view data structure.

Type: Application

Filed: March 14, 2014

Publication date: September 18, 2014

Applicant: Soft Machines, Inc.

Inventor: Mohammad ABDALLAH
PROCESSOR AND METHOD FOR PROCESSING INSTRUCTIONS USING AT LEAST ONE PROCESSING PIPELINE

Publication number: 20140281423

Abstract: A processor has a processing pipeline with first, second and third stages. An instruction at the first stage takes fewer cycles to reach the second stage then the third stage. The second and third stages each have a duplicated processing resource. For a pending instruction which requires the duplicated resource and can be processed using the duplicated resource at either of the second and third stages, the first stage determines whether a required operand would be available when the pending instruction would reach the second stage. If the operand would be available, then the pending instruction is processed using the duplicated resource at the second stage, while if the operand would not be available in time then the instruction is processed using the duplicated resource in the third pipeline stage. This technique helps to reduce delays caused by data dependency hazards.

Type: Application

Filed: March 14, 2013

Publication date: September 18, 2014

Inventors: Ian Michael CAULFIELD, Peter Richard GREENHALGH, Max John BATLEY

prev 1 2 3 4 5 6 7 8 9 … next