Patents Assigned to Soft Machines, Inc.

SINGLE CYCLE MULTI-BRANCH PREDICTION INCLUDING SHADOW CACHE FOR EARLY FAR BRANCH PREDICTION

Publication number: 20170262287

Abstract: A method of identifying instructions including accessing a plurality of instructions that comprise multiple branch instructions. For each branch instruction of the multiple branch instructions, a respective first mask is generated representing instructions that are executed if a branch is taken. A respective second mask is generated representing instructions that are executed if the branch is not taken. A prediction output is received that comprises a respective branch prediction for each branch instruction. For each branch instruction, the prediction output is used to select a respective resultant mask from among the respective first and second masks. For each branch instruction, a resultant mask of a subsequent branch is invalidated if a previous branch is predicted to branch over said subsequent branch. A logical operation is performed on all resultant masks to produce a final mask. The final mask is used to select a subset of instructions for execution.

Type: Application

Filed: September 16, 2011

Publication date: September 14, 2017

Applicant: SOFT MACHINES, INC.

Inventor: Mohammad Abdallah
Method for populating a source view data structure by using register template snapshots

Patent number: 9569216

Abstract: A method for populating a source view data structure by using register template snapshots. The method includes receiving an incoming instruction sequence using a global front end; grouping the instructions to form instruction blocks; using a plurality of register templates to track instruction destinations and instruction sources by populating the register template with block numbers corresponding to the instruction blocks, wherein the block numbers corresponding to the instruction blocks indicate interdependencies among the blocks of instructions; populating a source view data structure, wherein the source view data structure stores sources corresponding to the instruction blocks as recorded by the plurality of register templates; and determining which of the plurality of instruction blocks are ready for dispatch by using the populated source view data structure.

Type: Grant

Filed: March 14, 2014

Date of Patent: February 14, 2017

Assignee: SOFT MACHINES, INC.

Inventor: Mohammad Abdallah
Guest instruction block with near branching and far branching sequence construction to native instruction block

Patent number: 9542187

Abstract: A method for translating instructions for a processor. The method includes accessing a plurality of guest instructions that comprise multiple guest branch instructions comprising at least one guest far branch, and building an instruction sequence from the plurality of guest instructions by using branch prediction on the at least one guest far branch. The method further includes assembling a guest instruction block from the instruction sequence. The guest instruction block is translated to a corresponding native conversion block, wherein an at least one native far branch that corresponds to the at least one guest far branch and wherein the at least one native far branch includes an opposite guest address for an opposing branch path of the at least one guest far branch. Upon encountering a missprediction, a correct instruction sequence is obtained by accessing the opposite guest address.

Type: Grant

Filed: January 27, 2012

Date of Patent: January 10, 2017

Assignee: Soft Machines, Inc.

Inventor: Mohammad Abdallah
SYSTEMS AND METHODS FOR LOAD CANCELING IN A PROCESSOR THAT IS CONNECTED TO AN EXTERNAL INTERCONNECT FABRIC

Publication number: 20160357559

Abstract: Systems and methods for load canceling in a processor that is connected to an external interconnect fabric are disclosed. As a part of a method for load canceling in a processor that is connected to an external bus, and responsive to a flush request and a corresponding cancellation of pending speculative loads from a load queue, a type of one or more of the pending speculative loads that are positioned in the instruction pipeline external to the processor, is converted from load to prefetch. Data corresponding to one or more of the pending speculative loads that are positioned in the instruction pipeline external to the processor is accessed and returned to cache as prefetch data. The prefetch data is retired in a cache location of the processor.

Type: Application

Filed: August 23, 2016

Publication date: December 8, 2016

Applicant: SOFT MACHINES, INC.

Inventors: Karthikeyan Avudaiyappan, Mohammad Abdallah
Cache storing data fetched by address calculating load instruction with label used as associated name for consuming instruction to refer

Patent number: 9501280

Abstract: A unified architecture for dynamic generation, execution, synchronization and parallelization of complex instruction formats includes a virtual register file, register cache and register file hierarchy. A self-generating and synchronizing dynamic and static threading architecture provides efficient context switching.

Type: Grant

Filed: February 28, 2014

Date of Patent: November 22, 2016

Assignee: Soft Machines, Inc.

Inventor: Mohammad A. Abdallah
Systems and methods for supporting a plurality of load accesses of a cache in a single cycle

Patent number: 9430410

Abstract: A method for supporting a plurality of load accesses is disclosed. A plurality of requests to access a data cache is accessed, and in response, a tag memory is accessed that maintains a plurality of copies of tags for each entry in the data cache. Tags are identified that correspond to individual requests. The data cache is accessed based on the tags that correspond to the individual requests. A plurality of requests to access the same block of the plurality of blocks causes an access arbitration that is executed in the same clock cycle as is the access of the tag memory.

Type: Grant

Filed: July 30, 2012

Date of Patent: August 30, 2016

Assignee: SOFT MACHINES, INC.

Inventors: Karthikeyan Avudaiyappan, Mohammad Abdallah
Systems and methods for faster read after write forwarding using a virtual address

Patent number: 9361227

Abstract: Methods for read after write forwarding using a virtual address are disclosed. A method includes determining when a virtual address has been remapped from corresponding to a first physical address to a second physical address and determining if all stores occupying a store queue before the remapping have been retired from the store queue. Loads that are younger than the stores that occupied the store queue before the remapping are prevented from being dispatched and executed until the stores that occupied the store queue before the remapping have left the store queue and become globally visible.

Type: Grant

Filed: August 30, 2013

Date of Patent: June 7, 2016

Assignee: SOFT MACHINES, INC.

Inventors: Karthikeyan Avudaiyappan, Paul Chan
Memory fragments for supporting code block execution by using virtual cores instantiated by partitionable engines

Patent number: 9274793

Abstract: A system for executing instructions using a plurality of memory fragments for a processor. The system includes a global front end scheduler for receiving an incoming instruction sequence, wherein the global front end scheduler partitions the incoming instruction sequence into a plurality of code blocks of instructions and generates a plurality of inheritance vectors describing interdependencies between instructions of the code blocks. The system further includes a plurality of virtual cores of the processor coupled to receive code blocks allocated by the global front end scheduler, wherein each virtual core comprises a respective subset of resources of a plurality of partitionable engines, wherein the code blocks are executed by using the partitionable engines in accordance with a virtual core mode and in accordance with the respective inheritance vectors. A plurality memory fragments are coupled to the partitionable engines for providing data storage.

Type: Grant

Filed: March 23, 2012

Date of Patent: March 1, 2016

Assignee: SOFT MACHINES, INC.

Inventor: Mohammad Abdallah
Systems and methods for supporting a plurality of load and store accesses of a cache

Patent number: 9229873

Abstract: Systems and methods for supporting a plurality of load and store accesses of a cache are disclosed. Responsive to a request of a plurality of requests to access a block of a plurality of blocks of a load cache, the block of the load cache and a logically and physically paired block of a store coalescing cache are accessed in parallel. The data that is accessed from the block of the load cache is overwritten by the data that is accessed from the block of the store coalescing cache by merging on a per byte basis. Access is provided to the merged data.

Type: Grant

Filed: July 30, 2012

Date of Patent: January 5, 2016

Assignee: Soft Machines, Inc.

Inventors: Karthikeyan Avudaiyappan, Mohammad Abdallah
Multilevel conversion table cache for translating guest instructions to native instructions

Patent number: 9207960

Abstract: A method for translating instructions for a processor. The method includes accessing a guest instruction and performing a first level translation of the guest instruction using a first level conversion table. The method further includes outputting a resulting native instruction when the first level translation proceeds to completion. A second level translation of the guest instruction is performed using a second level conversion table when the first level translation does not proceed to completion, wherein the second level translation further processes the guest instruction based upon a partial translation from the first level conversion table. The resulting native instruction is output when the second level translation proceeds to completion.

Type: Grant

Filed: January 27, 2012

Date of Patent: December 8, 2015

Assignee: Soft Machines, Inc.

Inventor: Mohammad Abdallah
CACHE REPLACEMENT POLICY

Publication number: 20150286576

Abstract: Cache replacement policy. In accordance with a first embodiment of the present invention, an apparatus comprises a queue memory structure configured to queue cache requests that miss a second cache after missing a first cache. The apparatus comprises additional memory associated with the queue memory structure is configured to record an evict way of the cache requests for the cache. The apparatus may be further configured to lock the evict way recorded in the additional memory, for example, to prevent reuse of the evict way. The apparatus may be further configured to unlock the evict way responsive to a fill from the second cache to the cache. The additional memory may be a component of a higher level cache.

Type: Application

Filed: December 16, 2011

Publication date: October 8, 2015

Applicant: Soft Machines, Inc.

Inventors: Karthikeyan Avudaiyappan, Mohammad Abdallah
Apparatus and Method for Processing an Instruction Matrix Specifying Parallel and Dependent Operations

Publication number: 20150269118

Abstract: A matrix of execution blocks form a set of rows and columns. The rows support parallel execution of instructions and the columns support execution of dependent instructions. The matrix of execution blocks process a single block of instructions specifying parallel and dependent instructions.

Type: Application

Filed: June 8, 2015

Publication date: September 24, 2015

Applicant: SOFT MACHINES, INC.

Inventor: Mohammad A. Abdallah
Processor executing super instruction matrix with register file configurable for single or multiple threads operations

Patent number: 9053292

Abstract: A processor has a register file configurable for different execution modes. In one mode the multiple register segments form a single register file where each register segment stores a Multiple Instructions Multiple Data (MIMD) super instruction matrix issuing four simultaneous instruction matrices where each individual instruction within each of the four simultaneous instruction matrices is a scalar or Single Instruction Multiple Data (SIMD). Another execution mode has the multiple register segments forming individual independent register tiles with individual register state to support simultaneous processing of separate threads, where each instruction matrix is associated with a separate thread and a separate register file segment.

Type: Grant

Filed: November 30, 2012

Date of Patent: June 9, 2015

Assignee: Soft Machines, Inc.

Inventor: Mohammad A. Abdallah
SYSTEMS AND METHODS FOR FASTER READ AFTER WRITE FORWARDING USING A VIRTUAL ADDRESS

Publication number: 20150067230

Abstract: Methods for read after write forwarding using a virtual address are disclosed. A method includes determining when a virtual address has been remapped from corresponding to a first physical address to a second physical address and determining if all stores occupying a store queue before the remapping have been retired from the store queue. Loads that are younger than the stores that occupied the store queue before the remapping are prevented from being dispatched and executed until the stores that occupied the store queue before the remapping have left the store queue and become globally visible.

Type: Application

Filed: August 30, 2013

Publication date: March 5, 2015

Applicant: Soft Machines, Inc.

Inventors: Karthikeyan Avudaiyappan, Paul Chan
SYSTEMS AND METHODS FOR INVASIVE DEBUG OF A PROCESSOR WITHOUT PROCESSOR EXECUTION OF INSTRUCTIONS

Publication number: 20150052401

Abstract: Methods for invasive debug of a processor without processor execution of instructions are disclosed. As a part of a method, a memory mapped I/O of the processor is accessed using a debug bus and an operation is initiated that causes a debug port to gain access to registers of the processor using the memory mapped I/O. The invasive debug of the processor is executed from the debug port via registers of the processor.

Type: Application

Filed: August 19, 2013

Publication date: February 19, 2015

Applicant: Soft Machines, Inc.

Inventors: Karthikeyan AVUDAIYAPPAN, Brian MCGEE
SYSTEMS AND METHODS FOR ACQUIRING DATA FOR LOADS AT DIFFERENT ACCESS TIMES FROM HIERARCHICAL SOURCES USING A LOAD QUEUE AS A TEMPORARY STORAGE BUFFER AND COMPLETING THE LOAD EARLY

Publication number: 20150052303

Abstract: A method for acquiring cache line data associated with a load from respective hierarchical cache data storage components. As a part of the method, a store queue is accessed for one or more portions of a cache line associated with a load, and, if the one or more portions of the cache line is held in the store queue, the one or more portions of the cache line is stored in a load queue location associated with the load. The load is completed if the one or more portions of the cache line stored in the load queue location includes all portions of the cache line associated with the load.

Type: Application

Filed: August 19, 2013

Publication date: February 19, 2015

Applicant: Soft Machines, Inc.

Inventors: Karthikeyan Avudaiyappan, Paul G. Chan
SYSTEMS AND METHODS FOR READ REQUEST BYPASSING A LAST LEVEL CACHE THAT INTERFACES WITH AN EXTERNAL FABRIC

Publication number: 20150052304

Abstract: Methods for read request bypassing a last level cache which interfaces with an external fabric are disclosed. A method includes identifying a read request for a read transaction, generating a phantom read transaction identifier for the read transaction and forwarding the read transaction with the phantom read transaction identifier beyond a last level cache before detection of a hit or miss with respect to the read transaction. The phantom read transaction identifier acts as a pointer to a real read transaction identifier.

Type: Application

Filed: August 19, 2013

Publication date: February 19, 2015

Applicant: Soft Machines, Inc.

Inventor: Karthikeyan Avudaiyappan
METHOD FOR EXECUTING BLOCKS OF INSTRUCTIONS USING A MICROPROCESSOR ARCHITECTURE HAVING A REGISTER VIEW, SOURCE VIEW, INSTRUCTION VIEW, AND A PLURALITY OF REGISTER TEMPLATES

Publication number: 20150046686

Abstract: A method for executing blocks of instructions using a microprocessor architecture having a register view, source view, instruction view, and a plurality of register templates.

Type: Application

Filed: March 14, 2014

Publication date: February 12, 2015

Applicant: Soft Machines, Inc.

Inventor: Mohammad A. Abdallah
METHOD FOR USING REGISTER TEMPLATES TO TRACK INTERDEPENDENCIES AMONG BLOCKS OF INSTRUCTIONS

Publication number: 20150046683

Abstract: A method for executing instructions using register templates to track interdependencies among blocks of instructions. The method includes receiving an incoming instruction sequence using a global front end; grouping the instructions to form instruction blocks; and using a register template to track instruction destinations and instruction sources by populating the register template with block numbers corresponding to the instruction blocks, wherein the block numbers corresponding to the instruction blocks indicate interdependencies among the blocks of instructions.

Type: Application

Filed: March 14, 2014

Publication date: February 12, 2015

Applicant: Soft Machines, Inc.

Inventor: Mohammad A. Abdallah
MULTIPORT MEMORY CELL HAVING IMPROVED DENSITY AREA

Publication number: 20150023086

Abstract: A mutltiport memory cell having improved density area is disclosed. The memory cell includes a data storing component, a first memory access component coupled to a first side of the data storing component, a second memory access component coupled to a second side of the data storing component, first and second bit lines coupled to the first memory access component, first and second bit lines coupled to the second memory access component, first and second write lines coupled to the first memory access component and first and second write lines coupled to the second memory access component. The multiport memory cell also includes a read/write assist transistor, coupled to load transistors of the data storing component, that during read operations is activated for the duration of the read operation and during write operations is activated to impress the desired voltage level before or after one or more memory access components activated as a part of the write operation are deactivated.

Type: Application

Filed: March 17, 2014

Publication date: January 22, 2015

Applicant: SOFT MACHINES, INC.

Inventor: Dennis Wendell

1 2 3 4 next