Prefetching Patents (Class 712/207)

Intelligent storage of messages in page sets

Patent number: 11941296

Abstract: Intelligent storage of messages is provided. A spill-over page set is selected to store received messages corresponding to a predefined target page set associated with an application workload in response to the predefined target page set reaching a predefined minimum unused page threshold level. The spill-over page set is utilized as a message storage destination for the received messages corresponding to the predefined target page set associated with the application workload to extend message storage for the predefined target page set after the predefined target page set reached the predefined minimum unused page threshold level.

Type: Grant

Filed: January 3, 2022

Date of Patent: March 26, 2024

Assignee: International Business Machines Corporation

Inventors: Juan Zhang, Anthony John Sharkey
Methods and systems for nested stream prefetching for general purpose central processing units

Patent number: 11740906

Abstract: A method and hardware system to remove the overhead caused by having stream handling instructions in nested loops. Where code contains inner loops, nested in outer loops, a compiler pass identifies qualified nested streams and generates ISA specific instructions for transferring stream information linking an inner loop stream with an outer loop stream, to hardware components of a co-designed prefetcher. The hardware components include a frontend able to decode and execute instructions for a stream linking information transfer mechanism, a stream engine unit with a streams configuration table (SCT) having a field for allowing a subordinate stream to stay pending for values from its master stream, and a stream prefetch manager with buffers for storing values of current elements of a master stream, and with a nested streams control unit for reconfiguring and iterating the streams.

Type: Grant

Filed: February 22, 2022

Date of Patent: August 29, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Maziar Goudarzi, Zhizhao Qian, Reza Azimi, Billy Mengxuan Cai, Man Pok Ho
Dynamically generating an optimized processing pipeline for tasks

Patent number: 11500673

Abstract: A method for dynamically generating an optimized processing pipeline for tasks is provided. The method identifies one or more tasks to be executed from defined tasks that are defined declaratively as a number of stages of input data, data transformations, and output data. The method processes the identified tasks to determine dependencies between the tasks based on their defined stages and creates one or more optimized data processing pipelines by performing a dependency resolution procedure on stages of all tasks in parallel using the task dependencies to determine the order of the stages and removing duplication of stages between tasks.

Type: Grant

Filed: September 2, 2020

Date of Patent: November 15, 2022

Assignee: International Business Machines Corporation

Inventors: Luke Taher, Diogo Alexandre Ferreira Ramos, Vinh Tuan Thai
Circuitry and method for selectively controlling prefetching of program instructions

Patent number: 11461101

Abstract: Circuitry comprises processing circuitry to process program instructions, in which the program instructions are stored by storage circuitry and the processing circuitry comprises execute-in-place processing circuitry configured to execute the program instructions from the storage circuitry; a prefetch buffer to store program instructions for execution by the processing circuitry; prefetch circuitry to control prefetching of program instructions to the prefetch buffer and to select a next instruction for prefetching; and replacement circuitry to control deletion of instructions from the prefetch buffer to provide storage for newly prefetched program instructions; in which one or both of the prefetch circuitry and the replacement circuitry are configured so that one or both of a selection operation by the prefetch circuitry of a next program instruction for prefetching, and a deletion operation to delete an instruction from the prefetch buffer by the replacement circuitry is dependent upon previously executed p

Type: Grant

Filed: September 27, 2019

Date of Patent: October 4, 2022

Assignee: ARM TECHNOLOGY (CHINA) CO. LTD

Inventors: Chi-Ming Li, Chi-Chang Lai
Apparatus and method for managing a capability domain

Patent number: 11347508

Abstract: An apparatus and method are provided for managing a capability domain. The apparatus has processing circuitry for executing instructions, the processing circuitry when in a default state being arranged to operate in a capability domain comprising capabilities used to constrain operations performed by the processing circuitry when executing the instructions. A program counter capability storage element is also provided to store a program counter capability used by the processing circuitry to determine a program counter value. The program counter capability is arranged to identify a capability state for the processing circuitry. The processing circuitry is then arranged, when the capability state indicates the default state, to operate in the capability domain. However, when the capability state indicates the executive state, the processing circuitry is arranged to operate in a manner less constrained than when in the default state so as to allow modification of the capability domain.

Type: Grant

Filed: April 27, 2018

Date of Patent: May 31, 2022

Assignee: Arm Limited

Inventor: Graeme Peter Barnes
Registers for restricted memory

Patent number: 11294599

Abstract: Provided are integrated circuits and methods for operating integrated circuits. An integrated circuit can include a plurality of memory banks and an execution engine including a set of execution components. Each execution component can be associated with a respective memory bank and can read from and write to the respective memory bank. The integrated circuit can further include a set of registers each associated with a respective memory bank from the plurality of memory banks. The integrated circuit can further be operable to load to or store from the set of registers in parallel, and load to or store from the set of registers serially. A parallel operation followed by a serial operation enables data to be moved from many memory banks into one memory bank. A serial operation followed by a parallel operation enables data to be moved from one memory bank into many memory banks.

Type: Grant

Filed: June 3, 2020

Date of Patent: April 5, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Ron Diamant, Randy Renfu Huang, Sundeep Amirineni, Jeffrey T. Huynh
System and method for prefetching instructions and data

Patent number: 11288071

Abstract: Systems and methods for prefetching instructions and data run a program in a test environment, execute a tracing application on the program to collect processor trace data, use the processor trace data to extract a plurality of instruction addresses and a single continuous stack range to be prefetched during a run-time of the program, at initialization of the program, load the extracted plurality of instruction addresses and the extracted stack range, relocate the plurality of extracted instruction addresses and the extracted stack range into the program, and at run-time, prefetch one or more of the plurality of extracted instruction addresses and the extracted stack range during idle time of the program.

Type: Grant

Filed: May 4, 2020

Date of Patent: March 29, 2022

Assignee: Morgan Stanley Services Group Inc.

Inventor: Jordan Homer
Loop thread order execution control of a multi-threaded, self-scheduling reconfigurable computing fabric

Patent number: 11275710

Abstract: Representative apparatus, method, and system embodiments are disclosed for configurable computing. A representative system includes an interconnection network; a processor; and a plurality of configurable circuit clusters. Each configurable circuit cluster includes a plurality of configurable circuits arranged in an array; a synchronous network coupled to each configurable circuit of the array; and an asynchronous packet network coupled to each configurable circuit of the array.

Type: Grant

Filed: March 31, 2019

Date of Patent: March 15, 2022

Assignee: Micron Technology, Inc.

Inventor: Tony M. Brewer
Digital advertising optimization

Patent number: 11232168

Abstract: Systems, methods, and software for optimizing rendering performance of a web page containing one or more advertisements. Creative code related to an advertisement is received and a web page containing the creative code is virtually rendered while performance metrics regarding a plurality of connections to servers made during the rendering is collected. The collected performance metrics regarding the plurality of connections are analyzed to identify optimizations to be added to the creative code, and the creative code is modified by applying the identified optimizations.

Type: Grant

Filed: November 13, 2019

Date of Patent: January 25, 2022

Assignee: Introspective Analytics Inc.

Inventor: Robert James Dailey
Pre-fetch for memory sub-system with cache where the pre-fetch does not send data and response signal to host

Patent number: 11210225

Abstract: Various embodiments described herein provide for a pre-fetch operation on a memory sub-system, which can help avoid a cache miss when the memory sub-system subsequently processes a read command from a host system.

Type: Grant

Filed: November 25, 2019

Date of Patent: December 28, 2021

Assignee: Micron Technology, Inc.

Inventor: Dhawal Bavishi
Processor and interrupt controller

Patent number: 11182318

Abstract: Embodiments of the present disclosure provide an interrupt controller in a processor, comprising: an interrupt sampling circuitry configured to receive one or more interrupts from one or more interrupt sources that are communicatively coupled to the interrupt controller; and an arbitration circuitry configured to select a to-be-responded interrupt from the received one or more interrupts, the arbitration circuitry comprising: a selection circuitry configured to select from the one or more interrupts a highest-priority interrupt that has a highest priority among the one or more interrupts; and a threshold comparison circuitry communicatively coupled to the selection circuitry, the threshold comparison circuitry configured to compare the priority of the highest-priority interrupt with a preset priority threshold, wherein the arbitration circuitry is configured to select the highest-priority interrupt as the to-be-responded interrupt in response to the threshold comparison circuitry determining that the priority

Type: Grant

Filed: March 24, 2020

Date of Patent: November 23, 2021

Assignee: Alibaba Group Holding Limited

Inventors: Chaojun Zhao, Xiaoyan Xiang, Chen Chen, Taotao Zhu
Decoding method and apparatus

Patent number: 11171673

Abstract: A decoding method and apparatus are provided, to improve a degree of parallelism in decoded bit decisions and reduce a decoding delay. The method includes: performing a hard decision on each LLR in an inputted LLR vector having a length of M to obtain a first vector, where M?N and N is a length of to-be-decoded information; sequentially performing negation of some elements of the first vector to obtain L vectors; and then determining decoding results of the LLR vector based on the L vectors.

Type: Grant

Filed: July 8, 2020

Date of Patent: November 9, 2021

Assignee: Huawei Technologies Co., Ltd.

Inventors: Jiajie Tong, Huazi Zhang, Yunfei Qiao, Rong Li, Xiaocheng Liu, Jun Wang
Determining the effectiveness of prefetch instructions

Patent number: 11080052

Abstract: Effectiveness of prefetch instructions is determined. A prefetch instruction is executed to request that data be fetched into a cache of the computing environment. The effectiveness of the prefetch instruction is determined. This includes updating, based on executing the prefetch instruction, a cache directory of the cache. The updating includes, in the cache directory, effectiveness data relating to the data. The effectiveness data includes whether the data was installed in the cache based on the prefetch instruction. Additionally, the determining the effectiveness includes obtaining at least a portion of the effectiveness data from the cache directory, and using the at least a portion of effectiveness data to determine the effectiveness of the prefetch instruction.

Type: Grant

Filed: October 30, 2019

Date of Patent: August 3, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael K. Gschwind, Christian Jacobi, Anthony Saporito, Chung-Lung K. Shum, Timothy J. Slegel
Control flow graph analysis

Patent number: 11068246

Abstract: A method and systems generate a control flow graph including an edge of the control flow graph from a branch instruction to a target address of the branch instruction in an abstract interpretation for an assignment instruction to a branch target variable of a program. The program allocates a particular branch target variable to a branch instruction having a plurality of branch targets. The branch target address is loaded from the branch target variable upon branching, a branch address of a branch instruction having one branch target as well as the address assigned by the assignment instruction to the branch target variable being determined as certain constant values determined by compiling the program. The target address assigned by the assignment instruction is added to an object of the abstract interpretation. A current abstract interpretation is terminated if the abstract interpretation reaches an instruction already subjected to the abstract interpretation.

Type: Grant

Filed: January 17, 2018

Date of Patent: July 20, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Reid T. Copeland, Toshihiko Koju
Data processing apparatus and prefetch method

Patent number: 11055224

Abstract: An area for prefetching is determined while accommodating an increase in a block address space. A prediction model predicts prefetch addresses for each of bit ranges into which block addresses are split by using a plurality of neural networks assuming charge of the different bit ranges having performed machine learning on I/O trace data, a prediction accuracy determination section determines a size of an area for prefetching on the basis of addresses in the bit range for which prediction accuracy in prefetch is lower than a predetermined value, a predicted value determination section determines addresses of the area for prefetching on the basis of addresses in the bit range for which the prediction accuracy in the prefetch is equal to or higher than the predetermined value, and a prefetch issuance section caches data in the area for prefetching in a storage class memory from a NAND flash memory.

Type: Grant

Filed: March 6, 2019

Date of Patent: July 6, 2021

Assignee: HITACHI, LTD.

Inventors: Yuji Saeki, Takashige Baba
Electronic apparatus can execute instruction and instruction executing method

Patent number: 11010162

Abstract: An electronic device can execute instructions, comprising: a processing circuit; a first storage device, coupled to the processing circuit, configured to store at least one instruction and first operation data; and a second storage device, coupled to the processing circuit. The processing circuit reads at least one of the instruction and the first operation data corresponding to the read instruction from the first storage device, and the second storage device does not store the first operation data corresponding to the read instruction, the processing circuit backs up the read first operation data to the second storage device.

Type: Grant

Filed: September 17, 2019

Date of Patent: May 18, 2021

Assignee: Realtek Semiconductor Corp.

Inventor: Yen-Ju Lu
Effectiveness and prioritization of prefetches

Patent number: 11010168

Abstract: A method, system, and computer program product are provided for prioritizing prefetch instructions. The method includes a processor issuing a prefetch instruction and fetching elements from a cache that can include a memory or a higher level cache. The processor stores the elements in temporary storage and monitors for accesses by an instruction. The processor stores a record representing the prefetch instruction. The processor updates the record with an indicator and issues a new prefetch instruction by comparing the new prefetch instruction to the record, based on the new prefetch instruction matching the prefetch instruction, assigning the indicator to the new prefetch instruction as a priority value, based on the new prefetch instruction not matching the prefetch instruction, assigning a default value to the new prefetch instruction as the priority value, and determining whether to execute the new prefetch instruction, based on the priority value of the new prefetch instruction.

Type: Grant

Filed: June 18, 2019

Date of Patent: May 18, 2021

Assignee: International Business Machines Corporation

Inventors: Michael K. Gschwind, Christian Jacobi, Anthony Saporito, Chung-Lung K. Shum
Arithmetic processing device and arithmetic processing method of controlling prefetch of cache memory

Patent number: 11003581

Abstract: An arithmetic processing device includes circuitry configured to add an identifier of a request source that generates a prefetch request into the prefetch request, and output, in response to detecting a certain number of cache hits less than a first threshold, each of the cache hits occurring in a first cache memory provided at a lower hierarchical level than a second cache memory by each prefetch request into which a first identifier is added, a notification for suppressing a prefetch request issued for the lower hierarchical level of the first cache memory from a first request source identified by the first identifier.

Type: Grant

Filed: June 26, 2019

Date of Patent: May 11, 2021

Assignee: FUJITSU LIMITED

Inventors: Masakazu Tanomoto, Hideki Okawara
Securely sharing a memory between an embedded controller (EC) and a platform controller hub (PCH)

Patent number: 10916280

Abstract: Systems and methods for securely sharing a memory between an Embedded Controller (EC) and a Platform Controller Hub (PCH). In some embodiments, an IHS may include: a chipset; a flash device coupled to the chipset; and an EC coupled to the flash device via a first bus and to the chipset via a second bus, wherein the EC comprises a Read-Only Memory (ROM) portion and a Random Access Memory (RAM) portion, the EC configured to: retrieve EC firmware from the flash device via the first bus; store the retrieved EC firmware in the RAM portion; and prior to the execution of any instruction stored in the RAM portion, relinquish access to the flash device via the first bus.

Type: Grant

Filed: March 15, 2018

Date of Patent: February 9, 2021

Assignee: Dell Products, L.P.

Inventors: Adolfo S. Montero, Mark Tracy Ellis
Control of speculative demand loads

Patent number: 10884749

Abstract: Aspects of the present disclosure relate to control of speculative demand loads. In some embodiments, the method includes receiving instructions for a branch in a program, detecting the branch load is in the cache, monitoring a number of completed loads for the program, determining a cache pollution ratio of executed loads to completed loads, providing the cache pollution ratio to a branch prediction unit, and altering load instructions for the branch based on the cache pollution ratio.

Type: Grant

Filed: March 26, 2019

Date of Patent: January 5, 2021

Assignee: International Business Machines Corporation

Inventors: Satish Kumar Sadasivam, Puneeth A. H. Bhat, Shruti Saxena, Sangram Alapati
Conserving memory resources of a provider edge device in an ethernet tree

Patent number: 10887223

Abstract: A provider edge device, capable of accessing a first type of memory and a second type of memory, may determine a network address associated with a customer edge device. The provider edge device may determine whether the customer edge device is categorized as a leaf device in an Ethernet Tree service provided by the provider edge device. The provider edge device may selectively store the network address in the first type of memory or the second type of memory based on determining whether the customer edge device is categorized as a leaf device in the Ethernet Tree service.

Type: Grant

Filed: September 14, 2018

Date of Patent: January 5, 2021

Assignee: Juniper Networks, Inc.

Inventors: Manoj Sharma, Poorna Pushkala Balasubramanian, Nitin Singh, Xiaomin Wu
Controlling segment layout in a stress test for a processor memory with a link stack

Patent number: 10877864

Abstract: A processor memory is stress tested with a variable link stack depth using test code segments and link stack test segments on non-naturally aligned data boundaries. Link stack test segments are interspersed into test code segments of a processor memory test to change the link stack depth without changing results of the test code. The link stack test segments include branch to target, push/pop, push and pop segments. The depth of the link stack is varied independent of the memory test code by changing the number to branches in the branch to target segment and varying the number of the push/pop segments. The link stack test segments and test segments may be placed randomly with a recursive algorithm to intersperse the link stack test segments in the test code segments and to reduce the amount of data to be saved and restored for all subroutine calls, push and pop segments.

Type: Grant

Filed: December 20, 2018

Date of Patent: December 29, 2020

Assignee: International Business Machines Corporation

Inventors: Shakti Kapoor, Manoj Dusanapudi
Hardware processor architecture having a hint cache

Patent number: 10838871

Abstract: Aspects of the disclosure relate to a hardware processor architecture. The hardware processor architecture may include a processor cache to manage a set of instructions. The hardware processor architecture may include a hint cache to manage a set of hints associated with the set of instructions. Disclosed aspects relate to establishing a hint cache which has a set of hints associated with the set of instructions. The hint cache may be established with respect to the processor cache which has a set of instructions. The set of instructions may be accessed from the processor cache. From the hint cache, the set of hints associated with the set of instructions may be communicated. The set of instructions may be processed by the hardware processor using the set of hints associated with the set of instructions. In embodiments, static or dynamic hint cache bits can be utilized.

Type: Grant

Filed: November 7, 2016

Date of Patent: November 17, 2020

Assignee: International Business Machines Corporation

Inventor: Satish K. Sadasivam
Multiple stride prefetching

Patent number: 10769070

Abstract: Apparatuses and methods for prefetch generation are disclosed. Prefetching circuitry receives addresses specified by load instructions and can cause retrieval of a data value from an address before that address is received. Stride determination circuitry determines stride values as a difference between a current address and a previously received address. Plural stride values corresponding to a sequence of received addresses are determined. Multiple stride storage circuitry stores the plurality of stride values determined by the stride determination circuitry. New address comparison circuitry determines whether a current address corresponds to a matching stride value based on the plurality of stride values stored in the multiple stride storage circuitry. Prefetch initiation circuitry can causes a data value to be retrieved from a further address, wherein the further address is the current address modified by the matching stride value of the plurality of stride values.

Type: Grant

Filed: September 25, 2018

Date of Patent: September 8, 2020

Assignee: Arm Limited

Inventors: Joseph Michael Pusdesris, Miles Robert Dooley, Alexander Cole Shulyak, Krishnendra Nathella, Dam Sunwoo
Automatic predication of hard-to-predict convergent branches

Patent number: 10754655

Abstract: A processing device includes a branch IP table and branch predication circuitry coupled to the branch IP table. The branch predication circuitry to: determine a dynamic convergence point in a conditional branch of set of instructions; store the dynamic convergence point in the branch IP table; fetch a first and second speculative path of the conditional branch; while determining which of the first speculative path and the second speculative path is a taken path of the conditional branch and determining whether a dynamic convergence point is fetched corresponding to the stored dynamic convergence point, stall scheduling of instructions of the first speculative path and the second speculative path; and in response to determining that one of the first speculative path and the second speculative path is the taken path and the fetched dynamic convergence point corresponds to the stored convergence point, resume scheduling of the instructions of the taken path.

Type: Grant

Filed: June 28, 2018

Date of Patent: August 25, 2020

Assignee: Intel Corporation

Inventors: Adarsh Chauhan, Hong Wang, Jayesh Gaur, Zeev Sperber, Sumeet Bandishte, Lihu Rappoport, Stanislav Shwartsman, Kamil Garifullin, Sreenivas Subramoney, Adi Yoaz
Processor subroutine cache

Patent number: 10740105

Abstract: A processor includes an execution unit and a subroutine cache. The execution unit is configured to execute instructions. The subroutine cache us configured to provide instructions of a subroutine to the execution unit for execution. The subroutine cache includes subroutine instruction storage, a subroutine address register, and subroutine cache control logic. The subroutine control logic is configured to: identify a subroutine call instruction provided to the execution unit; determine whether an instruction of a subroutine invoked by the subroutine call instruction is stored in the subroutine instruction storage by evaluating a subroutine validity indicator that indicates whether at least a portion of the subroutine is stored in the subroutine instruction storage; and provide the instruction of the subroutine to the execution unit based on the subroutine validity indicator indicating that at least a portion of the subroutine is stored in the subroutine instruction storage.

Type: Grant

Filed: April 4, 2014

Date of Patent: August 11, 2020

Assignee: TEXAS INSTRUMENTS INCORPORATED

Inventor: Christian Wiencke
Virtual processor cache reuse

Patent number: 10740234

Abstract: An approach is provided in which a first core broadcasts a cache line request in response to detecting a cache miss corresponding to a first virtual central processing unit (VCPU) executing on the first core. Next, the first core receives a cache line response from the second core responding to the cache line request that includes tag extension data. The first core determines a cache miss type of the cache miss based on the tag extension data and, in turn, sends the cache miss type to a hypervisor that utilizes the cache miss type during a future VCPU dispatch selection.

Type: Grant

Filed: September 4, 2018

Date of Patent: August 11, 2020

Assignee: International Business Machines Corporation

Inventors: Bret R. Olszewski, Ram Raghavan, Maria Lorena Pesantez, Gayathri Mohan
Pointer associated branch line jumps for accelerated line jumps

Patent number: 10691460

Abstract: A method includes a processor providing at least one line entry address tag in each line of a branch predictor; indexing into the branch predictor with a current line address to predict a taken branch's target address and a next line address; re-indexing into the branch predictor with one of a predicted next line address or a sequential next line address when the at least one line entry address tag does not match the current line address; using branch prediction content compared against a search address to predict a direction and targets of branches and determining when a new line address is generated; and re-indexing into the branch predictor with a corrected next line address when it is determined that one of the predicted next line address or the sequential next line address differs from the new line address.

Type: Grant

Filed: December 13, 2016

Date of Patent: June 23, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James J. Bonanno, Brian R. Prasky
Data prefetching that stores memory addresses in a first table and responsive to the occurrence of loads corresponding to the memory addresses stores the memory addresses in a second table

Patent number: 10684857

Abstract: A method includes storing a first address of a first instruction executed by a processor core in a first table, where the first instruction writes a value into a register for utilization in addressing memory. The method stores the first address of the first instruction executed by the processor core in a second table with multiple entries, where a register value loaded into the register is utilized as a second address by a second instruction executed by the processor core to access a main memory. The method determines whether an instruction address associated with an instruction executed by the processor core is present in the second table, where the instruction address is the second address. Responsive to determining the instruction address is present in the second table, the method prefetches data from the main memory, where the register value is utilized as the second address in the main memory.

Type: Grant

Filed: February 1, 2018

Date of Patent: June 16, 2020

Assignee: International Business Machines Corporation

Inventors: Wolfgang Gellerich, Gerrit Koch, Peter M. Held, Martin Schwidefsky
Registers for restricted memory

Patent number: 10678479

Abstract: Provided are integrated circuits and methods for operating integrated circuits. An integrated circuit can include a plurality of memory banks and an execution engine including a set of execution components. Each execution component can be associated with a respective memory bank, and can read from and write to only the respective memory bank. The integrated circuit can further include a set of registers each associated with a respective memory bank from the plurality of memory banks. The integrated circuit can further be operable to load to or store from the set of registers in parallel, and load to or store from the set of registers serially. A parallel operation followed by a serial operation enables data to be moved from many memory banks into one memory bank. A serial operation followed by a parallel operation enables data to be moved from one memory bank into many memory banks.

Type: Grant

Filed: November 29, 2018

Date of Patent: June 9, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Ron Diamant, Randy Renfu Huang, Sundeep Amirineni, Jeffrey T. Huynh
Computing system and controller thereof

Patent number: 10664283

Abstract: Computing system and controller thereof are disclosed for ensuring the correct logical relationship between multiple instructions during their parallel execution. The computing system comprises: a plurality of functional modules each performing a respective function in response to an instruction for the given functional module; and a controller for determining whether or not to send an instruction to a corresponding functional module according to dependency relationship between the plurality of instructions.

Type: Grant

Filed: June 12, 2017

Date of Patent: May 26, 2020

Assignee: BEIJING DEEPHI TECHNOLOGY CO., LTD.

Inventors: Kaiyuan Guo, Song Yao
Technologies for automatic processor core association management and communication using direct data placement in private caches

Patent number: 10652353

Abstract: Technologies for communication with direct data placement include a number of computing nodes in communication over a network. Each computing node includes a many-core processor having an integrated host fabric interface (HFI) that maintains an association table (AT). In response to receiving a message from a remote device, the HFI determines whether the AT includes an entry associating one or more parameters of the message to a destination processor core. If so, the HFI causes a data transfer agent (DTA) of the destination core to receive the message data. The DTA may place the message data in a private cache of the destination core. Message parameters may include a destination process identifier or other network address and a virtual memory address range. The HFI may automatically update the AT based on communication operations generated by software executed by the processor cores. Other embodiments are described and claimed.

Type: Grant

Filed: September 24, 2015

Date of Patent: May 12, 2020

Assignee: Intel Corporation

Inventors: James Dinan, Venkata Krishnan, Srinivas Sridharan, David A. Webb
Processor with variable pre-fetch threshold

Patent number: 10628163

Abstract: A method and apparatus for controlling pre-fetching in a processor. A processor includes an execution pipeline and an instruction pre-fetch unit. The execution pipeline is configured to execute instructions. The instruction pre-fetch unit is coupled to the execution pipeline. The instruction pre-fetch unit includes instruction storage to store pre-fetched instructions, and pre-fetch control logic. The pre-fetch control logic is configured to fetch instructions from memory and store the fetched instructions in the instruction storage. The pre-fetch control logic is also configured to provide instructions stored in the instruction storage to the execution pipeline for execution. The pre-fetch control logic is further configured set a maximum number of instruction words to be pre-fetched for execution subsequent to execution of an instruction currently being executed in the execution pipeline.

Type: Grant

Filed: April 17, 2014

Date of Patent: April 21, 2020

Assignee: TEXAS INSTRUMENTS INCORPORATED

Inventors: Christian Wiencke, Johann Zipperer
Instruction prefetch halting upon predecoding predetermined instruction types

Patent number: 10620953

Abstract: A data processing apparatus has prefetch circuitry for prefetching instructions from a data store into an instruction queue. Branch prediction circuitry is provided for predicting outcomes of branch instructions and the prefetch circuitry may prefetch instructions subsequent to the branch based on the predicted outcome. Instruction identifying circuitry identifies whether a given instruction prefetched from the data store is a predetermined type of program flow altering instruction and if so then controls the prefetch circuitry to halt prefetching of subsequent instructions into the instruction queue.

Type: Grant

Filed: February 14, 2017

Date of Patent: April 14, 2020

Assignee: ARM Limited

Inventors: Kauser Yakub Johar, Antony John Penton
Building automation system with a parallel relationship computation engine

Patent number: 10612805

Abstract: A method for executing computations in parallel for a building management system of a building includes receiving a computing job request to determine values for one or more particular properties, receiving a property model indicating dependencies between a plurality of properties, the plurality of properties including the one or more particular properties, wherein the plurality of properties include building data for the building, and generating one or more computing threads based on the property model, wherein each computing thread includes a sequence of computations for determining values for the plurality of properties. The method further includes executing the computing threads in parallel to determine the values for the particular properties.

Type: Grant

Filed: February 14, 2018

Date of Patent: April 7, 2020

Assignee: Johnson Controls Technology Company

Inventor: Andrew J. Przybylski
Packed data operation mask shift processors, methods, systems, and instructions

Patent number: 10564966

Abstract: A method of an aspect includes receiving a packed data operation mask shift instruction. The packed data operation mask shift instruction indicates a source having a packed data operation mask, indicates a shift count number of bits, and indicates a destination. The method further includes storing a result in the destination in response to the packed data operation mask shift instruction. The result includes a sequence of bits of the packed data operation mask that have been shifted by the shift count number of bits. Other methods, apparatus, systems, and instructions are disclosed.

Type: Grant

Filed: December 22, 2011

Date of Patent: February 18, 2020

Assignee: Intel Corporation

Inventors: Bret L. Toll, Robert Valentine, Jesus Corbal San Adrian, Elmoustapha Ould-Ahmed Vall, Mark J. Charney
Communication between threads of multi-thread processor

Patent number: 10558463

Abstract: Embodiments of the present disclosure support hardware based thread switching in a multithreading environment. The thread switching is implemented on a multithread microprocessor by utilizing thread mailbox registers and other auxiliary registers that can be pre-programmed for hardware based thread switching. A set of mailbox registers can be allocated to each thread of a plurality of threads that can be executed in the microprocessor. A mailbox register in the set of mailbox registers comprises an identifier of a next thread of the plurality of threads to which an active thread switches based on a thread switch condition further indicated in the mailbox register. The auxiliary registers in the microprocessor can be used to configure a number of threads for simultaneous execution in the microprocessor, a priority for thread switching, and to store a program counter of each thread and states of registers of each thread.

Type: Grant

Filed: June 3, 2016

Date of Patent: February 11, 2020

Assignee: Synopsys, Inc.

Inventor: Thang Tran
Efficient work unit processing in a multicore system

Patent number: 10540288

Abstract: Techniques are described in which a system having multiple processing units processes a series of work units in a processing pipeline, where some or all of the work units access or manipulate data stored in non-coherent memory. In one example, this disclosure describes a method that includes identifying, prior to completing processing of a first work unit with a processing unit of a processor having multiple processing units, a second work unit that is expected to be processed by the processing unit after the first work unit. The method also includes processing the first work unit, and prefetching, from non-coherent memory, data associated with the second work unit into a second cache segment of the buffer cache, wherein prefetching the data associated with the second work unit occurs concurrently with at least a portion of the processing of the first work unit by the processing unit.

Type: Grant

Filed: April 10, 2018

Date of Patent: January 21, 2020

Assignee: Fungible, Inc.

Inventors: Wael Noureddine, Jean-Marc Frailong, Felix A. Marti, Charles Edward Gray, Paul Kim
PC-relative addressing and transmission

Patent number: 10534610

Abstract: Techniques for processing instructions include receiving a plurality of instructions from a program counter (PC) operable to be fused into a PC-relative plus offset instruction. The technique also includes fusing the plurality of instructions into an internal operation (IOP) that specifies PC-relative addressing with an offset. The technique also includes computing a shared PC portion that includes one or more common upper bits of a PC address of each of the plurality of instructions. If the shared PC portion is different than a previously computed shared PC portion, the technique transmits the shared PC portion to one or more downstream components in the processor pipeline. The technique further includes transmitting the IOP with a representation of lower order bits of the PC address and processing the IOP.

Type: Grant

Filed: July 20, 2016

Date of Patent: January 14, 2020

Assignee: International Business Machines Corporation

Inventor: Michael Karl Gschwind
Blocking instruction fetching in a computer processor

Patent number: 10528352

Abstract: Blocking instruction fetching in a computer processor, includes: receiving a non-branching instruction to be executed by the computer processor; determining whether executing the non-branching instruction will cause a flush; and responsive to determining that executing the non-branching instruction will cause a flush, disabling instruction fetching for the computer processor for a time, including recoding the instruction such that the recoded instruction will be interpreted by an instruction fetch unit as an unconditional branch instruction.

Type: Grant

Filed: March 8, 2016

Date of Patent: January 7, 2020

Assignee: International Business Machines Corporation

Inventors: Bryan G. Hickerson, Sheldon Levenstein, David S. Levitan, Albert J. Van Norstrand, Jr.
Determining the effectiveness of prefetch instructions

Patent number: 10521350

Abstract: Effectiveness of prefetch instructions is determined. A prefetch instruction is executed to request that data be fetched into a cache of the computing environment. The effectiveness of the prefetch instruction is determined. This includes updating, based on executing the prefetch instruction, a cache directory of the cache. The updating includes, in the cache directory, effectiveness data relating to the data. The effectiveness data includes whether the data was installed in the cache based on the prefetch instruction. Additionally, the determining the effectiveness includes obtaining at least a portion of the effectiveness data from the cache directory, and using the at least a portion of effectiveness data to determine the effectiveness of the prefetch instruction.

Type: Grant

Filed: July 20, 2016

Date of Patent: December 31, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Michael K. Gschwind, Christian Jacobi, Anthony Saporito, Chung-Lung K. Shum, Timothy J. Slegel
Program interruption filtering in transactional execution

Patent number: 10437602

Abstract: Program exception conditions cause a transaction to abort and typically result in an interruption in which the operating system obtains control. A program interruption filtering control is provided to selectively present the interrupt. That is, the interrupt from the program exception condition may or may not be presented depending at least on the program interruption filtering control and a transaction class associated with the program exception condition. The program interruption filtering control is provided by a TRANSACTION BEGIN instruction.

Type: Grant

Filed: June 15, 2012

Date of Patent: October 8, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Dan F. Greiner, Christian Jacobi, Marcel Mitran, Timothy J. Slegel
Stream based branch prediction index accelerator with power prediction

Patent number: 10430195

Abstract: A computer-implemented method for predicting a taken branch that ends an instruction stream in a pipelined high frequency microprocessor includes receiving, by a processor, a first instruction within a first instruction stream, the first instruction including a first instruction address. The computer-implemented method further includes searching, by the processor, a stream-based index accelerator predictor one time for the stream; determining, by the processor, a prediction for a branch ending the branch stream; influencing, by the processor, a metadata prediction engine based on the prediction; and updating, by the processor, a stream-based index accelerator predictor with information indicative of the prediction.

Type: Grant

Filed: June 27, 2016

Date of Patent: October 1, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James J. Bonanno, Michael J. Cadigan, Jr., Adam B. Collura, Christian Jacobi, Daniel Lipetz, Anthony Saporito
Stream based branch prediction index accelerator for multiple stream exits

Patent number: 10423420

Abstract: A computer-implemented method for predicting a taken branch that ends an instruction stream in a pipelined high frequency microprocessor includes receiving, by a processor, a first instruction within a first instruction stream, the first instruction comprising a first instruction address; searching, by the processor, an index accelerator predictor one time for the stream; determining, by the processor, a prediction for a taken branch ending the branch stream; influencing, by the processor, a metadata prediction engine based on the prediction; observing a plurality of taken branches from the exit accelerator predictor; maintaining frequency information based on the observed taken branches; determining, based on the frequency information, an updated prediction of the observed plurality of taken branches; and updating, by the processor, the index accelerator predictor with the the updated prediction.

Type: Grant

Filed: March 1, 2017

Date of Patent: September 24, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James J. Bonanno, Michael J. Cadigan, Jr., Adam B. Collura, Daniel Lipetz
Stream based branch prediction index accelerator for multiple stream exits

Patent number: 10423419

Abstract: A computer-implemented method for predicting a taken branch that ends an instruction stream in a pipelined high frequency microprocessor includes receiving, by a processor, a first instruction within a first instruction stream, the first instruction comprising a first instruction address; searching, by the processor, an index accelerator predictor one time for the stream; determining, by the processor, a prediction for a taken branch ending the branch stream; influencing, by the processor, a metadata prediction engine based on the prediction; observing a plurality of taken branches from the exit accelerator predictor; maintaining frequency information based on the observed taken branches; determining, based on the frequency information, an updated prediction of the observed plurality of taken branches; and updating, by the processor, the index accelerator predictor with the updated prediction.

Type: Grant

Filed: June 27, 2016

Date of Patent: September 24, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James J. Bonanno, Michael J. Cadigan, Jr., Adam B. Collura, Daniel Lipetz
Branch predictor search qualification using stream length prediction

Patent number: 10394559

Abstract: A computer-implemented method includes determining, by a stream-based index accelerator predictor of a processor, a predicted stream length between an instruction address and a taken branch ending an instruction stream. A first-level branch predictor of a hierarchical asynchronous lookahead branch predictor of the processor is searched for a branch prediction in one or more entries in a search range bounded by the instruction address and the predicted stream length. A search of a second-level branch predictor of the hierarchical asynchronous lookahead branch predictor is triggered based on failing to locate the branch prediction in the search range.

Type: Grant

Filed: December 13, 2016

Date of Patent: August 27, 2019

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: James J. Bonanno, Michael J. Cadigan, Jr., Adam B. Collura, Daniel Lipetz
Effectiveness and prioritization of prefeteches

Patent number: 10379862

Abstract: A method, system, and computer program product are provided for prioritizing prefetch instructions. The method includes a processor issuing a prefetch instruction and fetching elements from a cache that can include a memory or a higher level cache. The processor stores the elements in temporary storage and monitors for accesses by an instruction. The processor stores a record representing the prefetch instruction. The processor updates the record with an indicator and issues a new prefetch instruction by comparing the new prefetch instruction to the record, based on the new prefetch instruction matching the prefetch instruction, assigning the indicator to the new prefetch instruction as a priority value, based on the new prefetch instruction not matching the prefetch instruction, assigning a default value to the new prefetch instruction as the priority value, and determining whether to execute the new prefetch instruction, based on the priority value of the new prefetch instruction.

Type: Grant

Filed: June 28, 2016

Date of Patent: August 13, 2019

Assignee: International Business Machines Corporation

Inventors: Michael K. Gschwind, Christian Jacobi, Anthony Saporito, Chung-Lung K. Shum
Apparatuses for integrating arithmetic with logic operations

Patent number: 10379852

Abstract: An apparatus for integrating arithmetic with logic operations contains at least a calculation device and a post-logic unit. The calculation device calculates source data to generate and output first destination data. The post-logic unit, coupled to the calculation device, performs a comparison operation for comparing the first destination data with 0 and outputs a comparison result.

Type: Grant

Filed: August 24, 2017

Date of Patent: August 13, 2019

Assignee: VIA ALLIANCE SEMICONDUCTOR CO., LTD.

Inventors: Huaisheng Zhang, Dacheng Liang, Boming Chen, Renyu Bian
Slice construction for pre-executing data dependent loads

Patent number: 10379863

Abstract: Systems and methods for constructing an instruction slice for prefetching data of a data-dependent load instruction include a slicer for identifying a load instruction in an instruction sequence as a first occurrence of a qualified load instruction which will miss in a last-level cache. A commit buffer stores information pertaining to the first occurrence of the qualified load instruction and shadow instructions which follow. For a second occurrence of the qualified load instruction, an instruction slice is constructed from the information in the commit buffer to form a slice payload. A pre-execution engine pre-executes the instruction slice based on the slice payload to determine an address from which data is to be fetched for execution of a third and any subsequent occurrences of the qualified load instruction. The data is prefetched from the determined address for the third and any subsequent occurrence of the qualified load instruction.

Type: Grant

Filed: September 21, 2017

Date of Patent: August 13, 2019

Assignee: QUALCOMM Incorporated

Inventors: Shivam Priyadarshi, Rami Mohammad A. Al Sheikh, Brandon Dwiel, Derek Hower
Effectiveness and prioritization of prefetches

Patent number: 10372457

Abstract: A method, system, and computer program product are provided for prioritizing prefetch instructions. The method includes a processor issuing a prefetch instruction and fetching elements from a cache that can include a memory or a higher level cache. The processor stores the elements in temporary storage and monitors for accesses by an instruction. The processor stores a record representing the prefetch instruction. The processor updates the record with an indicator and issues a new prefetch instruction by comparing the new prefetch instruction to the record, based on the new prefetch instruction matching the prefetch instruction, assigning the indicator to the new prefetch instruction as a priority value, based on the new prefetch instruction not matching the prefetch instruction, assigning a default value to the new prefetch instruction as the priority value, and determining whether to execute the new prefetch instruction, based on the priority value of the new prefetch instruction.

Type: Grant

Filed: November 3, 2017

Date of Patent: August 6, 2019

Assignee: international Business Machines Corporation

Inventors: Michael K. Gschwind, Christian Jacobi, Anthony Saporito, Chung-Lung K. Shum

1 2 3 4 5 … next