Patents Examined by Jacob A. Petranek
-
Patent number: 11461143Abstract: A computing system is provided, including a processor configured to generate a directed weighted graph indicating a plurality of functions configured to be executed on a plurality of communicatively connected processing devices. For each of a plurality of pairs of the functions, the processor may determine a shortest path between the pair of functions. The processor may generate a second graph indicating the plurality of pairs of functions connected by the shortest paths. The processor may receive a pipeline directed acyclic graph (DAG) specifying a data pipeline of a plurality of processing stages. The processor may determine a subgraph isomorphism between the pipeline DAG and the second graph. The processor may convey, to one or more processing devices of the plurality of processing devices, instructions to execute the plurality of processing stages as specified by the subgraph isomorphism.Type: GrantFiled: September 8, 2020Date of Patent: October 4, 2022Assignee: MEGH COMPUTING, INC.Inventor: Jonathan Beare
-
Patent number: 11449337Abstract: A pseudorandom logic circuit may be embedded as a hardware within an emulation system, which may generate pseudorandom keephot instructions. A masking logic may mask out portions in each pseudorandom keephot instruction, which may change state elements during execution. A cluster of emulation processors may execute masked pseudorandom keephot instructions to consume power when not executing mission instructions. The cluster of emulation processors may run keephot cycles, during which the cluster of emulation processors may execute the pseudorandom keephot instructions causing the cluster of emulation processors to continue consuming a roughly constant amount of power, either at a same or different voltage level, but supposed outputs of the pseudorandom keephot instructions may have no impact upon inputs and outputs generated during mission cycles.Type: GrantFiled: December 19, 2019Date of Patent: September 20, 2022Assignee: CADENCE DESIGN SYSTEMS, INC.Inventors: Mitchell Poplack, Yuhei Hayashi
-
Patent number: 11449343Abstract: A system and method for efficiently protecting branch prediction information. In various embodiments, a computing system includes at least one processor with a branch predictor storing branch target addresses and security tags in a table. The security tag includes one or more components of machine context. When the branch predictor receives a portion of a first program counter of a first branch instruction, and hits on a first table entry during an access, the branch predictor reads out a first security tag. The branch predictor compares one or more components of machine context of the first security tag to one or more components of machine context of the first branch instruction. When there is at least one mismatch, the branch prediction information of the first table entry is not used. Additionally, there is no updating of any branch prediction training information of the first table entry.Type: GrantFiled: December 14, 2018Date of Patent: September 20, 2022Assignee: Apple Inc.Inventors: Jeffry E. Gonion, Ian D. Kountanis, Conrado Blasco, Steven Andrew Myers, Yannick L. Sierra
-
Patent number: 11442735Abstract: This application provides an instruction processing method and a chip. The method includes sending, by the thread unit, a search instruction to the search engine unit. The search instruction includes a data address and a first search field, and the thread unit switches from a RUN state to a WAIT state. The method also includes receiving, by the thread unit, data and a program counter that are sent by the search engine unit. The thread unit switches from the WAIT state to the RUN state.Type: GrantFiled: July 7, 2020Date of Patent: September 13, 2022Assignee: Huawei Technologies Co., Ltd.Inventors: Xing Tong, Hongliang Gao
-
Patent number: 11442890Abstract: On-circuit data activity monitoring may be performed for a systolic array. A current data activity measurement may be determined for changes in input data for processing at a systolic array and compared with a prior data activity measurement. Based on the comparison, a throttling recommendation may be provided to a management component to determine whether to perform the throttling recommendation.Type: GrantFiled: November 6, 2020Date of Patent: September 13, 2022Assignee: Amazon Technologies, Inc.Inventor: Thomas A Volpe
-
Patent number: 11436012Abstract: When a transformation job of flow logs generated for a cloud environment is triggered, a security service determines a parameterized template for batch data processing operations offered by the cloud service provider (CSP) to use based on the type of transformation job. The security service communicates an indication of the template and the corresponding parameter values to a data processing service/pipeline offered by the CSP. The provisioned processing resources retrieve the flow logs from a designated location in cloud storage, complete the transformation, and store the transformed flow logs in a new storage location. If the CSP does not provide a data processing service/pipeline which can perform bulk data transformation, the security service uses a generic parameterized template specifying a transformation job to be run on a cluster. Upon completion, the security service retrieves and analyzes the transformed flow logs as part of threat detection performed for securing the cloud environment.Type: GrantFiled: May 14, 2021Date of Patent: September 6, 2022Assignee: Palo Alto Networks, Inc.Inventor: Krishnan Shankar Narayan
-
Patent number: 11429393Abstract: An apparatus for data processing and a method of data processing are provided. Data processing operations are performed in response to instructions which reference architectural registers using physical registers to store data values when performing the data processing operations. Mappings between the architectural registers and the physical registers are stored, and when a data hazard condition is identified with respect to out-of-order program execution of an instruction, an architectural register specified in the instruction is remapped to an available physical register. A reorder buffer stores an entry for each destination architectural register specified by the instruction, entries being stored in program order, and an entry specifies a destination architectural register and an original physical register to which the destination architectural register was mapped before the architectural register remapped to an available physical register.Type: GrantFiled: November 11, 2015Date of Patent: August 30, 2022Assignee: ARM LIMITEDInventors: Vladimir Vasekin, Ian Michael Caulfield, Chiloda Ashan Senarath Pathirane
-
Patent number: 11429413Abstract: Technologies for dynamic statistics management include a computing device with a network interface controller (NIC) and a compute engine having a memory. The NIC is to provision a counter window to a software consumer executing in the computing device. The counter window is used to track a plurality of active counters associated with a network flow. The NIC determines whether one or more flush criteria are triggered. In so determining, the NIC transfers a value for each active counter to the memory, where global counter values are maintained.Type: GrantFiled: March 30, 2018Date of Patent: August 30, 2022Assignee: Intel CorporationInventors: Linden Cornett, Parthasarathy Sarangam
-
Patent number: 11422819Abstract: Disclosed herein are embodiments related to a power efficient multi-bit storage system. In one configuration, the multi-bit storage system includes a first storage circuit, a second storage circuit, a prediction circuit, and a clock gating circuit. In one aspect, the first storage circuit updates a first output bit according to a first input bit, in response to a trigger signal, and the second storage circuit updates a second output bit according to a second input bit, in response to the trigger signal. In one aspect, the prediction circuit generates a trigger enable signal indicating whether at least one of the first output bit or the second output bit is predicted to change a state. In one aspect, the clock gating circuit generates the trigger signal based on the trigger enable signal.Type: GrantFiled: June 12, 2020Date of Patent: August 23, 2022Assignee: Taiwan Semiconductor Manufacturing Company LimitedInventors: Kai-Chi Huang, Chi-Lin Liu, Wei-Hsiang Ma, Shang-Chih Hsieh
-
Patent number: 11422809Abstract: An apparatus and method for processing efficient multicast operation.Type: GrantFiled: May 13, 2020Date of Patent: August 23, 2022Assignee: INTEL CORPORATIONInventors: Christopher J. Hughes, Dan Baum
-
Patent number: 11416251Abstract: A data processing system utilizes non-volatile storage to store constant values. An instruction decoder decodes program instructions to generate control signals to control processing circuitry to perform processing operations which may include processing operations corresponding to constant-using program instructions. Such constant-using program instructions may include one or more operation specifying fields and one or more argument specifying fields which control the processing circuitry to generate an output value equal to that given by reading one or more constant values from the non-volatile storage, optionally modifying such a value, and then performing the processing operation upon the value, or the modified value, to generate an output value.Type: GrantFiled: November 16, 2017Date of Patent: August 16, 2022Assignee: Arm LimitedInventors: Sean Tristram LeGuay Ellis, Andrew James Booker
-
Patent number: 11416254Abstract: Systems, apparatuses, and methods for implementing zero cycle load bypass operations are described. A system includes a processor with at least a decode unit, control logic, mapper, and free list. When a load operation is detected, the control logic determines if the load operation qualifies to be converted to a zero cycle load bypass operation. Conditions for qualifying include the load operation being in the same decode group as an older store operation to the same address. Qualifying load operations are converted to zero cycle load bypass operations. A lookup of the free list is prevented for a zero cycle load bypass operation and a destination operand of the load is renamed with a same physical register identifier used for a source operand of the store. Also, the data of the store is bypassed to the load.Type: GrantFiled: December 5, 2019Date of Patent: August 16, 2022Assignee: Apple Inc.Inventors: Deepankar Duggal, Kulin N. Kothari, Conrado Blasco, Muawya M. Al-Otoom
-
Patent number: 11409694Abstract: A processor is provided. The processor includes a plurality of processing elements configured to be arranged in a matrix form, and a controller configured to control the plurality of processing elements during a plurality of cycles to process a target data, control first processing elements so that each of the first processing elements operates data provided from adjacent first processing elements and the input first element and inputs each of second elements included in a second row among the plurality of elements to second processing elements arranged in the second row among the plurality of processing elements, control the second processing elements so that each of the second processing elements operates data provided from adjacent second processing elements and the input second element, and operates data provided from the adjacent first processing elements in the same column among the first processing elements and pre-stored operation data.Type: GrantFiled: July 23, 2020Date of Patent: August 9, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Kyounghoon Kim, Gyeonghoon Kim, Hyunsik Kim, Haksup Song, Guyeon Wei, Jonghun Lee, Jinsae Jung, Junguk Cho, Sangbok Han
-
Patent number: 11409536Abstract: A method and apparatus for performing a multi-precision computation in a plurality of arithmetic logic units (ALUs) includes pairing a first Single Instruction/Multiple Data (SIMD) block channel device with a second SIMD block channel device to create a first block pair having one-level staggering between the first and second channel devices. A third SIMD block channel device is paired with a fourth SIMD block channel device to create a second block pair having one-level staggering between the third and fourth channel devices. A plurality of source inputs are received at the first block pair and the second block pair. The first block pair computes a first result, and the second block pair computes a second result.Type: GrantFiled: November 3, 2016Date of Patent: August 9, 2022Assignee: ADVANCED MICRO DEVICES, INC.Inventors: Bin He, YunXiao Zou, Jiasheng Chen, Michael Mantor
-
Patent number: 11409525Abstract: An apparatus and method for performing multiply-accumulate operations.Type: GrantFiled: January 24, 2018Date of Patent: August 9, 2022Assignee: Intel CorporationInventors: Alexander Heinecke, Dipankar Das, Robert Valentine, Mark Charney
-
Patent number: 11403255Abstract: Provided herein are implementations for managing discovered chiplets based on a physical topology. In particular, a method includes receiving, by a control chiplet, a discovery signal from a first subordinate chiplet, the control chiplet being a portion of a processing unit of a package. The method further includes determining, by the control chiplet, a physical topology of the first subordinate chiplet relative to the control chiplet based on the discovery signal. The method further includes managing, by the control chiplet, operation of the first subordinate chiplet based on the physical topology of the first subordinate chiplet.Type: GrantFiled: June 29, 2021Date of Patent: August 2, 2022Assignee: Red Hat, Inc.Inventor: Jonathan C. Masters
-
Patent number: 11397578Abstract: An apparatus such as a graphics processing unit (GPU) includes a plurality of processing elements configured to concurrently execute a plurality of first waves and accumulators associated with the plurality of processing elements. The accumulators are configured to store accumulated values representative of behavioral characteristics of the plurality of first waves that are concurrently executing on the plurality of processing elements. The apparatus also includes a dispatcher configured to dispatch second waves to the plurality of processing elements based on comparisons of values representative of behavioral characteristics of the second waves and the accumulated values stored in the accumulators. In some cases, the behavioral characteristics of the plurality of first waves comprise at least one of fetch bandwidths, usage of an arithmetic logic unit (ALU), and number of export operations.Type: GrantFiled: August 30, 2019Date of Patent: July 26, 2022Assignee: Advanced Micro Devices, Inc.Inventors: Randy Ramsey, William David Isenberg, Michael Mantor
-
Patent number: 11397577Abstract: A method, computer program product, and system for managing parallel microservices are provided. The method may include identifying information pertaining to each of a plurality of target microservices to be invoked by an issuer microservice, a predefined condition associated with the plurality of target microservices, and an action to be executed by the issuer microservice in response to the predefined condition being satisfied. The method may also include sending a first request to available target microservices of the plurality of target microservices based on the information pertaining to the respective available target microservices. The method may also include, in response to receiving a response to the first request from an available target microservice of the available target microservices, determining whether the predefined condition is satisfied, and in response to determining that the predefined condition is satisfied, causing the action to be executed by the issuer microservice.Type: GrantFiled: August 31, 2020Date of Patent: July 26, 2022Assignee: International Business Machines CorporationInventors: Xin Peng Liu, Yue Wang, Shuo Li, Xiaobo Wang
-
Patent number: 11379242Abstract: An integrated circuit may include elastic datapaths or pipelines, through which software threads or iterations of loops, may be executed. Throttling circuitry may be coupled along an elastic pipeline in the integrated circuit. The throttling circuitry may include dependency detection circuitry that dynamically detect memory dependency issues that may arise during runtime. To mitigate these dependency issues, the throttling circuitry may assert stall signals to upstream stages in the pipeline. Additionally, the throttling circuitry may control the pipeline to resolve a store operation prior to a corresponding load operation in order to avoid store/load conflicts. In an embodiment, the throttling circuitry may include a validator circuit, a rewind block, a revert block, and a flush block. The throttling circuitry may pass speculative iterations through the rewind block, and later validate the speculative iterations using the validator block.Type: GrantFiled: June 29, 2017Date of Patent: July 5, 2022Assignee: Intel CorporationInventors: Andrei Mihai Hagiescu Miriste, Byron Sinclair, Joseph Garvey
-
Patent number: 11372644Abstract: A processor system comprises a shared memory and a processing element. The processing element includes a matrix processor unit and is in communication with the shared memory. The processing element is configured to receive a processor instruction specifying a data matrix and a matrix manipulation operation. A manipulation matrix based on the processor instruction is identified. The data matrix and the manipulation matrix are loaded into the matrix processor unit and a matrix operation is performed to determine a result matrix. The result matrix is outputted to a destination location.Type: GrantFiled: December 9, 2019Date of Patent: June 28, 2022Assignee: Meta Platforms, Inc.Inventors: Thomas Mark Ulrich, Krishnakumar Narayanan Nair, Yuchen Hao