Patents Examined by Chuong D Ngo

High performance QR decomposition systems and methods

Patent number: 10872130

Abstract: Based on a Modified Gram-Schmidt (MGS) algorithm, QR decomposition techniques are optimized for parallel structures that provide arithmetic-logic unit (ALU) to ALU connectivity. The techniques utilize a different loop organization, but the dependent functional sequences of the algorithm are unchanged, thereby reducing likelihood of affecting error analysis and/or numerical stability. Some integrated circuit devices (e.g., FPGA) may implement hard floating-point (HFP) circuitry, such as a digital signal processing (DSP) block, distributed memories, and/or flexible internal connectivity, which can support the discussed high performance matrix arithmetic.

Type: Grant

Filed: August 31, 2017

Date of Patent: December 22, 2020

Assignee: Intel Corporation

Inventor: Martin Langhammer
Hardware-efficient variational quantum eigenvalue solver for quantum computing machines

Patent number: 10839306

Abstract: Generating trial states for a variational quantum Eigenvalue solver (VQE) using a quantum computer is described. An example method includes selecting a number of samples S to capture from qubits for a particular trial state. The method further includes mapping a Hamiltonian to the qubits according the trial state. The method further includes setting up an entangler in the quantum computer, the entangler defining an entangling interaction between a subset of the qubits of the quantum computer. The method further includes reading out qubit states after post-rotations associated with Pauli terms in the target Hamiltonian, the reading out being performed for S samples. The method further includes computing an energy state using the S qubit states. The method further includes, in response to the estimated energy state not converging with an expected energy state, computing a new trial state for the VQE and iterating to compute the estimated energy using the new trial state.

Type: Grant

Filed: April 17, 2019

Date of Patent: November 17, 2020

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: Antonio Mezzacapo, Jay M. Gambetta, Abhinav Kandala, Maika Takita, Paul K. Temme
Low power digital interpolation/decimation apparatus and method

Patent number: 10840891

Abstract: An apparatus performs interpolation/decimation in a digital circuit that receives an input signal and includes upsampling/downsampling and filtering stages. First and second paths include distinct first and second portions of the upsampling/downsampling and filtering stages. The first path consumes less quiescent state power. A selection circuit uses the first or second path and turns off the unused first or second path based on input signal spectral content or level. A mode includes applying a front-end digital/analog gain and a corresponding back-end analog/digital attenuation in conjunction with the first path being used and the second path being turned off. A cross-fader uses the first and second paths in a weighted mix manner while making a transition between using the first and second paths. The second path has higher filtering performance (e.g., superior stopband attenuation, passband ripple, transition band, e.g., via higher order or greater bit-width filtering).

Type: Grant

Filed: December 24, 2018

Date of Patent: November 17, 2020

Inventors: Aniruddha Satoskar, John L. Melanson, Siva Venkata Subbarao Bonasu
Fixed-point and floating-point arithmetic operator circuits in specialized processing blocks

Patent number: 10838695

Abstract: The present embodiments relate to circuitry that efficiently performs floating-point arithmetic operations and fixed-point arithmetic operations. Such circuitry may be implemented in specialized processing blocks. If desired, the specialized processing blocks may include configurable interconnect circuitry to support a variety of different use modes. For example, the specialized processing block may efficiently perform a fixed-point or floating-point addition operation or a portion thereof, a fixed-point or floating-point multiplication operation or a portion thereof, a fixed-point or floating-point multiply-add operation or a portion thereof, just to name a few. In some embodiments, two or more specialized processing blocks may be arranged in a cascade chain and perform together more complex operations such as a recursive mode dot product of two vectors of floating-point numbers or a Radix-2 Butterfly circuit, just to name a few.

Type: Grant

Filed: June 4, 2019

Date of Patent: November 17, 2020

Assignee: Altera Corporation

Inventor: Martin Langhammer
Multiply-accumulate operation device

Patent number: 10831447

Abstract: Electric charges depending on values of N+ electric signals and values of corresponding positive loads are held in first capture-and-storage circuitry. Electric charges having a size depending on values of (N?N+) electric signals and corresponding absolute values of negative loads are held in second capture-and-storage circuitry. A sum of N+ multiplied values obtained by multiplying each of the positive loads by each of the values of the N+ electric signals is calculated when a voltage held in the first capture-and-storage circuitry reaches a first threshold. A sum of (N?N+) multiplied values obtained by multiplying each of the absolute values by each of the values of the (N?N+) electric signals is calculated when a voltage held in the second capture-and-storage circuitry reaches a second threshold A sum of N multiplied values is obtained by subtracting the sum of (N?N+) multiplied values from the sum of N+ multiplied values.

Type: Grant

Filed: August 3, 2017

Date of Patent: November 10, 2020

Assignee: Sony Corporation

Inventors: Takashi Morie, Quan Wang, Hakaru Tamukoh
Low-power adder circuit

Patent number: 10824692

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for a circuit configured to add multiple inputs. The circuit includes a first adder section that receives a first input and a second input and adds the inputs to generate a first sum. The circuit also includes a second adder section that receives the first and second inputs and adds the inputs to generate a second sum. An input processor of the circuit receives the first and second inputs, determines whether a relationship between the first and second inputs satisfies a set of conditions, and selects a high-power mode of the adder circuit or a low-power mode of the adder circuit using the determined relationship between the first and second inputs. The high-power mode is selected and the first and second inputs are routed to the second adder section when the relationship satisfies the set of conditions.

Type: Grant

Filed: October 12, 2018

Date of Patent: November 3, 2020

Assignee: Google LLC

Inventors: Anand Suresh Kane, Ravi Narayanaswami
Reconfigurable matrix multiplier system and method

Patent number: 10817587

Abstract: A reconfigurable matrix multiplier (RMM) system/method allowing tight or loose coupling to supervisory control processor application control logic (ACL) in a system-on-a-chip (SOC) environment is disclosed. The RMM provides for C=A*B matrix multiplication operations having A-multiplier-matrix (AMM), B-multiplicand-matrix (BMM), and C-product-matrix (CPM), as well as C=A*B+D operations in which D-summation-matrix (DSM) represents the result of a previous multiplication operation or another previously defined matrix. The RMM provides for additional CPM LOAD/STORE paths allowing overlapping of compute/data transfer operations and provides for CPM data feedback to the AMM or BMM operand inputs from a previously calculated CPM result.

Type: Grant

Filed: February 26, 2018

Date of Patent: October 27, 2020

Assignee: TEXAS INSTRUMENTS INCORPORATED

Inventors: Arthur John Redfern, Donald Edward Steiss, Timothy David Anderson, Kai Chirca
Reducing dynamic power consumption in arrays

Patent number: 10817260

Abstract: Systems and methods are provided to skip multiplication operations with zeros in processing elements of the systolic array to reduce dynamic power consumption. A value of zero can be detected on an input data element entering each row of the array and respective zero indicators may be generated. These respective zero indicators may be passed to all the processing elements in the respective rows. The multiplication operation with the zero value can be skipped in each processing element based on the zero indicators, thus reducing dynamic power consumption.

Type: Grant

Filed: June 13, 2018

Date of Patent: October 27, 2020

Assignee: Amazon Technologies, Inc.

Inventors: Randy Huang, Ron Diamant, Thomas Elmer, Sundeep Amirineni, Thomas A. Volpe
Square root digit recurrence

Patent number: 10809980

Abstract: A data processing apparatus is provided, for performing a digit-recurrence square root operation on an input value. Receiver circuitry receives a remainder value of a previous iteration of the digit-recurrence square root operation. Comparison circuitry compares most significant bits of the remainder value of the previous iteration with a number of selection constants, in order to output a next digit of a result of the digit-recurrence square root operation. The comparison circuitry compares at most 3 fractional bits of the remainder value of the previous iteration with the plurality of selection constants.

Type: Grant

Filed: June 14, 2017

Date of Patent: October 20, 2020

Assignee: ARM Limited

Inventors: Javier Diaz Bruguera, David M. Russinoff
Variable group calculation apparatus, variable group calculation method, variable group calculation program, and data structure

Patent number: 10783218

Abstract: The present disclosure is applied to a variable group calculation apparatus for calculating an undetermined variable group that simultaneously minimizes a difference value and a data value. The difference value is a difference between an added composite value, which is obtained by adding and combining the undetermined variable group and a dictionary data group, and an observation data group. The data value includes the difference value and a regularization term of the undetermined variable group. The variable group calculation apparatus of the present disclosure includes a convolution unit configured to convert the regularization term to a convolution value for an L1 norm using the undetermined variable group and a mollifier function, and a calculation unit configured to perform the calculation using the regularization term, which is converted to the convolution value by the convolution unit.

Type: Grant

Filed: May 24, 2018

Date of Patent: September 22, 2020

Assignee: TOYOTA JIDOSHA KABUSHIKI KAISHA

Inventors: Shintaro Yoshizawa, Norimasa Kobori
Optimized neural network input stride method and apparatus

Patent number: 10776694

Abstract: A convolutional layer in a convolutional neural network uses a predetermined horizontal input stride and a predetermined vertical input stride that are greater than 1 while the hardware forming the convolutional layer operates using an input stride of 1. Each original weight kernel of a plurality of sets of original weight kernels is subdivided based on the predetermined horizontal and vertical input strides to form a set of a plurality of sub-kernels for each set of original weight kernels. Each of a plurality of IFMs is subdivided based on the predetermined horizontal and vertical input strides to form a plurality of sub-maps. Each sub-map is convolved by the corresponding sub-kernel for a set of original weight kernels using an input stride of 1. A convolved result of each sub-map and the corresponding sub-kernel is summed to form an output feature map.

Type: Grant

Filed: August 8, 2017

Date of Patent: September 15, 2020

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: John Wakefield Brothers
Dot product calculators and methods of operating the same

Patent number: 10768895

Abstract: Methods, apparatus, systems and articles of manufacture to perform dot product calculations using sparse vectors are disclosed. An example dot product calculator includes a counter to determine a trailing binary count of a control vector, the control vector corresponding to a first result of a first logic AND operation on a first bitmap of a first sparse vector and a second bitmap of a second sparse vector. The example dot product calculator further includes a mask generator to generate a mask vector based on the trailing binary count. The example dot product calculator further includes an interface to access a first value of the first sparse vector based on a second result of a second logic AND operation on the first bitmap and the mask vector and access a second value of the second sparse vector based on a third result of a third logic AND operation on the second bitmap and the mask vector.

Type: Grant

Filed: November 8, 2018

Date of Patent: September 8, 2020

Assignee: Movidius Limited

Inventors: Fergal Connor, David Bernard, Niall Hanrahan
Machine learning accelerator architecture

Patent number: 10769526

Abstract: An apparatus to facilitate acceleration of machine learning operations is disclosed. The apparatus comprises accelerator circuitry including a first set of processing elements to perform first computations including matrix multiplication operations, a second set of processing elements to perform second computations including sum of elements of weights and offset multiply operations and a third set of processing elements to perform third computations including sum of elements of inputs and offset multiply operations, wherein the second and third computations are performed in parallel with the first computations.

Type: Grant

Filed: April 24, 2018

Date of Patent: September 8, 2020

Assignee: Intel Corporation

Inventors: Bharat Daga, Pradeep Janedula, Aravind Babu Srinivasan, Ambili Vengallur
Reduced floating-point precision arithmetic circuitry

Patent number: 10761805

Abstract: The present embodiments relate to performing reduced-precision floating-point arithmetic operations using specialized processing blocks with higher-precision floating-point arithmetic circuitry. A specialized processing block may receive four floating-point numbers that represent two single-precision floating-point numbers, each separated into an LSB portion and an MSB portion, or four half-precision floating-point numbers. A first partial product generator may generate a first partial product of first and second input signals, while a second partial product generator may generate a second partial product of third and fourth input signals.

Type: Grant

Filed: September 26, 2018

Date of Patent: September 1, 2020

Assignee: Altera Corporation

Inventor: Martin Langhammer
Random number generator

Patent number: 10761809

Abstract: A random number generator includes an entropy source comprising a first digital device arranged to apply to an input signal a first delay value to obtain a first signal and a second digital device arranged to apply to the input signal a second delay value different from the first delay value to obtain a second signal; a sampling unit configured to sample one of the first and second signals using the other signal as reference clock, thereby obtaining a sampled signal; measurement means to perform measurements of the sampled signal's delay difference with respect to the reference clock; a controller circuit arranged to monitor the measured delay difference of the sampled signal and to check the values of the measured delay difference and, once a given condition related to the values is met, to output a configuration signal.

Type: Grant

Filed: March 12, 2020

Date of Patent: September 1, 2020

Assignee: KATHOLIEKE UNIVERSITEIT LEUVEN

Inventors: Adriaan Peetermans, Vladimir Rozic, Ingrid Verbauwhede
Tiled switch matrix data permutation circuit

Patent number: 10754621

Abstract: Embodiments of the present disclosure pertain to switch matrix circuit including a data permutation circuit. In one embodiment, the switch matrix comprises a plurality of adjacent switching blocks configured along a first axis, wherein the plurality of adjacent switching blocks each receive data and switch control settings along a second axis. The switch matrix includes a permutation circuit comprising, in each switching block, a plurality of switching stages spanning a plurality of adjacent switching blocks and at least one switching stage that does not span to adjacent switching blocks. The permutation circuit receives data in a first pattern and outputs the data in a second pattern. The data permutation performed by the switching stages is based on the particular switch control settings received in the adjacent switching blocks along the second axis.

Type: Grant

Filed: August 30, 2018

Date of Patent: August 25, 2020

Assignee: Groq, Inc.

Inventor: Gregory Michael Thorson
Generating of random numbers

Patent number: 10754617

Abstract: A device for generating a random number is suggested, the device comprising at least two shift registers, a transformation function that generates the random number based on at least one cell of each of the at least two shift registers.

Type: Grant

Filed: January 13, 2016

Date of Patent: August 25, 2020

Assignee: Infineon Technologies AG

Inventors: Berndt Gammel, Rainer Goettfert
Hybrid non-uniform convolution transform engine for deep learning applications

Patent number: 10755169

Abstract: A system performs convolution operations based on an analysis of the input size. The input includes data elements and filter weights. The system includes multiple processing elements. Each processing element includes multipliers and adders, with more of the adders than the multipliers. According to at least the analysis result which indicates whether the input size matches a predetermined size, the system is operative to select a first mode or a second mode. In the first mode, a greater number of the adders than the multipliers are enabled for each processing element to multiply transformed input and to perform an inverse transformation. In the second mode, an equal number of the adders and the multipliers are enabled for each processing element to multiply-and-accumulate the input. One or more of the multipliers are shared by the first mode and the second mode.

Type: Grant

Filed: December 14, 2017

Date of Patent: August 25, 2020

Assignee: MediaTek Inc.

Inventors: Pei-Kuei Tsung, Chien-Hung Lin, Yao-Sheng Wang, Po-Yu Chen
Optimizing linear programming models using different solvers

Patent number: 10747842

Abstract: Some embodiments provide a program. The program receives from a client device a request specifying a plurality of variables comprising set of sources and a set of objects. The program further receives a set of values from the set of sources. Each value in the set of values specifies an object in set of objects. The program also receives a selection of a set of defined scenarios for the request and the set of values. The program further generates a set of linear programming models based on the set of defined scenarios, the request, and the set of values.

Type: Grant

Filed: November 27, 2018

Date of Patent: August 18, 2020

Assignee: SAP SE

Inventors: Swapnil Laddha, Ajay Jose, Aarathi Vidyasagar, Rajendra Vuppala, Sudhir Bhojwani
Vectorization of wide integer data paths into parallel operations with value extraction for maintaining valid guard bands

Patent number: 10747534

Abstract: The embodiments herein describe techniques for monitoring guard bits in multi-result vectors generated by a first arithmetic unit in a chain and using side band logic to add or subtract offset values from guard bits in a second, subsequent arithmetic unit in the chain. In this manner, the guard bits can be adjusted on the fly (e.g., without interrupting or terminating the chain) to ensure the guard bits do not overflow. The side band logic can maintain a guard bits overflow value which is then combined with the output vector from the final arithmetic unit in the chain to compensate for adjusting the guard bits at the various arithmetic units in the chain. In this manner, the chain can have any desired length.

Type: Grant

Filed: November 26, 2018

Date of Patent: August 18, 2020

Assignee: XILINX, INC.

Inventors: Thomas B. Preusser, Thomas A. Branca

prev … 3 4 5 6 7 8 9 10 11 … next