Patents by Inventor Ashutosh Garg

Ashutosh Garg has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SHARING REGISTER FILE USAGE BETWEEN FUSED PROCESSING RESOURCES

Publication number: 20220206795

Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a shared local memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive an instruction to initiate a matrix multiplication operation, write a first set of matrix data into a first set of registers, and share the first set of matrix data between the first processing resource and the second processing resource for use in the matrix multiplication operation. Other embodiments may be described and claimed.

Type: Application

Filed: January 5, 2022

Publication date: June 30, 2022

Applicant: Intel Corporation

Inventors: SUBRAMANIAM MAIYURAN, VARGHESE GEORGE, JOYDEEP RAY, ASHUTOSH GARG, JORGE PARRA, SHUBH SHAH, SHUBRA MARWAHA
Graphics processors and graphics processing units having dot product accumulate instruction for hybrid floating point format

Patent number: 11361496

Abstract: Described herein is a graphics processing unit (GPU) comprising a single instruction, multiple thread (SIMT) multiprocessor comprising an instruction cache, a shared memory coupled with the instruction cache, and circuitry coupled with the shared memory and the instruction cache, the circuitry including multiple texture units, a first core including hardware to accelerate matrix operations, and a second core configured to receive an instruction having multiple operands in a bfloat16 (BF16) number format, wherein the multiple operands include a first source operand, a second source operand, and a third source operand, and the BF16 number format is a sixteen-bit floating point format having an eight-bit exponent and process the instruction, wherein to process the instruction includes to multiply the second source operand by the third source operand and add a first source operand to a result of the multiply.

Type: Grant

Filed: June 14, 2021

Date of Patent: June 14, 2022

Assignee: Intel Corporation

Inventors: Subramaniam Maiyuran, Shubra Marwaha, Ashutosh Garg, Supratim Pal, Jorge Parra, Chandra Gurram, Varghese George, Darin Starkey, Guei-Yuan Lueh
SPARSE MATRIX MULTIPLICATION ACCELERATION MECHANISM

Publication number: 20220171827

Abstract: An apparatus to facilitate acceleration of matrix multiplication operations. The apparatus comprises a systolic array including matrix multiplication hardware to perform multiply-add operations on received matrix data comprising data from a plurality of input matrices and sparse matrix acceleration hardware to detect zero values in the matrix data and perform one or more optimizations on the matrix data to reduce multiply-add operations to be performed by the matrix multiplication hardware.

Type: Application

Filed: November 16, 2021

Publication date: June 2, 2022

Applicant: Intel Corporation

Inventors: SUBRAMANIAM MAIYURAN, MATHEW NEVIN, JORGE PARRA, ASHUTOSH GARG, SHUBRA MARWAHA, SHUBH SHAH
SCALABLE SPARSE MATRIX MULTIPLY ACCELERATION USING SYSTOLIC ARRAYS WITH FEEDBACK INPUTS

Publication number: 20220156343

Abstract: Described herein is an accelerator device including a host interface, a fabric interconnect coupled with the host interface, and one or more hardware tiles coupled with the fabric interconnect, the one or more hardware tiles including sparse matrix multiply acceleration hardware including a systolic array with feedback inputs.

Type: Application

Filed: November 16, 2021

Publication date: May 19, 2022

Applicant: Intel Corporation

Inventors: SUBRAMANIAM MAIYURAN, JORGE PARRA, SUPRATIM PAL, ASHUTOSH GARG, SHUBRA MARWAHA, CHANDRA GURRAM, DARIN STARKEY, DURGESH BORKAR, VARGHESE GEORGE
Compiler assisted register file write reduction

Patent number: 11321799

Abstract: Examples described herein relate to a software and hardware optimization that manages scenarios where a write operation to a register is less than an entirety of the register. A compiler detects instructions that make partial writes to the same register, groups such instructions, and provides hints to hardware of the partial write. The execution unit combines the output data for grouped instructions and updates the destination register as single write instead of multiple separate partial writes.

Type: Grant

Filed: December 24, 2019

Date of Patent: May 3, 2022

Assignee: Intel Corporation

Inventors: Chandra S. Gurram, Gang Y. Chen, Subramaniam Maiyuran, Supratim Pal, Ashutosh Garg, Jorge E. Parra, Darin M. Starkey, Guei-Yuan Lueh, Wei-Yu Chen
GRAPHICS PROCESSORS AND GRAPHICS PROCESSING UNITS HAVING DOT PRODUCT ACCUMULATE INSTRUCTION FOR HYBRID FLOATING POINT FORMAT

Publication number: 20220129266

Abstract: Graphics processors and graphics processing units having dot product accumulate instructions for a hybrid floating point format are disclosed. In one embodiment, a graphics multiprocessor comprises an instruction unit to dispatch instructions and a processing resource coupled to the instruction unit. The processing resource is configured to receive a dot product accumulate instruction from the instruction unit and to process the dot product accumulate instruction using a bfloat16 number (BF16) format.

Type: Application

Filed: March 14, 2020

Publication date: April 28, 2022

Applicant: Intel Corporation

Inventors: Subramaniam Maiyuran, Shubra Marwaha, Ashutosh Garg, Supratim Pal, Jorge Parra, Chandra Gurram, Varghese George, Darin Starkey, Guei-Yuan Lueh
Instructions and logic for vector multiply add with zero skipping

Patent number: 11314515

Abstract: Embodiments described herein provide for an instruction and associated logic to enable a vector multiply add instructions with automatic zero skipping for sparse input. One embodiment provides for a general-purpose graphics processor comprising logic to perform operations comprising fetching a hardware macro instruction having a predicate mask, a repeat count, and a set of initial operands, where the initial operands include a destination operand and multiple source operands. The hardware macro instruction is configured to perform one or more multiply/add operations on input data associated with a set of matrices.

Type: Grant

Filed: December 23, 2019

Date of Patent: April 26, 2022

Assignee: Intel Corporation

Inventors: Supratim Pal, Sasikanth Avancha, Ishwar Bhati, Wei-Yu Chen, Dipankar Das, Ashutosh Garg, Chandra S. Gurram, Junjie Gu, Guei-Yuan Lueh, Subramaniam Maiyuran, Jorge E. Parra, Sudarshan Srinivasan, Varghese George
SYSTEM, METHOD, AND COMPUTER PROGRAM FOR PROCESSING COMPENSATION DATA

Publication number: 20220092547

Abstract: A system and method for implementing a talent management platform to determine, from a pool of applicants to a job opening, matching candidates, determine, using a neural network module, a set of feature values from the one or more feature values contained in the talent profiles associated with the matching candidates, generate a query based on the set of feature values, retrieve, based on the query, compensation data from a compensation database, and present, in a user interface, the compensation data as part of potential offers to the matching candidates.

Type: Application

Filed: September 17, 2021

Publication date: March 24, 2022

Applicant: Eightfold AI Inc.

Inventors: Ashutosh Garg, Ruoyu Roy Wang
Sharing register file usage between fused processing resources

Patent number: 11221848

Abstract: Embodiments described herein provide an apparatus comprising a plurality of processing resources including a first processing resource and a second processing resource, a shared local memory communicatively coupled to the first processing resource and the second processing resource, and a processor to receive an instruction to initiate a matrix multiplication operation, write a first set of matrix data into a first set of registers, and share the first set of matrix data between the first processing resource and the second processing resource for use in the matrix multiplication operation. Other embodiments may be described and claimed.

Type: Grant

Filed: September 25, 2019

Date of Patent: January 11, 2022

Assignee: INTEL CORPORATION

Inventors: Subramaniam Maiyuran, Varghese George, Joydeep Ray, Ashutosh Garg, Jorge Parra, Shubh Shah, Shubra Marwaha
Scalable sparse matrix multiply acceleration using systolic arrays with feedback inputs

Patent number: 11204977

Abstract: Described herein is an accelerator device including a host interface, a fabric interconnect coupled with the host interface, and one or more hardware tiles coupled with the fabric interconnect, the one or more hardware tiles including sparse matrix multiply acceleration hardware including a systolic array with feedback inputs.

Type: Grant

Filed: June 26, 2020

Date of Patent: December 21, 2021

Assignee: Intel Corporation

Inventors: Subramaniam Maiyuran, Jorge Parra, Supratim Pal, Ashutosh Garg, Shubra Marwaha, Chandra Gurram, Darin Starkey, Durgesh Borkar, Varghese George
SPARSE OPTIMIZATIONS FOR A MATRIX ACCELERATOR ARCHITECTURE

Publication number: 20210374897

Abstract: Embodiments described herein include, software, firmware, and hardware logic that provides techniques to perform arithmetic on sparse data via a systolic processing unit. Embodiment described herein provided techniques to skip computational operations for zero filled matrices and sub-matrices. Embodiments additionally provide techniques to maintain data compression through to a processing unit. Embodiments additionally provide an architecture for a sparse aware logic unit.

Type: Application

Filed: June 3, 2021

Publication date: December 2, 2021

Applicant: Intel Corporation

Inventors: Joydeep Ray, Scott Janus, Varghese George, Subramaniam Maiyuran, Altug Koker, Abhishek Appu, Prasoonkumar Surti, Vasanth Ranganathan, Andrei Valentin, Ashutosh Garg, Yoav Harel, Arthur Hunter, JR., SungYe Kim, Mike Macpherson, Elmoustapha Ould-Ahmed-Vall, William Sadler, Lakshminarayanan Striramassarma, Vikranth Vemulapalli
Sparse matrix multiplication acceleration mechanism

Patent number: 11188618

Abstract: An apparatus to facilitate acceleration of matrix multiplication operations. The apparatus comprises a systolic array including matrix multiplication hardware to perform multiply-add operations on received matrix data comprising data from a plurality of input matrices and sparse matrix acceleration hardware to detect zero values in the matrix data and perform one or more optimizations on the matrix data to reduce multiply-add operations to be performed by the matrix multiplication hardware.

Type: Grant

Filed: September 5, 2019

Date of Patent: November 30, 2021

Assignee: Intel Corporation

Inventors: Subramaniam Maiyuran, Mathew Nevin, Jorge Parra, Ashutosh Garg, Shubra Marwaha, Shubh Shah
System, method, and computer program for enabling a candidate to anonymously apply for a job

Patent number: 11176271

Abstract: A system and computer program enable a candidate to anonymously apply for a job position at an organization. In response to submitting a resume or other data for the purpose of applying anonymously for a job, the system generates an anonymous profile for the candidate and provides it to the organization to which the candidate is applying. The system excludes the candidate's name from the anonymous profile. In certain embodiments, the system also excludes one or more of the following: the candidate's address, data that is indicative of a candidate's race, age, or gender, and data that is not relevant for the job role for which the candidate is applying. After reviewing the anonymous profile, the organization has the option to reject the candidate or explore the candidate further. In response to the organization rejecting the candidate, the system notifies the candidate of the rejection without revealing the candidate's identity to the organization.

Type: Grant

Filed: December 4, 2018

Date of Patent: November 16, 2021

Assignee: EIGHTFOLD AI INC.

Inventors: Ashutosh Garg, Varun Kacholia
SCALABLE SPARSE MATRIX MULTIPLY ACCELERATION USING SYSTOLIC ARRAYS WITH FEEDBACK INPUTS

Publication number: 20210349966

Abstract: Described herein is an accelerator device including a host interface, a fabric interconnect coupled with the host interface, and one or more hardware tiles coupled with the fabric interconnect, the one or more hardware tiles including sparse matrix multiply acceleration hardware including a systolic array with feedback inputs.

Type: Application

Filed: June 26, 2020

Publication date: November 11, 2021

Applicant: Intel Corporation

Inventors: SUBRAMANIAM MAIYURAN, JORGE PARRA, SUPRATIM PAL, ASHUTOSH GARG, SHUBRA MARWAHA, CHANDRA GURRAM, DARIN STARKEY, DURGESH BORKAR, VARGHESE GEORGE
GRAPHICS PROCESSORS AND GRAPHICS PROCESSING UNITS HAVING DOT PRODUCT ACCUMULATE INSTRUCTION FOR HYBRID FLOATING POINT FORMAT

Publication number: 20210312697

Abstract: Described herein is a graphics processing unit (GPU) comprising a single instruction, multiple thread (SIMT) multiprocessor comprising an instruction cache, a shared memory coupled with the instruction cache, and circuitry coupled with the shared memory and the instruction cache, the circuitry including multiple texture units, a first core including hardware to accelerate matrix operations, and a second core configured to receive an instruction having multiple operands in a bfloat16 (BF16) number format, wherein the multiple operands include a first source operand, a second source operand, and a third source operand, and the BF16 number format is a sixteen-bit floating point format having an eight-bit exponent and process the instruction, wherein to process the instruction includes to multiply the second source operand by the third source operand and add a first source operand to a result of the multiply.

Type: Application

Filed: June 14, 2021

Publication date: October 7, 2021

Applicant: Intel Corporation

Inventors: Subramaniam Maiyuran, Shubra Marwaha, Ashutosh Garg, Supratim Pal, Jorge Parra, Chandra Gurram, Varghese George, Darin Starkey, Guei-Yuan Lueh
INSTRUCTION AND LOGIC FOR SYSTOLIC DOT PRODUCT WITH ACCUMULATE

Publication number: 20210303299

Abstract: Embodiments described herein provided for an instruction and associated logic to enable GPGPU program code to access special purpose hardware logic to accelerate dot product operations. One embodiment provides for a graphics processing unit comprising a fetch unit to fetch an instruction for execution and a decode unit to decode the instruction into a decoded instruction. The decoded instruction is a matrix instruction to cause the graphics processing unit to perform a parallel dot product operation. The GPGPU also includes systolic dot product circuitry to execute the decoded instruction across one or more SIMD lanes using multiple systolic layers, wherein to execute the decoded instruction, a dot product computed at a first systolic layer is to be output to a second systolic layer, wherein each systolic layer includes one or more sets of interconnected multipliers and adders, each set of multipliers and adders to generate a dot product.

Type: Application

Filed: June 15, 2021

Publication date: September 30, 2021

Applicant: Intel Corporation

Inventors: SUBRAMANIAM MAIYURAN, GUEI-YUAN LUEH, SUPRATIM PAL, ASHUTOSH GARG, CHANDRA S. GURRAM, JORGE E. PARRA, JUNJIE GU, KONRAD TRIFUNOVIC, HONG BIN LIAO, MIKE B. MACPHERSON, SHUBH B. SHAH, SHUBRA MARWAHA, STEPHEN JUNKINS, TIMOTHY R. BAUER, VARGHESE GEORGE, WEIYU CHEN
Sparse optimizations for a matrix accelerator architecture

Patent number: 11113784

Abstract: Embodiments described herein include, software, firmware, and hardware logic that provides techniques to perform arithmetic on sparse data via a systolic processing unit. Embodiment described herein provided techniques to skip computational operations for zero filled matrices and sub-matrices. Embodiments additionally provide techniques to maintain data compression through to a processing unit. Embodiments additionally provide an architecture for a sparse aware logic unit.

Type: Grant

Filed: October 6, 2020

Date of Patent: September 7, 2021

Assignee: Intel Corporation

Inventors: Joydeep Ray, Scott Janus, Varghese George, Subramaniam Maiyuran, Altug Koker, Abhishek Appu, Prasoonkumar Surti, Vasanth Ranganathan, Andrei Valentin, Ashutosh Garg, Yoav Harel, Arthur Hunter, Jr., SungYe Kim, Mike Macpherson, Elmoustapha Ould-Ahmed-Vall, William Sadler, Lakshminarayanan Striramassarma, Vikranth Vemulapalli
SYSTEM, METHOD, AND COMPUTER PROGRAM FOR AUTOMATICALLY REMOVING DATA FROM CANDIDATE PROFILES THAT MAY INFLUENCE BIAS

Publication number: 20210264373

Abstract: A system and method relate to excluding biasing data from talent profiles, including identifying one or more classes of bias, obtaining a first talent profile, wherein the first talent profile comprises an identifier of a person, and a plurality of values characterizing aspects of the person, determining, from the plurality of values, a first value that is indicative of influences over at least one of the one or more classes of bias, and removing or substituting the first value in the first talent profile to generate a second talent profile.

Type: Application

Filed: May 13, 2021

Publication date: August 26, 2021

Applicant: Eightfold AI Inc.

Inventors: Ashutosh GARG, Varun KACHOLIA
System, method and computer program product for providing thematic landing pages

Patent number: 11100554

Abstract: Techniques for thematic landing pages are disclosed. In some embodiments, a process for providing thematic landing pages includes receiving a user query for a theme; determining products (e.g., using a processor) that are relevant to the theme (e.g., based on a content relevancy); and generating a thematic web page for the theme based on the relevant products. For example, the thematic landing page can be associated with a merchant web site, and the relevant products can be products that are available via the merchant web site.

Type: Grant

Filed: March 12, 2015

Date of Patent: August 24, 2021

Assignee: BloomReach Inc.

Inventors: Mohit Gupta, Fei Chen, Fei Xie, Shao-Chuan Wang, Vache Moroyan, Ashutosh Garg, Stormy Shippy, Wally Ye, Ramkumar Rajendran
COMPILER ASSISTED REGISTER FILE WRITE REDUCTION

Publication number: 20210192673

Abstract: Examples described herein relate to a software and hardware optimization that manages scenarios where a write operation to a register is less than an entirety of the register. A compiler detects instructions that make partial writes to the same register, groups such instructions, and provides hints to hardware of the partial write. The execution unit combines the output data for grouped instructions and updates the destination register as single write instead of multiple separate partial writes.

Type: Application

Filed: December 24, 2019

Publication date: June 24, 2021

Inventors: Chandra S. GURRAM, Gang Y. CHEN, Subramaniam MAIYURAN, Supratim PAL, Ashutosh GARG, Jorge E. PARRA, Darin M. STARKEY, Guei-Yuan LUEH, Wei-Yu CHEN

prev 1 2 3 4 5 6 … next