Patents by Inventor Yaniv Fais

Yaniv Fais has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS AND APPARATUS TO TILE WALK A TENSOR FOR CONVOLUTION OPERATIONS

Publication number: 20240119255

Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

Type: Application

Filed: December 21, 2023

Publication date: April 11, 2024

Applicant: Intel Corporation

Inventors: Yaniv Fais, Moshe Maor
HARDWARE IP OPTIMIZED CONVOLUTIONAL NEURAL NETWORK

Publication number: 20240112033

Abstract: In an example, an apparatus comprises at least one execution platform; and logic, at least partially including hardware logic, to receive a trained neural network model in a model optimizer and convert the trained neural network model to an optimized model comprising parameters that are fit to the at least one execution platform. Other embodiments are also disclosed and claimed.

Type: Application

Filed: November 20, 2023

Publication date: April 4, 2024

Applicant: Intel Corporation

Inventors: Amit Bleiweiss, Itamar Ben-Ari, Michael Behar, Guy Jacob, Gal Leibovich, Jacob Subag, Lev Faivishevsky, Yaniv Fais, Tomer Schwartz
Variable precision and mix type representation of multiple layers in a network

Patent number: 11886984

Abstract: In an example, an apparatus comprises a plurality of execution units comprising at least a first type of execution unit and a second type of execution unit and logic, at least partially including hardware logic, to expose embedded cast operations in at least one of a load instruction or a store instruction; determine a target precision level for the cast operations; and load the cast operations at the target precision level. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: August 10, 2021

Date of Patent: January 30, 2024

Assignee: INTEL CORPORATION

Inventors: Uzi Sarel, Ehud Cohen, Tomer Schwartz, Amitai Armon, Yahav Shadmiy, Amit Bleiweiss, Gal Leibovich, Jeremie Dreyfuss, Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag
SMART MEMORY HANDLING AND DATA MANAGEMENT FOR MACHINE LEARNING NETWORKS

Publication number: 20240028883

Abstract: A mechanism is described for facilitating memory handling and data management in machine learning at autonomous machines. A method of embodiments, as described herein, includes detecting multiple tables associated with multiple neural networks at multiple autonomous machines, where each of the multiple tables include an index. The method may further include combining the multiple tables and multiple indexes associated with the multiple tables into a single table and a single index, respectively, where the single table is communicated to the multiple autonomous machines to allow simultaneous processing of one or more portions of the single table using one or more memory devices and one or more processors of one or more of the multiple autonomous machines.

Type: Application

Filed: July 26, 2023

Publication date: January 25, 2024

Applicant: Intel Corporation

Inventors: Tomer Schwartz, Ehud Cohen, Uzi Sarel, Amitai Armon, Yaniv Fais, Lev Faivishevsky, Amit Bleiweiss, Yahav Shadmiy, Jacob Subag
REAL TIME CONTEXT DEPENDENT DEEP LEARNING

Publication number: 20230394305

Abstract: In an example, an apparatus comprises a plurality of execution units comprising and logic, at least partially including hardware logic, to receive a plurality of data inputs for training a neural network, wherein the data inputs comprise training data and weights inputs; represent the data inputs in a first form; and represent the weight inputs in a second form. Other embodiments are also disclosed and claimed.

Type: Application

Filed: May 30, 2023

Publication date: December 7, 2023

Applicant: Intel Corporation

Inventors: Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag, Jeremie Dreyfuss, Amit Bleiweiss, Tomer Schwartz, Raanan Yonatan Yehezkel Rohekar, Michael Behar, Amitai Armon, Uzi Sarel
LOW RANK MATRIX COMPRESSION

Publication number: 20230316589

Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to implement a lossy compression algorithm which utilizes a data transform and quantization process to compress data in a convolutional neural network (CNN) layer. Other embodiments are also disclosed and claimed.

Type: Application

Filed: March 28, 2023

Publication date: October 5, 2023

Applicant: Intel Corporation

Inventors: Tomer Bar-On, Jacob Subag, Yaniv Fais, Jeremie Dreyfuss, Gal Novik, Gal Leibovich, Tomer Schwartz, Ehud Cohen, Lev Faivishevsky, Uzi Sarel, Amitai Armon, Yahav Shadmiy
Smart memory handling and data management for machine learning networks

Patent number: 11763140

Abstract: A mechanism is described for facilitating memory handling and data management in machine learning at autonomous machines. A method of embodiments, as described herein, includes detecting multiple tables associated with multiple neural networks at multiple autonomous machines, where each of the multiple tables include an index. The method may further include combining the multiple tables and multiple indexes associated with the multiple tables into a single table and a single index, respectively, where the single table is communicated to the multiple autonomous machines to allow simultaneous processing of one or more portions of the single table using one or more memory devices and one or more processors of one or more of the multiple autonomous machines.

Type: Grant

Filed: August 5, 2021

Date of Patent: September 19, 2023

Assignee: INTEL CORPORATION

Inventors: Tomer Schwartz, Ehud Cohen, Uzi Sarel, Amitai Armon, Yaniv Fais, Lev Faivishevsky, Amit Bleiweiss, Yahav Shadmiy, Jacob Subag
SCHEDULING CONFIGURATION FOR DEEP LEARNING NETWORKS

Publication number: 20230281435

Abstract: In an example, an apparatus comprises a plurality of execution units comprising and logic, at least partially including hardware logic, to traverse a solution space, score a plurality of solutions to a scheduling deep learning network execution, and select a preferred solution from the plurality of solutions to implement the deep learning network. Other embodiments are also disclosed and claimed.

Type: Application

Filed: February 24, 2023

Publication date: September 7, 2023

Applicant: Intel Corporation

Inventors: Eran Ben-Avi, Neta Zmora, Guy Jacob, Lev Faivishevsky, Jeremie Dreyfuss, Tomer Bar-On, Jacob Subag, Yaniv Fais, Shira Hirsch, Orly Weisel, Zigi Walter, Yarden Oren
Real time context dependent deep learning

Patent number: 11704564

Abstract: In an example, an apparatus comprises a plurality of execution units comprising and logic, at least partially including hardware logic, to receive a plurality of data inputs for training a neural network, wherein the data inputs comprise training data and weights inputs; represent the data inputs in a first form; and represent the weight inputs in a second form. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: August 17, 2021

Date of Patent: July 18, 2023

Assignee: INTEL CORPORATION

Inventors: Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag, Jeremie Dreyfuss, Amit Bleiweiss, Tomer Schwartz, Raanan Yonatan Yehezkel Rohekar, Michael Behar, Amitai Armon, Uzi Sarel
Storage system of DNN outputs for black box

Patent number: 11669719

Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to save one or more outputs of a deep learning neural network in a storage system of an autonomous vehicle and upload the one or more outputs to a remote server. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: February 12, 2021

Date of Patent: June 6, 2023

Assignee: INTEL CORPORATION

Inventors: Jeremie Dreyfuss, Amit Bleiweiss, Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag, Eran Ben-Avi, Neta Zmora, Tomer Schwartz
Multiply-accumulate “0” data gating

Patent number: 11656846

Abstract: In an example, an apparatus comprises a plurality of execution units and logic, at least partially including hardware logic, to gate at least one of a multiply unit or an accumulate unit in response to an input of value zero. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: November 24, 2020

Date of Patent: May 23, 2023

Assignee: INTEL CORPORATION

Inventors: Yaniv Fais, Tomer Bar-On, Jacob Subag, Jeremie Dreyfuss, Lev Faivishevsky, Michael Behar, Amit Bleiweiss, Guy Jacob, Gal Leibovich, Itamar Ben-Ari, Galina Ryvchin, Eyal Yaacoby
Low rank matrix compression

Patent number: 11620766

Abstract: In an example, an apparatus comprises logic, at least partially including hardware logic, to implement a lossy compression algorithm which utilizes a data transform and quantization process to compress data in a convolutional neural network (CNN) layer.

Type: Grant

Filed: June 10, 2021

Date of Patent: April 4, 2023

Assignee: INTEL CORPORATION

Inventors: Tomer Bar-On, Jacob Subag, Yaniv Fais, Jeremie Dreyfuss, Gal Novik, Gal Leibovich, Tomer Schwartz, Ehud Cohen, Lev Faivishevsky, Uzi Sarel, Amitai Armon, Yahav Shadmiy
Sub-graph in frequency domain and dynamic selection of convolution implementation on a GPU

Patent number: 11600035

Abstract: In an example, an apparatus comprises a plurality of execution units; and logic, at least partially including hardware logic, to determine a sub-graph of a network that can be executed in a frequency domain and apply computations in the sub-graph in the frequency domain. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: February 10, 2022

Date of Patent: March 7, 2023

Assignee: INTEL CORPORATION

Inventors: Uzi Sarel, Ehud Cohen, Tomer Schwartz, Amitai Armon, Yahav Shadmiy, Itamar Ben-Ari, Amit Bleiweiss, Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag, Michael Behar, Guy Jacob, Gal Leibovich, Jeremie Dreyfuss
Scheduling configuration for deep learning networks

Patent number: 11599777

Abstract: In an example, an apparatus comprises a plurality of execution units comprising and logic, at least partially including hardware logic, to traverse a solution space, score a plurality of solutions to a scheduling deep learning network execution, and select a preferred solution from the plurality of solutions to implement the deep learning network. Other embodiments are also disclosed and claimed.

Type: Grant

Filed: April 28, 2017

Date of Patent: March 7, 2023

Assignee: Intel Corporation

Inventors: Eran Ben-Avi, Neta Zmora, Guy Jacob, Lev Faivishevsky, Jeremie Dreyfuss, Tomer Bar-On, Jacob Subag, Yaniv Fais, Shira Hirsch, Orly Weisel, Zigi Walter, Yarden Oren
METHODS AND APPARATUS TO TILE WALK A TENSOR FOR CONVOLUTION OPERATIONS

Publication number: 20230067421

Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

Type: Application

Filed: September 28, 2022

Publication date: March 2, 2023

Inventors: Yaniv Fais, Moshe Maor
APPLYING A TWO DIMENSIONAL (2D) KERNEL ON AN INPUT FEATURE MAP

Publication number: 20230046558

Abstract: A method, integrated circuit, and a computer readable medium that stores instructions for reducing IO traffic from a global or remote memory unit to a buffer of a neural network unit, by using overlap rows of an input feature map tile.

Type: Application

Filed: August 10, 2022

Publication date: February 16, 2023

Inventors: Orly Weisel, Yaniv FAIS, Shira HIRSCH
TRANSPOSED CONVOLUTION ON DOWNSAMPLED DATA

Publication number: 20220366534

Abstract: The present subject matter provides technical solutions facing technical problems associated with preserving spatial dimension of images used in CNNs. Transposed convolutional layers may be used to provide improved spatial dimension preservation or reconstruction. In contrast with image interpolation, transposed convolutional layers may use a set of weights to reconstruct input images. When using CNNs for ADAS and AV applications, the transposed convolutional layers may be trained jointly with convolutional layers during the CNN training process. This may provide the ability to use a lower-dimensional representation of input images, while preserving the spatial dimension of images for use in ADAS and AV systems.

Type: Application

Filed: May 9, 2022

Publication date: November 17, 2022

Inventors: Yaniv FAIS, Ariel Binenfeld, Liran Levy
APPLYING A CONVOLUTION KERNEL ON INPUT DATA

Publication number: 20220366215

Abstract: A method for neural network convolution, the method may include receiving input data that is a 3D input data and comprises input data segments associated with different input data depth values; receiving a convolution kernel that is a 3D convolution kernel and comprises kernel segments associated with different kernel depth values; performing multiple 3D convolution iteration, wherein each of 3D convolution iteration comprises: determining whether the 3D convolution iteration is of a first type or of a second type; executing the 3D convolution iteration of the first type when determining that the 3D convolution iteration is of the first type; and executing the 3D convolution iteration of the second type when determining that the 3D convolution iteration is of the second type.

Type: Application

Filed: May 9, 2022

Publication date: November 17, 2022

Inventors: Orly WEISEL, Yaniv FAIS, Shira HIRSCH, Daniel SREBNIK
Methods and apparatus to tile walk a tensor for convolution operations

Patent number: 11494608

Abstract: An example apparatus to perform a convolution on an input tensor includes a parameters generator to: generate a horizontal hardware execution parameter for a horizontal dimension of the input tensor based on a kernel parameter and a layer parameter; and generate a vertical hardware execution parameter for a vertical dimension of the input tensor based on the kernel parameter and the layer parameter; an accelerator interface to configure a hardware accelerator circuitry based on the horizontal and vertical hardware execution parameters; a horizontal Iterator controller to determine when the hardware accelerator circuitry completes the first horizontal iteration of the convolution; and a vertical Iterator controller to determine when the hardware accelerator circuitry completes the first vertical iteration of the convolution.

Type: Grant

Filed: August 14, 2019

Date of Patent: November 8, 2022

Assignee: Intel Corporation

Inventors: Yaniv Fais, Moshe Maor
SUB-GRAPH IN FREQUENCY DOMAIN AND DYNAMIC SELECTION OF CONVOLUTION IMPLEMENTATION ON A GPU

Publication number: 20220237850

Abstract: In an example, an apparatus comprises a plurality of execution units; and logic, at least partially including hardware logic, to determine a sub-graph of a network that can be executed in a frequency domain and apply computations in the sub-graph in the frequency domain. Other embodiments are also disclosed and claimed.

Type: Application

Filed: February 10, 2022

Publication date: July 28, 2022

Applicant: Intel Corporation

Inventors: Uzi Sarel, Ehud Cohen, Tomer Schwartz, Amitai Armon, Yahav Shadmiy, Itamar Ben-Ari, Amit Bleiweiss, Lev Faivishevsky, Tomer Bar-On, Yaniv Fais, Jacob Subag, Michael Behar, Guy Jacob, Gal Leibovich, Jeremie Dreyfuss

1 2 3 4 next