Patents by Inventor Ayon Basumallik

Ayon Basumallik has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Vector processor data storage

Patent number: 11899967

Abstract: Aspects of the present disclosure provide an aligned storage strategy for stripes within a long vector for a vector processor, such that the extra computation needed to track strides between input stripes and output stripes may be eliminated. As a result, the stripe locations are located in a more predictable memory access pattern such that memory access bandwidth may be improved and the tendency for memory error may be reduced.

Type: Grant

Filed: November 15, 2021

Date of Patent: February 13, 2024

Assignee: Lightmatter, Inc.

Inventors: Nicholas Moore, Gongyu Wang, Bradley Dobbie, Tyler J. Kenney, Ayon Basumallik
Residue number system in a photonic matrix accelerator

Patent number: 11836466

Abstract: A photonic processor uses light signals and a residue number system (RNS) to perform calculations. The processor sums two or more values by shifting the phase of a light signal with phase shifters and reading out the summed phase with a coherent detector. Because phase winds back every 2? radians, the photonic processor performs addition modulo 2?. A photonic processor may use the summation of phases to perform dot products and correct erroneous residues. A photonic processor may use the RNS in combination with a positional number system (PNS) to extend the numerical range of the photonic processor, which may be used to accelerate homomorphic encryption (HE)-based deep learning.

Type: Grant

Filed: December 6, 2021

Date of Patent: December 5, 2023

Assignee: Lightmatter, Inc.

Inventors: Eric Hein, Ayon Basumallik, Nicholas C. Harris, Darius Bunandar, Cansu Demirkiran
TECHNIQUES OF PERFORMING OPERATIONS USING A HYBRID ANALOG-DIGITAL PROCESSOR

Publication number: 20230177284

Abstract: Described herein are techniques of using a hybrid analog-digital processor to perform matrix operations. The hybrid analog-digital may store digital values in memory encoded in a low bit number format. The hybrid analog-digital processor may perform, using an analog processor, a matrix operation to obtain output(s). The output(s) may be encoded in the number format. The hybrid analog-digital processor may determine, using the output(s), an unbiased estimate of a matrix operation result. The hybrid analog-digital processor may store, in the memory, the unbiased estimate of the matrix operation result encoded in the number format.

Type: Application

Filed: December 7, 2022

Publication date: June 8, 2023

Applicant: Lightmatter, Inc.

Inventors: Darius Bunandar, Calvin McCarter, Ayon Basumallik
SWITCHED SPATIAL TENSOR DATA MANIPULATION WITH PHOTONICS

Publication number: 20230071600

Abstract: A method for manipulating an input vector is described. The method involves controlling a plurality of optical switches to obtain a nominal orientation vector or a transpose orientation vector based on a plurality of input optical signals encoding the input vector and received at the plurality of optical switches. The nominal orientation vector and the transpose orientation vector represent transposed versions of one another. A memory system comprising a first section configured to store vectors in accordance with a nominal orientation and a second section configured to store vectors in accordance with a transpose orientation. A controller stores the nominal orientation vector in the first section of the memory system or stores the transpose orientation vector in the second section of the memory system.

Type: Application

Filed: September 6, 2022

Publication date: March 9, 2023

Applicant: Lightmatter, Inc.

Inventors: Nicholas C. Harris, Carl Ramey, Darius Bunandar, Ayon Basumallik
THE ACCURACY OF LOW-BITWIDTH NEURAL NETWORKS BY REGULARIZING THE HIGHER-ORDER MOMENTS OF WEIGHTS AND HIDDEN STATES

Publication number: 20220261645

Abstract: Methods and systems for training neural networks using low-bitwidth accelerators are described. The methods described herein use moment-penalization functions. For example, a method comprises producing a modified data set by training a neural network using a moment-penalization function and the data set. The moment-penalization function is configured to penalize a moment associated with the neural network. Training the neural network in turn comprises quantizing the data set to obtain a fixed-point data set so that the fixed-point data set represents the data set in a fixed-point representation, and passing the fixed-point data set through an analog accelerator. The inventors have recognized that training a neural network using a modified objective function augments the accuracy and robustness of the neural network notwithstanding the use of low-bitwidth accelerators.

Type: Application

Filed: February 15, 2022

Publication date: August 18, 2022

Applicant: Lightmatter, Inc.

Inventors: Nicholas Dronen, Tyler J. Kenney, Tomo Lazovich, Ayon Basumallik, Darius Bunandar
RESIDUE NUMBER SYSTEM IN A PHOTONIC MATRIX ACCELERATOR

Publication number: 20220229634

Abstract: A photonic processor uses light signals and a residue number system (RNS) to perform calculations. The processor sums two or more values by shifting the phase of a light signal with phase shifters and reading out the summed phase with a coherent detector. Because phase winds back every 2? radians, the photonic processor performs addition modulo 2?. A photonic processor may use the summation of phases to perform dot products and correct erroneous residues. A photonic processor may use the RNS in combination with a positional number system (PNS) to extend the numerical range of the photonic processor, which may be used to accelerate homomorphic encryption (HE)-based deep learning.

Type: Application

Filed: December 6, 2021

Publication date: July 21, 2022

Applicant: Lightmatter, Inc.

Inventors: Eric Hein, Ayon Basumallik, Nicholas C. Harris, Darius Bunandar, Cansu Demirkiran
MACHINE LEARNING MODEL TRAINING USING AN ANALOG PROCESSOR

Publication number: 20220172052

Abstract: Described herein are techniques of training a machine learning model and performing inference using an analog processor. Some embodiments mitigate the loss in performance of a machine learning model resulting from a lower precision of an analog processor by using an adaptive block floating-point representation of numbers for the analog processor. Some embodiments mitigate the loss in performance of a machine learning model due to noise that is present when using an analog processor. The techniques involve training the machine learning model such that it is robust to noise.

Type: Application

Filed: November 29, 2021

Publication date: June 2, 2022

Applicant: Lightmatter, Inc.

Inventors: Darius Bunandar, Ludmila Levkova, Nicholas Dronen, Lakshmi Nair, David Widemann, David Walter, Martin B.Z. Forsythe, Tomo Lazovich, Ayon Basumallik, Nicholas C. Harris
VECTOR PROCESSOR DATA STORAGE

Publication number: 20220155996

Abstract: Aspects of the present disclosure provide an aligned storage strategy for stripes within a long vector for a vector processor, such that the extra computation needed to track strides between input stripes and output stripes may be eliminated. As a result, the stripe locations are located in a more predictable memory access pattern such that memory access bandwidth may be improved and the tendency for memory error may be reduced.

Type: Application

Filed: November 15, 2021

Publication date: May 19, 2022

Applicant: Lightmatter, Inc.

Inventors: Nicholas Moore, Gongyu Wang, Bradley Dobbie, Tyler J. Kenney, Ayon Basumallik
PARALLELIZATION AND PIPELINING STRATEGIES FOR AN EFFICIENT ANALOG NEURAL NETWORK ACCELERATOR

Publication number: 20220156469

Abstract: Parallelization and pipelining techniques that can be applied to multi-core analog accelerators are described. The techniques descried herein improve performance of matrix multiplication (e.g., tensor-tensor multiplication, matrix-matrix multiplication or matrix-vector multiplication). The parallelization and pipelining techniques developed by the inventors and described herein focus on maintaining a high utilization of the processing cores. A representative processing systemin includes an analog accelerator, a digital processor, and a controller. The controller is configured to control the analog accelerator to output data using linear operations and to control the digital processor to perform non-linear operations based on the output data.

Type: Application

Filed: November 15, 2021

Publication date: May 19, 2022

Applicant: Lightmatter, Inc.

Inventors: Gongyu Wang, Cansu Demirkiran, Nicholas Moore, Ayon Basumallik, Darius Bunandar
EFFICIENT BUFFERING TECHNIQUE FOR TRANSFERRING DATA

Publication number: 20220147280

Abstract: Aspects of the present disclosure are directed to an efficient data transfer strategy in which data transfer is scheduled based on a prediction of the internal memory utilization due to computational workload throughout its runtime. According to one aspect, the DMA transfer may be performed opportunistically: whenever internal buffer memory is available and the additional internal memory usage due to DMA transfer isn't interfering with the processor's ability to complete the workload. In some embodiments, an opportunistic transfer schedule may be found by solving an optimization problem.

Type: Application

Filed: November 9, 2021

Publication date: May 12, 2022

Applicant: Lightmatter, Inc.

Inventors: Darius Bunandar, Cansu Demirkiran, Gongyu Wang, Nicholas Moore, Ayon Basumallik
TECHNIQUES FOR ADAPTING NEURAL NETWORKS TO DEVICES

Publication number: 20220036185

Abstract: A training system for training a machine learning model such as a neural network may have a different configuration and/or hardware components than a target device that employs the trained neural network. For example, the training system may use a higher precision format to represent neural network parameters than the target device. In another example, the target device may use analog and digital processing hardware to compute an output of the neural network whereas the training system may have used only digital processing hardware to train the neural network. The difference in configuration and/or hardware components of the target device may introduce quantization error into parameters of the neural network, and thus affect performance of the neural network on the target device. Described herein is a training system that trains a neural network for use on a target device that reduces loss in performance resulting from quantization error.

Type: Application

Filed: July 30, 2021

Publication date: February 3, 2022

Applicant: Lightmatter, Inc.

Inventors: Nicholas Dronen, Tomo Lazovich, Ayon Basumallik, Darius Bunandar
Correlating contextual information of invoking code in a programming language with contextual information of at least one invoked code in a different programming language

Patent number: 11055092

Abstract: The exemplary embodiments may provide an approach to finding and identifying the correlation between the invoking code and the invoked code by correlating the timestamps of contextual information of code in the invoking code and invoked code. As a result, developers have information during investigating the programs and can use the information to identify a region of interest to narrow down a performance problem in the invoking code efficiently. As a result, development productivity can be improved.

Type: Grant

Filed: February 19, 2020

Date of Patent: July 6, 2021

Assignee: The MathWorks, Inc.

Inventors: Ayon Basumallik, Meng-Ju Wu
Computing performance for execution of functions called by function handles

Patent number: 10949181

Abstract: Extended types are defined for functions that are called by function handles in a programming environment. The extended types can be accessed and used by a computing system to improve compile-time and run-time performance of the computing system.

Type: Grant

Filed: November 13, 2018

Date of Patent: March 16, 2021

Assignee: The MathWorks, Inc.

Inventors: Rajeshwar Vanka, Ayon Basumallik, Brett Baker
Composite-trace just-in-time compilation

Patent number: 10929160

Abstract: Systems and methods for just-in-time compilation are disclosed. The systems and methods can be used to generate composite blocks, reducing program execution time. The systems and methods can include generating single-trace blocks during program execution. Upon satisfaction of a trigger criterion, single-trace blocks can be selected for compilation into a composite block. The trigger criterion can be a number of executions of a trigger block. Selecting the single-trace blocks can include identifying blocks reachable from the trigger block, selecting a subset of the reachable blocks, and selecting an entry point for the composite block. The composite block can be generated from the single-trace blocks and incorporated into the program control flow, such that the composite block is executed in place of the selected single-trace blocks.

Type: Grant

Filed: December 12, 2018

Date of Patent: February 23, 2021

Assignee: The MathWorks, Inc.

Inventors: Nikolay Mateev, Ayon Basumallik, Aaditya Kalsi, Prabhakar Kumar
Code generation and execution for dynamic programming languages

Patent number: 9135027

Abstract: A device may identify a first compiled block with an original constraint and an additional constraint. The first compiled block may be identified based on a program counter value and may include compiled information relating to a first segment of program code, linking information associated with a second compiled block, and information distinguishing the original constraint from the additional constraint. The original constraint may relate to a type of variable used in the first segment of programming code. The additional constraint may relate to a variable used in a second segment of programming code associated with the second compiled block. The device may copy information of the first compiled block to generate a third compiled block that lacks the additional constraint. The device may execute the third compiled block to execute a program associated with the programming code.

Type: Grant

Filed: January 14, 2015

Date of Patent: September 15, 2015

Assignee: The MathWorks, Inc.

Inventors: Ayon Basumallik, Nikolay Mateev
Code generation and execution for dynamic programming languages

Patent number: 8943474

Abstract: A device receives programming code, corresponding to a dynamic programming language, that is to be executed by a computing environment, and executes the programming code. When executing the programming code, the device maintains a program counter that identifies an execution location within the programming code, and select blocks of the programming code based on the program counter. The blocks correspond to segments of the programming code, and are associated with type-based constraints that relate to types of variables that are used by the block. When executing the programming code, the device also compiles the selected blocks, caches the compiled blocks along with the type-based constraints, generates linking information between certain ones of the compiled blocks based on the type-based constraints, and executes the compiled blocks in an order based on the program counter, the type-based constraints, and the linking information.

Type: Grant

Filed: October 24, 2012

Date of Patent: January 27, 2015

Assignee: The MathWorks, Inc.

Inventors: Ayon Basumallik, Brett W. Baker, Nikolay Mateev, Hongjun Zheng