Patents by Inventor Marc Tremblay

Marc Tremblay has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Executing large artificial intelligence models on memory-constrained devices

Patent number: 11520592

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be manually or automatically adjusted to reduce the communication overhead.

Type: Grant

Filed: September 20, 2019

Date of Patent: December 6, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Gautham Popuri, Layali Rashid, Tiyasa Mitra, Mohit Mittal, Maral Mesmakhosroshahi
HORTICULTURAL LIGHTING SCENARIOS MANAGEMENT SYSTEM AND METHOD

Publication number: 20220346209

Abstract: A computer implemented method for managing horticultural lighting scenarios including the steps of receiving lighting scenarios and storing lighting scenario attributes thereof in a data storage; transmitting the lighting scenarios to a horticultural structure for deployment on at least one horticultural lighting apparatus; acquiring runtime data generated during the execution of the lighting scenarios and storing the runtime data on the data storage.

Type: Application

Filed: April 22, 2022

Publication date: October 27, 2022

Inventors: Gabriel Dupras, Francois Roy-Moisan, Jacques Poirier, Charles Smith, Louis Brun, Patrick Menard, Marc Tremblay
Spread neural networks

Patent number: 11475303

Abstract: Techniques for training neural networks are provided. According to one set of embodiments, a first array is processed in a spreading component to produce a second array, where a first dimension of the first array corresponds to at least one sequence of approximately orthogonal numeric vectors representing tokens, and where the spreading component combines values along the first dimension. The second array is processed in a transformer neural network to determine correlations between the sequence, which produces a third array. One or more batches of the third array are processed in a de-spreading component to produce a fourth array.

Type: Grant

Filed: April 14, 2020

Date of Patent: October 18, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Andrew Wagner, Tiyasa Mitra, Sujeeth Subramanya Bharadwaj, Saurabh Mohan Kulkarni, Marc Tremblay
METHODS AND SYSTEMS FOR CONTROLLING HORTICULTURAL LIGHT SOURCES

Publication number: 20220322610

Abstract: There are provided methods and systems for controlling light sources within a horticultural structure. The method includes: illuminating the plant or crop in a first zone with a first light source to meet initial lighting conditions of the plant or crop, the first light source being driven according to a first portion of a lighting scenario; monitoring information representative of illumination delivered to the plant or crop during the first portion of the scenario; receiving inputs indicating that the plant or crop is to be moved towards a second zone and determining subsequent lighting conditions of the plant or crop in the second zone, based on the monitored information; and illuminating the plant or crop in the second zone with a second light source to meet the subsequent lighting conditions of the plant or crop, the second light source being driven according to a second portion of the scenario.

Type: Application

Filed: April 13, 2022

Publication date: October 13, 2022

Inventors: Patrick Menard, Marco Lafond, Kassim Tremblay, Louis Brun, Marc Tremblay
METHODS AND SYSTEMS FOR CONTROLLING HORTICULTURAL LIGHT SOURCES DURING DISTRIBUTION

Publication number: 20220330406

Abstract: The present techniques concern methods and systems for controlling horticultural light sources illuminating a plant or crop during a distribution process including one or more phases, and more specifically relate to methods and systems for controlling horticultural lighting sources during an initial phase, a transit phase and a selling phase of the distribution process, each phase taking place a different location one from another. In some implementations, the present techniques allow illuminating the plant or crop according to a global dynamic lighting scenario, wherein at least a portion of the global dynamic lighting scenario is associated with a corresponding phase of the distribution process.

Type: Application

Filed: April 13, 2022

Publication date: October 13, 2022

Inventors: Marc Tremblay, Kassim Tremblay
System and method for gradient accumulation with free momentum

Patent number: 11449752

Abstract: Methods for gradient accumulation with free momentum are performed by systems and devices during neural network model training. An accumulator that includes a processor circuit and a memory element generates free momentum between passes of a neural network model training process. The processor circuit receives a difference weight (gradient) and generates a first input by applying a weighting parameter thereto. The processor circuit obtains a prior weight from the memory element and generates a second input by applying another weighting parameter thereto. The processor circuit generates a filtered input with momentum by filtering the first and second input. The memory element generates a stored next pass weight by accumulating the filtered input with the prior weight. A computing resource then processes the next pass of the neural network model training using the stored next pass weight. The methods, systems, and devices are applicable to pipelined model parallelism training processes.

Type: Grant

Filed: March 31, 2020

Date of Patent: September 20, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Andrew Wagner, Marc Tremblay, Saurabh M. Kulkarni, Tiyasa Mitra, Sujeeth S. Bharadwaj
DATA PARALLELISM IN DISTRIBUTED TRAINING OF ARTIFICIAL INTELLIGENCE MODELS

Publication number: 20220283820

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be adjusted to reduce the communication overhead. Multi-level parallel parameters reduction may be performed at the parameter server and the target device.

Type: Application

Filed: May 24, 2022

Publication date: September 8, 2022

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Sujeeth Subramanya Bharadwaj, Devangkumar Patel, Jinwen Xi, Maral Mesmakhosroshahi
System and method for improving convolutional neural network-based machine learning models

Patent number: 11436491

Abstract: Improved convolutional neural network-based machine learning models are disclosed herein. A convolutional neural network is configured to decompose feature maps generated based on a data item to be classified. The feature maps are decomposed into a first and second subsets. The first subset is representative of high frequency components of the data item, and the second subset is representative of low frequency components of the data item. The second subset is upsampled and is combined with the first subset. The combined feature maps are convolved with a filter to extract a set of features associated with the data item. The first subset is also downsampled and combined with the second subset. The combined feature maps are convolved with a filter to extract another set of features. The data item is classified based on the sets of features extracted based on the convolution operations.

Type: Grant

Filed: March 13, 2020

Date of Patent: September 6, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Sujeeth S. Bharadwaj, Bharadwaj Pudipeddi, Marc Tremblay
Data parallelism in distributed training of artificial intelligence models

Patent number: 11436019

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be adjusted to reduce the communication overhead. Multi-level parallel parameters reduction may be performed at the parameter server and the target device.

Type: Grant

Filed: September 30, 2019

Date of Patent: September 6, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Sujeeth Subramanya Bharadwaj, Devangkumar Patel, Jinwen Xi, Maral Mesmakhosroshahi
Dynamic multi-layer execution for artificial intelligence modeling

Patent number: 11354579

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. This paradigm of executing one portion of the AI model at a time allows for dynamic execution of the large AI model.

Type: Grant

Filed: September 30, 2019

Date of Patent: June 7, 2022

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Sujeeth Subramanya Bharadwaj, Jinwen Xi, Maral Mesmakhosroshahi
MULTI-TOKEN EMBEDDING AND CLASSIFIER FOR MASKED LANGUAGE MODELS

Publication number: 20220067280

Abstract: Embodiments of the present disclosure include systems and methods for training transformer models. In some embodiments, a set of input data are received. The input data comprises a plurality of tokens including masked tokens. The plurality of tokens in an embedding layer are processed. The embedding layer is coupled to a transformer layer. The plurality of tokens are processed in the transformer layer, which is coupled to a classifier layer. The plurality of tokens are processed in the classifier layer. The classifier layer is coupled to a loss layer. At least one of the embedding layer and the classifier layer combine masked tokens at a current position with tokens at one or more of a previous position and a subsequent position.

Type: Application

Filed: August 25, 2020

Publication date: March 3, 2022

Inventors: Andy Wagner, Tiyasa Mitra, Marc Tremblay
Systems and methods for error recovery

Patent number: 11226859

Abstract: Embodiments of the present disclosure include an error recovery method comprising detecting a computing error, restarting a first artificial intelligence processor of a plurality of artificial intelligence processors processing a data set, and loading a model in the artificial intelligence processor, wherein the model corresponds to a same model processed by the plurality of artificial intelligence processors during a previous processing iteration by the plurality of artificial intelligence processors on data from the data set.

Type: Grant

Filed: March 27, 2020

Date of Patent: January 18, 2022

Assignee: Microsoft Technology Licensing, LLC

Inventors: Bharadwaj Pudipeddi, Maral Mesmakhosroshahi, Jinwen Xi, Saurabh M. Kulkarni, Marc Tremblay, Matthias Baenninger, Nuno Claudino Pereira Lopes
SYSTEM AND METHOD FOR GRADIENT ACCUMULATION WITH FREE MOMENTUM

Publication number: 20210303991

Abstract: Methods for gradient accumulation with free momentum are performed by systems and devices during neural network model training. An accumulator that includes a processor circuit and a memory element generates free momentum between passes of a neural network model training process. The processor circuit receives a difference weight (gradient) and generates a first input by applying a weighting parameter thereto. The processor circuit obtains a prior weight from the memory element and generates a second input by applying another weighting parameter thereto. The processor circuit generates a filtered input with momentum by filtering the first and second input. The memory element generates a stored next pass weight by accumulating the filtered input with the prior weight. A computing resource then processes the next pass of the neural network model training using the stored next pass weight. The methods, systems, and devices are applicable to pipelined model parallelism training processes.

Type: Application

Filed: March 31, 2020

Publication date: September 30, 2021

Inventors: Andrew Wagner, Marc Tremblay, Saurabh M. Kulkarni, Tiyasa Mitra, Sujeeth S. Bharadwaj
Method and user device for generating predicted survey participation data at the user device

Patent number: 11132701

Abstract: A method for generating predicted survey participation data at a current user device. A server collects training behavioral data related to a website from a plurality of user devices. The server also collects training survey participation data related to the website from at least some of the plurality of user devices. The server analyzes the training survey participation data and the corresponding training behavioral data to infer correlations between the training survey participation data and the corresponding training behavioral data. The server further generates predictive survey participation patterns based on the inferred correlations. The server transmits the predictive survey participation patterns to the current user device. The current device collects current behavioral data related to the website.

Type: Grant

Filed: December 15, 2016

Date of Patent: September 28, 2021

Assignee: EMPLIFI INC.

Inventor: Marc Tremblay
SYSTEM AND METHOD FOR IMPROVING CONVOLUTIONAL NEURAL NETWORK-BASED MACHINE LEARNING MODELS

Publication number: 20210287083

Abstract: Improved convolutional neural network-based machine learning models are disclosed herein. A convolutional neural network is configured to decompose feature maps generated based on a data item to be classified. The feature maps are decomposed into a first and second subsets. The first subset is representative of high frequency components of the data item, and the second subset is representative of low frequency components of the data item. The second subset is upsampled and is combined with the first subset. The combined feature maps are convolved with a filter to extract a set of features associated with the data item. The first subset is also downsampled and combined with the second subset. The combined feature maps are convolved with a filter to extract another set of features. The data item is classified based on the sets of features extracted based on the convolution operations.

Type: Application

Filed: March 13, 2020

Publication date: September 16, 2021

Inventors: Sujeeth S. Bharadwaj, Bharadwaj Pudipeddi, Marc Tremblay
PIPELINED NEURAL NETWORK PROCESSING WITH CONTINUOUS AND ASYNCHRONOUS UPDATES

Publication number: 20210097366

Abstract: Systems and methods for pipelined neural network processing with continuous and asynchronous updates are described. A method for processing a neural network comprising L layers, where L is an integer greater than two, includes partitioning the L layers among a set of computing resources configured to process forward passes and backward passes associated with each of the L layers. The method further includes initiating processing of the forward passes and the backward passes using the set of computing resources. The method further includes upon completion of a first set of forward passes and a first set of backward passes associated with a first layer of the L layers, initiating update of parameters associated with the first layer when gradients are available for updating the parameters associated with the first layer without waiting to calculate gradients associated with any of remaining L layers.

Type: Application

Filed: September 27, 2019

Publication date: April 1, 2021

Inventors: Andy Wagner, Tiyasa Mitra, Saurabh M. Kulkarni, Marc Tremblay, Sujeeth S. Bharadwaj
LOSSLESS EXPONENT AND LOSSY MANTISSA WEIGHT COMPRESSION FOR TRAINING DEEP NEURAL NETWORKS

Publication number: 20210064986

Abstract: Systems, methods, and apparatuses are provided for compressing values. A plurality of parameters may be obtained from a memory, each parameter comprising a floating-point number that is used in a relationship between artificial neurons or nodes in a model. A mantissa value and an exponent value may be extracted from each floating-point number to generate a set of mantissa values and a set of exponent values. The set of mantissa values may be compressed to generate a mantissa lookup table (LUT) and a plurality of mantissa LUT index values. The set of exponent values may be encoded to generate an exponent LUT and a plurality of exponent LUT index values. The mantissa LUT, mantissa LUT index values, exponent LUT, and exponent LUT index values may be provided to one or more processing entities to train the model.

Type: Application

Filed: September 3, 2019

Publication date: March 4, 2021

Inventors: Jinwen Xi, Bharadwaj Pudipeddi, Marc Tremblay
DYNAMIC MULTI-LAYER EXECUTION FOR ARTIFICIAL INTELLIGENCE MODELING

Publication number: 20210019634

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. This paradigm of executing one portion of the AI model at a time allows for dynamic execution of the large AI model.

Type: Application

Filed: September 30, 2019

Publication date: January 21, 2021

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Sujeeth Subramanya Bharadwaj, Jinwen Xi, Maral Mesmakhosroshahi
EXECUTING LARGE ARTIFICIAL INTELLIGENCE MODELS ON MEMORY-CONSTRAINED DEVICES

Publication number: 20210019151

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be manually or automatically adjusted to reduce the communication overhead.

Type: Application

Filed: September 20, 2019

Publication date: January 21, 2021

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Gautham Popuri, Layali Rashid, Tiyasa Mitra, III, Mohit Mittal, Maral Mesmakhosroshahi
DATA PARALLELISM IN DISTRIBUTED TRAINING OF ARTIFICIAL INTELLIGENCE MODELS

Publication number: 20210019152

Abstract: Methods, systems, apparatuses, and computer program products are described herein that enable execution of a large AI model on a memory-constrained target device that is communicatively connected to a parameter server, which stores a master copy of the AI model. The AI model may be dissected into smaller portions (e.g., layers or sub-layers), and each portion may be executed as efficiently as possible on the target device. After execution of one portion of the AI model is finished, another portion of the AI model may be downloaded and executed at the target device. To improve efficiency, the input samples may be divided into microbatches, and a plurality of microbatches executing in sequential order may form a minibatch. The size of the group of microbatches or minibatch can be adjusted to reduce the communication overhead. Multi-level parallel parameters reduction may be performed at the parameter server and the target device.

Type: Application

Filed: September 30, 2019

Publication date: January 21, 2021

Inventors: Bharadwaj Pudipeddi, Marc Tremblay, Sujeeth Subramanya Bharadwaj, Devangkumar Patel, Jinwen Xi, Maral Mesmakhosroshahi

prev 1 2 3 4 5 6 … next