Patents Assigned to Latent AI, Inc.

Generating and executing context-specific neural network models based on target runtime parameters

Patent number: 12190228

Abstract: The disclosed embodiments relate to a system that generates and executes a deep neural network (DNN) based on target runtime parameters. During operation, the system receives a trained original model and a set of target runtime parameters for the DNN, wherein the target runtime parameters are associated with one or more of the following for the DNN: desired operating conditions, desired resource utilization, and desired accuracy of results. Next, the system generates a context-specific model based on the original model and the set of target runtime parameters. The system also generates an operational plan for executing both the original model and the context-specific model to meet requirements of the target runtime parameters. Finally, the system controls execution of the original model and the context-specific model based on the operational plan.

Type: Grant

Filed: April 22, 2021

Date of Patent: January 7, 2025

Assignee: Latent AI, Inc.

Inventors: Sek Meng Chai, Jagadeesh Kandasamy
SECURING A NEURAL NETWORK RUNTIME ENGINE

Publication number: 20240070266

Abstract: A system, apparatus and method are provided for securing a neural network (or other artificial intelligence model) against malicious activity, such as piracy, theft of intellectual property, sabotage, etc. One or more security elements or features (e.g., digital watermarks, encryption, obfuscation) are applied to the neural network model during training and/or optimization. Therefore, the model is enhanced with robust security before it is linked or merged with application software for performing inference processing using the model.

Type: Application

Filed: August 22, 2023

Publication date: February 29, 2024

Applicant: Latent AI, Inc.

Inventors: Sek Meng Chai, Jonathan D. Brookshire, Abelardo Lopez-Lagunas
Optimizing execution of a neural network based on operational performance parameters

Patent number: 11816568

Abstract: The disclosed embodiments relate to a system that optimizes execution of a DNN based on operational performance parameters. During operation, the system collects the operational performance parameters from the DNN during operation of the DNN, wherein the operational performance parameters include parameters associated with operating conditions for the DNN, parameters associated with resource utilization during operation of the DNN, and parameters associated with accuracy of results produced by the DNN. Next, the system uses the operational performance parameters to update the DNN model to improve performance and efficiency during execution of the DNN.

Type: Grant

Filed: September 10, 2020

Date of Patent: November 14, 2023

Assignee: Latent AI, Inc.

Inventors: Sek Meng Chai, Jagadeesh Kandasamy
NEURAL NETWORK OPTIMIZATION USING KNOWLEDGE REPRESENTATIONS

Publication number: 20230297835

Abstract: Systems, tools and methods are provided for optimizing neural networks (NNs) to run efficiently on target hardware such as central processing units (CPUs), graphics processing units (GPUs), digital signal processors (DSPs), etc. The provided software tools are implemented as part of a machine-learning operations (MLOps) workflow for building a neural network, and include optimization algorithms (e.g., for quantization and/or pruning) and compiler processes that reduce memory requirements and processing latency.

Type: Application

Filed: March 17, 2023

Publication date: September 21, 2023

Applicant: Latent AI, Inc.

Inventors: Sek Meng Chai, Jan Ernst, Abelardo Lopez-Lagunas, Ryan M. Dailey
GENERATING AND EXECUTING CONTEXT-SPECIFIC NEURAL NETWORK MODELS BASED ON TARGET RUNTIME PARAMETERS

Publication number: 20210241108

Abstract: The disclosed embodiments relate to a system that generates and executes a deep neural network (DNN) based on target runtime parameters. During operation, the system receives a trained original model and a set of target runtime parameters for the DNN, wherein the target runtime parameters are associated with one or more of the following for the DNN: desired operating conditions, desired resource utilization, and desired accuracy of results. Next, the system generates a context-specific model based on the original model and the set of target runtime parameters. The system also generates an operational plan for executing both the original model and the context-specific model to meet requirements of the target runtime parameters. Finally, the system controls execution of the original model and the context-specific model based on the operational plan.

Type: Application

Filed: April 22, 2021

Publication date: August 5, 2021

Applicant: Latent AI, Inc.

Inventors: Sek Meng Chai, Jagadeesh Kandasamy
USING A RUNTIME ENGINE TO FACILITATE DYNAMIC ADAPTATION OF DEEP NEURAL NETWORKS FOR EFFICIENT PROCESSING

Publication number: 20210081806

Abstract: The disclosed embodiments relate to a system that facilitates dynamic runtime execution of a deep neural network (DNN). During operation, the system receives a model, a set of weights and runtime metadata for the DNN. The system also obtains code to perform inference-processing operations for the DNN. Next, the system compiles code to implement a runtime engine that facilitates throttling operations during execution of the inference-processing operations, wherein the runtime engine conserves computing resources by selecting portions of the inference-processing operations to execute based on the runtime metadata.

Type: Application

Filed: September 10, 2020

Publication date: March 18, 2021

Applicant: Latent AI, Inc.

Inventors: Sek Meng Chai, Jagadeesh Kandasamy
OPTIMIZING EXECUTION OF A NEURAL NETWORK BASED ON OPERATIONAL PERFORMANCE PARAMETERS

Publication number: 20210081789

Abstract: The disclosed embodiments relate to a system that optimizes execution of a DNN based on operational performance parameters. During operation, the system collects the operational performance parameters from the DNN during operation of the DNN, wherein the operational performance parameters include parameters associated with operating conditions for the DNN, parameters associated with resource utilization during operation of the DNN, and parameters associated with accuracy of results produced by the DNN. Next, the system uses the operational performance parameters to update the DNN model to improve performance and efficiency during execution of the DNN.

Type: Application

Filed: September 10, 2020

Publication date: March 18, 2021

Applicant: Latent AI, Inc.

Inventors: Sek Meng Chai, Jagadeesh Kandasamy

Generating and executing context-specific neural network models based on target runtime parameters

SECURING A NEURAL NETWORK RUNTIME ENGINE

Optimizing execution of a neural network based on operational performance parameters

NEURAL NETWORK OPTIMIZATION USING KNOWLEDGE REPRESENTATIONS

GENERATING AND EXECUTING CONTEXT-SPECIFIC NEURAL NETWORK MODELS BASED ON TARGET RUNTIME PARAMETERS

USING A RUNTIME ENGINE TO FACILITATE DYNAMIC ADAPTATION OF DEEP NEURAL NETWORKS FOR EFFICIENT PROCESSING

OPTIMIZING EXECUTION OF A NEURAL NETWORK BASED ON OPERATIONAL PERFORMANCE PARAMETERS