Patents by Inventor Mojan Javaheripi

Mojan Javaheripi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

GENERATING SMALL LANGUAGE MODEL VIA TWO-PHASE TRAINING

Publication number: 20250086471

Abstract: Systems and methods for generating a small language model are provided. In particular, a computing device may obtain a general dataset including a plurality of general data, annotate a subset of the general dataset based on one or more classifier metrics indicative of a quality of the general dataset, train a classifier based on the annotated subset of the general dataset and the one or more classifier metrics, analyze each general data of the general dataset to determine a score for each of the one or more classifier metrics associated with the respective general data using the trained classifier, generate a filtered general dataset by filtering the general dataset based on one or more filters, train the small language model with the filtered general dataset, generate a synthetic dataset for refining the small language model, and train the small language model with the synthetic dataset.

Type: Application

Filed: June 4, 2024

Publication date: March 13, 2025

Applicant: Microsoft Technology Licensing, LLC

Inventors: Sébastien BUBECK, Ronen ELDAN, Allison DEL GIORNO, Suriya GUNASEKAR, Yin Tat LEE, Yuanzhi Li, Mojan JAVAHERIPI
QUANTIZATION AND CRYPTOGRAPHIC PROTOCOL BASED MACHINE LEARNING MODELS FOR CONFIDENTIAL DATA ANALYSIS AND INFERENCE

Publication number: 20250005200

Abstract: In some embodiments, there is provided a system, which comprises a processor, and at least one non-transitory computer readable media storing instructions.

Type: Application

Filed: November 8, 2022

Publication date: January 2, 2025

Inventors: Mojan Javaheripi, Mohammad Samragh Razlighi, Siam Umar Hussain, Farinaz Koushanfar
ZERO PEEK ROBUSTNESS CHECKS FOR FEDERATED LEARNING

Publication number: 20240346379

Abstract: A computing system determines a median of a first number of mean values received from a first number of clusters, where each cluster of the first number of clients includes a first plurality of clients. Also, a threshold is determined based on the median, where the threshold applies to model updates. The median and the threshold are broadcast to all clients. Next, one or more clients that fail to provide a proof attesting that their model update is within the threshold of the median are dropped. Then, a second plurality of clients, not including the one or more dropped clients, participate in a final round of secure aggregation. Next, a final aggregate result is obtained, where the final aggregate result is based on the final round of secure aggregation. Then, one or more actions are performed based on the final aggregate result.

Type: Application

Filed: April 12, 2024

Publication date: October 17, 2024

Inventors: Zahra Ghodsi, Mojan Javaheripi, Nojan Sheybani, Xinqiao Zhang, Farinaz Koushanfar
TRANSFORMER-BASED AUTOREGRESSIVE LANGUAGE MODEL SELECTION

Publication number: 20230214629

Abstract: Generally discussed herein are devices, systems, and methods for improving architecture search and identification with constraints. A method can include receiving, at a compute device, a request for a transformer-based autoregressive language model (TBALM), the request specifying a maximum latency, identifying TBALM architectures that satisfies the maximum latency, identifying a TBALM architecture of the identified TBALM architectures that has a greatest number of decoder parameters resulting in an identified TBALM architecture, and providing the identified TBALM architecture.

Type: Application

Filed: December 30, 2021

Publication date: July 6, 2023

Inventors: Debadeepta Dey, Shital Rajnikant Shah, Gustavo Henrique De Rosa, Caio César Teodoro Mendes, Sebastien Bubeck, Tomasz Lukasz Religa, Saurabh Vasant Naik, Yan He, Subhabrata Mukherjee, Mojan Javaheripi
Small-world nets for fast neural network training and execution

Patent number: 11625614

Abstract: A method, a system, and a computer program product for fast training and/or execution of neural networks. A description of a neural network architecture is received. Based on the received description, a graph representation of the neural network architecture is generated. The graph representation includes one or more nodes connected by one or more connections. At least one connection is modified. Based on the generated graph representation, a new graph representation is generated using the modified at least one connection. The new graph representation has a small-world property. The new graph representation is transformed into a new neural network architecture.

Type: Grant

Filed: October 23, 2019

Date of Patent: April 11, 2023

Assignee: The Regents of the University of California

Inventors: Mojan Javaheripi, Farinaz Koushanfar, Bita Darvish Rouhani
SMALL-WORLD NETS FOR FAST NEURAL NETWORK TRAINING AND EXECUTION

Publication number: 20200125960

Abstract: A method, a system, and a computer program product for fast training and/or execution of neural networks. A description of a neural network architecture is received. Based on the received description, a graph representation of the neural network architecture is generated. The graph representation includes one or more nodes connected by one or more connections. At least one connection is modified. Based on the generated graph representation, a new graph representation is generated using the modified at least one connection. The new graph representation has a small-world property. The new graph representation is transformed into a new neural network architecture.

Type: Application

Filed: October 23, 2019

Publication date: April 23, 2020

Inventors: Mojan Javaheripi, Farinaz Koushanfar, Bita Darvish Rouhani

GENERATING SMALL LANGUAGE MODEL VIA TWO-PHASE TRAINING

QUANTIZATION AND CRYPTOGRAPHIC PROTOCOL BASED MACHINE LEARNING MODELS FOR CONFIDENTIAL DATA ANALYSIS AND INFERENCE

ZERO PEEK ROBUSTNESS CHECKS FOR FEDERATED LEARNING

TRANSFORMER-BASED AUTOREGRESSIVE LANGUAGE MODEL SELECTION

Small-world nets for fast neural network training and execution

SMALL-WORLD NETS FOR FAST NEURAL NETWORK TRAINING AND EXECUTION