Patents by Inventor Mojan Javaheripi

Mojan Javaheripi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250086471
    Abstract: Systems and methods for generating a small language model are provided. In particular, a computing device may obtain a general dataset including a plurality of general data, annotate a subset of the general dataset based on one or more classifier metrics indicative of a quality of the general dataset, train a classifier based on the annotated subset of the general dataset and the one or more classifier metrics, analyze each general data of the general dataset to determine a score for each of the one or more classifier metrics associated with the respective general data using the trained classifier, generate a filtered general dataset by filtering the general dataset based on one or more filters, train the small language model with the filtered general dataset, generate a synthetic dataset for refining the small language model, and train the small language model with the synthetic dataset.
    Type: Application
    Filed: June 4, 2024
    Publication date: March 13, 2025
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Sébastien BUBECK, Ronen ELDAN, Allison DEL GIORNO, Suriya GUNASEKAR, Yin Tat LEE, Yuanzhi Li, Mojan JAVAHERIPI
  • Publication number: 20250005200
    Abstract: In some embodiments, there is provided a system, which comprises a processor, and at least one non-transitory computer readable media storing instructions.
    Type: Application
    Filed: November 8, 2022
    Publication date: January 2, 2025
    Inventors: Mojan Javaheripi, Mohammad Samragh Razlighi, Siam Umar Hussain, Farinaz Koushanfar
  • Publication number: 20240346379
    Abstract: A computing system determines a median of a first number of mean values received from a first number of clusters, where each cluster of the first number of clients includes a first plurality of clients. Also, a threshold is determined based on the median, where the threshold applies to model updates. The median and the threshold are broadcast to all clients. Next, one or more clients that fail to provide a proof attesting that their model update is within the threshold of the median are dropped. Then, a second plurality of clients, not including the one or more dropped clients, participate in a final round of secure aggregation. Next, a final aggregate result is obtained, where the final aggregate result is based on the final round of secure aggregation. Then, one or more actions are performed based on the final aggregate result.
    Type: Application
    Filed: April 12, 2024
    Publication date: October 17, 2024
    Inventors: Zahra Ghodsi, Mojan Javaheripi, Nojan Sheybani, Xinqiao Zhang, Farinaz Koushanfar
  • Publication number: 20230214629
    Abstract: Generally discussed herein are devices, systems, and methods for improving architecture search and identification with constraints. A method can include receiving, at a compute device, a request for a transformer-based autoregressive language model (TBALM), the request specifying a maximum latency, identifying TBALM architectures that satisfies the maximum latency, identifying a TBALM architecture of the identified TBALM architectures that has a greatest number of decoder parameters resulting in an identified TBALM architecture, and providing the identified TBALM architecture.
    Type: Application
    Filed: December 30, 2021
    Publication date: July 6, 2023
    Inventors: Debadeepta Dey, Shital Rajnikant Shah, Gustavo Henrique De Rosa, Caio César Teodoro Mendes, Sebastien Bubeck, Tomasz Lukasz Religa, Saurabh Vasant Naik, Yan He, Subhabrata Mukherjee, Mojan Javaheripi
  • Patent number: 11625614
    Abstract: A method, a system, and a computer program product for fast training and/or execution of neural networks. A description of a neural network architecture is received. Based on the received description, a graph representation of the neural network architecture is generated. The graph representation includes one or more nodes connected by one or more connections. At least one connection is modified. Based on the generated graph representation, a new graph representation is generated using the modified at least one connection. The new graph representation has a small-world property. The new graph representation is transformed into a new neural network architecture.
    Type: Grant
    Filed: October 23, 2019
    Date of Patent: April 11, 2023
    Assignee: The Regents of the University of California
    Inventors: Mojan Javaheripi, Farinaz Koushanfar, Bita Darvish Rouhani
  • Publication number: 20200125960
    Abstract: A method, a system, and a computer program product for fast training and/or execution of neural networks. A description of a neural network architecture is received. Based on the received description, a graph representation of the neural network architecture is generated. The graph representation includes one or more nodes connected by one or more connections. At least one connection is modified. Based on the generated graph representation, a new graph representation is generated using the modified at least one connection. The new graph representation has a small-world property. The new graph representation is transformed into a new neural network architecture.
    Type: Application
    Filed: October 23, 2019
    Publication date: April 23, 2020
    Inventors: Mojan Javaheripi, Farinaz Koushanfar, Bita Darvish Rouhani