Patents by Inventor Devi Niru Parikh

Devi Niru Parikh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS, APPARATUSES AND COMPUTER PROGRAM PRODUCTS FOR IMAGE EDITING VIA RECOGNITION AND GENERATION TASKS

Publication number: 20260127792

Abstract: Methods and systems are provided to edit or update images or videos based on instructions. A system may analyze an input image and may determine an instruction associated with the input image. The instruction may include content to edit or update the input image. The system may select an edit task, among predetermined edit tasks associated with changes to images, based on a description of the content of the instruction. The system may generate an output image, based on implementing the selected edit task, including an update to the input image depicting the description of the content of the instruction.

Type: Application

Filed: November 3, 2025

Publication date: May 7, 2026

Inventors: Adam Polyak, Yuval Kirstain, Yaniv Nechemia Taigman, Shelly Sheynin, Uriel Singer, Amit Zohar, Devi Niru Parikh
EQUIPPING MACHINE LEARNING MODELS WITH SOCIAL NETWORK KNOWLEDGE, VIDEO EDITING VIA FACTORIZED DIFFUSION DISTILLATION & EFFICIENT DEPTH STABILIZER FOR MIXED REALITY & AUGMENTED REALITY

Publication number: 20260006286

Abstract: Various systems, methods, and devices are described for utilizing artificial intelligence (AI) bot (e.g., a chatbot) to fetch or create content associated with a third-party platform based on an input associated with an electronic device. In an example, systems and methods of AI bot fetching or creating content may include receiving an input, via a user device. The input may be textual, audible, or any other suitable method. Based on the input, one or more content items may be fetched or created. The machine learning model may be utilized to determine context associated with the input. The machine leaning model may determine a number of content items associated with the input and data sources related to the retrieval generators. A result may be presented to a user, where the result may comprise the one or more content items determined.

Type: Application

Filed: May 20, 2025

Publication date: January 1, 2026

Inventors: Hong Yan, Adam Polyak, Yaniv Nechemia Taigman, Devi Niru Parikh, Rakesh Ranjan, Hao Jiang, Shelly Sheynin, Uriel Singer, Yuval Kirstain, Jingqing Huang, Amit Zohar
Task-specific text generation based on multimodal inputs

Patent number: 12236192

Abstract: A system and method for generating task-specific text by processing multimodal inputs using machine-learning models is provided. The method may include accessing first sets of tokens associated with a desired task and one or more modalities associated with a context of the desired task. The method may further include determining a second set of tokens for each of the one or more modalities using a classifier network associated with the modality. The method may further include generating a number of embedding vectors by mapping the first sets of tokens and the second set of tokens associated with each of the one or more modalities to an embedding space. The method may further include producing a sequence of words addressing the desired task by processing the number of embedding vectors with an encoder-decoder network.

Type: Grant

Filed: June 4, 2021

Date of Patent: February 25, 2025

Assignee: Meta Platforms, Inc.

Inventors: Xudong Lin, Gediminas Bertasius, Jue Wang, Devi Niru Parikh, Lorenzo Torresani
AVATAR PERSONALIZATION USING IMAGE GENERATION

Publication number: 20240257470

Abstract: A method and system for personalized avatar generation. The method includes receiving a text prompt and generating a first pose based on the text prompt using a first model. The method also includes re-targeting the first pose to a target avatar body. The method also includes identifying a predefined avatar configuration corresponding to a user based on a profile of the user. The method also includes converting the target avatar body by applying the predefined avatar configuration to the target avatar body. The method also includes rendering an avatar, using a second model, based on the target avatar body with the predefined avatar configuration, wherein the avatar is in the first pose.

Type: Application

Filed: August 30, 2023

Publication date: August 1, 2024

Inventors: Sonal Gupta, Samaneh Azadi, Mian Akbar Shah, Thomas Falstad Hayes, Devi Niru Parikh
TEXT TO VIDEO GENERATION

Publication number: 20240155071

Abstract: A method and system for text-to-video generation. The method includes receiving a text input, generating a representation frame based on the text input using a model trained on text-image pairs, generating a set of frames based on the representation frame and a first frame rate, interpolating the set of frames to a higher frame rate, generating a first video based on the interpolated set of frames, increasing a resolution of the first video based on a first and second super-resolution model, and generating an output video based on a result of the super-resolution models.

Type: Application

Filed: September 29, 2023

Publication date: May 9, 2024

Inventors: Sonal Gupta, Adam Polyak, Thomas Falstad Hayes, Xi Yin, Jie An, Chao Yang, Oron Ashual, Oran Gafni, Devi Niru Parikh, Yaniv Nechemia Taigman, Uriel Singer, Songyang Zhang, Qiyuan Hu
GENERATING AUDIO FILES FROM TEXT INPUT

Publication number: 20240112687

Abstract: Methods, systems, and storage media for generating audio data includes receiving a text input. The method also includes receiving a plurality of representative audio sources and encoding the plurality of representative audio sources into a plurality of audio tokens. The method includes encoding the text input into a plurality of text representations. The method comprises mapping each audio tokens of the plurality of audio tokens to a text representation of the plurality of text representations. The method also comprises determining a relationship score based on mapping each audio tokens to the text representation, wherein the relationship score identifies a distribution of audio tokens from the plurality of audio tokens. The method and systems can also comprise decoding the subgroup of audio tokens to yield a reconstructed audio source.

Type: Application

Filed: September 29, 2023

Publication date: April 4, 2024

Inventors: Yaniv Nechemia Taigman, Felix Kruk, Yossef Mordechay Adi, Gabriel Synnaeve, Adam Polyak, Uriel Singer, Devi Niru Parikh, Alexandre Défossez, Jade Copet
Task-Specific Text Generation Based On Multimodal Inputs

Publication number: 20220222435

Abstract: In one embodiment, a method includes accessing first sets of tokens associated with a desired task and one or more modalities associated with a context of the desired task, determining a second set of tokens for each of the one or more modalities using a classifier network associated with the modality, generating a number of embedding vectors by mapping the first sets of tokens and the second set of tokens associated with each of the one or more modalities to an embedding space, and producing a sequence of words addressing the desired task by processing the number of embedding vectors with an encoder-decoder network.

Type: Application

Filed: June 4, 2021

Publication date: July 14, 2022

Inventors: Xudong Lin, Gediminas Bertasius, Jue Wang, Devi Niru Parikh, Lorenzo Torresani

METHODS, APPARATUSES AND COMPUTER PROGRAM PRODUCTS FOR IMAGE EDITING VIA RECOGNITION AND GENERATION TASKS

EQUIPPING MACHINE LEARNING MODELS WITH SOCIAL NETWORK KNOWLEDGE, VIDEO EDITING VIA FACTORIZED DIFFUSION DISTILLATION & EFFICIENT DEPTH STABILIZER FOR MIXED REALITY & AUGMENTED REALITY

Task-specific text generation based on multimodal inputs

AVATAR PERSONALIZATION USING IMAGE GENERATION

TEXT TO VIDEO GENERATION

GENERATING AUDIO FILES FROM TEXT INPUT

Task-Specific Text Generation Based On Multimodal Inputs