Patents by Inventor Krishna Kumar Singh

Krishna Kumar Singh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

AFFORDANCE-BASED REPOSING OF AN OBJECT IN A SCENE

Publication number: 20240169701

Abstract: Systems and methods for inserting an object into a background are described. Examples of the systems and methods include obtaining a background image including a region for inserting the object, and encoding the background image to obtain an encoded background. A modified image is then generated based on the encoded background using a diffusion model. The modified image depicts the object within the region.

Type: Application

Filed: November 23, 2022

Publication date: May 23, 2024

Inventors: SUMITH KULAL, KRISHNA KUMAR SINGH, JIMEL YANG, JINGWAN LU, ALEXEI EFROS
MODIFYING POSES OF TWO-DIMENSIONAL HUMANS IN TWO-DIMENSIONAL IMAGES BY REPOSING THREE-DIMENSIONAL HUMAN MODELS REPRESENTING THE TWO-DIMENSIONAL HUMANS

Publication number: 20240144623

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify two-dimensional images via scene-based editing using three-dimensional representations of the two-dimensional images. For instance, in one or more embodiments, the disclosed systems utilize three-dimensional representations of two-dimensional images to generate and modify shadows in the two-dimensional images according to various shadow maps. Additionally, the disclosed systems utilize three-dimensional representations of two-dimensional images to modify humans in the two-dimensional images. The disclosed systems also utilize three-dimensional representations of two-dimensional images to provide scene scale estimation via scale fields of the two-dimensional images. In some embodiments, the disclosed systems utilizes three-dimensional representations of two-dimensional images to generate and visualize 3D planar surfaces for modifying objects in two-dimensional images.

Type: Application

Filed: April 20, 2023

Publication date: May 2, 2024

Inventors: Giorgio Gori, Yi Zhou, Yangtuanfeng Wang, Yang Zhou, Krishna Kumar Singh, Jae Shin Yoon, Duygu Ceylan Aksit
GENERATING THREE-DIMENSIONAL HUMAN MODELS REPRESENTING TWO-DIMENSIONAL HUMANS IN TWO-DIMENSIONAL IMAGES

Publication number: 20240144520

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify two-dimensional images via scene-based editing using three-dimensional representations of the two-dimensional images. For instance, in one or more embodiments, the disclosed systems utilize three-dimensional representations of two-dimensional images to generate and modify shadows in the two-dimensional images according to various shadow maps. Additionally, the disclosed systems utilize three-dimensional representations of two-dimensional images to modify humans in the two-dimensional images. The disclosed systems also utilize three-dimensional representations of two-dimensional images to provide scene scale estimation via scale fields of the two-dimensional images. In some embodiments, the disclosed systems utilizes three-dimensional representations of two-dimensional images to generate and visualize 3D planar surfaces for modifying objects in two-dimensional images.

Type: Application

Filed: April 20, 2023

Publication date: May 2, 2024

Inventors: Giorgio Gori, Yi Zhou, Yangtuanfeng Wang, Yang Zhou, Krishna Kumar Singh, Jae Shin Yoon, Duygu Ceylan Aksit
UTILIZING A GENERATIVE MACHINE LEARNING MODEL TO CREATE MODIFIED DIGITAL IMAGES FROM AN INFILL SEMANTIC MAP

Publication number: 20240135509

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Qing Liu, Jianming Zhang, Krishna Kumar Singh, Scott Cohen, Zhe Lin
HUMAN INPAINTING UTILIZING A SEGMENTATION BRANCH FOR GENERATING AN INFILL SEGMENTATION MAP

Publication number: 20240135512

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
GENERATIVE MODEL FOR MULTI-MODALITY OUTPUTS FROM A SINGLE INPUT

Publication number: 20240135672

Abstract: An image generation system implements a multi-branch GAN to generate images that each express visually similar content in a different modality. A generator portion of the multi-branch GAN includes multiple branches that are each tasked with generating one of the different modalities. A discriminator portion of the multi-branch GAN includes multiple fidelity discriminators, one for each of the generator branches, and a consistency discriminator, which constrains the outputs generated by the different generator branches to appear visually similar to one another. During training, outputs from each of the fidelity discriminators and the consistency discriminator are used to compute a non-saturating GAN loss. The non-saturating GAN loss is used to refine parameters of the multi-branch GAN during training until model convergence. The trained multi-branch GAN generates multiple images from a single input, where each of the multiple images depicts visually similar content expressed in a different modality.

Type: Application

Filed: October 20, 2022

Publication date: April 25, 2024

Applicant: Adobe Inc.

Inventors: Yijun Li, Zhixin Shu, Zhen Zhu, Krishna Kumar Singh
UTILIZING A WARPED DIGITAL IMAGE WITH A REPOSING MODEL TO SYNTHESIZE A MODIFIED DIGITAL IMAGE

Publication number: 20240135513

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz
UTILIZING A GENERATIVE MACHINE LEARNING MODEL AND GRAPHICAL USER INTERFACE FOR CREATING MODIFIED DIGITAL IMAGES FROM AN INFILL SEMANTIC MAP

Publication number: 20240135510

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Qing Liu, Jianming Zhang, Krishna Kumar Singh, Scott Cohen, Zhe Lin
MODIFYING DIGITAL IMAGES VIA MULTI-LAYERED SCENE COMPLETION FACILITATED BY ARTIFICIAL INTELLIGENCE

Publication number: 20240135514

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via multi-layered scene completion techniques facilitated by artificial intelligence. For instance, in some embodiments, the disclosed systems receive a digital image portraying a first object and a second object against a background, where the first object occludes a portion of the second object. Additionally, the disclosed systems pre-process the digital image to generate a first content fill for the portion of the second object occluded by the first object and a second content fill for a portion of the background occluded by the second object. After pre-processing, the disclosed systems detect one or more user interactions to move or delete the first object from the digital image. The disclosed systems further modify the digital image by moving or deleting the first object and exposing the first content fill for the portion of the second object.

Type: Application

Filed: September 1, 2023

Publication date: April 25, 2024

Inventors: Daniil Pakhomov, Qing Liu, Zhihong Ding, Scott Cohen, Zhe Lin, Jianming Zhang, Zhifei Zhang, Ohiremen Dibua, Mariette Souppe, Krishna Kumar Singh, Jonathan Brandt
SYNTHESIZING A MODIFIED DIGITAL IMAGE UTILIZING A REPOSING MODEL

Publication number: 20240135572

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz
GENERATING A MODIFIED DIGITAL IMAGE UTILIZING A HUMAN INPAINTING MODEL

Publication number: 20240135511

Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.

Type: Application

Filed: March 27, 2023

Publication date: April 25, 2024

Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz, Qing Liu, Jianming Zhang, Zhe Lin
VIDEO SUMMARIZATION USING SEMANTIC INFORMATION

Publication number: 20240127061

Abstract: Example apparatus disclosed herein are to process a first image of a first video segment from the image capture sensor with a machine learning algorithm to determine a first score for the first image, the machine learning algorithm to detect actions associated with images, the actions associated with labels. Disclosed example apparatus are also to determine a second score for the first video segment based on respective first scores for corresponding images in the first video segment. Disclosed example apparatus are further to determine, based on the second score, whether to retain the first video segment in the memory.

Type: Application

Filed: November 15, 2023

Publication date: April 18, 2024

Inventors: Myung Hwangbo, Krishna Kumar Singh, Teahyung Lee, Omesh Tickoo
DEBIASING IMAGE TO IMAGE TRANSLATION MODELS

Publication number: 20240046412

Abstract: A system debiases image translation models to produce generated images that contain minority attributes. A balanced batch for a minority attribute is created by over-sampling images having the minority attribute from an image dataset. An image translation model is trained using images from the balanced batch by applying supervised contrastive loss to output of an encoder of the image translation model and an auxiliary classifier loss based on predicted attributes in images generated by a decoder of the image translation model. Once trained, the image translation model is used to generate images with the minority image when given an input image having the minority attribute.

Type: Application

Filed: August 3, 2022

Publication date: February 8, 2024

Inventors: Md Mehrab Tanjim, Krishna Kumar Singh, Kushal Kafle, Ritwik Sinha
Generating synthesized digital images utilizing class-specific machine-learning models

Patent number: 11861762

Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that generate synthetized digital images using class-specific generators for objects of different classes. The disclosed system modifies a synthesized digital image by utilizing a plurality of class-specific generator neural networks to generate a plurality of synthesized objects according to object classes identified in the synthesized digital image. The disclosed system determines object classes in the synthesized digital image such as via a semantic label map corresponding to the synthesized digital image. The disclosed system selects class-specific generator neural networks corresponding to the classes of objects in the synthesized digital image. The disclosed system also generates a plurality of synthesized objects utilizing the class-specific generator neural networks based on contextual data associated with the identified objects.

Type: Grant

Filed: August 12, 2021

Date of Patent: January 2, 2024

Assignee: Adobe Inc.

Inventors: Yuheng Li, Yijun Li, Jingwan Lu, Elya Shechtman, Krishna Kumar Singh
Video summarization using semantic information

Patent number: 11861495

Abstract: Example apparatus disclosed herein are to process a first image of a first video segment from the image capture sensor with a machine learning algorithm to determine a first score for the first image, the machine learning algorithm to detect actions associated with images, the actions associated with labels. Disclosed example apparatus are also to determine a second score for the first video segment based on respective first scores for corresponding images in the first video segment. Disclosed example apparatus are further to determine, based on the second score, whether to retain the first video segment in the memory.

Type: Grant

Filed: March 15, 2021

Date of Patent: January 2, 2024

Assignee: Intel Corporation

Inventors: Myung Hwangbo, Krishna Kumar Singh, Teahyung Lee, Omesh Tickoo
Synthesizing digital images utilizing image-guided model inversion of an image classifier

Patent number: 11842468

Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that utilize image-guided model inversion of an image classifier with a discriminator. The disclosed systems utilize a neural network image classifier to encode features of an initial image and a target image. The disclosed system also reduces a feature distance between the features of the initial image and the features of the target image at a plurality of layers of the neural network image classifier by utilizing a feature distance regularizer. Additionally, the disclosed system reduces a patch difference between image patches of the initial image and image patches of the target image by utilizing a patch-based discriminator with a patch consistency regularizer. The disclosed system then generates a synthesized digital image based on the constrained feature set and constrained image patches of the initial image.

Type: Grant

Filed: February 18, 2021

Date of Patent: December 12, 2023

Assignee: Adobe Inc.

Inventors: Pei Wang, Yijun Li, Jingwan Lu, Krishna Kumar Singh
Diverse Image Inpainting Using Contrastive Learning

Publication number: 20230342884

Abstract: An image inpainting system is described that receives an input image that includes a masked region. From the input image, the image inpainting system generates a synthesized image that depicts an object in the masked region by selecting a first code that represents a known factor characterizing a visual appearance of the object and a second code that represents an unknown factor characterizing the visual appearance of the object apart from the known factor in latent space. The input image, the first code, and the second code are provided as input to a generative adversarial network that is trained to generate the synthesized image using contrastive losses. Different synthesized images are generated from the same input image using different combinations of first and second codes, and the synthesized images are output for display.

Type: Application

Filed: April 21, 2022

Publication date: October 26, 2023

Applicant: Adobe Inc.

Inventors: Krishna Kumar Singh, Yuheng Li, Yijun Li, Jingwan Lu, Elya Shechtman
Generating synthesized digital images utilizing a multi-resolution generator neural network

Patent number: 11769227

Abstract: This disclosure describes methods, non-transitory computer readable storage media, and systems that generate synthetized digital images via multi-resolution generator neural networks. The disclosed system extracts multi-resolution features from a scene representation to condition a spatial feature tensor and a latent code to modulate an output of a generator neural network. For example, the disclosed systems utilizes a base encoder of the generator neural network to generate a feature set from a semantic label map of a scene. The disclosed system then utilizes a bottom-up encoder to extract multi-resolution features and generate a latent code from the feature set. Furthermore, the disclosed system determines a spatial feature tensor by utilizing a top-down encoder to up-sample and aggregate the multi-resolution features. The disclosed system then utilizes a decoder to generate a synthesized digital image based on the spatial feature tensor and the latent code.

Type: Grant

Filed: August 12, 2021

Date of Patent: September 26, 2023

Assignee: Adobe Inc.

Inventors: Yuheng Li, Yijun Li, Jingwan Lu, Elya Shechtman, Krishna Kumar Singh
Image Inversion Using Multiple Latent Spaces

Publication number: 20230289970

Abstract: In implementations of systems for image inversion using multiple latent spaces, a computing device implements an inversion system to generate a segment map that segments an input digital image into a first image region and a second image region and assigns the first image region to a first latent space and the second image region to a second latent space that corresponds to a layer of a convolutional neural network. An inverted latent representation of the input digital image is computed using a binary mask for the second image region. The inversion system modifies the inverted latent representation of the input digital image using an edit direction vector that corresponds to a visual feature. An output digital image is generated that depicts a reconstruction of the input digital image having the visual feature based on the modified inverted latent representation of the input digital image.

Type: Application

Filed: March 14, 2022

Publication date: September 14, 2023

Applicant: Adobe Inc.

Inventors: Gaurav Parmar, Krishna Kumar Singh, Yijun Li, Richard Zhang, Jingwan Lu
GENERATING ANIMATED DIGITAL VIDEOS UTILIZING A CHARACTER ANIMATION NEURAL NETWORK INFORMED BY POSE AND MOTION EMBEDDINGS

Publication number: 20230123820

Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and method that utilize a character animation neural network informed by motion and pose signatures to generate a digital video through person-specific appearance modeling and motion retargeting. In particular embodiments, the disclosed systems implement a character animation neural network that includes a pose embedding model to encode a pose signature into spatial pose features. The character animation neural network further includes a motion embedding model to encode a motion signature into motion features. In some embodiments, the disclosed systems utilize the motion features to refine per-frame pose features and improve temporal coherency. In certain implementations, the disclosed systems also utilize the motion features to demodulate neural network weights used to generate an image frame of a character in motion based on the refined pose features.

Type: Application

Filed: October 15, 2021

Publication date: April 20, 2023

Inventors: Yangtuanfeng Wang, Duygu Ceylan Aksit, Krishna Kumar Singh, Niloy J Mitra

1 2 next