Patents by Inventor Krishna Kumar Singh
Krishna Kumar Singh has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 12657882Abstract: Systems and methods for training a Generative Adversarial Network (GAN) using feature regularization are described herein. Embodiments are configured to generate a candidate image using a generator network of a GAN, classify the candidate image as real or generated using a discriminator network of the GAN, and train the GAN to generate realistic images based on the classifying of the candidate image. The training process includes regularizing a gradient with respect to features extracted using a discriminator network of the GAN.Type: GrantFiled: July 24, 2023Date of Patent: June 16, 2026Assignee: ADOBE INC.Inventors: Min Jin Chong, Krishna Kumar Singh, Yijun Li, Jingwan Lu
-
Publication number: 20260134589Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation includes obtaining a source image and a lighting input that indicates a lighting condition for the source image. An image generation model generates a relighted foreground image based on the source image. A relighted background image is also generated based on the lighting input. The relighted foreground image depicts a foreground element with the lighting condition and the relighted background image depicts a background element with the lighting condition. The relighted foreground image and the relighted background image are combined to obtain a relighted image, wherein the relighted image depicts the foreground element and the background element with the lighting condition.Type: ApplicationFiled: November 14, 2024Publication date: May 14, 2026Inventors: Junuk Cha, Jae Shin Yoon, Mengwei Ren, Krishna Kumar Singh, Seunghyun Yoon, He Zhang, Yannick Hold-Geoffroy, HyunJoon Jung
-
Patent number: 12626461Abstract: A modeling system accesses a two-dimensional (2D) input image displayed via a user interface, the 2D input image depicting, at a first view, a first object. At least one region of the first object is not represented by pixel values of the 2D input image. The modeling system generates, by applying a 3D representation generation model to the 2D input image, a three-dimensional (3D) representation of the first object that depicts an entirety of the first object including the first region. The modeling system displays, via the user interface, the 3D representation, wherein the 3D representation is viewable via the user interface from a plurality of views including the first view.Type: GrantFiled: September 5, 2023Date of Patent: May 12, 2026Assignee: Adobe Inc.Inventors: Jae Shin Yoon, Yangtuanfeng Wang, Krishna Kumar Singh, Junying Wang, Jingwan Lu
-
Patent number: 12626423Abstract: Systems and methods for image processing (e.g., image extension or image uncropping) using neural networks are described. One or more aspects include obtaining an image (e.g., a source image, a user provided image, etc.) having an initial aspect ratio, and identifying a target aspect ratio (e.g., via user input) that is different from the initial aspect ratio. The image may be positioned in an image frame having the target aspect ratio, where the image frame includes an image region containing the image and one or more extended regions outside the boundaries of the image. An extended image may be generated (e.g., using a generative neural network), where the extended image includes the image in the image region as well as generated image portions in the extended regions and the one or more generated image portions comprise an extension of a scene element depicted in the image.Type: GrantFiled: March 20, 2024Date of Patent: May 12, 2026Assignee: ADOBE INC.Inventors: Yuqian Zhou, Elya Shechtman, Zhe Lin, Krishna Kumar Singh, Jingwan Lu, Connelly Stuart Barnes, Sohrab Amirghodsi
-
Patent number: 12626431Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing machine learning models to generate modified digital images. In particular, in some embodiments, the disclosed systems generate image editing directions between textual identifiers of two visual features utilizing a language prediction machine learning model and a text encoder. In some embodiments, the disclosed systems generated an inversion of a digital image utilizing a regularized inversion model to guide forward diffusion of the digital image. In some embodiments, the disclosed systems utilize cross-attention guidance to preserve structural details of a source digital image when generating a modified digital image with a diffusion neural network.Type: GrantFiled: March 3, 2023Date of Patent: May 12, 2026Assignee: Adobe Inc.Inventors: Yijun Li, Richard Zhang, Krishna Kumar Singh, Jingwan Lu, Gaurav Parmar, Jun-Yan Zhu
-
Patent number: 12614301Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify digital images via scene-based editing using image understanding facilitated by artificial intelligence. For example, in one or more embodiments the disclosed systems utilize generative machine learning models to create modified digital images portraying human subjects. In particular, the disclosed systems generate modified digital images by performing infill modifications to complete a digital image or human inpainting for portions of a digital image that portrays a human. Moreover, in some embodiments, the disclosed systems perform reposing of subjects portrayed within a digital image to generate modified digital images. In addition, the disclosed systems in some embodiments perform facial expression transfer and facial expression animations to generate modified digital images or animations.Type: GrantFiled: March 27, 2023Date of Patent: April 28, 2026Assignee: Adobe Inc.Inventors: Krishna Kumar Singh, Yijun Li, Jingwan Lu, Duygu Ceylan Aksit, Yangtuanfeng Wang, Jimei Yang, Tobias Hinz
-
Patent number: 12608858Abstract: An image processing system obtains an input image (e.g., a user provided image, etc.) and a mask indicating an edit region of the image. A user selects an image editing mode for an image generation network from a plurality of image editing modes. The image generation network generates an output image using the input image, the mask, and the image editing mode.Type: GrantFiled: September 26, 2023Date of Patent: April 21, 2026Assignee: ADOBE INC.Inventors: Yuqian Zhou, Krishna Kumar Singh, Zhifei Zhang, Difan Liu, Zhe Lin, Jianming Zhang, Qing Liu, Jingwan Lu, Elya Shechtman, Sohrab Amirghodsi, Connelly Stuart Barnes
-
Publication number: 20260105630Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that modify two-dimensional images via scene-based editing using three-dimensional representations of the two-dimensional images. For instance, in one or more embodiments, the disclosed systems utilize three-dimensional representations of two-dimensional images to generate and modify shadows in the two-dimensional images according to various shadow maps. Additionally, the disclosed systems utilize three-dimensional representations of two-dimensional images to modify humans in the two-dimensional images. The disclosed systems also utilize three-dimensional representations of two-dimensional images to provide scene scale estimation via scale fields of the two-dimensional images. In some embodiments, the disclosed systems utilizes three-dimensional representations of two-dimensional images to generate and visualize 3D planar surfaces for modifying objects in two-dimensional images.Type: ApplicationFiled: October 14, 2025Publication date: April 16, 2026Inventors: Giorgio Gori, Yi Zhou, Yangtuanfeng Wang, Yang Zhou, Krishna Kumar Singh, Jae Shin Yoon, Duygu Ceylan Aksit
-
Publication number: 20260105661Abstract: A method, apparatus, non-transitory computer readable medium, and system for image processing include obtaining an input image and a modification prompt, where the input image depicts an object with a first attribute and the modification prompt describes a modification from the first attribute to a second attribute different from the first attribute, encoding the modification prompt to obtain a text embedding, where the text embedding represents the modification in an embedding space, and generating a modified image based on the input image and the text embedding, where the modified image depicts the object with the second attribute.Type: ApplicationFiled: June 26, 2025Publication date: April 16, 2026Inventors: Nanxuan Zhao, Yilin Wang, Hui Qu, Yufan Zhou, Zhe Lin, Krishna Kumar Singh, Qing Liu, Yuheng Li, Yu-Teng Li, Wei-An Lin
-
Publication number: 20260094308Abstract: A method, apparatus, non-transitory computer readable medium, and system for text generation includes obtaining an input image including a first element having a first value of an attribute and a second element having a second value of the attribute. An image encoder of a multi-modal machine learning model then encodes the input image to obtain an image embedding and a text decoder of the multi-modal machine learning model generates an output text based on the image embedding. The output text indicates that the first element has the first value of the attribute and that the second element has the second value of the attribute.Type: ApplicationFiled: September 27, 2024Publication date: April 2, 2026Inventors: Yuheng Li, Krishna Kumar Singh, Yijun Li, Elya Shechtman, Zhe Lin
-
Patent number: 12586344Abstract: An image generation system implements a multi-branch GAN to generate images that each express visually similar content in a different modality. A generator portion of the multi-branch GAN includes multiple branches that are each tasked with generating one of the different modalities. A discriminator portion of the multi-branch GAN includes multiple fidelity discriminators, one for each of the generator branches, and a consistency discriminator, which constrains the outputs generated by the different generator branches to appear visually similar to one another. During training, outputs from each of the fidelity discriminators and the consistency discriminator are used to compute a non-saturating GAN loss. The non-saturating GAN loss is used to refine parameters of the multi-branch GAN during training until model convergence. The trained multi-branch GAN generates multiple images from a single input, where each of the multiple images depicts visually similar content expressed in a different modality.Type: GrantFiled: October 21, 2022Date of Patent: March 24, 2026Assignee: Adobe Inc.Inventors: Yijun Li, Zhixin Shu, Zhen Zhu, Krishna Kumar Singh
-
Publication number: 20260080578Abstract: The present disclosure relates to systems, methods, and non-transitory computer-readable media that performs shadow removal and harmonizes lighting properties of a foreground with a background. Furthermore, the disclosed systems receive a shadow removal request for an input digital image that includes a foreground object with a shadow occluding at least part of the foreground object. Moreover, the disclosed systems generate a combined embedding from a mask of the foreground object and the input digital image. Further, the disclosed systems generate a modified digital image without the shadow occluding at least part of the foreground object and lighting properties of the foreground object harmonized with lighting properties of a background.Type: ApplicationFiled: September 16, 2024Publication date: March 19, 2026Inventors: Jae Shin Yoon, Zhixin Shu, Yannick Hold-Geoffroy, Xuaner Zhang, Srujani Kamineni, Mengwei Ren, Krishna Kumar Singh, He Zhang
-
Patent number: 12573004Abstract: Embodiments include systems and methods for generative image filling based on text and a reference image. In one aspect, the system obtains an input image, a reference image, and a text prompt. Then, the system encodes the reference image to obtain an image embedding and encodes the text prompt to obtain a text embedding. Subsequently, a composite image is generated based on the input image, the image embedding, and the text embedding.Type: GrantFiled: November 21, 2023Date of Patent: March 10, 2026Assignee: ADOBE INC.Inventors: Yuqian Zhou, Krishna Kumar Singh, Zhe Lin, Qing Liu, Zhifei Zhang, Sohrab Amirghodsi, Elya Shechtman, Jingwan Lu
-
Patent number: 12561956Abstract: Systems and methods for inserting an object into a background are described. Examples of the systems and methods include obtaining a background image including a region for inserting the object, and encoding the background image to obtain an encoded background. A modified image is then generated based on the encoded background using a diffusion model. The modified image depicts the object within the region.Type: GrantFiled: November 23, 2022Date of Patent: February 24, 2026Assignee: ADOBE INC.Inventors: Sumith Kulal, Krishna Kumar Singh, Jimei Yang, Jingwan Lu, Alexei Efros
-
Publication number: 20260051091Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation includes obtaining an input image and an input prompt. In some cases, the input image depicts a scene and the input prompt indicates a target element to be added to the scene. The image generation model generates a normalized output based on the input image and the input prompt by performing a channel shift on a preliminary output of the image generation model. A synthetic image is generated including the scene of the input image and the target element of the input prompt that is harmonized with the scene.Type: ApplicationFiled: August 16, 2024Publication date: February 19, 2026Inventors: Zhe Lin, Yuqian Zhou, Krishna Kumar Singh
-
Patent number: 12555288Abstract: A method, apparatus, and non-transitory computer readable medium for image generation are described. Embodiments of the present disclosure obtain a content input and a style input via a user interface or from a database. The content input includes a target spatial layout and the style input includes a target style. A content encoder of an image processing apparatus encodes the content input to obtain a spatial layout mask representing the target spatial layout. A style encoder of the image processing apparatus encodes the style input to obtain a style embedding representing the target style. An image generation model of the image processing apparatus generates an image based on the spatial layout mask and the style embedding, where the image includes the target spatial layout and the target style.Type: GrantFiled: September 1, 2023Date of Patent: February 17, 2026Assignee: ADOBE INC.Inventors: Wonwoong Cho, Hareesh Ravi, Midhun Harikumar, Vinh Ngoc Khuc, Krishna Kumar Singh, Jingwan Lu, Ajinkya Gorakhnath Kale
-
Publication number: 20260038126Abstract: In one implementation of subject-aware background video generation, a processing device generates mask data and foreground feature data from frames of a subject video. The mask data separates a subject depicted in the subject video from an environment therein. The foreground feature data describes the features of the subject. The processing device receives a condition frame that depicts a different environment. A machine-learning model generates a composite video by aligning the subject's movement with the different environment from inputs of the foreground feature data, the mask data, and the condition frame, which conditions the generation of the different environment for the composite video. The processing device then presents the composite video via a user interface.Type: ApplicationFiled: July 30, 2024Publication date: February 5, 2026Applicant: Adobe Inc.Inventors: Zhan Xu, Yang Zhou, Krishna Kumar Singh, Jimei Yang, Chun-hao Huang, Boxiao Pan
-
Publication number: 20260024171Abstract: A method, apparatus, non-transitory computer readable medium, and system for image generation includes obtaining an object image and a target lighting indicator, generating a shading map based on the object image and the target lighting indicator, and generating a relighted image based on the object image and the shading map. The relighted image depicts an object from the object image with lighting based on the target lighting indicator.Type: ApplicationFiled: July 17, 2024Publication date: January 22, 2026Inventors: Junying Wang, Jae Shin Yoon, Jingyuan Liu, Xin Sun, Krishna Kumar Singh, Zhixin Shu, He Zhang, Jimei Yang, Yangtuanfeng Wang, Nanxuan Zhao, Simon Chen
-
Patent number: 12524839Abstract: Embodiments include systems and methods for generative image filling based on text and a reference image. In one aspect, the system obtains an input image, a reference image, and a text prompt. Then, the system encodes the reference image to obtain an image embedding and encodes the text prompt to obtain a text embedding. Subsequently, a composite image is generated based on the input image, the image embedding, and the text embedding.Type: GrantFiled: November 21, 2023Date of Patent: January 13, 2026Assignee: ADOBE INC.Inventors: Yuqian Zhou, Krishna Kumar Singh, Zhe Lin, Qing Liu, Zhifei Zhang, Sohrab Amirghodsi, Elya Shechtman, Jingwan Lu
-
Patent number: 12518358Abstract: The present disclosure relates to systems, non-transitory computer-readable media, and methods for utilizing machine learning models to generate modified digital images. In particular, in some embodiments, the disclosed systems generate image editing directions between textual identifiers of two visual features utilizing a language prediction machine learning model and a text encoder. In some embodiments, the disclosed systems generated an inversion of a digital image utilizing a regularized inversion model to guide forward diffusion of the digital image. In some embodiments, the disclosed systems utilize cross-attention guidance to preserve structural details of a source digital image when generating a modified digital image with a diffusion neural network.Type: GrantFiled: March 3, 2023Date of Patent: January 6, 2026Assignee: Adobe Inc.Inventors: Yijun Li, Richard Zhang, Krishna Kumar Singh, Jingwan Lu, Gaurav Parmar, Jun-Yan Zhu