Patents by Inventor Niloy Mitra

Niloy Mitra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Techniques for domain to domain projection using a generative model

Patent number: 11880766

Abstract: An improved system architecture uses a pipeline including a Generative Adversarial Network (GAN) including a generator neural network and a discriminator neural network to generate an image. An input image in a first domain and information about a target domain are obtained. The domains correspond to image styles. An initial latent space representation of the input image is produced by encoding the input image. An initial output image is generated by processing the initial latent space representation with the generator neural network. Using the discriminator neural network, a score is computed indicating whether the initial output image is in the target domain. A loss is computed based on the computed score. The loss is minimized to compute an updated latent space representation. The updated latent space representation is processed with the generator neural network to generate an output image in the target domain.

Type: Grant

Filed: July 23, 2021

Date of Patent: January 23, 2024

Assignee: Adobe Inc.

Inventors: Cameron Smith, Ratheesh Kalarot, Wei-An Lin, Richard Zhang, Niloy Mitra, Elya Shechtman, Shabnam Ghadar, Zhixin Shu, Yannick Hold-Geoffrey, Nathan Carr, Jingwan Lu, Oliver Wang, Jun-Yan Zhu
Attribute decorrelation techniques for image editing

Patent number: 11875221

Abstract: Systems and methods generate a filtering function for editing an image with reduced attribute correlation. An image editing system groups training data into bins according to a distribution of a target attribute. For each bin, the system samples a subset of the training data based on a pre-determined target distribution of a set of additional attributes in the training data. The system identifies a direction in the sampled training data corresponding to the distribution of the target attribute to generate a filtering vector for modifying the target attribute in an input image, obtains a latent space representation of an input image, applies the filtering vector to the latent space representation of the input image to generate a filtered latent space representation of the input image, and provides the filtered latent space representation as input to a neural network to generate an output image with a modification to the target attribute.

Type: Grant

Filed: September 7, 2021

Date of Patent: January 16, 2024

Assignee: Adobe Inc.

Inventors: Wei-An Lin, Baldo Faieta, Cameron Smith, Elya Shechtman, Jingwan Lu, Jun-Yan Zhu, Niloy Mitra, Ratheesh Kalarot, Richard Zhang, Shabnam Ghadar, Zhixin Shu
Fitting 3D primitives to a high-resolution point cloud

Patent number: 11682166

Abstract: Embodiments provide systems, methods, and computer storage media for fitting 3D primitives to a 3D point cloud. In an example embodiment, 3D primitives are fit to a 3D point cloud using a global primitive fitting network that evaluates the entire 3D point cloud and a local primitive fitting network that evaluates local patches of the 3D point cloud. The global primitive fitting network regresses a representation of larger (global) primitives that fit the global structure. To identify smaller 3D primitives for regions with fine detail, local patches are constructed by sampling from a pool of points likely to contain fine detail, and the local primitive fitting network regresses a representation of smaller (local) primitives that fit the local structure of each of the local patches. The global and local primitives are merged into a combined, multi-scale set of fitted primitives, and representative primitive parameters are computed for each fitted primitive.

Type: Grant

Filed: March 15, 2021

Date of Patent: June 20, 2023

Assignee: Adobe Inc.

Inventors: Eric-Tuan Le, Duygu Ceylan Aksit, Tamy Boubekeur, Radomir Meeh, Niloy Mitra, Minhyuk Sung
Attribute conditioned image generation

Patent number: 11640684

Abstract: A method, apparatus, and non-transitory computer readable medium for image processing are described. Embodiments of the method, apparatus, and non-transitory computer readable medium include identifying an original image including a plurality of semantic attributes, wherein each of the semantic attributes represents a complex set of features of the original image; identifying a target attribute value that indicates a change to a target attribute of the semantic attributes; computing a modified feature vector based on the target attribute value, wherein the modified feature vector incorporates the change to the target attribute while holding at least one preserved attribute of the semantic attributes substantially unchanged; and generating a modified image based on the modified feature vector, wherein the modified image includes the change to the target attribute and retains the at least one preserved attribute from the original image.

Type: Grant

Filed: July 21, 2020

Date of Patent: May 2, 2023

Assignees: ADOBE INC., KING ABDULLAH UNIVERSITY OF SCIENCE AND TECHNOLOGY

Inventors: Rameen Abdal, Peter Wonka, Niloy Mitra, Peihao Zhu
FITTING 3D PRIMITIVES TO A HIGH-RESOLUTION POINT CLOUD

Publication number: 20220292765

Abstract: Embodiments provide systems, methods, and computer storage media for fitting 3D primitives to a 3D point cloud. In an example embodiment, 3D primitives are fit to a 3D point cloud using a global primitive fitting network that evaluates the entire 3D point cloud and a local primitive fitting network that evaluates local patches of the 3D point cloud. The global primitive fitting network regresses a representation of larger (global) primitives that fit the global structure. To identify smaller 3D primitives for regions with fine detail, local patches are constructed by sampling from a pool of points likely to contain fine detail, and the local primitive fitting network regresses a representation of smaller (local) primitives that fit the local structure of each of the local patches. The global and local primitives are merged into a combined, multi-scale set of fitted primitives, and representative primitive parameters are computed for each fitted primitive.

Type: Application

Filed: March 15, 2021

Publication date: September 15, 2022

Inventors: Eric-Tuan Le, Duygu Ceylan Aksit, Tamy Boubekeur, Radomir Mech, Niloy Mitra, Minhyuk Sung
MULTI-SCALE OUTPUT TECHNIQUES FOR GENERATIVE ADVERSARIAL NETWORKS

Publication number: 20220122222

Abstract: An improved system architecture uses a Generative Adversarial Network (GAN) including a specialized generator neural network to generate multiple resolution output images. The system produces a latent space representation of an input image. The system generates a first output image at a first resolution by providing the latent space representation of the input image as input to a generator neural network comprising an input layer, an output layer, and a plurality of intermediate layers and taking the first output image from an intermediate layer, of the plurality of intermediate layers of the generator neural network. The system generates a second output image at a second resolution different from the first resolution by providing the latent space representation of the input image as input to the generator neural network and taking the second output image from the output layer of the generator neural network.

Type: Application

Filed: July 23, 2021

Publication date: April 21, 2022

Inventors: Cameron Smith, Ratheesh Kalarot, Wei-An Lin, Richard Zhang, Niloy Mitra, Elya Shechtman, Shabnam Ghadar, Zhixin Shu, Yannick Hold-Geoffrey, Nathan Carr, Jingwan Lu, Oliver Wang, Jun-Yan Zhu
ATTRIBUTE DECORRELATION TECHNIQUES FOR IMAGE EDITING

Publication number: 20220122232

Abstract: Systems and methods generate a filtering function for editing an image with reduced attribute correlation. An image editing system groups training data into bins according to a distribution of a target attribute. For each bin, the system samples a subset of the training data based on a pre-determined target distribution of a set of additional attributes in the training data. The system identifies a direction in the sampled training data corresponding to the distribution of the target attribute to generate a filtering vector for modifying the target attribute in an input image, obtains a latent space representation of an input image, applies the filtering vector to the latent space representation of the input image to generate a filtered latent space representation of the input image, and provides the filtered latent space representation as input to a neural network to generate an output image with a modification to the target attribute.

Type: Application

Filed: September 7, 2021

Publication date: April 21, 2022

Inventors: Wei-An Lin, Baldo Faieta, Cameron Smith, Elya Shechtman, Jingwan Lu, Jun-Yan Zhu, Niloy Mitra, Ratheesh Kalarot, Richard Zhang, Shabnam Ghadar, Zhixin Shu
TECHNIQUES FOR DOMAIN TO DOMAIN PROJECTION USING A GENERATIVE MODEL

Publication number: 20220122221

Abstract: An improved system architecture uses a pipeline including a Generative Adversarial Network (GAN) including a generator neural network and a discriminator neural network to generate an image. An input image in a first domain and information about a target domain are obtained. The domains correspond to image styles. An initial latent space representation of the input image is produced by encoding the input image. An initial output image is generated by processing the initial latent space representation with the generator neural network. Using the discriminator neural network, a score is computed indicating whether the initial output image is in the target domain. A loss is computed based on the computed score. The loss is minimized to compute an updated latent space representation. The updated latent space representation is processed with the generator neural network to generate an output image in the target domain.

Type: Application

Filed: July 23, 2021

Publication date: April 21, 2022

Inventors: Cameron Smith, Ratheesh Kalarot, Wei-An Lin, Richard Zhang, Niloy Mitra, Elya Shechtman, Shabnam Ghadar, Zhixin Shu, Yannick Hold-Geoffrey, Nathan Carr, Jingwan Lu, Oliver Wang, Jun-Yan Zhu
IDENTITY-PRESERVING TECHNIQUES FOR GENERATIVE ADVERSARIAL NETWORK PROJECTION

Publication number: 20220122305

Abstract: An improved system architecture uses a pipeline including an encoder and a Generative Adversarial Network (GAN) including a generator neural network to generate edited images with improved speed, realism, and identity preservation. The encoder produces an initial latent space representation of an input image by encoding the input image. The generator neural network generates an initial output image by processing the initial latent space representation of the input image. The system generates an optimized latent space representation of the input image using a loss minimization technique that minimizes a loss between the input image and the initial output image. The loss is based on target perceptual features extracted from the input image and initial perceptual features extracted from the initial output image. The system outputs the optimized latent space representation of the input image for downstream use.

Type: Application

Filed: July 23, 2021

Publication date: April 21, 2022

Inventors: Cameron Smith, Ratheesh Kalarot, Wei-An Lin, Richard Zhang, Niloy Mitra, Elya Shechtman, Shabnam Ghadar, Zhixin Shu, Yannick Hold-Geoffrey, Nathan Carr, Jingwan Lu, Oliver Wang, Jun-Yan Zhu
ATTRIBUTE CONTROL TECHNIQUES FOR IMAGE EDITING

Publication number: 20220122306

Abstract: Systems and methods dynamically adjust an available range for editing an attribute in an image. An image editing system computes a metric for an attribute in an input image as a function of a latent space representation of the input image and a filtering vector for editing the input image. The image editing system compares the metric to a threshold. If the metric exceeds the threshold, then the image editing system selects a first range for editing the attribute in the input image. If the metric does not exceed the threshold, a second range is selected. The image editing system causes display of a user interface for editing the input image comprising an interface element for editing the attribute within the selected range.

Type: Application

Filed: September 7, 2021

Publication date: April 21, 2022

Inventors: Wei-An Lin, Baldo Faieta, Cameron Smith, Elya Shechtman, Jingwan Lu, Jun-Yan Zhu, Niloy Mitra, Ratheesh Kalarot, Richard Zhang, Shabnam Ghadar, Zhixin Shu
DIRECT REGRESSION ENCODER ARCHITECTURE AND TRAINING

Publication number: 20220121931

Abstract: Systems and methods train and apply a specialized encoder neural network for fast and accurate projection into the latent space of a Generative Adversarial Network (GAN). The specialized encoder neural network includes an input layer, a feature extraction layer, and a bottleneck layer positioned after the feature extraction layer. The projection process includes providing an input image to the encoder and producing, by the encoder, a latent space representation of the input image. Producing the latent space representation includes extracting a feature vector from the feature extraction layer, providing the feature vector to the bottleneck layer as input, and producing the latent space representation as output. The latent space representation produced by the encoder is provided as input to the GAN, which generates an output image based upon the latent space representation. The encoder is trained using specialized loss functions including a segmentation loss and a mean latent loss.

Type: Application

Filed: July 23, 2021

Publication date: April 21, 2022

Inventors: Ratheesh Kalarot, Wei-An Lin, Cameron Smith, Zhixin Shu, Baldo Faieta, Shabnam Ghadar, Jingwan Lu, Aliakbar Darabi, Jun-Yan Zhu, Niloy Mitra, Richard Zhang, Elya Shechtman
SUPERVISED LEARNING TECHNIQUES FOR ENCODER TRAINING

Publication number: 20220121932

Abstract: Systems and methods train an encoder neural network for fast and accurate projection into the latent space of a Generative Adversarial Network (GAN). The encoder is trained by providing an input training image to the encoder and producing, by the encoder, a latent space representation of the input training image. The latent space representation is provided as input to the GAN to generate a generated training image. A latent code is sampled from a latent space associated with the GAN and the sampled latent code is provided as input to the GAN. The GAN generates a synthetic training image based on the sampled latent code. The sampled latent code is provided as input to the encoder to produce a synthetic training code. The encoder is updated by minimizing a loss between the generated training image and the input training image, and the synthetic training code and the sampled latent code.

Type: Application

Filed: July 23, 2021

Publication date: April 21, 2022

Inventors: Ratheesh Kalarot, Wei-An Lin, Cameron Smith, Zhixin Shu, Baldo Faieta, Shabnam Ghadar, Jingwan Lu, Aliakbar Darabi, Jun-Yan Zhu, Niloy Mitra, Richard Zhang, Elya Shechtman
ATTRIBUTE CONDITIONED IMAGE GENERATION

Publication number: 20220028139

Abstract: A method, apparatus, and non-transitory computer readable medium for image processing are described. Embodiments of the method, apparatus, and non-transitory computer readable medium include identifying an original image including a plurality of semantic attributes, wherein each of the semantic attributes represents a complex set of features of the original image; identifying a target attribute value that indicates a change to a target attribute of the semantic attributes; computing a modified feature vector based on the target attribute value, wherein the modified feature vector incorporates the change to the target attribute while holding at least one preserved attribute of the semantic attributes substantially unchanged; and generating a modified image based on the modified feature vector, wherein the modified image includes the change to the target attribute and retains the at least one preserved attribute from the original image.

Type: Application

Filed: July 21, 2020

Publication date: January 27, 2022

Inventors: NILOY MITRA, Peter Wonka, Rameen Abdal, Peihao Zhu
Synthetic data generation for training a machine learning model for dynamic object compositing in scenes

Patent number: 10867444

Abstract: This application relates generally to augmenting images and videos with dynamic object compositing, and more specifically, to generating synthetic training data to train a machine learning model to automatically augment an image or video with a dynamic object. The synthetic training data may contain multiple data points from thousands of simulated dynamic object movements within a virtual environment. Based on the synthetic training data, the machine learning model may determine the movement of a new dynamic object within new virtual environment.

Type: Grant

Filed: November 29, 2018

Date of Patent: December 15, 2020

Assignee: Adobe Inc.

Inventors: Bryan Russell, Daniel Kaufman, Carlo Innamorati, Niloy Mitra
SYNTHETIC DATA GENERATION FOR TRAINING A MACHINE LEARNING MODEL FOR DYNAMIC OBJECT COMPOSITING IN SCENES

Publication number: 20200175759

Abstract: This application relates generally to augmenting images and videos with dynamic object compositing, and more specifically, to generating synthetic training data to train a machine learning model to automatically augment an image or video with a dynamic object. The synthetic training data may contain multiple data points from thousands of simulated dynamic object movements within a virtual environment. Based on the synthetic training data, the machine learning model may determine the movement of a new dynamic object within new virtual environment.

Type: Application

Filed: November 29, 2018

Publication date: June 4, 2020

Inventors: Bryan Russell, Daniel Kaufman, Carlo Innamorati, Niloy Mitra
Determining structure and functionality of scanned objects

Patent number: 10380317

Abstract: Methods and systems for generating digital models from objects. In particular, one or more embodiments determine a plurality of correspondences for first and second components of an object. One or more embodiments estimate a joint connecting the first and second components based on the correspondences. One or more embodiments jointly determine a global transformation and one or more joint parameters that map the plurality of components of the object from the first digital scan to the second digital scan. One or more embodiments also updating the correspondences based on the determined global transformation and parameter(s). One or more embodiments re-estimate the joint based on the updated correspondences. One or more embodiments select a candidate joint with a lowest error estimate from a plurality of candidate joints according to determined global transformations and joint parameter(s) for the candidate joints.

Type: Grant

Filed: March 7, 2016

Date of Patent: August 13, 2019

Assignee: ADOBE INC.

Inventors: Duygu Ceylan, Byungmoon Kim, Aron Monszpart, Vladimir Kim, Niloy Mitra
Providing a tutorial for drawing a scaffold to guide a drawing of a three dimensional object

Patent number: 10192355

Abstract: The systems and techniques disclosed herein provide tutorials for drawing three dimensional objects with accurate proportions and perspective. A user is able to select an object and a viewpoint to automatically generate a tutorial. Regardless of the object and viewpoint, an easy-to-use tutorial is produced that guides the user to draw the object with accurate proportions and perspective. Given a segmented 3D model of the object and a camera viewpoint, a sequence of steps for constructing the scaffold is determined. The sequence of steps is based on an intelligent selection of primitives and inter-primitive anchorings that provides an order for drawing the primitives and makes the scaffold easy to construct. The primitives and inter-primitive anchorings are selected from a rich set of possibilities that allow for some inaccuracies to reduce the difficulty of the tutorial. The primitives and inter-primitive anchoring are selected to balance the difficulty and the potential inaccuracy.

Type: Grant

Filed: January 16, 2017

Date of Patent: January 29, 2019

Assignee: ADOBE INC.

Inventors: Holger Winnemöller, Niloy Mitra, Lubomira Dontcheva, James Hennessey
PROVIDING A TUTORIAL FOR DRAWING A SCAFFOLD TO GUIDE A DRAWING OF A THREE DIMENSIONAL OBJECT

Publication number: 20180204376

Abstract: The systems and techniques disclosed herein provide tutorials for drawing three dimensional objects with accurate proportions and perspective. A user is able to select an object and a viewpoint to automatically generate a tutorial. Regardless of the object and viewpoint, an easy-to-use tutorial is produced that guides the user to draw the object with accurate proportions and perspective. Given a segmented 3D model of the object and a camera viewpoint, a sequence of steps for constructing the scaffold is determined. The sequence of steps is based on an intelligent selection of primitives and inter-primitive anchorings that provides an order for drawing the primitives and makes the scaffold easy to construct. The primitives and inter-primitive anchorings are selected from a rich set of possibilities that allow for some inaccuracies to reduce the difficulty of the tutorial. The primitives and inter-primitive anchoring are selected to balance the difficulty and the potential inaccuracy.

Type: Application

Filed: January 16, 2017

Publication date: July 19, 2018

Inventors: Holger Winnemöller, Niloy Mitra, Lubomira Dontcheva, James Hennessey
DETERMINING STRUCTURE AND FUNCTIONALITY OF SCANNED OBJECTS

Publication number: 20170255712

Abstract: Methods and systems for generating digital models from objects. In particular, one or more embodiments determine a plurality of correspondences for first and second components of an object. One or more embodiments estimate a joint connecting the first and second components based on the correspondences. One or more embodiments jointly determine a global transformation and one or more joint parameters that map the plurality of components of the object from the first digital scan to the second digital scan. One or more embodiments also updating the correspondences based on the determined global transformation and parameter(s). One or more embodiments re-estimate the joint based on the updated correspondences. One or more embodiments select a candidate joint with a lowest error estimate from a plurality of candidate joints according to determined global transformations and joint parameter(s) for the candidate joints.

Type: Application

Filed: March 7, 2016

Publication date: September 7, 2017

Inventors: Duygu Ceylan, Byungmoon Kim, Aron Monszpart, Vladimir Kim, Niloy Mitra