Patents by Inventor Sameh Khamis

Sameh Khamis has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12626445
    Abstract: Apparatuses, systems, and techniques to generate a surface. In at least one embodiment, one or more neural networks are used to generate a surface of an object based, at least in part, on motion of the object.
    Type: Grant
    Filed: January 27, 2022
    Date of Patent: May 12, 2026
    Assignee: NVIDIA Corporation
    Inventors: Sameh Khamis, Sourav Biswas, Kangxue Yin, Maria Shugrina, Sanja Fidler
  • Publication number: 20260112137
    Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be combined with a generative network to generate objects based on parameters associated with a textual input. An input including a 3D mesh and texture may be provided to a trained system along with a textual input that includes parameters for object generation. Features of the input object may be identified and then tuned in accordance with the textual input to generate a modified 3D object that includes a new texture along with one or more geometric adjustments.
    Type: Application
    Filed: September 15, 2025
    Publication date: April 23, 2026
    Inventors: Kangxue Yin, Huan Ling, Masha Shugrina, Sameh Khamis, Sanja Fidler
  • Patent number: 12592010
    Abstract: Apparatuses, systems, and techniques are presented to generate image data. In at least one embodiment, one or more neural networks are used to cause a lighting effect to be applied to one or more objects within one or more images based, at least in part, on synthetically generated images of the one or more objects.
    Type: Grant
    Filed: July 12, 2022
    Date of Patent: March 31, 2026
    Assignee: NVIDIA Corporation
    Inventors: Ting-Chun Wang, Ming-Yu Liu, Koki Nagano, Sameh Khamis, Jan Kautz
  • Patent number: 12536733
    Abstract: A single two-dimensional (2D) image can be used as input to obtain a three-dimensional (3D) representation of the 2D image. This is done by extracting features from the 2D image by an encoder and determining a 3D representation of the 2D image utilizing a trained 2D convolutional neural network (CNN). Volumetric rendering is then run on the 3D representation to combine features within one or more viewing directions, and the combined features are provided as input to a multilayer perceptron (MLP) that predicts and outputs color (or multi-dimensional neural features) and density values for each point within the 3D representation. As a result, single-image inverse rendering may be performed using only a single 2D image as input to create a corresponding 3D representation of the scene in the single 2D image.
    Type: Grant
    Filed: December 14, 2021
    Date of Patent: January 27, 2026
    Assignee: NVIDIA CORPORATION
    Inventors: Koki Nagano, Eric Ryan Chan, Sameh Khamis, Shalini De Mello, Tero Tapani Karras, Orazio Gallo, Jonathan Tremblay
  • Publication number: 20250378632
    Abstract: Embodiments of the present disclosure provide techniques for performing virtual object placement in a video sequence using generative artificial intelligence models. An example method generally includes receiving an input prompt specifying an object to insert into a scene depicted in an input image stream; decoding, using a generative artificial intelligence model, perspective and lighting information for the input image stream; determining, based on the decoded perspective and lighting information, a location in the scene in which the object is to be inserted; and generating, using the generative artificial intelligence model, an output image stream including the object into the scene at the determined location, wherein visual effects for the object are based on the perspective and lighting information for the input image stream.
    Type: Application
    Filed: June 4, 2025
    Publication date: December 11, 2025
    Inventors: Sameh KHAMIS, Abdelrahman Samir Abdelrahman MOHAMED, Ahmed Aly Saad AHMED
  • Patent number: 12417602
    Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be combined with a generative network to generate objects based on parameters associated with a textual input. An input including a 3D mesh and texture may be provided to a trained system along with a textual input that includes parameters for object generation. Features of the input object may be identified and then tuned in accordance with the textual input to generate a modified 3D object that includes a new texture along with one or more geometric adjustments.
    Type: Grant
    Filed: February 27, 2023
    Date of Patent: September 16, 2025
    Assignee: Nvidia Corporation
    Inventors: Kangxue Yin, Huan Ling, Masha Shugrina, Sameh Khamis, Sanja Fidler
  • Publication number: 20250200866
    Abstract: In various examples, information may be received for a 3D model, such as 3D geometry information, lighting information, and material information. A machine learning model may be trained to disentangle the 3D geometry information, the lighting information, and/or material information from input data to provide the information, which may be used to project geometry of the 3D model onto an image plane to generate a mapping between pixels and portions of the 3D model. Rasterization may then use the mapping to determine which pixels are covered and in what manner, by the geometry. The mapping may also be used to compute radiance for points corresponding to the one or more 3D models using light transport simulation. Disclosed approaches may be used in various applications, such as image editing, 3D model editing, synthetic data generation, and/or data set augmentation.
    Type: Application
    Filed: March 3, 2025
    Publication date: June 19, 2025
    Inventors: Wenzheng Chen, Joey Litalien, Jun Gao, Zian Wang, Clement Tse Tsian Christophe Louis Fuji Tsang, Sameh Khamis, Or Litany, Sanja Fidler
  • Patent number: 12316825
    Abstract: An electronic device estimates a depth map of an environment based on matching reduced-resolution stereo depth images captured by depth cameras to generate a coarse disparity (depth) map. The electronic device downsamples depth images captured by the depth cameras and matches sections of the reduced-resolution images to each other to generate a coarse depth map. The electronic device upsamples the coarse depth map to a higher resolution and refines the upsampled depth map to generate a high-resolution depth map to support location-based functionality.
    Type: Grant
    Filed: February 17, 2023
    Date of Patent: May 27, 2025
    Assignee: Google LLC
    Inventors: Sameh Khamis, Yinda Zhang, Christoph Rhemann, Julien Valentin, Adarsh Kowdle, Vladimir Tankovich, Michael Schoenberg, Shahram Izadi, Thomas Funkhouser, Sean Fanello
  • Publication number: 20250086896
    Abstract: In various examples, systems and methods are disclosed relating to neural networks for three-dimensional (3D) scene representations and modifying the 3D scene representations. In some implementations, a diffusion model can be configured to modify selected portions of 3D scenes represented using neural radiance fields, without painting back in content of the selected portions that was originally present. A first view of the neural radiance fields can be inpainted to remove a target feature from the first view, and used as guidance for updating the neural radiance field so that the target feature can be realistically removed from various second views of the neural radiance fields while context is retained outside of the selected portions.
    Type: Application
    Filed: September 12, 2023
    Publication date: March 13, 2025
    Applicant: NVIDIA Corporation
    Inventors: Or LITANY, Sanja FIDLER, Cho-Ying WU, Huan LING, Zan GOJCIC, Riccardo DE LUTIO, Sameh KHAMIS
  • Publication number: 20250086905
    Abstract: The present invention sets forth a technique for performing virtual object placement in a video sequence. The technique includes identifying a planar surface depicted in an input video sequence and selecting a virtual object included in an object library. The technique also includes generating, for a combination of the planar surface and the virtual object, a suitability metric associated with the combination, wherein the suitability metric is based at least on a semantic compatibility between the virtual object and the planar surface. The technique further includes generating, via one or more machine learning models, a modified video sequence based on the suitability metric, where the modified video sequence depicts the virtual object placed on the planar surface.
    Type: Application
    Filed: September 11, 2024
    Publication date: March 13, 2025
    Inventors: Abdelrahman Samir Abdelrahman MOHAMED, Sameh KHAMIS, Ahmed Aly Saad AHMED, David Abraham WIENER
  • Patent number: 12243152
    Abstract: In various examples, information may be received for a 3D model, such as 3D geometry information, lighting information, and material information. A machine learning model may be trained to disentangle the 3D geometry information, the lighting information, and/or material information from input data to provide the information, which may be used to project geometry of the 3D model onto an image plane to generate a mapping between pixels and portions of the 3D model. Rasterization may then use the mapping to determine which pixels are covered and in what manner, by the geometry. The mapping may also be used to compute radiance for points corresponding to the one or more 3D models using light transport simulation. Disclosed approaches may be used in various applications, such as image editing, 3D model editing, synthetic data generation, and/or data set augmentation.
    Type: Grant
    Filed: February 14, 2024
    Date of Patent: March 4, 2025
    Assignee: NVIDIA Corporation
    Inventors: Wenzheng Chen, Joey Litalien, Jun Gao, Zian Wang, Clement Tse Tsian Christophe Louis Fuji Tsang, Sameh Khamis, Or Litany, Sanja Fidler
  • Publication number: 20250029351
    Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be used for part-aware style transformation of both geometric features and textural components of a source asset to a target asset. The source asset may be segmented into particular parts and then ellipsoid approximations may be warped according to correspondence of the particular parts to the target assets. Moreover, a texture associated with the target asset may be used to warp or adjust a source texture, where the new texture can be applied to the warped parts.
    Type: Application
    Filed: October 3, 2024
    Publication date: January 23, 2025
    Inventors: Kangxue Yin, Jun Gao, Masha Shugrina, Sameh Khamis, Sanja Fidler
  • Publication number: 20240371096
    Abstract: Approaches presented herein provide systems and methods for disentangling identity from expression input models. One or more machine learning systems may be trained directly from three-dimensional (3D) points to develop unique latent codes for expressions associated with different identities. These codes may then be mapped to different identities to independently model an object, such as a face, to generate a new mesh including an expression for an independent identity. A pipeline may include a set of machine learning systems to determine model parameters and also adjust input expression codes using gradient backpropagation in order train models for incorporation into a content development pipeline.
    Type: Application
    Filed: May 4, 2023
    Publication date: November 7, 2024
    Inventors: Sameh Khamis, Koki Nagano, Jan Kautz, Sanja Fidler
  • Publication number: 20240355067
    Abstract: One embodiment of the present invention sets forth a technique for estimating a real-world size of an object included in an input scene. The technique includes identifying one or more depictions of human faces included in a two-dimensional input scene and generating one or more bounding boxes associated with the input scene, where each bounding box represents a head size associated with a different one of the one or more depictions of human faces. The technique also includes calculating a relative depth value for each of one or more pixels included in the input scene. The technique further includes calculating an average relative head size based on the one or more bounding boxes and relative depth values associated with the one or more pixels and generating a depth scale based on the average relative head size and a known real-world dimension of an average human head.
    Type: Application
    Filed: April 24, 2024
    Publication date: October 24, 2024
    Inventors: Ahmed Aly Saad AHMED, Sameh Khamis, Abdelrahman Samir Abdelrahman Mohamed
  • Publication number: 20240354993
    Abstract: One embodiment of the present invention sets forth a technique for performing estimation of scene parameters associated with a two-dimensional (2D) input scene. technique includes identifying, based on the input scene, one or more line segments included in the input scene and generating one or more vanishing points associated with the input scene based on the one or more line segments. The technique also includes estimating, based on the one or more vanishing points, one or more scene parameters associated with the scene and inserting a world object into the input scene based on the one or more scene parameters.
    Type: Application
    Filed: April 24, 2024
    Publication date: October 24, 2024
    Inventors: Ahmed Aly Saad AHMED, Sameh Khamis, Abdelrahman Samir Abdelrahman Mohamed
  • Patent number: 12112445
    Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be used for part-aware style transformation of both geometric features and textural components of a source asset to a target asset. The source asset may be segmented into particular parts and then ellipsoid approximations may be warped according to correspondence of the particular parts to the target assets. Moreover, a texture associated with the target asset may be used to warp or adjust a source texture, where the new texture can be applied to the warped parts.
    Type: Grant
    Filed: September 7, 2021
    Date of Patent: October 8, 2024
    Assignee: Nvidia Corporation
    Inventors: Kangxue Yin, Jun Gao, Masha Shugrina, Sameh Khamis, Sanja Fidler
  • Publication number: 20240290054
    Abstract: Generation of three-dimensional (3D) object models may be challenging for users without a sufficient skill set for content creation and may also be resource intensive. One or more style transfer networks may be combined with a generative network to generate objects based on parameters associated with a textual input. An input including a 3D mesh and texture may be provided to a trained system along with a textual input that includes parameters for object generation. Features of the input object may be identified and then tuned in accordance with the textual input to generate a modified 3D object that includes a new texture along with one or more geometric adjustments.
    Type: Application
    Filed: February 27, 2023
    Publication date: August 29, 2024
    Inventors: Kangxue Yin, Huan Ling, Masha Shugrina, Sameh Khamis, Sanja Fidler
  • Publication number: 20240185506
    Abstract: In various examples, information may be received for a 3D model, such as 3D geometry information, lighting information, and material information. A machine learning model may be trained to disentangle the 3D geometry information, the lighting information, and/or material information from input data to provide the information, which may be used to project geometry of the 3D model onto an image plane to generate a mapping between pixels and portions of the 3D model. Rasterization may then use the mapping to determine which pixels are covered and in what manner, by the geometry. The mapping may also be used to compute radiance for points corresponding to the one or more 3D models using light transport simulation. Disclosed approaches may be used in various applications, such as image editing, 3D model editing, synthetic data generation, and/or data set augmentation.
    Type: Application
    Filed: February 14, 2024
    Publication date: June 6, 2024
    Inventors: Wenzheng Chen, Joey Litalien, Jun Gao, Zian Wang, Clement Tse Tsian Christophe Louis Fuji Tsang, Sameh Khamis, Or Litany, Sanja Fidler
  • Publication number: 20240153188
    Abstract: In various examples, systems and methods are disclosed relating to generating physics-plausible whole body motion, including determining a mesh sequence corresponding to a motion of at least one dynamic character of one or more dynamic characters and a mesh of a terrain using a video sequence, determining using a generative model and based at least one the mesh sequence and the mesh of the terrain, an occlusion-free motion of the at least one dynamic character by infilling physics-plausible character motions in the mesh sequence for at least one frame of the video sequence that includes an occlusion of at least a portion of the at least one dynamic character, and determining physics-plausible whole body motion of the at least one dynamic character by applying physics-based imitation upon the occlusion-free motion.
    Type: Application
    Filed: August 24, 2023
    Publication date: May 9, 2024
    Applicant: NVIDIA Corporation
    Inventors: Jingbo WANG, Ye YUAN, Cheng XIE, Sanja FIDLER, Jan KAUTZ, Umar IQBAL, Zan GOJCIC, Sameh KHAMIS
  • Publication number: 20240104842
    Abstract: A method for generating, by an encoder-based model, a three-dimensional (3D) representation of a two-dimensional (2D) image is provided. The encoder-based model is trained to infer the 3D representation using a synthetic training data set generated by a pre-trained model. The pre-trained model is a 3D generative model that produces a 3D representation and a corresponding 2D rendering, which can be used to train a separate encoder-based model for downstream tasks like estimating a triplane representation, neural radiance field, mesh, depth map, 3D key points, or the like, given a single input image, using the pseudo ground truth 3D synthetic training data set. In a particular embodiment, the encoder-based model is trained to predict a triplane representation of the input image, which can then be rendered by a volume renderer according to pose information to generate an output image of the 3D scene from the corresponding viewpoint.
    Type: Application
    Filed: September 22, 2023
    Publication date: March 28, 2024
    Inventors: Koki Nagano, Alexander Trevithick, Chao Liu, Eric Ryan Chan, Sameh Khamis, Michael Stengel, Zhiding Yu