Patents by Inventor Orazio Gallo

Orazio Gallo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

IMAGE-BASED THREE-DIMENSIONAL OCCUPANT ASSESSMENT FOR IN-CABIN MONITORING SYSTEMS AND APPLICATIONS

Publication number: 20250022290

Abstract: In various examples, image-based three-dimensional occupant assessment for in-cabin monitoring systems and applications are provided. An evaluation function may determine a 3D representation of an occupant of a machine by evaluating sensor data comprising an image frame from an optical image sensor. The 3D representation may comprise at least one characteristic representative of a size of the occupant, (e.g., a 3D pose and/or 3D shape), which may be used to derive other characteristics such as, but not limited to weight, height, and/or age. A first processing path may generate a representation of one or more features corresponding to at least a portion of the occupant based on optical image data, and a second processing path may determine a depth corresponding to the one or more features based on depth data derived from the optical image data and ground truth depth data corresponding to the interior of the machine.

Type: Application

Filed: July 10, 2023

Publication date: January 16, 2025

Inventors: Sakthivel SIVARAMAN, Arjun Guru, Rajath Shetty, Umar Iqbal, Orazio Gallo, Hang Su, Abhishek Badki, Varsha Hedau
TECHNIQUES FOR AUTHENTICATING USERS DURING COMPUTER-MEDIATED INTERACTIONS

Publication number: 20240370536

Abstract: Techniques are disclosed herein for authenticating users. The techniques include generating a first fingerprint that represents one or more motions of a first avatar that is driven by a first user, and determining an identity of the first user based on the first fingerprint and a second fingerprint associated with the first user.

Type: Application

Filed: April 26, 2024

Publication date: November 7, 2024

Inventors: Ekta PRASHNANI, Orazio GALLO, Shalini DE MELLO, Koki NAGANO, David LUEBKE
MESH RECONSTRUCTION USING DATA-DRIVEN PRIORS

Publication number: 20240265561

Abstract: One embodiment of a method includes predicting one or more three-dimensional (3D) mesh representations based on a plurality of digital images, wherein the one or more 3D mesh representations are refined by minimizing at least one difference between the one or more 3D mesh representations and the plurality of digital images.

Type: Application

Filed: April 17, 2024

Publication date: August 8, 2024

Inventors: Orazio Gallo, Abhishek Badki
Mesh reconstruction using data-driven priors

Patent number: 11995854

Abstract: One embodiment of a method includes predicting one or more three-dimensional (3D) mesh representations based on a plurality of digital images, wherein the one or more 3D mesh representations are refined by minimizing at least one difference between the one or more 3D mesh representations and the plurality of digital images.

Type: Grant

Filed: December 19, 2018

Date of Patent: May 28, 2024

Assignee: NVIDIA Corporation

Inventors: Orazio Gallo, Abhishek Badki
POSE TRANSFER FOR THREE-DIMENSIONAL CHARACTERS USING A LEARNED SHAPE CODE

Publication number: 20240070987

Abstract: Transferring pose to three-dimensional characters is a common computer graphics task that typically involves transferring the pose of a reference avatar to a (stylized) three-dimensional character. Since three-dimensional characters are created by professional artists through imagination and exaggeration, and therefore, unlike human or animal avatars, have distinct shape and features, matching the pose of a three-dimensional character to that of a reference avatar generally requires manually creating shape information for the three-dimensional character that is required for pose transfer. The present disclosure provides for the automated transfer of a reference pose to a three-dimensional character, based specifically on a learned shape code for the three-dimensional character.

Type: Application

Filed: February 15, 2023

Publication date: February 29, 2024

Inventors: Xueting Li, Sifei Liu, Shalini De Mello, Orazio Gallo, Jiashun Wang, Jan Kautz
View synthesis using attribute correspondences and geometric relationship constraints

Patent number: 11783532

Abstract: A target image corresponding to a novel view may be synthesized from two source images, corresponding source camera poses, and pixel attribute correspondences between the two source images. A particular object in the target image need only be visible in one of the two source images for successful synthesis. Each pixel in the target image is defined according to an identified pixel in one of the two source images. The identified source pixel provides attributes such as color, texture, and feature descriptors for the target pixel. The source and target camera poses are used to define geometric relationships for identifying the source pixels. In an embodiment, the pixel attribute correspondences are optical flow that defines movement of attributes from a first image of the two source images to a second image of the two source images.

Type: Grant

Filed: December 22, 2021

Date of Patent: October 10, 2023

Assignee: NVIDIA Corporation

Inventors: Hang Su, Zitian Chen, Orazio Gallo
IMAGE STITCHING WITH AN ADAPTIVE THREE-DIMENSIONAL BOWL MODEL OF THE SURROUNDING ENVIRONMENT FOR SURROUND VIEW VISUALIZATION

Publication number: 20230316635

Abstract: In various examples, an environment surrounding an ego-object is visualized using an adaptive 3D bowl that models the environment with a shape that changes based on distance (and direction) to one or more representative point(s) on detected objects. Distance (and direction) to detected objects may be determined using 3D object detection or a top-down 2D or 3D occupancy grid, and used to adapt the shape of the adaptive 3D bowl in various ways (e.g., by sizing its ground plane to fit within the distance to the closest detected object, fitting a shape using an optimization algorithm). The adaptive 3D bowl may be enabled or disabled during each time slice (e.g., based on ego-speed), and the 3D bowl for each time slice may be used to render a visualization of the environment (e.g., a top-down projection image, a textured 3D bowl, and/or a rendered view thereof).

Type: Application

Filed: February 23, 2023

Publication date: October 5, 2023

Inventors: Hairong JIANG, Nuri Murat ARAR, Orazio GALLO, Jan KAUTZ, Ronan LETOQUIN
IMAGE STITCHING WITH DYNAMIC SEAM PLACEMENT BASED ON OBJECT SALIENCY FOR SURROUND VIEW VISUALIZATION

Publication number: 20230316458

Abstract: In various examples, dynamic seam placement is used to position seams in regions of overlapping image data to avoid crossing salient objects or regions. Objects may be detected from image frames representing overlapping views of an environment surrounding an ego-object such as a vehicle. The images may be aligned to create an aligned composite image or surface (e.g., a panorama, a 360° image, bowl shaped surface) with regions of overlapping image data, and a representation of the detected objects and/or salient regions (e.g., a saliency mask) may be generated and projected onto the aligned composite image or surface. Seams may be positioned in the overlapping regions to avoid or minimize crossing salient pixels represented in the projected masks, and the image data may be blended at the seams to create a stitched image or surface (e.g., a stitched panorama, stitched 360° image, stitched textured surface).

Type: Application

Filed: February 23, 2023

Publication date: October 5, 2023

Inventors: Yuzhuo REN, Kenneth TURKOWSKI, Nuri Murat ARAR, Orazio GALLO, Jan KAUTZ, Niranjan AVADHANAM, Hang SU
IMAGE STITCHING WITH DYNAMIC SEAM PLACEMENT BASED ON EGO-VEHICLE STATE FOR SURROUND VIEW VISUALIZATION

Publication number: 20230319218

Abstract: In various examples, a state machine is used to select between a default seam placement or dynamic seam placement that avoids salient regions, and to enable and disable dynamic seam placement based on speed of ego-motion, direction of ego-motion, proximity to salient objects, active viewport, driver gaze, and/or other factors. Images representing overlapping views of an environment may be aligned to create an aligned composite image or surface (e.g., a panorama, a 360° image, bowl shaped surface) with overlapping regions of image data, and a default or dynamic seam placement may be selected based on driving scenario (e.g., driving direction, speed, proximity to nearby objects). As such, seams may be positioned in the overlapping regions of image data, and the image data may be blended at the seams to create a stitched image or surface (e.g., a stitched panorama, stitched 360° image, stitched textured surface).

Type: Application

Filed: February 23, 2023

Publication date: October 5, 2023

Inventors: Yuzhuo REN, Nuri Murat ARAR, Orazio GALLO, Jan KAUTZ, Niranjan AVADHANAM, Hang SU
NOVEL VIEW SYNTHESIS USING ATTRIBUTE CORRESPONDENCES AND GEOMETRIC RELATIONSHIP CONSTRAINTS

Publication number: 20230306678

Abstract: A target image corresponding to a novel view may be synthesized from two source images, corresponding source camera poses, and pixel attribute correspondences between the two source images. A particular object in the target image need only be visible in one of the two source images for successful synthesis. Each pixel in the target image is defined according to an identified pixel in one of the two source images. The identified source pixel provides attributes such as color, texture, and feature descriptors for the target pixel. The source and target camera poses are used to define geometric relationships for identifying the source pixels. In an embodiment, the pixel attribute correspondences are optical flow that defines movement of attributes from a first image of the two source images to a second image of the two source images.

Type: Application

Filed: December 22, 2021

Publication date: September 28, 2023

Inventors: Hang Su, Zitian Chen, Orazio Gallo
VIEW GENERATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20230137403

Abstract: Apparatuses, systems, and techniques are presented to generate one or more images. In at least one embodiment, one or more neural networks are used to generate one or more images of one or more objects in two or more different poses from two or more different points of view.

Type: Application

Filed: October 29, 2021

Publication date: May 4, 2023

Inventors: Orazio Gallo, Umar Iqbal, Atsuhiro Noguchi
SINGLE-IMAGE INVERSE RENDERING

Publication number: 20230081641

Abstract: A single two-dimensional (2D) image can be used as input to obtain a three-dimensional (3D) representation of the 2D image. This is done by extracting features from the 2D image by an encoder and determining a 3D representation of the 2D image utilizing a trained 2D convolutional neural network (CNN). Volumetric rendering is then run on the 3D representation to combine features within one or more viewing directions, and the combined features are provided as input to a multilayer perceptron (MLP) that predicts and outputs color (or multi-dimensional neural features) and density values for each point within the 3D representation. As a result, single-image inverse rendering may be performed using only a single 2D image as input to create a corresponding 3D representation of the scene in the single 2D image.

Type: Application

Filed: December 14, 2021

Publication date: March 16, 2023

Inventors: Koki Nagano, Eric Ryan Chan, Sameh Khamis, Shalini De Mello, Tero Tapani Karras, Orazio Gallo, Jonathan Tremblay
OBJECT DETECTION AND COLLISION AVOIDANCE USING A NEURAL NETWORK

Publication number: 20220153262

Abstract: Apparatuses, systems, and techniques to identify objects in view of a camera associated with a vehicle. In at least one embodiment, objects with which a vehicle may collide are identified, based on, for example, a difference between a size of an image of the objects detected at a first point in time and a size of an image of the objects detected at a subsequent point in time.

Type: Application

Filed: November 19, 2020

Publication date: May 19, 2022

Inventors: Orazio Gallo, Abhishek Haridas Badki
Deep-learning method for separating reflection and transmission images visible at a semi-reflective surface in a computer image of a real-world scene

Patent number: 11270161

Abstract: When a computer image is generated from a real-world scene having a semi-reflective surface (e.g. window), the computer image will create, at the semi-reflective surface from the viewpoint of the camera, both a reflection of a scene in front of the semi-reflective surface and a transmission of a scene located behind the semi-reflective surface. Similar to a person viewing the real-world scene from different locations, angles, etc., the reflection and transmission may change, and also move relative to each other, as the viewpoint of the camera changes. Unfortunately, the dynamic nature of the reflection and transmission negatively impacts the performance of many computer applications, but performance can generally be improved if the reflection and transmission are separated. The present disclosure uses deep learning to separate reflection and transmission at a semi-reflective surface of a computer image generated from a real-world scene.

Type: Grant

Filed: July 8, 2020

Date of Patent: March 8, 2022

Assignee: NVIDIA Corporation

Inventors: Orazio Gallo, Jinwei Gu, Jan Kautz, Patrick Wieschollek
DEPTH ESTIMATION USING A NEURAL NETWORK

Publication number: 20210183088

Abstract: Apparatuses, systems, and techniques to identify object distance with one or more cameras. In at least one embodiment, one or more cameras capture at least two images, where one image is transformed to the other, and a neural network determines whether said object is in front of or behind a known distance, whereby an object's distance may be determined after a set of known distances are analyzed.

Type: Application

Filed: December 13, 2019

Publication date: June 17, 2021

Inventors: Orazio Gallo, Abhishek Badki, Alejandro Troccoli
GAZE DETERMINATION USING ONE OR MORE NEURAL NETWORKS

Publication number: 20210132688

Abstract: Apparatuses, systems, and techniques are presented to modify media content using inferred attention. In at least one embodiment, a network is trained to predict a gaze of one or more users on one or more image features based, at least in part, on one or more prior gazes of the one or more users, wherein the prediction is to be used to modify at least one of the one or more image features.

Type: Application

Filed: October 31, 2019

Publication date: May 6, 2021

Inventors: Joohwan Kim, Josef Spjut, Iuri Frosio, Orazio Gallo, Ekta Prashnani
Guided hallucination for missing image content using a neural network

Patent number: 10922793

Abstract: Missing image content is generated using a neural network. In an embodiment, a high resolution image and associated high resolution semantic label map are generated from a low resolution image and associated low resolution semantic label map. The input image/map pair (low resolution image and associated low resolution semantic label map) lacks detail and is therefore missing content. Rather than simply enhancing the input image/map pair, data missing in the input image/map pair is improvised or hallucinated by a neural network, creating plausible content while maintaining spatio-temporal consistency. Missing content is hallucinated to generate a detailed zoomed in portion of an image. Missing content is hallucinated to generate different variations of an image, such as different seasons or weather conditions for a driving video.

Type: Grant

Filed: March 14, 2019

Date of Patent: February 16, 2021

Assignee: NVIDIA Corporation

Inventors: Seung-Hwan Baek, Kihwan Kim, Jinwei Gu, Orazio Gallo, Alejandro Jose Troccoli, Ming-Yu Liu, Jan Kautz
DEEP-LEARNING METHOD FOR SEPARATING REFLECTION AND TRANSMISSION IMAGES VISIBLE AT A SEMI-REFLECTIVE SURFACE IN A COMPUTER IMAGE OF A REAL-WORLD SCENE

Publication number: 20200342263

Abstract: When a computer image is generated from a real-world scene having a semi-reflective surface (e.g. window), the computer image will create, at the semi-reflective surface from the viewpoint of the camera, both a reflection of a scene in front of the semi-reflective surface and a transmission of a scene located behind the semi-reflective surface. Similar to a person viewing the real-world scene from different locations, angles, etc., the reflection and transmission may change, and also move relative to each other, as the viewpoint of the camera changes. Unfortunately, the dynamic nature of the reflection and transmission negatively impacts the performance of many computer applications, but performance can generally be improved if the reflection and transmission are separated. The present disclosure uses deep learning to separate reflection and transmission at a semi-reflective surface of a computer image generated from a real-world scene.

Type: Application

Filed: July 8, 2020

Publication date: October 29, 2020

Inventors: Orazio Gallo, Jinwei Gu, Jan Kautz, Patrick Wieschollek
VIEW SYNTHESIS USING NEURAL NETWORKS

Publication number: 20200294194

Abstract: A video stitching system combines video from different cameras to form a panoramic video that, in various embodiments, is temporally stable and tolerant to strong parallax. In an embodiment, the system provides a smooth spatial interpolation that can be used to connect the input video images. In an embodiment, the system applies an interpolation layer to slices of the overlapping video sources, and the network learns a dense flow field to smoothly align the input videos with spatial interpolation. Various embodiments are applicable to areas such as virtual reality, immersive telepresence, autonomous driving, and video surveillance.

Type: Application

Filed: March 11, 2019

Publication date: September 17, 2020

Inventors: Deqing Sun, Orazio Gallo, Jan Kautz, Jinwei GU, Wei-Sheng Lai
Deep-learning method for separating reflection and transmission images visible at a semi-reflective surface in a computer image of a real-world scene

Patent number: 10762620

Abstract: When a computer image is generated from a real-world scene having a semi-reflective surface (e.g. window), the computer image will create, at the semi-reflective surface from the viewpoint of the camera, both a reflection of a scene in front of the semi-reflective surface and a transmission of a scene located behind the semi-reflective surface. Similar to a person viewing the real-world scene from different locations, angles, etc., the reflection and transmission may change, and also move relative to each other, as the viewpoint of the camera changes. Unfortunately, the dynamic nature of the reflection and transmission negatively impacts the performance of many computer applications, but performance can generally be improved if the reflection and transmission are separated. The present disclosure uses deep learning to separate reflection and transmission at a semi-reflective surface of a computer image generated from a real-world scene.

Type: Grant

Filed: November 26, 2018

Date of Patent: September 1, 2020

Assignee: NVIDIA Corporation

Inventors: Orazio Gallo, Jinwei Gu, Jan Kautz, Patrick Wieschollek

1 2 3 next