Patents by Inventor Jérôme Revaud

Jérôme Revaud has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

METHODS AND SYSTEMS FOR GENERATING 3D REPRESENTATIONS OF SCENES FROM A PLURALITY OF IMAGES USING POINTMAPS

Publication number: 20250209740

Abstract: A computer-implemented method for reconstructing a scene in three dimensions from a plurality of images of one or more viewpoints of the scene acquired using an imaging device includes: receiving the plurality of images without receiving extrinsic or intrinsic properties of the imaging device; and processing the plurality of images using a neural network to produce a plurality of pointmaps of the scene that correspond to the plurality of images and that are aligned in a common coordinate frame, where each pointmap is a one-to-one mapping between pixels of one of the plurality of images and three-dimensional points of the scene.

Type: Application

Filed: December 11, 2024

Publication date: June 26, 2025

Applicant: NAVER CORPORATION

Inventors: Jérome REVAUD, Vincent LEROY, Yohann CABON, Boris CHIDLOVSKI, Shuzhe WANG, Philippe WEINZAEPFEL, Lojze ZUST, Bardienus Pieter DUISTERHOF
SYSTEMS AND METHODS FOR MULTI WINDOW TRAINING OF VISION MODELS

Publication number: 20250111660

Abstract: A training system includes: a transformer module having the transformer architecture and configured to perform a vision task; and a training module configured to: receive a training image having a predetermined resolution; determine N windows of tokens of pixels in the training image and mask the tokens of all of the other pixels of the training image that are outside of the N windows, where N is an integer greater than or equal to 2; input the N windows of tokens to the transformer module; train the transformer module based on an output of the transformer module generated based on the N windows of tokens; and test the transformer module using a test image having the predetermined resolution.

Type: Application

Filed: August 8, 2024

Publication date: April 3, 2025

Applicant: NAVER CORPORATION

Inventors: Vincent LEROY, Philippe Weinzaepfel, Thomas Lucas, Jèrome Revaud
SYSTEM AND METHOD FOR MODEL-FREE, ONE-SHOT OBJECT POSE ESTIMATION VIA COORDINATE REGRESSION

Publication number: 20240404104

Abstract: A computer implemented method and system using an object-agnostic model for predicting a pose of an object in an image receives a query image having a target object therein; receives a set of reference images of the target object from different viewpoints; encodes, using a vision transformer, the received query image and the received set of reference images to generate a set of token features for the received query image and a set of token features for the received set of reference images; extracts, using a transformer decoder, information from the set of token features for the encoded reference images with respect to a set of token features for the received query image; processes, using a prediction head, the combined set of token features to generate a 2D-3D mapping and a confidence map of the query image; and processes the 2D-3D mapping and confidence map to determine the pose of the target object in the query image.

Type: Application

Filed: April 25, 2024

Publication date: December 5, 2024

Applicant: Naver Labs Corporation

Inventors: Jérome Revaud, Romain Brégier, Yohann Cabon, Philippe Weinzaepfel, JongMin Lee
UNSUPERVISED PRE-TRAINING OF GEOMETRIC VISION MODELS

Publication number: 20240144019

Abstract: A training system includes: a model; and a training module configured to: construct a first pair of images of at least a first portion of a first human captured at different times; construct a second pair of images of at least a second portion of a second human captured at the same time from different points of view; input the first and second pairs of images to the model; the model configured to: generate first and second reconstructed images of the at least the first portion of the first human based on the first and second pairs, respectively, and the training module is configured to selectively adjust one or more parameters of the model based on: the first reconstructed image and the second reconstructed image.

Type: Application

Filed: August 29, 2023

Publication date: May 2, 2024

Applicants: NAVER CORPORATION, NAVER LABS CORPORATION

Inventors: Philippe WEINZAEPFEL, Vincent Leroy, Romain Brègier, Yohann Cabon, Thomas Lucas, Leonid Antsfield, Boris Chidlovskii, Gabriela Csurka Khedari, Jèrôme Revaud, Matthieu Armando, Fabien Baradel, Salma Galaaoui, Gregory Rogez
UNSUPERVISED PRE-TRAINING OF GEOMETRIC VISION MODELS

Publication number: 20240135695

Abstract: A method includes: performing unsupervised pre-training of a model, the model including and a decoder including: obtaining a first image and a second image under different conditions or from different viewpoints; encoding, by the encoder, the first image into a representation of the first image and the second image into a representation of the second image; transforming the representation of the first image into a transformed representation; decoding, by the decoder, the transformed representation into a reconstructed image, where the transforming of the representation of the first image and the decoding of the transformed representation is based on the representation of the first image and the representation of the second image; and adjusting one or more parameters of at least one of the encoder and the decoder based on minimizing a loss; and fine-tuning the model, initialized with a set of task specific encoder parameters, for a geometric vision task.

Type: Application

Filed: August 4, 2023

Publication date: April 25, 2024

Applicants: NAVER CORPORATION, NAVER LABS CORPORATION

Inventors: Romain BRÉGIER, Yohann CABON, Thomas LUCAS, Jérôme REVAUD, Philippe WEINZAEPFEL, Boris CHIDLOVSKII, Vincent LEROY, Leonid ANTSFELD, Gabriela CSURKA KHEDARI
SYSTEM AND METHOD USING PYRAMIDAL AND UNIQUENESS MATCHING PRIORS FOR IDENTIFYING CORRESPONDENCES BETWEEN IMAGES

Publication number: 20230306718

Abstract: A computer-implemented method includes: obtaining a pair of images depicting a same scene, the pair of images including a first image with a first pixel grid and a second image with a second pixel grid, the first pixel grid different than the second pixel grid; by a neural network module having a first set of parameters: generating a first feature map based on the first image; and generating a second feature map based on the second image; determining a first correlation volume based on the first and second feature maps; iteratively determining a second correlation volume based on the first correlation volume; determining a loss for the first and second feature maps based on the second correlation volume; generating a second set of the parameters based on minimizing a loss function using the loss; and updating the neural network module to include the second set of parameters.

Type: Application

Filed: January 30, 2023

Publication date: September 28, 2023

Applicant: NAVER CORPORATION

Inventors: Jérome REVAUD, Vincent LEROY, Philippe WEINZAEPFEL, Boris CHIDLOVSKII
Untrained systems and methods for vehicle speed estimation

Patent number: 11715215

Abstract: A speed estimation system includes: a detection module configured to determine bounding boxes of an object moving on a surface in images, respectively, captured using a camera; a solver module configured to, based on the bounding boxes, determine a homography of the surface by solving an optimization problem, where the solver module is not trained; and a speed module configured to, using the homography, determine a speed that the object is moving on the surface.

Type: Grant

Filed: July 1, 2021

Date of Patent: August 1, 2023

Assignee: NAVER CORPORATION

Inventors: Jérome Revaud, Yohann Cabon
METHOD AND SYSTEM FOR MACHINE LEARNING FROM IMBALANCED DATA WITH NOISY LABELS

Publication number: 20230169332

Abstract: A computer-implemented method for training an artificial neural network with training data including samples and corresponding labels for performing a task includes: pre-training the artificial neural network to generate matrix representations that are invariant to a predetermined set of data augmentations applied to a sample, where the artificial neural network includes an encoder module and a projection module configured to generate the matrix representations based on ones of the samples, respectively; and after the pre-training, fine-tune training the artificial neural network using a loss function, wherein fine-tuning the artificial neural network includes adjusting, based on the labels, one or more weights of the projection module while maintaining constant weights of the encoder module, and where the loss function is based on a logit adjustment loss that is based on logits that are adjusted based on a class distribution of the training data.

Type: Application

Filed: June 1, 2022

Publication date: June 1, 2023

Applicant: NAVER CORPORATION

Inventors: Shyamgopal Karthik, Jérome Revaud, Boris Chidlovskii
UNTRAINED SYSTEMS AND METHODS FOR VEHICLE SPEED ESTIMATION

Publication number: 20230032420

Abstract: A speed estimation system includes: a detection module configured to determine bounding boxes of an object moving on a surface in images, respectively, captured using a camera; a solver module configured to, based on the bounding boxes, determine a homography of the surface by solving an optimization problem, where the solver module is not trained; and a speed module configured to, using the homography, determine a speed that the object is moving on the surface.

Type: Application

Filed: July 1, 2021

Publication date: February 2, 2023

Applicant: NAVER CORPORATION

Inventors: Jérome REVAUD, Yohann CABON
VEHICLE SPEED ESTIMATION SYSTEMS AND METHODS

Publication number: 20230019731

Abstract: A speed estimation system includes: a detection module configured to: detect an object on a surface in an image captured using a camera; and generate a bounding box around the object; a Jacobian module configured to generate a Jacobian for the object based on the bounding box; and a speed module configured to determine a speed that the object is traveling on the surface based on the Jacobian.

Type: Application

Filed: July 1, 2021

Publication date: January 19, 2023

Applicant: NAVER CORPORATION

Inventor: Jérome REVAUD
Training a convolutional neural network for image retrieval with a listwise ranking loss function

Patent number: 11521072

Abstract: A method of performing image retrieval includes: obtaining a query image; generating a global feature descriptor of the query image by inputting the query image into a convolutional neural network (CNN) and obtaining the global feature descriptor as an output of the CNN, where parameters of the CNN are learned during training of the CNN on a batch of training images using a listwise ranking loss function and optimizing a quantized mean average precision ranking evaluation metric; determining similarities between the query image and other images based on distances between the global feature descriptor of the query image and global feature descriptors of the other images, respectively; ranking the other images based on the similarities, respectively; and selecting a set of the other images based on the similarities between the query image and the other images.

Type: Grant

Filed: February 11, 2020

Date of Patent: December 6, 2022

Assignee: NAVER CORPORATION

Inventors: Jérome Revaud, Jon Almazan, Cesar De Souza, Rafael Sampaio De Rezende
SPEED ESTIMATION SYSTEMS AND METHODS WITHOUT CAMERA CALIBRATION

Publication number: 20220245831

Abstract: A speed estimation system includes: a detection module having a neural network configured to: receive a time series of images, the images including a surface having a local geometry; detect an object in the time series of images on the surface; determine pixel coordinates of the object in the time series of images, respectively; determine bounding boxes around the object in the time series of images, respectively; determine local mappings, which are not a function of global parameters describing the local geometry of the surface, between pixel coordinates and distance coordinates for the time series of images based on the bounding boxes around the object in the time series of images, respectively; and a speed module configured to determine a speed of the object traveling relative to the surface based on the distance coordinates determined for the time series of images.

Type: Application

Filed: January 29, 2021

Publication date: August 4, 2022

Applicant: NAVER CORPORATION

Inventors: Jérome REVAUD, Yohann CABON, Julien MORAT
SUPERLOSS: A GENERIC LOSS FOR ROBUST CURRICULUM LEARNING

Publication number: 20220114444

Abstract: A computer-implemented method for training a neural network to perform a data processing task includes: for each data sample of a set of labeled data samples: by a first loss function for the data processing task, computing a first loss for that data sample; and by a second loss function, automatically computing a weight value for the data sample based on the first loss, the weight value indicative of a reliability of a label of the data sample predicted by the neural network for the data sample and dictating the extent to which that data sample impacts training of the neural network; and training the neural network with the set of labelled data samples according to their respective weight value.

Type: Application

Filed: July 23, 2021

Publication date: April 14, 2022

Applicant: NAVER CORPORATION

Inventors: Philippe WEINZAEPFEL, Jérome REVAUD, Thibault CASTELLS
Joint detection and description systems and methods

Patent number: 11176425

Abstract: A system for detecting and describing keypoints in images is described. A camera is configured to capture an image including a plurality of pixels. A fully convolutional network is configured to jointly and concurrently: generate descriptors for each of the pixels, respectively; generate reliability scores for each of the pixels, respectively; and generate repeatability scores for each of the pixels, respectively. A scoring module is configured to generate scores for the pixels, respectively, based on the reliability scores and the repeatability scores of the pixels, respectively. A keypoint list module is configured to: select X of the pixels having the X highest scores, where X is an integer greater than 1; and generate a keypoint list including: locations of the selected X pixels; and the descriptors of the selected X pixels.

Type: Grant

Filed: December 11, 2019

Date of Patent: November 16, 2021

Assignees: NAVER CORPORATION, NAVER LABS CORPORATION

Inventors: Jérome Revaud, Cesar De Souza, Martin Humenberger, Philippe Weinzaepfel
Systems and methods for detecting a point of interest change using a convolutional neural network

Patent number: 11055569

Abstract: A method for detecting a point of interest (POI) change in a pair of inputted POI images. A first processor of the method: generates triplets of training POI images using a base of training POI images and trains a convolutional neural network (CNN) of three-stream Siamese type based on the triplets of training POI images. A second processor of the method: computes, for each image of the pair of inputted POI images, a descriptor of that image using a stream of the CNN of three-stream Siamese type, computes a similarity score based on the descriptors of the images of the pair of inputted POI images using a similarity score function, and selectively detects the POI change based on the similarity score. A third processor of the method generates the base of training POI images to include an initial set of POI images and a set of synthetic POI images.

Type: Grant

Filed: September 19, 2019

Date of Patent: July 6, 2021

Assignee: NAVER CORPORATION

Inventors: Jérôme Revaud, Rafael Sampaio De Rezende
JOINT DETECTION AND DESCRIPTION SYSTEMS AND METHODS

Publication number: 20210182626

Abstract: A system for detecting and describing keypoints in images is described. A camera is configured to capture an image including a plurality of pixels. A fully convolutional network is configured to jointly and concurrently: generate descriptors for each of the pixels, respectively; generate reliability scores for each of the pixels, respectively; and generate repeatability scores for each of the pixels, respectively. A scoring module is configured to generate scores for the pixels, respectively, based on the reliability scores and the repeatability scores of the pixels, respectively. A keypoint list module is configured to: select X of the pixels having the X highest scores, where X is an integer greater than 1; and generate a keypoint list including: locations of the selected X pixels; and the descriptors of the selected X pixels.

Type: Application

Filed: December 11, 2019

Publication date: June 17, 2021

Applicants: Naver Corporation, Naver Labs Corporation

Inventors: Jérome Revaud, Cesar De Souza, Martin Humenberger, Philippe Weinzaepfel
System and method for training a convolutional neural network and classifying an action performed by a subject in a video using the trained convolutional neural network

Patent number: 10867184

Abstract: A method for training a convolutional neural network for classification of actions performed by subjects in a video is realized by (a) for each frame of the video, for each key point of the subject, generating a heat map of the key point representing a position estimation of the key point within the frame; (b) colorizing each heat map as a function of the relative time of the corresponding frame in the video; (c) for each key point, aggregating all the colorized heat maps of the key point into at least one image representing the evolution of the position estimation of the key point during the video; and training the convolutional neural network using as input the sets associated to each training video of images representing the evolution of the position estimation of each key point during the video.

Type: Grant

Filed: March 20, 2019

Date of Patent: December 15, 2020

Inventors: Vasileios Choutas, Philippe Weinzaepfel, Jérôme Revaud, Cordelia Schmid
TRAINING A CONVOLUTIONAL NEURAL NETWORK FOR IMAGE RETRIEVAL WITH A LISTWISE RANKING LOSS FUNCTION

Publication number: 20200342328

Abstract: A method of performing image retrieval includes: obtaining a query image; generating a global feature descriptor of the query image by inputting the query image into a convolutional neural network (CNN) and obtaining the global feature descriptor as an output of the CNN, where parameters of the CNN are learned during training of the CNN on a batch of training images using a listwise ranking loss function and optimizing a quantized mean average precision ranking evaluation metric; determining similarities between the query image and other images based on distances between the global feature descriptor of the query image and global feature descriptors of the other images, respectively; ranking the other images based on the similarities, respectively; and selecting a set of the other images based on the similarities between the query image and the other images.

Type: Application

Filed: February 11, 2020

Publication date: October 29, 2020

Applicant: NAVER CORPORATION

Inventors: Jérome REVAUD, Jon ALMAZAN, Cesar DE SOUZA, Rafael SAMPAIO DE REZENDE
SYSTEMS AND METHODS FOR DETECTING A POINT OF INTEREST CHANGE USING A CONVOLUTIONAL NEURAL NETWORK

Publication number: 20200110966

Abstract: A method for detecting a point of interest (POI) change in a pair of inputted POI images. A first processor of the method: generates triplets of training POI images using a base of training POI images and trains a convolutional neural network (CNN) of three-stream Siamese type based on the triplets of training POI images. A second processor of the method: computes, for each image of the pair of inputted POI images, a descriptor of that image using a stream of the CNN of three-stream Siamese type, computes a similarity score based on the descriptors of the images of the pair of inputted POI images using a similarity score function, and selectively detects the POI change based on the similarity score. A third processor of the method generates the base of training POI images to include an initial set of POI images and a set of synthetic POI images.

Type: Application

Filed: September 19, 2019

Publication date: April 9, 2020

Applicant: Naver Corporation

Inventors: Jérôme Revaud, Rafael Sampaio De Rezende
SYSTEM AND METHOD FOR TRAINING A CONVOLUTIONAL NEURAL NETWORK AND CLASSIFYING AN ACTION PERFORMED BY A SUBJECT IN A VIDEO USING THE TRAINED CONVOLUTIONAL NEURAL NETWORK

Publication number: 20190303677

Abstract: A method for training a convolutional neural network for classification of actions performed by subjects in a video is realized by (a) for each frame of the video, for each key point of the subject, generating a heat map of the key point representing a position estimation of the key point within the frame; (b) colorizing each heat map as a function of the relative time of the corresponding frame in the video; (c) for each key point, aggregating all the colorized heat maps of the key point into at least one image representing the evolution of the position estimation of the key point during the video; and training the convolutional neural network using as input the sets associated to each training video of images representing the evolution of the position estimation of each key point during the video.

Type: Application

Filed: March 20, 2019

Publication date: October 3, 2019

Applicant: Naver Corporation

Inventors: Vasileios Choutas, Philippe Weinzaepfel, Jérôme Revaud, Cordelia Schmid