Patents by Inventor Klaus Greff

Klaus Greff has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

TRACKING QUERY POINTS IN VIDEOS USING POINT TRACKING NEURAL NETWORKS

Publication number: 20250191194

Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for tracking query points in videos using a point tracking neural network.

Type: Application

Filed: March 7, 2023

Publication date: June 12, 2025

Inventors: Carl Doersch, Ankush Gupta, Larisa Markeeva, Klaus Greff, Andrea Tagliasacchi, Adrià Recasens Continente, Yusuf Aytar, Joao Carreira, Andrew Zisserman, Yi Yang
CONTROLLING ROBOTS USING MULTI-MODAL LANGUAGE MODELS

Publication number: 20250144795

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for controlling an agent interacting with an environment. In one aspect, a method comprises: receiving one or more observations of an environment; receiving an input text sequence that describes a task to be performed by a robot in the environment; generating an encoded representation of the input text sequence in an embedding space; generating a corresponding encoded representation of each of the one or more observations in the embedding space; generating a sequence of input tokens that comprises the encoded representation of the input text sequence and the corresponding encoded representation of each observation; processing the sequence of input tokens using a language model neural network to generate an output text sequence that comprises high-level natural language instructions; and determining, from the high-level natural language instructions, one or more actions to be performed by the robot.

Type: Application

Filed: January 2, 2025

Publication date: May 8, 2025

Inventors: Peter Raymond Florence, Danny Michael Driess, Igor Mordatch, Andy Zeng, Seyed Mohammad Mehdi Sajjadi, Klaus Greff
Latent Pose Queries for Machine-Learned Image View Synthesis

Publication number: 20240169662

Abstract: An example method includes obtaining, by a computing system, one or more source images of a scene; obtaining, by the computing system, a query associated with a target view of the scene, wherein at least a portion of the query is parameterized in a latent pose space; and generating, by the computing system and using a machine-learned image view synthesis model, an output image of the scene associated with the target view.

Type: Application

Filed: November 22, 2023

Publication date: May 23, 2024

Inventors: Seyed Mohammad Mehdi Sajjadi, Klaus Greff, Etienne François Régis Pot, Daniel Christopher Duckworth, Mario Lucic, Aravindh Mahendran, Thomas Kipf
Neural Semantic Fields for Generalizable Semantic Segmentation of 3D Scenes

Publication number: 20240119697

Abstract: Example embodiments of the present disclosure provide an example computer-implemented method for constructing a three-dimensional semantic segmentation of a scene from two-dimensional inputs. The example method includes obtaining, by a computing system comprising one or more processors, an image set comprising one or more views of a subject scene. The example method includes generating, by the computing system and based at least in part on the image set, a scene representation describing the subject scene in three dimensions. The example method includes generating, by the computing system and using a machine-learned semantic segmentation model framework, a multidimensional field of probability distributions over semantic categories, the multidimensional field defined over the three dimensions of the subject scene. The example method includes outputting, by the computing system, classification data for at least one location in the subject scene.

Type: Application

Filed: October 10, 2022

Publication date: April 11, 2024

Inventors: Daniel Christopher Duckworth, Suhani Deepak-Ranu Vora, Noha Radwan, Klaus Greff, Henning Meyer, Kyle Adam Genova, Seyed Mohammad Mehdi Sajjadi, Etienne François Régis Pot, Andrea Tagliasacchi
Geometry-Free Neural Scene Representations Through Novel-View Synthesis

Publication number: 20240096001

Abstract: Provided are machine learning models that generate geometry-free neural scene representations through efficient object-centric novel-view synthesis. In particular, one example aspect of the present disclosure provides a novel framework in which an encoder model (e.g., an encoder transformer network) processes one or more RGB images (with or without pose) to produce a fully latent scene representation that can be passed to a decoder model (e.g., a decoder transformer network). Given one or more target poses, the decoder model can synthesize images in a single forward pass. In some example implementations, because transformers are used rather than convolutional or MLP networks, the encoder can learn an attention model that extracts enough 3D information about a scene from a small set of images to render novel views with correct projections, parallax, occlusions, and even semantics, without explicit geometry.

Type: Application

Filed: November 15, 2022

Publication date: March 21, 2024

Inventors: Seyed Mohammad Mehdi Sajjadi, Henning Meyer, Etienne François Régis Pot, Urs Michael Bergmann, Klaus Greff, Noha Radwan, Suhani Deepak-Ranu Vora, Mario Lu¢i¢, Daniel Christopher Duckworth, Thomas Allen Funkhouser, Andrea Tagliasacchi
Conditional Object-Centric Learning with Slot Attention for Video and Other Sequential Data

Publication number: 20220383628

Abstract: A method includes obtaining first feature vectors and second feature vectors representing contents of a first and second image frame, respectively, of an input video. The method may also include generating, based on the first feature vectors, first slot vectors, where each slot vector represents attributes of a corresponding entity as represented in the first image frame, and generating, based on the first slot vectors, predicted slot vectors including a corresponding predicted slot vector that represents a transition of the attributes of the corresponding entity from the first to the second image frame. The method may additionally include generating, based on the predicted slot vectors and the second feature vectors, second slot vectors including a corresponding slot vector that represents the attributes of the corresponding entity as represented in the second image frame, and determining an output based on the predicted slot vectors or the second slot vectors.

Type: Application

Filed: April 21, 2022

Publication date: December 1, 2022

Inventors: Thomas Kipf, Gamaleldin Elsayed, Aravindh Mahendran, Austin Charles Stone, Sara Sabour Rouh Aghdam, Georg Heigold, Rico Jonschkowski, Alexey Dosovitskiy, Klaus Greff
Segmentation of data

Patent number: 11481585

Abstract: Disclosed is a computer-implemented method for segmenting input data. In the method a plurality of tags is generated; the input data is masked with the plurality of tags; a plurality of output reconstructions is generated by inputting the plurality of masked input data to one of the following: a denoising neural network, a variational autoencoder; a plurality of values representing distances of each plurality of output reconstructions to the input data are determined; a plurality of updated versions of input data is generated by applying at least one of the determined values representing distances of each plurality of output reconstructions to the input data; and updated output reconstructions are generated by inputting the plurality of updated versions of input data to one of the networks. Also disclosed is a method for training the network and a processing unit.

Type: Grant

Filed: May 19, 2017

Date of Patent: October 25, 2022

Assignee: Canary Capital LLC

Inventors: Harri Valpola, Klaus Greff
Highly trainable neural network configuration

Patent number: 10984320

Abstract: A computer-based method includes receiving an input signal at a neuron in a computer-based neural network that includes a plurality of neuron layers, applying a first non-linear transform to the input signal at the neuron to produce a plain signal, and calculating a weighted sum of a first component of the input signal and the plain signal at the neuron. In a typical implementation, the first non-linear transform is a function of the first component of the input signal and at least a second component of the input signal.

Type: Grant

Filed: May 1, 2017

Date of Patent: April 20, 2021

Assignee: Nnaisense SA

Inventors: Rupesh Kumar Srivastava, Klaus Greff
Segmentation of Data

Publication number: 20190220691

Abstract: Disclosed is a computer-implemented method for segmenting input data. In the method a plurality of tags is generated; the input data is masked with the plurality of tags; a plurality of output reconstructions is generated by inputting the plurality of masked input data to one of the following: a denoising neural network, a variational autoencoder; a plurality of values representing distances of each plurality of output reconstructions to the input data are determined; a plurality of updated versions of input data is generated by applying at least one of the determined values representing distances of each plurality of output reconstructions to the input data; and updated output reconstructions are generated by inputting the plurality of updated versions of input data to one of the networks. Also disclosed is a method for training the network and a processing unit.

Type: Application

Filed: May 19, 2017

Publication date: July 18, 2019

Applicant: Curious AI Oy

Inventors: Harri VALPOLA, Klaus GREFF
HIGHLY TRAINABLE NEURAL NETWORK CONFIGURATION

Publication number: 20170316308

Abstract: A computer-based method includes receiving an input signal at a neuron in a computer-based neural network that includes a plurality of neuron layers, applying a first non-linear transform to the input signal at the neuron to produce a plain signal, and calculating a weighted sum of a first component of the input signal and the plain signal at the neuron. In a typical implementation, the first non-linear transform is a function of the first component of the input signal and at least a second component of the input signal.

Type: Application

Filed: May 1, 2017

Publication date: November 2, 2017

Inventors: Rupesh Kumar Srivastava, Klaus Greff