Patents by Inventor Matthias Grundmann

Matthias Grundmann has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Real-Time Pose Estimation for Unseen Objects

Publication number: 20220044439

Abstract: Example embodiments allow for fast, efficient determination of bounding box vertices or other pose information for objects based on images of a scene that may contain the objects. An artificial neural network or other machine learning algorithm is used to generate, from an input image, a heat map and a number of pairs of displacement maps. The location of a peak within the heat map is then used to extract, from the displacement maps, the two-dimensional displacement, from the location of the peak within the image, of vertices of a bounding box that contains the object. This bounding box can then be used to determine the pose of the object within the scene. The artificial neural network can be configured to generate intermediate segmentation maps, coordinate maps, or other information about the shape of the object so as to improve the estimated bounding box.

Type: Application

Filed: August 9, 2020

Publication date: February 10, 2022

Inventors: Tingbo Hou, Matthias Grundmann, Liangkai Zhang, Jianing Wei, Adel Ahmadyan
Motion stills experience

Patent number: 11221737

Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface facilitating a creation of a video from a plurality of media items, wherein the user interface displays video content of the first and second media items in a first portion; receiving user input in the first portion of the user interface, wherein the user input comprises a selection of the first media item; updating the user interface to comprise a control element and a second portion, and adding the first media item to a set of selected media items, wherein the second portion displays image content of the set of selected media items and the control element enables a user to initiate the creation of the video; and creating the video based on video content of the set of selected media items.

Type: Grant

Filed: January 13, 2020

Date of Patent: January 11, 2022

Assignee: Google LLC

Inventors: Matthias Grundmann, Jokubas Zukerman, Marco Paglia, Kenneth Conley, Karthik Raveendran, Reed Morse
Scalable real-time hand tracking

Patent number: 11182909

Abstract: Example aspects of the present disclosure are directed to computing systems and methods for hand tracking using a machine-learned system for palm detection and key-point localization of hand landmarks. In particular, example aspects of the present disclosure are directed to a multi-model hand tracking system that performs both palm detection and hand landmark detection. Given a sequence of image frames, for example, the hand tracking system can detect one or more palms depicted in each image frame. For each palm detected within an image frame, the machine-learned system can determine a plurality of hand landmark positions of a hand associated with the palm. The system can perform key-point localization to determine precise three-dimensional coordinates for the hand landmark positions. In this manner, the machine-learned system can accurately track a hand depicted in the sequence of images using the precise three-dimensional coordinates for the hand landmark positions.

Type: Grant

Filed: December 10, 2019

Date of Patent: November 23, 2021

Assignee: Google LLC

Inventors: Valentin Bazarevsky, Fan Zhang, Andrei Vakunov, Andrei Tkachenka, Matthias Grundmann
Surface geometry object model training and inference

Patent number: 11158122

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network model to predict mesh vertices corresponding to a three-dimensional surface geometry of an object depicted in an image.

Type: Grant

Filed: October 2, 2019

Date of Patent: October 26, 2021

Assignee: Google LLC

Inventors: Artsiom Ablavatski, Yury Kartynnik, Ivan Grishchenko, Matthias Grundmann
Collage of interesting moments in a video

Patent number: 11120835

Abstract: A computer-implemented method includes determining interesting moments in a video. The method further includes generating video segments based on the interesting moments, wherein each of the video segments includes at least one of the interesting moments from the video. The method further includes generating a collage from the video segments, where the collage includes at least two windows and wherein each window includes one of the video segments.

Type: Grant

Filed: December 17, 2018

Date of Patent: September 14, 2021

Assignee: Google LLC

Inventors: Sharadh Ramaswamy, Matthias Grundmann, Kenneth Conley
Scalable Real-Time Hand Tracking

Publication number: 20210174519

Abstract: Example aspects of the present disclosure are directed to computing systems and methods for hand tracking using a machine-learned system for palm detection and key-point localization of hand landmarks. In particular, example aspects of the present disclosure are directed to a multi-model hand tracking system that performs both palm detection and hand landmark detection. Given a sequence of image frames, for example, the hand tracking system can detect one or more palms depicted in each image frame. For each palm detected within an image frame, the machine-learned system can determine a plurality of hand landmark positions of a hand associated with the palm. The system can perform key-point localization to determine precise three-dimensional coordinates for the hand landmark positions. In this manner, the machine-learned system can accurately track a hand depicted in the sequence of images using the precise three-dimensional coordinates for the hand landmark positions.

Type: Application

Filed: December 10, 2019

Publication date: June 10, 2021

Inventors: Valentin Bazarevsky, Fan Zhang, Andrei Vakunov, Andrei Tkachenka, Matthias Grundmann
Efficient Convolutional Neural Networks and Techniques to Reduce Associated Computational Costs

Publication number: 20210133508

Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.

Type: Application

Filed: October 30, 2019

Publication date: May 6, 2021

Inventors: Valentin Bazarevsky, Yury Kartynnik, Andrei Vakunov, Karthik Raveendran, Matthias Grundmann
SURFACE GEOMETRY OBJECT MODEL TRAINING AND INFERENCE

Publication number: 20210104096

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network model to predict mesh vertices corresponding to a three-dimensional surface geometry of an object depicted in an image.

Type: Application

Filed: October 2, 2019

Publication date: April 8, 2021

Inventors: Artsiom Ablavatski, Yury Kartynnik, Ivan Grishchenko, Matthias Grundmann
Methods, systems, and media for generating a summarized video with video thumbnails

Patent number: 10956749

Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.

Type: Grant

Filed: March 11, 2019

Date of Patent: March 23, 2021

Assignee: Google LLC

Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
Calibration-Free Instant Motion Tracking for Augmented Reality

Publication number: 20200250852

Abstract: The present disclosure provides systems and methods for calibration-free instant motion tracking useful, for example, for rending virtual content in augmented reality settings. In particular, a computing system can iteratively augment image frames that depict a scene to insert virtual content at an anchor region within the scene, including situations in which the anchor region moves relative to the scene. To do so, the computing system can estimate, for each of a number of sequential image frames: a rotation of an image capture system that captures the image frames; and a translation of the anchor region relative to an image capture system, thereby providing sufficient information to determine where and at what orientation to render the virtual content within the image frame.

Type: Application

Filed: December 17, 2019

Publication date: August 6, 2020

Inventors: Jianing Wei, Matthias Grundmann
HYBRID PLACEMENT OF OBJECTS IN AN AUGMENTED REALITY ENVIRONMENT

Publication number: 20200211288

Abstract: In a general aspect, a method can include receiving data defining an augmented reality (AR) environment including a representation of a physical environment, and changing tracking of an AR object within the AR environment between region-tracking mode and plane-tracking mode.

Type: Application

Filed: October 7, 2019

Publication date: July 2, 2020

Inventors: Bryan Woods, Jianingwei Wei, Sundeep Vaddadi, Cheng Yang, Konstantine Tsotsos, Keith Schaefer, Leon Wong, Keir Banks Mierle, Matthias Grundmann
Motion stills experience

Patent number: 10534503

Abstract: Implementations disclose a user interface for viewing and combining media items into a video. A method includes presenting a user interface facilitating a creation of a video from a plurality of media items, the user interface comprising a first portion concurrently playing a first media item and a second media item of the plurality of media items; receiving user input indicating a selection of the first media item in the first portion of the user interface; in response to determining that the user input is of a first type, adding the first media item to a set of selected media items, and presenting the set of selected media items in a second portion of the user interface; and creating the video from the set of selected media items.

Type: Grant

Filed: June 21, 2016

Date of Patent: January 14, 2020

Assignee: GOOGLE LLC

Inventors: Matthias Grundmann, Jokubas Zukerman, Marco Paglia, Kenneth Conley, Karthik Raveendran, Reed Morse
System and method for grouping related photographs

Patent number: 10514818

Abstract: A computer-implemented method, computer program product, and computing system is provided for interacting with images having similar content. In an embodiment, a method may include identifying a plurality of photographs as including a common characteristic. The method may also include generating a flipbook media item including the plurality of photographs. The method may further include associating one or more interactive control features with the flipbook media item.

Type: Grant

Filed: April 6, 2016

Date of Patent: December 24, 2019

Assignee: GOOGLE LLC

Inventors: Sergey Ioffe, Vivek Kwatra, Matthias Grundmann
GRAPHICAL IMAGE RETRIEVAL BASED ON EMOTIONAL STATE OF A USER OF A COMPUTING DEVICE

Publication number: 20190228031

Abstract: A computing device is described that includes a camera configured to capture an image of a user of the computing device, a memory configured to store the image of the user, at least one processor, and at least one module. The at least one module is operable by the at least one processor to obtain, from the memory, an indication of the image of the user of the computing device, determine, based on the image, a first emotion classification tag, and identify, based on the first emotion classification tag, at least one graphical image from a database of pre-classified images that has an emotional classification that is associated with the first emotion classification tag. The at least one module is further operable by the at least one processor to output, for display, the at least one graphical image.

Type: Application

Filed: January 28, 2019

Publication date: July 25, 2019

Applicant: Google LLC

Inventors: Matthias GRUNDMANN, Karthik RAVEENDRAN, Daniel Castro CHIN
METHODS, SYSTEMS, AND MEDIA FOR GENERATING A SUMMARIZED VIDEO WITH VIDEO THUMBNAILS

Publication number: 20190205654

Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.

Type: Application

Filed: March 11, 2019

Publication date: July 4, 2019

Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
COLLAGE OF INTERESTING MOMENTS IN A VIDEO

Publication number: 20190189161

Abstract: A computer-implemented method includes determining interesting moments in a video. The method further includes generating video segments based on the interesting moments, wherein each of the video segments includes at least one of the interesting moments from the video. The method further includes generating a collage from the video segments, where the collage includes at least two windows and wherein each window includes one of the video segments.

Type: Application

Filed: December 17, 2018

Publication date: June 20, 2019

Applicant: Google LLC

Inventors: Sharadh RAMASWAMY, Matthias GRUNDMANN, Kenneth CONLEY
Tracking image regions

Patent number: 10262420

Abstract: Systems and methods are disclosed for tracking regions within a media item. A method includes identifying a region in a first frame of a media item using a first user specified position; calculating, based on the first user specified position and on tracking data, an estimated position of the region within a second frame of the media item and an estimated position of the region within a third frame of the media item; adjusting the estimated position of the region within the second frame to a second user specified position; blending, by a processing device, the estimated position within the third frame based on the second user specified position of the second frame to generate a blended position within the third frame; and storing, in a data store, the blended position within the third frame.

Type: Grant

Filed: May 3, 2017

Date of Patent: April 16, 2019

Assignee: GOOGLE LLC

Inventors: Amanda Conway, Matthias Grundmann, Christian Ingemar Falk
Methods, systems, and media for generating a summarized video with video thumbnails

Patent number: 10229326

Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.

Type: Grant

Filed: September 7, 2018

Date of Patent: March 12, 2019

Assignee: Google LLC

Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
Graphical image retrieval based on emotional state of a user of a computing device

Patent number: 10191920

Abstract: A computing device is described that includes a camera configured to capture an image of a user of the computing device, a memory configured to store the image of the user, at least one processor, and at least one module. The at least one module is operable by the at least one processor to obtain, from the memory, an indication of the image of the user of the computing device, determine, based on the image, a first emotion classification tag, and identify, based on the first emotion classification tag, at least one graphical image from a database of pre-classified images that has an emotional classification that is associated with the first emotion classification tag. The at least one module is further operable by the at least one processor to output, for display, the at least one graphical image.

Type: Grant

Filed: August 24, 2015

Date of Patent: January 29, 2019

Assignee: Google LLC

Inventors: Matthias Grundmann, Karthik Raveendran, Daniel Castro Chin
IDENTIFYING INTERESTING PORTIONS OF VIDEOS

Publication number: 20190013047

Abstract: A plurality of videos is analyzed (in real time or after the videos are generated) to identify interesting portions of the videos. The interesting portions are identified based on one or more of the people depicted in the videos, the objects depicted in the videos, the motion of objects and/or people in the videos, and the locations where people depicted in the videos are looking. The interesting portions are combined to generate a content item.

Type: Application

Filed: March 31, 2015

Publication date: January 10, 2019

Inventors: Arthur Wait, Krishna Bharat, Caroline Rebecca Pantofaru, Christian Frueh, Matthias Grundmann, Jay Yagnik, Ryan Michael Hickman

prev 1 2 3 4 next