Patents by Inventor Matthias Grundmann

Matthias Grundmann has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20220044439
    Abstract: Example embodiments allow for fast, efficient determination of bounding box vertices or other pose information for objects based on images of a scene that may contain the objects. An artificial neural network or other machine learning algorithm is used to generate, from an input image, a heat map and a number of pairs of displacement maps. The location of a peak within the heat map is then used to extract, from the displacement maps, the two-dimensional displacement, from the location of the peak within the image, of vertices of a bounding box that contains the object. This bounding box can then be used to determine the pose of the object within the scene. The artificial neural network can be configured to generate intermediate segmentation maps, coordinate maps, or other information about the shape of the object so as to improve the estimated bounding box.
    Type: Application
    Filed: August 9, 2020
    Publication date: February 10, 2022
    Inventors: Tingbo Hou, Matthias Grundmann, Liangkai Zhang, Jianing Wei, Adel Ahmadyan
  • Patent number: 11221737
    Abstract: The technology disclosed herein includes a user interface for viewing and combining media items into a video. An example method includes presenting a user interface facilitating a creation of a video from a plurality of media items, wherein the user interface displays video content of the first and second media items in a first portion; receiving user input in the first portion of the user interface, wherein the user input comprises a selection of the first media item; updating the user interface to comprise a control element and a second portion, and adding the first media item to a set of selected media items, wherein the second portion displays image content of the set of selected media items and the control element enables a user to initiate the creation of the video; and creating the video based on video content of the set of selected media items.
    Type: Grant
    Filed: January 13, 2020
    Date of Patent: January 11, 2022
    Assignee: Google LLC
    Inventors: Matthias Grundmann, Jokubas Zukerman, Marco Paglia, Kenneth Conley, Karthik Raveendran, Reed Morse
  • Patent number: 11182909
    Abstract: Example aspects of the present disclosure are directed to computing systems and methods for hand tracking using a machine-learned system for palm detection and key-point localization of hand landmarks. In particular, example aspects of the present disclosure are directed to a multi-model hand tracking system that performs both palm detection and hand landmark detection. Given a sequence of image frames, for example, the hand tracking system can detect one or more palms depicted in each image frame. For each palm detected within an image frame, the machine-learned system can determine a plurality of hand landmark positions of a hand associated with the palm. The system can perform key-point localization to determine precise three-dimensional coordinates for the hand landmark positions. In this manner, the machine-learned system can accurately track a hand depicted in the sequence of images using the precise three-dimensional coordinates for the hand landmark positions.
    Type: Grant
    Filed: December 10, 2019
    Date of Patent: November 23, 2021
    Assignee: Google LLC
    Inventors: Valentin Bazarevsky, Fan Zhang, Andrei Vakunov, Andrei Tkachenka, Matthias Grundmann
  • Patent number: 11158122
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network model to predict mesh vertices corresponding to a three-dimensional surface geometry of an object depicted in an image.
    Type: Grant
    Filed: October 2, 2019
    Date of Patent: October 26, 2021
    Assignee: Google LLC
    Inventors: Artsiom Ablavatski, Yury Kartynnik, Ivan Grishchenko, Matthias Grundmann
  • Patent number: 11120835
    Abstract: A computer-implemented method includes determining interesting moments in a video. The method further includes generating video segments based on the interesting moments, wherein each of the video segments includes at least one of the interesting moments from the video. The method further includes generating a collage from the video segments, where the collage includes at least two windows and wherein each window includes one of the video segments.
    Type: Grant
    Filed: December 17, 2018
    Date of Patent: September 14, 2021
    Assignee: Google LLC
    Inventors: Sharadh Ramaswamy, Matthias Grundmann, Kenneth Conley
  • Publication number: 20210174519
    Abstract: Example aspects of the present disclosure are directed to computing systems and methods for hand tracking using a machine-learned system for palm detection and key-point localization of hand landmarks. In particular, example aspects of the present disclosure are directed to a multi-model hand tracking system that performs both palm detection and hand landmark detection. Given a sequence of image frames, for example, the hand tracking system can detect one or more palms depicted in each image frame. For each palm detected within an image frame, the machine-learned system can determine a plurality of hand landmark positions of a hand associated with the palm. The system can perform key-point localization to determine precise three-dimensional coordinates for the hand landmark positions. In this manner, the machine-learned system can accurately track a hand depicted in the sequence of images using the precise three-dimensional coordinates for the hand landmark positions.
    Type: Application
    Filed: December 10, 2019
    Publication date: June 10, 2021
    Inventors: Valentin Bazarevsky, Fan Zhang, Andrei Vakunov, Andrei Tkachenka, Matthias Grundmann
  • Publication number: 20210133508
    Abstract: A computing system is disclosed including a convolutional neural configured to receive an input that describes a facial image and generate a facial object recognition output that describes one or more facial feature locations with respect to the facial image. The convolutional neural network can include a plurality of convolutional blocks. At least one of the convolutional blocks can include one or more separable convolutional layers configured to apply a depthwise convolution and a pointwise convolution during processing of an input to generate an output. The depthwise convolution can be applied with a kernel size that is greater than 3×3. At least one of the convolutional blocks can include a residual shortcut connection from its input to its output.
    Type: Application
    Filed: October 30, 2019
    Publication date: May 6, 2021
    Inventors: Valentin Bazarevsky, Yury Kartynnik, Andrei Vakunov, Karthik Raveendran, Matthias Grundmann
  • Publication number: 20210104096
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for training a neural network model to predict mesh vertices corresponding to a three-dimensional surface geometry of an object depicted in an image.
    Type: Application
    Filed: October 2, 2019
    Publication date: April 8, 2021
    Inventors: Artsiom Ablavatski, Yury Kartynnik, Ivan Grishchenko, Matthias Grundmann
  • Patent number: 10956749
    Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.
    Type: Grant
    Filed: March 11, 2019
    Date of Patent: March 23, 2021
    Assignee: Google LLC
    Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
  • Publication number: 20200250852
    Abstract: The present disclosure provides systems and methods for calibration-free instant motion tracking useful, for example, for rending virtual content in augmented reality settings. In particular, a computing system can iteratively augment image frames that depict a scene to insert virtual content at an anchor region within the scene, including situations in which the anchor region moves relative to the scene. To do so, the computing system can estimate, for each of a number of sequential image frames: a rotation of an image capture system that captures the image frames; and a translation of the anchor region relative to an image capture system, thereby providing sufficient information to determine where and at what orientation to render the virtual content within the image frame.
    Type: Application
    Filed: December 17, 2019
    Publication date: August 6, 2020
    Inventors: Jianing Wei, Matthias Grundmann
  • Publication number: 20200211288
    Abstract: In a general aspect, a method can include receiving data defining an augmented reality (AR) environment including a representation of a physical environment, and changing tracking of an AR object within the AR environment between region-tracking mode and plane-tracking mode.
    Type: Application
    Filed: October 7, 2019
    Publication date: July 2, 2020
    Inventors: Bryan Woods, Jianingwei Wei, Sundeep Vaddadi, Cheng Yang, Konstantine Tsotsos, Keith Schaefer, Leon Wong, Keir Banks Mierle, Matthias Grundmann
  • Patent number: 10534503
    Abstract: Implementations disclose a user interface for viewing and combining media items into a video. A method includes presenting a user interface facilitating a creation of a video from a plurality of media items, the user interface comprising a first portion concurrently playing a first media item and a second media item of the plurality of media items; receiving user input indicating a selection of the first media item in the first portion of the user interface; in response to determining that the user input is of a first type, adding the first media item to a set of selected media items, and presenting the set of selected media items in a second portion of the user interface; and creating the video from the set of selected media items.
    Type: Grant
    Filed: June 21, 2016
    Date of Patent: January 14, 2020
    Assignee: GOOGLE LLC
    Inventors: Matthias Grundmann, Jokubas Zukerman, Marco Paglia, Kenneth Conley, Karthik Raveendran, Reed Morse
  • Patent number: 10514818
    Abstract: A computer-implemented method, computer program product, and computing system is provided for interacting with images having similar content. In an embodiment, a method may include identifying a plurality of photographs as including a common characteristic. The method may also include generating a flipbook media item including the plurality of photographs. The method may further include associating one or more interactive control features with the flipbook media item.
    Type: Grant
    Filed: April 6, 2016
    Date of Patent: December 24, 2019
    Assignee: GOOGLE LLC
    Inventors: Sergey Ioffe, Vivek Kwatra, Matthias Grundmann
  • Publication number: 20190228031
    Abstract: A computing device is described that includes a camera configured to capture an image of a user of the computing device, a memory configured to store the image of the user, at least one processor, and at least one module. The at least one module is operable by the at least one processor to obtain, from the memory, an indication of the image of the user of the computing device, determine, based on the image, a first emotion classification tag, and identify, based on the first emotion classification tag, at least one graphical image from a database of pre-classified images that has an emotional classification that is associated with the first emotion classification tag. The at least one module is further operable by the at least one processor to output, for display, the at least one graphical image.
    Type: Application
    Filed: January 28, 2019
    Publication date: July 25, 2019
    Applicant: Google LLC
    Inventors: Matthias GRUNDMANN, Karthik RAVEENDRAN, Daniel Castro CHIN
  • Publication number: 20190205654
    Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.
    Type: Application
    Filed: March 11, 2019
    Publication date: July 4, 2019
    Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
  • Publication number: 20190189161
    Abstract: A computer-implemented method includes determining interesting moments in a video. The method further includes generating video segments based on the interesting moments, wherein each of the video segments includes at least one of the interesting moments from the video. The method further includes generating a collage from the video segments, where the collage includes at least two windows and wherein each window includes one of the video segments.
    Type: Application
    Filed: December 17, 2018
    Publication date: June 20, 2019
    Applicant: Google LLC
    Inventors: Sharadh RAMASWAMY, Matthias GRUNDMANN, Kenneth CONLEY
  • Patent number: 10262420
    Abstract: Systems and methods are disclosed for tracking regions within a media item. A method includes identifying a region in a first frame of a media item using a first user specified position; calculating, based on the first user specified position and on tracking data, an estimated position of the region within a second frame of the media item and an estimated position of the region within a third frame of the media item; adjusting the estimated position of the region within the second frame to a second user specified position; blending, by a processing device, the estimated position within the third frame based on the second user specified position of the second frame to generate a blended position within the third frame; and storing, in a data store, the blended position within the third frame.
    Type: Grant
    Filed: May 3, 2017
    Date of Patent: April 16, 2019
    Assignee: GOOGLE LLC
    Inventors: Amanda Conway, Matthias Grundmann, Christian Ingemar Falk
  • Patent number: 10229326
    Abstract: Methods, systems, and media for summarizing a video with video thumbnails are provided.
    Type: Grant
    Filed: September 7, 2018
    Date of Patent: March 12, 2019
    Assignee: Google LLC
    Inventors: Matthias Grundmann, Alexandra Ivanna Hawkins, Sergey Ioffe
  • Patent number: 10191920
    Abstract: A computing device is described that includes a camera configured to capture an image of a user of the computing device, a memory configured to store the image of the user, at least one processor, and at least one module. The at least one module is operable by the at least one processor to obtain, from the memory, an indication of the image of the user of the computing device, determine, based on the image, a first emotion classification tag, and identify, based on the first emotion classification tag, at least one graphical image from a database of pre-classified images that has an emotional classification that is associated with the first emotion classification tag. The at least one module is further operable by the at least one processor to output, for display, the at least one graphical image.
    Type: Grant
    Filed: August 24, 2015
    Date of Patent: January 29, 2019
    Assignee: Google LLC
    Inventors: Matthias Grundmann, Karthik Raveendran, Daniel Castro Chin
  • Publication number: 20190013047
    Abstract: A plurality of videos is analyzed (in real time or after the videos are generated) to identify interesting portions of the videos. The interesting portions are identified based on one or more of the people depicted in the videos, the objects depicted in the videos, the motion of objects and/or people in the videos, and the locations where people depicted in the videos are looking. The interesting portions are combined to generate a content item.
    Type: Application
    Filed: March 31, 2015
    Publication date: January 10, 2019
    Inventors: Arthur Wait, Krishna Bharat, Caroline Rebecca Pantofaru, Christian Frueh, Matthias Grundmann, Jay Yagnik, Ryan Michael Hickman