Patents by Inventor Toby Sharp
Toby Sharp has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Publication number: 20230316552Abstract: The techniques described herein disclose a system that is configured to detect and track the three-dimensional pose of an object (e.g., a head-mounted display device) in a color image using an accessible three-dimensional model of the object. The system uses the three-dimensional pose of the object to repair pixel depth values associated with a region (e.g., a surface) of the object that is composed of material that absorbs light emitted by a time-of-flight depth sensor to determine depth. Consequently, a color-depth image (e.g., a Red-Green-Blue-Depth image or RGB-D image) can be produced that does not include dark holes on and around the region of the object that is composed of material that absorbs light emitted by the time-of-flight depth sensor.Type: ApplicationFiled: April 4, 2022Publication date: October 5, 2023Inventors: JingJing SHEN, Erroll William WOOD, Toby SHARP, Ivan RAZUMENIC, Tadas BALTRUSAITIS, Julien Pascal Christophe VALENTIN, Predrag JOVANOVIC
-
Publication number: 20230281863Abstract: Keypoints are predicted in an image. Predictions are generated for each of the keypoints of an image as a 2D random variable, normally distributed with location (x, y) and standard deviation sigma. A neural network is trained to maximize a log-likelihood that samples from each of the predicted keypoints equal a ground truth. The trained neural network is used to predict keypoints of an image without generating a heatmap.Type: ApplicationFiled: June 28, 2022Publication date: September 7, 2023Inventors: Julien Pascal Christophe VALENTIN, Erroll William WOOD, Thomas Joseph CASHMAN, Martin de LA GORCE, Tadas BALTRUSAITIS, Daniel Stephen WILDE, Jingjing SHEN, Matthew Alastair JOHNSON, Charles Thomas HEWITT, Nikola MILOSAVLJEVIC, Stephan Joachim GARBIN, Toby SHARP, Ivan STOJILJKOVIC
-
Patent number: 10832163Abstract: Memory facilitation using directed acyclic graphs is described, for example, where a plurality of directed acyclic graphs are trained for gesture recognition from human skeletal data, or to estimate human body joint positions from depth images for gesture detection. In various examples directed acyclic graphs are grown during training using a training objective which takes into account both connection patterns between nodes and split function parameter values. For example, a layer of child nodes is grown and connected to a parent layer of nodes using an initialization strategy. In examples, various local search processes are used to find good combinations of connection patterns and split function parameters.Type: GrantFiled: October 28, 2016Date of Patent: November 10, 2020Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Toby Sharp, Pushmeet Kohli, Reinhard Sebastian Bernhard Nowozin, John Michael Winn, Antonio Criminisi
-
Patent number: 10311282Abstract: Region of interest detection in raw time of flight images is described. For example, a computing device receives at least one raw image captured for a single frame by a time of flight camera. The raw image depicts one or more objects in an environment of the time of flight camera (such as human hands, bodies or any other objects). The raw image is input to a trained region detector and in response one or more regions of interest in the raw image are received. A received region of interest comprises image elements of the raw image which are predicted to depict at least part of one of the objects. A depth computation logic computes depth from the one or more regions of interest of the raw image.Type: GrantFiled: September 11, 2017Date of Patent: June 4, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Cem Keskin, Christoph Rhemann, Toby Sharp, Duncan Paul Robertson, Pushmeet Kohli, Andrew William Fitzgibbon, Shahram Izadi
-
Patent number: 10186081Abstract: A tracker is described which comprises an input configured to receive captured sensor data depicting an object. The tracker has a processor configured to access a rigged, smooth-surface model of the object and to compute values of pose parameters of the model by calculating an optimization to fit the model to data related to the captured sensor data. Variables representing correspondences between the data and the model are included in the optimization jointly with the pose parameters.Type: GrantFiled: December 29, 2015Date of Patent: January 22, 2019Assignee: Microsoft Technology Licensing, LLCInventors: Jonathan James Taylor, Thomas Joseph Cashman, Andrew William Fitzgibbon, Toby Sharp, Jamie Daniel Joseph Shotton
-
Patent number: 10110881Abstract: Model fitting from raw time of flight image data is described, for example, to track position and orientation of a human hand or other entity. In various examples, raw image data depicting the entity is received from a time of flight camera. A 3D model of the entity is accessed and used to render, from the 3D model, simulations of raw time of flight image data depicting the entity in a specified pose/shape. The simulated raw image data and at least part of the received raw image data are compared and on the basis of the comparison, parameters of the entity are computed.Type: GrantFiled: October 30, 2014Date of Patent: October 23, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Toby Sharp, Jonathan James Taylor, Pushmeet Kohli, Shahram Izadi, Andrew William Fitzgibbon, Reinhard Sebastian Bernhard Nowozin
-
Patent number: 10037624Abstract: Examples describe an apparatus for calibrating a three dimensional (3D) mesh model of an articulated object. The articulated object is an instance of a specified object class. The apparatus comprises an input configured to receive captured sensor data depicting the object. The apparatus has a calibration engine configured to compute values of shape parameters of the 3D mesh model which indicate which member of the object class is depicted in the captured sensor data, in order to calibrate the 3D mesh model. The calibration engine is configured to compute the values of the shape parameters with an optimization process to find at least one potential local or global minimum of an energy function, the energy function expressing a degree of similarity between data rendered from the model and the received sensor data.Type: GrantFiled: December 29, 2015Date of Patent: July 31, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Thomas Joseph Cashman, David Joseph New Tan, Jamie Daniel Joseph Shotton, Andrew William Fitzgibbon, Sameh Khamis, Jonathan James Taylor, Toby Sharp, Daniel Stefan Tarlow
-
Patent number: 9911032Abstract: Tracking hand or body pose from image data is described, for example, to control a game system, natural user interface or for augmented reality. In various examples a prediction engine takes a single frame of image data and predicts a distribution over a pose of a hand or body depicted in the image data. In examples, a stochastic optimizer has a pool of candidate poses of the hand or body which it iteratively refines, and samples from the predicted distribution are used to replace some candidate poses in the pool. In some examples a best candidate pose from the pool is selected as the current tracked pose and the selection processes uses a 3D model of the hand or body.Type: GrantFiled: January 4, 2017Date of Patent: March 6, 2018Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Cem Keskin, Jonathan Taylor, Toby Sharp, Shahram Izadi, Andrew William Fitzgibbon, Pushmeet Kohli, Duncan Paul Robertson
-
Publication number: 20170372126Abstract: Region of interest detection in raw time of flight images is described. For example, a computing device receives at least one raw image captured for a single frame by a time of flight camera. The raw image depicts one or more objects in an environment of the time of flight camera (such as human hands, bodies or any other objects). The raw image is input to a trained region detector and in response one or more regions of interest in the raw image are received. A received region of interest comprises image elements of the raw image which are predicted to depict at least part of one of the objects. A depth computation logic computes depth from the one or more regions of interest of the raw image.Type: ApplicationFiled: September 11, 2017Publication date: December 28, 2017Applicant: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph SHOTTON, Cem KESKIN, Christoph RHEMANN, Toby SHARP, Duncan Paul ROBERTSON, Pushmeet KOHLI, Andrew William FITZGIBBON, Shahram IZADI
-
Patent number: 9773155Abstract: Region of interest detection in raw time of flight images is described. For example, a computing device receives at least one raw image captured for a single frame by a time of flight camera. The raw image depicts one or more objects in an environment of the time of flight camera (such as human hands, bodies or any other objects). The raw image is input to a trained region detector and in response one or more regions of interest in the raw image are received. A received region of interest comprises image elements of the raw image which are predicted to depict at least part of one of the objects. A depth computation logic computes depth from the one or more regions of interest of the raw image.Type: GrantFiled: October 14, 2014Date of Patent: September 26, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Cem Keskin, Christoph Rhemann, Toby Sharp, Duncan Paul Robertson, Pushmeet Kohli, Andrew William Fitzgibbon, Shahram Izadi
-
Publication number: 20170186226Abstract: Examples describe an apparatus for calibrating a three dimensional (3D) mesh model of an articulated object. The articulated object is an instance of a specified object class. The apparatus comprises an input configured to receive captured sensor data depicting the object. The apparatus has a calibration engine configured to compute values of shape parameters of the 3D mesh model which indicate which member of the object class is depicted in the captured sensor data, in order to calibrate the 3D mesh model. The calibration engine is configured to compute the values of the shape parameters with an optimization process to find at least one potential local or global minimum of an energy function, the energy function expressing a degree of similarity between data rendered from the model and the received sensor data.Type: ApplicationFiled: December 29, 2015Publication date: June 29, 2017Inventors: Thomas Joseph CASHMAN, David Joseph New TAN, Jamie Daniel Joseph SHOTTON, Andrew William FITZGIBBON, Sameh KHAMIS, Jonathan James TAYLOR, Toby SHARP, Daniel Stefan TARLOW
-
Publication number: 20170186165Abstract: A tracker is described which comprises an input configured to receive captured sensor data depicting an object. The tracker has a processor configured to access a rigged, smooth-surface model of the object and to compute values of pose parameters of the model by calculating an optimization to fit the model to data related to the captured sensor data. Variables representing correspondences between the data and the model are included in the optimization jointly with the pose parameters.Type: ApplicationFiled: December 29, 2015Publication date: June 29, 2017Inventors: Jonathan James TAYLOR, Thomas Joseph CASHMAN, Andrew William FITZGIBBON, Toby SHARP, Jamie Daniel Joseph SHOTTON
-
Publication number: 20170147947Abstract: Memory facilitation using directed acyclic graphs is described, for example, where a plurality of directed acyclic graphs are trained for gesture recognition from human skeletal data, or to estimate human body joint positions from depth images for gesture detection. In various examples directed acyclic graphs are grown during training using a training objective which takes into account both connection patterns between nodes and split function parameter values. For example, a layer of child nodes is grown and connected to a parent layer of nodes using an initialization strategy. In examples, various local search processes are used to find good combinations of connection patterns and split function parameters.Type: ApplicationFiled: October 28, 2016Publication date: May 25, 2017Inventors: Jamie Daniel Joseph Shotton, Toby Sharp, Pushmeet Kohli, Reinhard Sebastian Bernhard Nowozin, John Michael Winn, Antonio Criminisi
-
Publication number: 20170116471Abstract: Tracking hand or body pose from image data is described, for example, to control a game system, natural user interface or for augmented reality. In various examples a prediction engine takes a single frame of image data and predicts a distribution over a pose of a hand or body depicted in the image data. In examples, a stochastic optimizer has a pool of candidate poses of the hand or body which it iteratively refines, and samples from the predicted distribution are used to replace some candidate poses in the pool. In some examples a best candidate pose from the pool is selected as the current tracked pose and the selection processes uses a 3D model of the hand or body.Type: ApplicationFiled: January 4, 2017Publication date: April 27, 2017Inventors: Jamie Daniel Joseph Shotton, Cem Keskin, Jonathan Taylor, Toby Sharp, Shahram Izadi, Andrew William Fitzgibbon, Pushmeet Kohli, Duncan Paul Robertson
-
Patent number: 9613298Abstract: Tracking using sensor data is described, for example, where a plurality of machine learning predictors are used to predict a plurality of complementary, or diverse, parameter values of a process describing how the sensor data arises. In various examples a selector selects which of the predicted values are to be used, for example, to control a computing device. In some examples the tracked parameter values are pose of a moving camera or pose of an object moving in the field of view of a static camera; in some examples the tracked parameter values are of a 3D model of a hand or other articulated or deformable entity. The machine learning predictors have been trained in series, with training examples being reweighted after training an individual predictor, to favor training examples on which the set of predictors already trained performs poorly.Type: GrantFiled: June 2, 2014Date of Patent: April 4, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Abner Guzmán-Rivera, Pushmeet Kohli, Benjamin Michael Glocker, Jamie Daniel Joseph Shotton, Shahram Izadi, Toby Sharp, Andrew William Fitzgibbon
-
Patent number: 9557836Abstract: Depth image compression is described for example, to enable body-part centers of players of a game to be detected in real time from depth images or for other applications such as augmented reality, and human-computer interaction. In an embodiment, depth images which have associated body-part probabilities, are compressed using probability mass which is related to the depth of an image element and a probability of a body part for the image element. In various examples, compression of the depth images using probability mass enables body part center detection, by clustering output elements, to be speeded up. In some examples, the scale of the compression is selected according to a depth of a foreground region and in some cases different scales are used for different image regions. In some examples, certainties of the body-part centers are calculated using probability masses of clustered image elements.Type: GrantFiled: November 1, 2011Date of Patent: January 31, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Toby Sharp, Jamie Daniel Joseph Shotton
-
Patent number: 9552070Abstract: Tracking hand or body pose from image data is described, for example, to control a game system, natural user interface or for augmented reality. In various examples a prediction engine takes a single frame of image data and predicts a distribution over a pose of a hand or body depicted in the image data. In examples, a stochastic optimizer has a pool of candidate poses of the hand or body which it iteratively refines, and samples from the predicted distribution are used to replace some candidate poses in the pool. In some examples a best candidate pose from the pool is selected as the current tracked pose and the selection processes uses a 3D model of the hand or body.Type: GrantFiled: September 23, 2014Date of Patent: January 24, 2017Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Cem Keskin, Jonathan James Taylor, Toby Sharp, Shahram Izadi, Andrew William Fitzgibbon, Pushmeet Kohli, Duncan Paul Robertson
-
Patent number: 9489639Abstract: Memory facilitation using directed acyclic graphs is described, for example, where a plurality of directed acyclic graphs are trained for gesture recognition from human skeletal data, or to estimate human body joint positions from depth images for gesture detection. In various examples directed acyclic graphs are grown during training using a training objective which takes into account both connection patterns between nodes and split function parameter values. For example, a layer of child nodes is grown and connected to a parent layer of nodes using an initialization strategy. In examples, various local search processes are used to find good combinations of connection patterns and split function parameters.Type: GrantFiled: November 13, 2013Date of Patent: November 8, 2016Assignee: Microsoft Technology Licensing, LLCInventors: Jamie Daniel Joseph Shotton, Toby Sharp, Pushmeet Kohli, Reinhard Sebastian Bernhard Nowozin, John Michael Winn, Antonio Criminisi
-
Publication number: 20160127715Abstract: Model fitting from raw time of flight image data is described, for example, to track position and orientation of a human hand or other entity. In various examples, raw image data depicting the entity is received from a time of flight camera. A 3D model of the entity is accessed and used to render, from the 3D model, simulations of raw time of flight image data depicting the entity in a specified pose/shape. The simulated raw image data and at least part of the received raw image data are compared and on the basis of the comparison, parameters of the entity are computed.Type: ApplicationFiled: October 30, 2014Publication date: May 5, 2016Inventors: Jamie Daniel Joseph Shotton, Toby Sharp, Jonathan James Taylor, Pushmeet Kohli, Shahram Izadi, Andrew William Fitzgibbon, Reinhard Sebastian Bernhard Nowozin
-
Publication number: 20160104031Abstract: Region of interest detection in raw time of flight images is described. For example, a computing device receives at least one raw image captured for a single frame by a time of flight camera. The raw image depicts one or more objects in an environment of the time of flight camera (such as human hands, bodies or any other objects). The raw image is input to a trained region detector and in response one or more regions of interest in the raw image are received. A received region of interest comprises image elements of the raw image which are predicted to depict at least part of one of the objects. A depth computation logic computes depth from the one or more regions of interest of the raw image.Type: ApplicationFiled: October 14, 2014Publication date: April 14, 2016Inventors: Jamie Daniel Joseph Shotton, Cem Keskin, Christoph Rhemann, Toby Sharp, Duncan Paul Robertson, Pushmeet Kohli, Andrew William Fitzgibbon, Shahram Izadi