Patents by Inventor Pushmeet Kohli

Pushmeet Kohli has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Tracking using sensor data

Patent number: 9613298

Abstract: Tracking using sensor data is described, for example, where a plurality of machine learning predictors are used to predict a plurality of complementary, or diverse, parameter values of a process describing how the sensor data arises. In various examples a selector selects which of the predicted values are to be used, for example, to control a computing device. In some examples the tracked parameter values are pose of a moving camera or pose of an object moving in the field of view of a static camera; in some examples the tracked parameter values are of a 3D model of a hand or other articulated or deformable entity. The machine learning predictors have been trained in series, with training examples being reweighted after training an individual predictor, to favor training examples on which the set of predictors already trained performs poorly.

Type: Grant

Filed: June 2, 2014

Date of Patent: April 4, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Abner Guzmán-Rivera, Pushmeet Kohli, Benjamin Michael Glocker, Jamie Daniel Joseph Shotton, Shahram Izadi, Toby Sharp, Andrew William Fitzgibbon
Tracking hand/body pose

Patent number: 9552070

Abstract: Tracking hand or body pose from image data is described, for example, to control a game system, natural user interface or for augmented reality. In various examples a prediction engine takes a single frame of image data and predicts a distribution over a pose of a hand or body depicted in the image data. In examples, a stochastic optimizer has a pool of candidate poses of the hand or body which it iteratively refines, and samples from the predicted distribution are used to replace some candidate poses in the pool. In some examples a best candidate pose from the pool is selected as the current tracked pose and the selection processes uses a 3D model of the hand or body.

Type: Grant

Filed: September 23, 2014

Date of Patent: January 24, 2017

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jamie Daniel Joseph Shotton, Cem Keskin, Jonathan James Taylor, Toby Sharp, Shahram Izadi, Andrew William Fitzgibbon, Pushmeet Kohli, Duncan Paul Robertson
GESTURE RECOGNITION TECHNIQUES

Publication number: 20160370867

Abstract: In one or more implementations, a static geometry model is generated, from one or more images of a physical environment captured using a camera, using one or more static objects to model corresponding one or more objects in the physical environment. Interaction of a dynamic object with at least one of the static objects is identified by analyzing at least one image and a gesture is recognized from the identified interaction of the dynamic object with the at least one of the static objects to initiate an operation of the computing device.

Type: Application

Filed: May 24, 2016

Publication date: December 22, 2016

Applicant: Microsoft Technology Licensing, LLC

Inventors: David Kim, Otmar D. Hilliges, Shahram Izadi, Patrick L. Olivier, Jamie Daniel Joseph Shotton, Pushmeet Kohli, David G. Molyneaux, Stephen E. Hodges, Andrew W. Fitzgibbon
Memory facilitation using directed acyclic graphs

Patent number: 9489639

Abstract: Memory facilitation using directed acyclic graphs is described, for example, where a plurality of directed acyclic graphs are trained for gesture recognition from human skeletal data, or to estimate human body joint positions from depth images for gesture detection. In various examples directed acyclic graphs are grown during training using a training objective which takes into account both connection patterns between nodes and split function parameter values. For example, a layer of child nodes is grown and connected to a parent layer of nodes using an initialization strategy. In examples, various local search processes are used to find good combinations of connection patterns and split function parameters.

Type: Grant

Filed: November 13, 2013

Date of Patent: November 8, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Jamie Daniel Joseph Shotton, Toby Sharp, Pushmeet Kohli, Reinhard Sebastian Bernhard Nowozin, John Michael Winn, Antonio Criminisi
WEBSITE ACCESS CONTROL

Publication number: 20160205109

Abstract: A method of controlling access to content such as web sites on the intranet or interne is described. For example, the method comprises receiving an address of content to be accessed and obtaining similarity of the address to previously labeled addresses of other content items. The similarity is based on co-occurrence of addresses of content items in records of browsing sessions from many consenting users. For example, a browsing session record comprises addresses of content items accessed by a user in a time period during which the user is actively accessing content. A co-occurrence of addresses of content items is the existence of the addresses in the same browsing session record. Access to the content is then controlled on the basis of the similarity.

Type: Application

Filed: January 13, 2015

Publication date: July 14, 2016

Inventors: Pushmeet Kohli, Yoram Bachrach, Filip Radlinski, Ulrich Paquet, Li Quan Khoo
Depth sensing using an infrared camera

Patent number: 9380224

Abstract: A method of sensing depth using an infrared camera. In an example method, an infrared image of a scene is received from an infrared camera. The infrared image is applied to a trained machine learning component which uses the intensity of image elements to assign all or some of the image elements a depth value which represents the distance between the surface depicted by the image element and the infrared camera. In various examples, the machine line component comprises one or more random decision forests.

Type: Grant

Filed: February 28, 2014

Date of Patent: June 28, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Cem Keskin, Sean Ryan Francesco Fanello, Shahram Izadi, Pushmeet Kohli, David Kim, David Sweeney, Jamie Daniel Joseph Shotton, Duncan Paul Robertson, Sing Bing Kang
Gesture recognition techniques

Patent number: 9372544

Abstract: In one or more implementations, a static geometry model is generated, from one or more images of a physical environment captured using a camera, using one or more static objects to model corresponding one or more objects in the physical environment. Interaction of a dynamic object with at least one of the static objects is identified by analyzing at least one image and a gesture is recognized from the identified interaction of the dynamic object with the at least one of the static objects to initiate an operation of the computing device.

Type: Grant

Filed: May 16, 2014

Date of Patent: June 21, 2016

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: David Kim, Otmar D. Hilliges, Shahram Izadi, Patrick L. Olivier, Jamie Daniel Joseph Shotton, Pushmeet Kohli, David G. Molyneaux, Stephen E. Hodges, Andrew W. Fitzgibbon
REDUCING INTERFERENCE BETWEEN MULTIPLE INFRA-RED DEPTH CAMERAS

Publication number: 20160163054

Abstract: Systems and methods for reducing interference between multiple infra-red depth cameras are described. In an embodiment, the system comprises multiple infra-red sources, each of which projects a structured light pattern into the environment. A controller is used to control the sources in order to reduce the interference caused by overlapping light patterns. Various methods are described including: cycling between the different sources, where the cycle used may be fixed or may change dynamically based on the scene detected using the cameras; setting the wavelength of each source so that overlapping patterns are at different wavelengths; moving source-camera pairs in independent motion patterns; and adjusting the shape of the projected light patterns to minimize overlap. These methods may also be combined in any way. In another embodiment, the system comprises a single source and a mirror system is used to cast the projected structured light pattern around the environment.

Type: Application

Filed: January 20, 2016

Publication date: June 9, 2016

Inventors: Shahram Izadi, David Molyneaux, Otmar Hilliges, David Kim, Jamie Daniel Joseph Shotton, Stephen Edward Hodges, David Alexander Butler, Andrew Fitzgibbon, Pushmeet Kohli
MODEL FITTING FROM RAW TIME-OF-FLIGHT IMAGES

Publication number: 20160127715

Abstract: Model fitting from raw time of flight image data is described, for example, to track position and orientation of a human hand or other entity. In various examples, raw image data depicting the entity is received from a time of flight camera. A 3D model of the entity is accessed and used to render, from the 3D model, simulations of raw time of flight image data depicting the entity in a specified pose/shape. The simulated raw image data and at least part of the received raw image data are compared and on the basis of the comparison, parameters of the entity are computed.

Type: Application

Filed: October 30, 2014

Publication date: May 5, 2016

Inventors: Jamie Daniel Joseph Shotton, Toby Sharp, Jonathan James Taylor, Pushmeet Kohli, Shahram Izadi, Andrew William Fitzgibbon, Reinhard Sebastian Bernhard Nowozin
DEPTH FROM TIME OF FLIGHT CAMERA

Publication number: 20160104031

Abstract: Region of interest detection in raw time of flight images is described. For example, a computing device receives at least one raw image captured for a single frame by a time of flight camera. The raw image depicts one or more objects in an environment of the time of flight camera (such as human hands, bodies or any other objects). The raw image is input to a trained region detector and in response one or more regions of interest in the raw image are received. A received region of interest comprises image elements of the raw image which are predicted to depict at least part of one of the objects. A depth computation logic computes depth from the one or more regions of interest of the raw image.

Type: Application

Filed: October 14, 2014

Publication date: April 14, 2016

Inventors: Jamie Daniel Joseph Shotton, Cem Keskin, Christoph Rhemann, Toby Sharp, Duncan Paul Robertson, Pushmeet Kohli, Andrew William Fitzgibbon, Shahram Izadi
INFERENCE ENGINE FOR EFFICIENT MACHINE LEARNING

Publication number: 20160104070

Abstract: An inference engine is described for efficient machine learning. For example, an inference engine executes a plurality of ordered steps to carry out inference on the basis of observed data. For each step, a plurality of inputs to the step are received. A predictor predicts an output of the step and computes uncertainty of the prediction. Either the predicted output or a known output is selected on the basis of the uncertainty. If the known output is selected, the known output is computed, (for example, using a resource intensive, accurate process). The predictor is retrained using the known output and the plurality of inputs of the step as training data. For example, computing the prediction is fast and efficient as compared with computing the known output.

Type: Application

Filed: October 14, 2014

Publication date: April 14, 2016

Inventors: Seyed Mohammadali Eslami, Daniel Stefan Tarlow, Pushmeet Kohli, John Winn
TRACKING HAND/BODY POSE

Publication number: 20160085310

Abstract: Tracking hand or body pose from image data is described, for example, to control a game system, natural user interface or for augmented reality. In various examples a prediction engine takes a single frame of image data and predicts a distribution over a pose of a hand or body depicted in the image data. In examples, a stochastic optimizer has a pool of candidate poses of the hand or body which it iteratively refines, and samples from the predicted distribution are used to replace some candidate poses in the pool. In some examples a best candidate pose from the pool is selected as the current tracked pose and the selection processes uses a 3D model of the hand or body.

Type: Application

Filed: September 23, 2014

Publication date: March 24, 2016

Inventors: Jamie Daniel Joseph Shotton, Cem Keskin, Jonathan James Taylor, Toby Sharp, Shahram Izadi, Andrew William Fitzgibbon, Pushmeet Kohli, Duncan Paul Robertson
Adaptive Task Assignment

Publication number: 20160034840

Abstract: Crowdsourcing using active learning is described, for example, to select pairs of tasks and groups of workers so that information gained about answers to the tasks in the pool is optimized. In various examples a machine learning system learns variables describing characteristics of communities of workers, characteristics of workers, task variables and uncertainty of these variables. In various examples, the machine learning system predicts task variables and uncertainty of the predicted task variables for possible combinations of communities of workers and tasks. In examples the predicted variables and uncertainty are used to calculate expected information gain of the possible combinations and to rank the possible combinations. In examples, the crowdsourcing system uses the expected information gain to allocate tasks to worker communities and observe the results; the results may then be used to update the machine learning system.

Type: Application

Filed: July 31, 2014

Publication date: February 4, 2016

Inventors: Matteo Venanzi, John Philip Guiver, Pushmeet Kohli
Real-time camera tracking using depth maps

Patent number: 9242171

Abstract: Real-time camera tracking using depth maps is described. In an embodiment depth map frames are captured by a mobile depth camera at over 20 frames per second and used to dynamically update in real-time a set of registration parameters which specify how the mobile depth camera has moved. In examples the real-time camera tracking output is used for computer game applications and robotics. In an example, an iterative closest point process is used with projective data association and a point-to-plane error metric in order to compute the updated registration parameters. In an example, a graphics processing unit (GPU) implementation is used to optimize the error metric in real-time. In some embodiments, a dense 3D model of the mobile camera environment is used.

Type: Grant

Filed: February 23, 2013

Date of Patent: January 26, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Richard Newcombe, Shahram Izadi, David Molyneaux, Otmar Hilliges, David Kim, Jamie Daniel Joseph Shotton, Pushmeet Kohli, Andrew Fitzgibbon, Stephen Edward Hodges, David Alexander Butler
Reducing interference between multiple infra-red depth cameras

Patent number: 9247238

Abstract: Systems and methods for reducing interference between multiple infra-red depth cameras are described. In an embodiment, the system comprises multiple infra-red sources, each of which projects a structured light pattern into the environment. A controller is used to control the sources in order to reduce the interference caused by overlapping light patterns. Various methods are described including: cycling between the different sources, where the cycle used may be fixed or may change dynamically based on the scene detected using the cameras; setting the wavelength of each source so that overlapping patterns are at different wavelengths; moving source-camera pairs in independent motion patterns; and adjusting the shape of the projected light patterns to minimize overlap. These methods may also be combined in any way. In another embodiment, the system comprises a single source and a mirror system is used to cast the projected structured light pattern around the environment.

Type: Grant

Filed: January 31, 2011

Date of Patent: January 26, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Shahram Izadi, David Molyneaux, Otmar Hilliges, David Kim, Jamie Daniel Joseph Shotton, Stephen Edward Hodges, David Alexander Butler, Andrew Fitzgibbon, Pushmeet Kohli
CONTOUR COMPLETION FOR AUGMENTING SURFACE RECONSTRUCTIONS

Publication number: 20160019711

Abstract: Surface reconstruction contour completion embodiments are described which provide dense reconstruction of a scene from images captured from one or more viewpoints. Both a room layout and the full extent of partially occluded objects in a room can be inferred using a Contour Completion Random Field model to augment a reconstruction volume. The augmented reconstruction volume can then be used by any surface reconstruction pipeline to show previously occluded objects and surfaces.

Type: Application

Filed: September 28, 2015

Publication date: January 21, 2016

Inventors: Lior Shapira, Ran Gal, Eyal Ofek, Pushmeet Kohli, Nathan Silberman
LAYOUT DESIGN USING LOCALLY SATISFIABLE PROPOSALS

Publication number: 20150356774

Abstract: A “Layout Optimizer” provides various real-time iterative constraint-satisfaction methodologies that use constraint-based frameworks to generate optimized layouts that map or embed virtual objects into environments. The term environment refers to combinations of environmental characteristics, including, but not limited to, 2D or 3D scene geometry or layout, scene colors, patterns, and/or textures, scene illumination, scene heat sources, fixed or moving people, objects or fluids, etc., any of which may evolve or change over time. A set of parameters are specified or selected for each object. Further, the environmental characteristics are determined automatically or specified by users. Relationships between objects and/or the environment derived from constraints associated with objects and the environment are then used to iteratively determine optimized self-consistent and scene-consistent object layouts.

Type: Application

Filed: June 9, 2014

Publication date: December 10, 2015

Inventors: Ran Gal, Pushmeet Kohli, Eyal Ofek, Lior Shapira
TRACKING USING SENSOR DATA

Publication number: 20150347846

Abstract: Tracking using sensor data is described, for example, where a plurality of machine learning predictors are used to predict a plurality of complementary, or diverse, parameter values of a process describing how the sensor data arises. In various examples a selector selects which of the predicted values are to be used, for example, to control a computing device. In some examples the tracked parameter values are pose of a moving camera or pose of an object moving in the field of view of a static camera; in some examples the tracked parameter values are of a 3D model of a hand or other articulated or deformable entity. The machine learning predictors have been trained in series, with training examples being reweighted after training an individual predictor, to favour training examples on which the set of predictors already trained performs poorly.

Type: Application

Filed: June 2, 2014

Publication date: December 3, 2015

Applicant: Microsoft Corporation

Inventors: Abner GUZMÁN-RIVERA, Pushmeet KOHLI, Benjamin Michael GLOCKER, Jamie Daniel Joseph SHOTTON, Shahram IZADI, Toby SHARP, Andrew William FITZGIBBON
Contour completion for augmenting surface reconstructions

Patent number: 9171403

Abstract: Surface reconstruction contour completion embodiments are described which provide dense reconstruction of a scene from images captured from one or more viewpoints. Both a room layout and the full extent of partially occluded objects in a room can be inferred using a Contour Completion Random Field model to augment a reconstruction volume. The augmented reconstruction volume can then be used by any surface reconstruction pipeline to show previously occluded objects and surfaces.

Type: Grant

Filed: February 13, 2014

Date of Patent: October 27, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Lior Shapira, Ran Gal, Eyal Ofek, Pushmeet Kohli, Nathan Silberman
NON-GREEDY MACHINE LEARNING FOR HIGH ACCURACY

Publication number: 20150302317

Abstract: Non-greedy machine learning for high accuracy is described, for example, where one or more random decision trees are trained for gesture recognition in order to control a computing-based device. In various examples, a random decision tree or directed acyclic graph (DAG) is grown using a greedy process and is then post-processed to recalculate, in a non-greedy process, leaf node parameters and split function parameters of internal nodes of the graph. In various examples the very large number of options to be assessed by the non-greedy process is reduced by using a constrained objective function. In examples the constrained objective function takes into account a binary code denoting decisions at split nodes of the tree or DAG. In examples, resulting trained decision trees are more compact and have improved generalization and accuracy.

Type: Application

Filed: April 22, 2014

Publication date: October 22, 2015

Applicant: Microsoft Corporation

Inventors: Mohammad Norouzi, Pushmeet Kohli

prev 1 2 3 4 5 6 next