Patents by Inventor Antonio Criminisi

Antonio Criminisi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 8638985
    Abstract: Techniques for human body pose estimation are disclosed herein. Images such as depth images, silhouette images, or volumetric images may be generated and pixels or voxels of the images may be identified. The techniques may process the pixels or voxels to determine a probability that each pixel or voxel is associated with a segment of a body captured in the image or to determine a three-dimensional representation for each pixel or voxel that is associated with a location on a canonical body. These probabilities or three-dimensional representations may then be utilized along with the images to construct a posed model of the body captured in the image.
    Type: Grant
    Filed: March 3, 2011
    Date of Patent: January 28, 2014
    Assignee: Microsoft Corporation
    Inventors: Jamie Daniel Joseph Shotton, Shahram Izadi, Otmar Hilliges, David Kim, David Geoffrey Molyneaux, Matthew Darius Cook, Pushmeet Kohli, Antonio Criminisi, Ross Brook Girshick, Andrew William Fitzgibbon
  • Patent number: 8625897
    Abstract: Foreground and background image segmentation is described. In an example, a seed region is selected in a foreground portion of an image, and a geodesic distance is calculated from each image element to the seed region. A subset of the image elements having a geodesic distance less than a threshold is determined, and this subset of image elements are labeled as foreground. In another example, an image element from an image showing at least a user, a foreground object in proximity to the user, and a background is applied to trained decision trees to obtain probabilities of the image element representing one of these items, and a corresponding classification assigned to the image element. This is repeated for each image element. Image elements classified as belonging to the user are labeled as foreground, and image elements classified as foreground objects or background are labeled as background.
    Type: Grant
    Filed: May 28, 2010
    Date of Patent: January 7, 2014
    Assignee: Microsoft Corporation
    Inventors: Antonio Criminisi, Jamie Daniel Joseph Shotton, Andrew Fitzgibbon, Toby Sharp, Matthew Darius Cook
  • Publication number: 20130346346
    Abstract: Semi-supervised random decision forests for machine learning are described, for example, for interactive image segmentation, medical image analysis, and many other applications. In examples, a random decision forest comprising a plurality of hierarchical data structures is trained using both unlabeled and labeled observations. In examples, a training objective is used which seeks to cluster the observations based on the labels and similarity of the observations. In an example, a transducer assigns labels to the unlabeled observations on the basis of the clusters and certainty information. In an example, an inducer forms a generic clustering function by counting examples of class labels at leaves of the trees in the forest. In an example, an active learning module identifies regions in a feature space from which the observations are drawn using the clusters and certainty information; new observations from the identified regions are used to train the random decision forest.
    Type: Application
    Filed: June 21, 2012
    Publication date: December 26, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Antonio Criminisi, Jamie Daniel Joseph Shotton
  • Publication number: 20130343619
    Abstract: Density estimation and/or manifold learning are described, for example, for computer vision, medical image analysis, text document clustering. In various embodiments a density forest is trained using unlabeled data to estimate the data distribution. In embodiments the density forest comprises a plurality of random decision trees each accumulating portions of the training data into clusters at their leaves. In embodiments probability distributions representing the clusters at each tree are aggregated to form a forest density which is an estimate of a probability density function from which the unlabeled data may be generated. A mapping engine may use the clusters at the leaves of the density forest to estimate a mapping function which maps the unlabeled data to a lower dimensional space whilst preserving relative distances or other relationships between the unlabeled data points. A sampling engine may use the density forest to randomly sample data from the forest density.
    Type: Application
    Filed: June 21, 2012
    Publication date: December 26, 2013
    Applicant: MICROSOFT CORPORATION
    Inventors: Antonio Criminisi, Jamie Daniel Joseph Shotton, Ender Konukoglu
  • Patent number: 8587670
    Abstract: An image capture device is described which is operable in any one of a number of capture modes. The device comprises a camera, a memory and a processor. The memory stores a plurality of sets of capture triggers, with each set of capture triggers being associated with one of the plurality of capture modes. The processor selects one of the plurality of capture modes, such that the device is operable in the selected capture mode. In the selected capture mode, an image is captured automatically when a capture trigger within the associated set of capture triggers is satisfied.
    Type: Grant
    Filed: November 16, 2006
    Date of Patent: November 19, 2013
    Assignee: Microsoft Corporation
    Inventors: Kenneth Wood, Stephen Hodges, Lyndsay Williams, James Srinivasan, Carsten Rother, Antonio Criminisi, John Chiloyan
  • Patent number: 8571263
    Abstract: Predicting joint positions is described, for example, to find joint positions of humans or animals (or parts thereof) in an image to control a computer game or for other applications. In an embodiment image elements of a depth image make joint position votes so that for example, an image element depicting part of a torso may vote for a position of a neck joint, a left knee joint and a right knee joint. A random decision forest may be trained to enable image elements to vote for the positions of one or more joints and the training process may use training images of bodies with specified joint positions. In an example a joint position vote is expressed as a vector representing a distance and a direction of a joint position from an image element making the vote. The random decision forest may be trained using a mixture of objectives.
    Type: Grant
    Filed: March 17, 2011
    Date of Patent: October 29, 2013
    Assignee: Microsoft Corporation
    Inventors: Jamie Daniel Joseph Shotton, Pushmeet Kohli, Ross Brook Girshick, Andrew Fitzgibbon, Antonio Criminisi
  • Patent number: 8558917
    Abstract: A method of transferring images from a first device to a second device and computer program code for performing this method is described. A connection characteristic for a connection between the first & second devices is determined and at least one image is selected from a plurality of images on the first device for transfer dependent upon both the connection characteristic and image selection criteria. The selected image(s) are then transferred over the connection from the first device to the second device.
    Type: Grant
    Filed: November 24, 2006
    Date of Patent: October 15, 2013
    Assignee: Microsoft Corporation
    Inventors: Kenneth Wood, Stephen Hodges, Lyndsay Williams, Mitch Goldberg, Carsten Rother, Antonio Criminisi, James Srinivasan
  • Publication number: 20130223690
    Abstract: Computing high dynamic range photographs is described for example, to enable high ranges of intensities to be represented in a single image. In various embodiments two or more photographs of the same scene taken at different exposure levels are combined in a way which takes into account intensity or other gradients in the images to form a high dynamic range image. In embodiments geodesic distances (which take into account intensity or other image gradients) are computed and used to form weights for a weighted aggregation of the photographs. In some embodiments a user configurable parameter is operable to control a degree of mixing of the photographs as the high dynamic range image is formed.
    Type: Application
    Filed: February 28, 2012
    Publication date: August 29, 2013
    Applicant: MICROSOFT CORPORATION
    Inventor: Antonio Criminisi
  • Patent number: 8514264
    Abstract: Existing remote workspace sharing systems are difficult to use. For example, changes made on a common work product by one user often appear abruptly on displays viewed by remote users. As a result the interaction is perceived as unnatural by the users and is often inefficient. Images of a display of a common work product are received from a camera at a first location. These images may also comprise information about objects between the display and the camera such as a user's hand editing a document on a tablet PC. These images are combined with images of the shared work product and displayed at remote locations. Advance information about remote user actions is then visible and facilitates collaborative mediation between users. Depth information may be used to influence the process of combining the images.
    Type: Grant
    Filed: February 27, 2012
    Date of Patent: August 20, 2013
    Assignee: Microsoft Corporation
    Inventors: Ankur Agarwal, Antonio Criminisi, William A. S. Buxton, Andrew Blake, Andrew William Fitzgibbon
  • Patent number: 8498481
    Abstract: Image segmentation using star-convexity constraints is described. In an example, user input specifies positions of one or more star centers in a foreground to be segmented from a background of an image. In embodiments, an energy function is used to express the problem of segmenting the image and that energy function incorporates a star-convexity constraint which limits the number of possible solutions. For example, the star-convexity constraint may be that, for any point p inside the foreground, all points on a shortest path (which may be geodesic or Euclidean) between the nearest star center and p also lie inside the foreground. In some examples continuous star centers such as lines are used. In embodiments a user may iteratively edit the star centers by adding brush strokes to the image in order to progressively change the star-convexity constraints and obtain an accurate segmentation.
    Type: Grant
    Filed: May 7, 2010
    Date of Patent: July 30, 2013
    Assignee: Microsoft Corporation
    Inventors: Andrew Blake, Varun Gulshan, Carsten Rother, Antonio Criminisi
  • Patent number: 8436852
    Abstract: Image editing which is consistent with geometry of a scene depicted in the image is described. In an embodiment a graphical user interface (GUI) is provided to enable a user to simply and quickly specify four corners of a rectangular frame drawn onto a source image using the GUI. In embodiments, the four corners are used to compute parameters of a virtual camera assumed to capture the image of the drawn frame. Embodiments of an image processing system are described which use the virtual camera parameters to control editing of the source image in ways consistent with the 3D geometry of the scene depicted in that image. In some embodiments out of bounds images are formed and/or realistic-looking shadows are synthesized. In examples, users are able to edit images and the virtual camera parameters are dynamically recomputed and used to update the edited image.
    Type: Grant
    Filed: February 9, 2009
    Date of Patent: May 7, 2013
    Assignee: Microsoft Corporation
    Inventors: Antonio Criminisi, Carsten Rother, Gavin Smyth, Amit Shesh
  • Patent number: 8437570
    Abstract: A method of geodesic image and video processing is proposed. In an embodiment, the method uses a geodesic distance transform to construct an image filter. The filter can be used in a variety of image editing operations such as segmentation, denoising, texture smoothing, image stitching and cartooning. In one embodiment, the method may be made efficient by utilizing parallelism of the algorithm to carry out processing steps on at least two processing cores concurrently. This efficiency may enable high-resolution images and video to be processed at ‘real time’ rates without the need for specialist hardware.
    Type: Grant
    Filed: May 23, 2008
    Date of Patent: May 7, 2013
    Assignee: Microsoft Corporation
    Inventors: Antonio Criminisi, Toby Sharp
  • Patent number: 8351654
    Abstract: Image processing using geodesic forests is described. In an example, a geodesic forest engine determines geodesic shortest-path distances between each image element and a seed region specified in the image in order to form a geodesic forest data structure. The geodesic distances take into account gradients in the image of a given image modality such as intensity, color, or other modality. In some embodiments, a 1D processing engine carries out 1D processing along the branches of trees in the geodesic forest data structure to form a processed image. For example, effects such as ink painting, edge-aware texture flattening, contrast-aware image editing, forming animations using geodesic forests and other effects are achieved using the geodesic forest data structure. In some embodiments the geodesic forest engine uses a four-part raster scan process to achieve real-time processing speeds and parallelization is possible in many of the embodiments.
    Type: Grant
    Filed: April 28, 2009
    Date of Patent: January 8, 2013
    Assignee: Microsoft Corporation
    Inventors: Antonio Criminisi, Toby Sharp
  • Patent number: 8351736
    Abstract: Methods and a processing device are provided for restoring pixels damaged by artifacts caused by dust, or other particles, entering a digital image capturing device. A user interface may be provided for a user to indicate an approximate location of an artifact appearing in a digital image. Dust attenuation may be estimated and an inverse transformation, based on the estimated dust attenuation, may be applied to damaged pixels in order to recover an estimate of the underlying digital image. One or many candidate source patch may be selected based on having smallest pixel distances, with respect to a target patch area. The damaged pixels included in the target patch area may be considered when calculating the pixel distance with respect to candidate source patches. RGB values of corresponding pixels of source patches may be used to restore the damaged pixels included in the target patch area.
    Type: Grant
    Filed: June 2, 2009
    Date of Patent: January 8, 2013
    Assignee: Microsoft Corporation
    Inventors: Denis Demandolx, Eric Paul Bennett, Antonio Criminisi, Vladimir Farbman, Steven James White
  • Publication number: 20120269407
    Abstract: Automatic organ localization is described. In an example, an organ in a medical image is localized using one or more trained regression trees. Each image element of the medical image is applied to the trained regression trees to compute probability distributions that relate to a distance from each image element to the organ. At least a subset of the probability distributions are selected and aggregated to calculate a localization estimate for the organ. In another example, the regression trees are trained using training images having a predefined organ location. At each node of the tree, test parameters are generated that determine which subsequent node each training image element is passed to. This is repeated until each image element reaches a leaf node of the tree. A probability distribution is generated and stored at each leaf node, based on the distance from the leaf node's image elements to the organ.
    Type: Application
    Filed: April 19, 2011
    Publication date: October 25, 2012
    Applicant: MICROSOFT CORPORATION
    Inventors: Antonio CRIMINISI, Jamie Daniel Joseph SHOTTON, Duncan Paul ROBERTSON, Sayan D. PATHAK, Steven James WHITE, Khan Mohammed SIDDIQUI
  • Patent number: 8290295
    Abstract: A system for multi-modal mapping of images is described. Embodiments are described where the image mapping system is used for visualizing high dynamic range images such as medical images, satellite images, high dynamic range photographs and the like and also for compressing such images. In examples, high bit-depth images are tone-mapped for display on equipment of lower bit-depth without loss of detail. In embodiments, the image mapping system computes statistics describing an input image and fits a multi-modal model to those statistics efficiently. In embodiments, the multi-modal model is a Gaussian mixture model and a plurality of sigmoid functions corresponding to the multi-modal model are obtained. In an embodiment the sigmoid functions are added to form a tone-mapping function which is used to transform a high bit-depth image such as 16 or 12 bits per pixel to a low bit-depth image such as 8 bits per pixel.
    Type: Grant
    Filed: March 3, 2009
    Date of Patent: October 16, 2012
    Assignee: Microsoft Corporation
    Inventors: Antonio Criminisi, Evgeny Salnikov, Toby Sharp
  • Publication number: 20120239174
    Abstract: Predicting joint positions is described, for example, to find joint positions of humans or animals (or parts thereof) in an image to control a computer game or for other applications. In an embodiment image elements of a depth image make joint position votes so that for example, an image element depicting part of a torso may vote for a position of a neck joint, a left knee joint and a right knee joint. A random decision forest may be trained to enable image elements to vote for the positions of one or more joints and the training process may use training images of bodies with specified joint positions. In an example a joint position vote is expressed as a vector representing a distance and a direction of a joint position from an image element making the vote. The random decision forest may be trained using a mixture of objectives.
    Type: Application
    Filed: March 17, 2011
    Publication date: September 20, 2012
    Applicant: Microsoft Corporation
    Inventors: Jamie Daniel Joseph Shotton, Pushmeet Kohli, Ross Brook Girshick, Andrew Fitzgibbon, Antonio Criminisi
  • Patent number: 8249397
    Abstract: A method of displaying digital images is described in which a display length indicator is received and digital images are accessed. A set of digital images are selected from the accessed digital images in accordance with the display length indicator and displayed in a predetermined order. The method may be performed by a computer program, which may be embodied on a computer readable medium.
    Type: Grant
    Filed: November 16, 2006
    Date of Patent: August 21, 2012
    Assignee: Microsoft Corporation
    Inventors: Kenneth Wood, Stephen Hodges, Lyndsay Williams, James Srinivasan, Carsten Rother, Antonio Criminisi
  • Publication number: 20120207359
    Abstract: Image registration is described. In an embodiment an image registration system executes automatic registration of images, for example medical images. In an example, semantic information is computed for each of the images to be registered comprising information about the types of objects in the images and the certainty of that information. In an example a mapping is found to register the images which takes into account the intensities of the image elements as well as the semantic information in a manner which is weighted by the certainty of that semantic information. For example, the semantic information is computed by estimating posterior distributions for the locations of anatomical structures by using a regression forest and transforming the posterior distributions into a probability map. In an example the mapping is found as a global point of inflection of an energy function, the energy function having a term related to the semantic information.
    Type: Application
    Filed: February 11, 2011
    Publication date: August 16, 2012
    Applicant: Microsoft Corporation
    Inventors: Ender Konukoglu, Sayan Pathak, Khan Mohammad Siddiqui, Antonio Criminisi, Steven White, Jamie Daniel Joseph Shotton, Duncan Paul Robertson
  • Publication number: 20120166462
    Abstract: The present discussion relates to automated image data processing and visualization. One example can facilitate generating a graphical user-interface (GUI) from image data that includes multiple semantically-labeled user-selectable anatomical structures. This example can receive a user selection of an individual semantically-labeled user-selectable anatomical structure. The example can locate a sub-set of the image data associated with the individual semantically-labeled user-selectable anatomical structure and can cause presentation of the sub-set of the image data on a subsequent GUI.
    Type: Application
    Filed: December 28, 2010
    Publication date: June 28, 2012
    Applicant: Microsoft Corporation
    Inventors: Sayan D. Pathak, Antonio Criminisi, Steven J. White, Liqun Fu, Khan M. Siddiqui, Toby Sharp, Ender Konukoglu, Bryan Dove, Michael T. Gillam