Patents by Inventor Xiaoou Tang
Xiaoou Tang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 8064697Abstract: Systems and methods perform Laplacian Principal Components Analysis (LPCA). In one implementation, an exemplary system receives multidimensional data and reduces dimensionality of the data by locally optimizing a scatter of each local sample of the data. The optimization includes summing weighted distances between low dimensional representations of the data and a mean. The weights of the distances can be determined by a coding length of each local data sample. The system can globally align the locally optimized weighted scatters of the local samples and provide a global projection matrix. The LPCA improves performance of such applications as face recognition and manifold learning.Type: GrantFiled: October 12, 2007Date of Patent: November 22, 2011Assignee: Microsoft CorporationInventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang
-
Patent number: 8045800Abstract: Systems and methods of segmenting images are disclosed herein. The similarity of images in a set of images is compared. A group of images is selected from the set of images. The images in the group of images are selected based on compared similarities among the images. An informative image is selected from the group of images. User-defined semantic information of the informative image is received. The group of images as a graph is modeled as a graph. Each image in the group of images denotes a node in the graph. Edges of the graph denote a foreground relationship between images or a background relationship between images. One or more images in the group of images are automatically segmented by propagating the semantic information of the informative image to images in the group of images having a corresponding graph node that is related to a graph node corresponding to the informative image. Segmentation results can be refined according to user provided image semantics.Type: GrantFiled: February 4, 2008Date of Patent: October 25, 2011Assignee: Microsoft CorporationInventors: Xiaoou Tang, Qiong Yang
-
Patent number: 8026931Abstract: Digital video effects are described. In one aspect, a foreground object in a video stream is identified. The video stream comprises multiple image frames. The foreground object is modified by rendering a 3-dimensional (3-D) visual feature over the foreground object for presentation to a user in a modified video stream. Pose of the foreground object is tracked in 3-D space across respective ones of the image frames to identify when the foreground object changes position in respective ones of the image frames. Based on this pose tracking, aspect ratio of the 3-D visual feature is adaptively modified and rendered over the foreground object in corresponding image frames for presentation to the user in the modified video stream.Type: GrantFiled: August 28, 2006Date of Patent: September 27, 2011Assignee: Microsoft CorporationInventors: Jian Sun, Qiang Wang, Weiwei Zhang, Xiaoou Tang, Heung-Yeung Shum
-
Patent number: 8024152Abstract: Tensor linear Laplacian discrimination for feature extraction is disclosed. One embodiment comprises generating a contextual distance based sample weight and class weight, calculating a within-class scatter using the at least one sample weight and a between-class scatter for multiple classes of data samples in a sample set using the class weight, performing a mode-k matrix unfolding on scatters and generating at least one orthogonal projection matrix.Type: GrantFiled: September 23, 2008Date of Patent: September 20, 2011Assignee: Microsoft CorporationInventors: Wei Zhang, Zhouchen Lin, Xiaoou Tang
-
Patent number: 8014572Abstract: Systems and methods are described for a face annotation framework with partial clustering and interactive labeling. In one implementation, an exemplary system automatically groups some images of a collection of images into clusters, each cluster mainly including images that contain a person's face associated with that cluster. After an initial user-labeling of each cluster with the person's name or other label, in which the user may also delete/label images that do not belong in the cluster, the system iteratively proposes subsequent clusters for the user to label, proposing clusters of images that when labeled, produce a maximum information gain at each iteration and minimize the total number of user interactions for labeling the entire collection of images.Type: GrantFiled: June 8, 2007Date of Patent: September 6, 2011Assignee: Microsoft CorporationInventors: Rong Xiao, Fang Wen, Xiaoou Tang
-
Publication number: 20110206276Abstract: This disclosure describes an integrated framework for class-unsupervised object segmentation. The class-unsupervised object segmentation occurs by integrating top-down constraints and bottom-up constraints on object shapes using an algorithm in an integrated manner. The algorithm describes a relationship among object parts and superpixels. This process forms object shapes with object parts and oversegments pixel images into the superpixels, with the algorithm in conjunction with the constraints. This disclosure describes computing a mask map from a hybrid graph, segmenting the image into a foreground object and a background, and displaying the foreground object from the background.Type: ApplicationFiled: May 4, 2011Publication date: August 25, 2011Applicant: Microsoft CorporationInventors: Zhouchen Lin, Guangcan Liu, Xiaoou Tang
-
Patent number: 8000533Abstract: Systems and methods for space-time video montage are described. In one aspect, one or more arbitrary space-time volumes representing informative video portion(s) of at least one input video data sequence are identified. A video summary representing a montage of the at least one input video data sequence is generated for presentation to user from the one or more arbitrary space-time volumes.Type: GrantFiled: November 14, 2006Date of Patent: August 16, 2011Assignee: Microsoft CorporationInventors: Yasuyuki Matsushita, Hong-Wen Kang, Xiaoou Tang
-
Patent number: 7995841Abstract: This disclosure describes an integrated framework for class-unsupervised object segmentation. The class-unsupervised object segmentation occurs by integrating top-down constraints and bottom-up constraints on object shapes using an algorithm in an integrated manner. The algorithm describes a relationship among object parts and superpixels. This process forms object shapes with object parts and oversegments pixel images into the superpixels, with the algorithm in conjunction with the constraints. This disclosure describes computing a mask map from a hybrid graph, segmenting the image into a foreground object and a background, and displaying the foreground object from the background.Type: GrantFiled: September 24, 2007Date of Patent: August 9, 2011Assignee: Microsoft CorporationInventors: Zhouchen Lin, Guangcan Liu, Xiaoou Tang
-
Patent number: 7996343Abstract: Described is using semi-Riemannian geometry in supervised learning to learn a discriminant subspace for classification, e.g., labeled samples are used to learn the geometry of a semi-Riemannian submanifold. For a given sample, the K nearest classes of that sample are determined, along with the nearest samples that are in other classes, and the nearest samples in that sample's same class. The distances between these samples are computed, and used in computing a metric matrix. The metric matrix is used to compute a projection matrix that corresponds to the discriminant subspace. In online classification, as a new sample is received, it is projected into a feature space by use of the projection matrix and classified accordingly.Type: GrantFiled: September 30, 2008Date of Patent: August 9, 2011Assignee: Microsoft CorporationInventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang
-
Patent number: 7991230Abstract: Exemplary systems and methods use micro-structure modeling of an image for extracting image features. The micro-structure in an image is modeled as a Markov Random Field, and the model parameters are learned from training images. Micro-patterns adaptively designed from the modeled micro-structure capture spatial contexts of the image. In one implementation, a series of micro-patterns based on the modeled micro-structure can be automatically designed for each block of the image, providing improved feature extraction and recognition because of adaptability to various images, various pixel attributes, and various sites within an image.Type: GrantFiled: August 22, 2006Date of Patent: August 2, 2011Assignee: Microsoft CorporationInventors: Qiong Yang, Dian Gong, Xiaoou Tang
-
Patent number: 7970727Abstract: A method for modeling data affinities and data structures. In one implementation, a contextual distance may be calculated between a selected data point in a data sample and a data point in a contextual set of the selected data point. The contextual set may include the selected data point and one or more data points in the neighborhood of the selected data point. The contextual distance may be the difference between the selected data point's contribution to the integrity of the geometric structure of the contextual set and the data point's contribution to the integrity of the geometric structure of the contextual set. The process may be repeated for each data point in the contextual set of the selected data point. The process may be repeated for each selected data point in the data sample. A digraph may be created using a plurality of contextual distances generated by the process.Type: GrantFiled: February 18, 2008Date of Patent: June 28, 2011Assignee: Microsoft CorporationInventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang
-
Patent number: 7949621Abstract: An efficient, effective and at times superior object detection and/or recognition (ODR) function may be built from a set of Bayesian stumps. Bayesian stumps may be constructed for each feature and object class, and the ODR function may be constructed from the subset of Bayesian stumps that minimize Bayesian error for a particular object class. That is, Bayesian error may be utilized as a feature selection measure for the ODR function. Furthermore, Bayesian stumps may be efficiently implemented as lookup tables with entries corresponding to unequal intervals of feature histograms. Interval widths and entry values may be determined so as to minimize Bayesian error, yielding Bayesian stumps that are optimal in this respect.Type: GrantFiled: October 12, 2007Date of Patent: May 24, 2011Assignee: Microsoft CorporationInventors: Rong Xiao, Xiaoou Tang
-
Patent number: 7940985Abstract: Methods for detecting a salient object in an input image are described. For this, the salient object in an image may be defined using a set of local, regional, and global features including multi-scale contrast, center-surround histogram, and color spatial distribution. These features are optimally combined through conditional random field learning. The learned conditional random field is then used to locate the salient object in the image. The methods can also use image segmentation, where the salient object is separated from the image background.Type: GrantFiled: June 6, 2007Date of Patent: May 10, 2011Assignee: Microsoft CorporationInventors: Jian Sun, Tie Liu, Xiaoou Tang, Heung-Yeung Shum
-
Patent number: 7860347Abstract: A search includes comparing a query image provided by a user to a plurality of stored images of faces stored in a stored image database, and determining a similarity of the query image to the plurality of stored images. One or more resultant images of faces, selected from among the stored images, are displayed to the user based on the determined similarity of the stored images to the query image provided by the user. The resultant images are displayed based at least in part on one or more facial features.Type: GrantFiled: February 3, 2010Date of Patent: December 28, 2010Assignee: Microsoft CorporationInventors: Xiaoou Tang, Qiong Yang, David P. Vronay, Leizhong Zhang, Ta Bao
-
Publication number: 20100303367Abstract: A Poisson-quantization noise model for modeling noise in low-light conditions is described. In one aspect, image information is received. A Poisson-quantization noise model is then generated from a Poisson noise model and a quantization noise model. Poisson-quantization noise is then estimated in the image information using the Poisson-quantization noise model.Type: ApplicationFiled: May 3, 2010Publication date: December 2, 2010Applicant: Microsoft CorporationInventors: Yasuyuki Matsushita, Xiaoou Tang, Francois Alter
-
Patent number: 7817822Abstract: The present video tracking technique outputs a Maximum A Posterior (MAP) solution for a target object based on two object templates obtained from a start and an end keyframe of a whole state sequence. The technique first minimizes the whole state space of the sequence by generating a sparse set of local two-dimensional modes in each frame of the sequence. The two-dimensional modes are converted into three-dimensional points within a three-dimensional volume. The three-dimensional points are clustered using a spectral clustering technique where each cluster corresponds to a possible trajectory segment of the target object. If there is occlusion in the sequence, occlusion segments are generated so that an optimal trajectory of the target object can be obtained.Type: GrantFiled: April 27, 2006Date of Patent: October 19, 2010Assignee: Microsoft CorporationInventors: Jian Sun, Weiwei Zhang, Xiaoou Tang, Heung-Yeung Shum
-
Patent number: 7808532Abstract: A flash-based strategy is used to separate foreground information from background information within image information. In this strategy, a first image is taken without the use of flash. A second image is taken of the same subject matter with the use of flash. The foreground information in the flash image is illuminated by the flash to a much greater extent than the background information. Based on this property, the strategy applies processing to extract the foreground information from the background information. The strategy supplements the flash information by also taking into consideration motion information and color information.Type: GrantFiled: May 29, 2007Date of Patent: October 5, 2010Assignee: Microsoft CorporationInventors: Jian Sun, Jian Sun, Sing Bing Kang, Xiaoou Tang, Heung-Yeung Shum
-
Patent number: 7755619Abstract: Systems and methods perform automatic 3D face modeling. In one implementation, a brief video clip of a user's head turning from front to side provides enough input for automatically achieving a model that includes 2D feature matches, 3D head pose, 3D face shape, and facial textures. The video clip of the user may be of poor quality. In a two layer iterative method, the video clip is divided into segments. Flow-based feature estimation and model-based feature refinement are applied recursively to each segment. Then the feature estimation and refinement are iteratively applied across all the segments. The entire modeling method is automatic and the two layer iterative method provides speed and efficiency, especially when sparse bundle adjustment is applied to boost efficiency.Type: GrantFiled: August 17, 2006Date of Patent: July 13, 2010Assignee: Microsoft CorporationInventors: Qiang Wang, Heung-Yeung Shum, Xiaoou Tang
-
Publication number: 20100135584Abstract: A search includes comparing a query image provided by a user to a plurality of stored images of faces stored in a stored image database, and determining a similarity of the query image to the plurality of stored images. One or more resultant images of faces, selected from among the stored images, are displayed to the user based on the determined similarity of the stored images to the query image provided by the user. The resultant images are displayed based at least in part on one or more facial features.Type: ApplicationFiled: February 3, 2010Publication date: June 3, 2010Applicant: Microsoft CorporationInventors: Xiaoou Tang, Qiong Yang, Leizhong Zhang, Ta Bao, David P. Vronay
-
Patent number: 7720283Abstract: Exemplary systems and methods segment a foreground from a background image in a video sequence. In one implementation, a system refines a segmentation boundary between the foreground and the background image by attenuating background contrast while preserving contrast of the segmentation boundary itself, providing an accurate background cut of live video in real time. A substitute background may then be merged with the segmented foreground within the live video. The system can apply an adaptive background color mixture model to improve segmentation of foreground from background under various background changes, such as camera movement, illumination change, and movement of small objects in the background.Type: GrantFiled: August 31, 2006Date of Patent: May 18, 2010Assignee: Microsoft CorporationInventors: Jian Sun, Heung-Yeung Shum, Xiaoou Tang, Weiwei Zhang