Patents by Inventor Xiaoou Tang

Xiaoou Tang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20100121792
    Abstract: Directed graph embedding is described. In one implementation, a system explores the link structure of a directed graph and embeds the vertices of the directed graph into a vector space while preserving affinities that are present among vertices of the directed graph. Such an embedded vector space facilitates general data analysis of the information in the directed graph. Optimal embedding can be achieved by measuring local affinities among vertices via transition probabilities between the vertices, based on a stationary distribution of Markov random walks through the directed graph. For classifying linked web pages represented by a directed graph, the system can train a support vector machine (SVM) classifier, which can operate in a user-selectable number of dimensions.
    Type: Application
    Filed: January 7, 2008
    Publication date: May 13, 2010
    Inventors: Qiong Yang, Mo Chen, Xiaoou Tang
  • Patent number: 7711047
    Abstract: A Poisson-quantization noise model for modeling noise in low-light conditions is described. In one aspect, image information is received. A Poisson-quantization noise model is then generated from a Poisson noise model and a quantization noise model. Poisson-quantization noise is then estimated in the image information using the Poisson-quantization noise model.
    Type: Grant
    Filed: December 21, 2005
    Date of Patent: May 4, 2010
    Assignee: Microsoft Corporation
    Inventors: Yasuyuki Matsushita, Xiaoou Tang, Francois Alter
  • Patent number: 7692647
    Abstract: Real-time rendering of realistic rain is described. In one aspect, image samples of real rain and associated information are automatically modeled in real-time to generate synthetic rain particles in view of respective scene radiances of target video content frames. The synthetic rain particles are rendered in real-time using pre-computed radiance transfer with uniform random distribution across respective frames of the target video content.
    Type: Grant
    Filed: September 14, 2006
    Date of Patent: April 6, 2010
    Assignee: Microsoft Corporation
    Inventors: Zhouchen Lin, Lifeng Wang, Tian Fang, Xu Yang, Xuan Yu, Jian Wang, Xiaoou Tang
  • Publication number: 20100080450
    Abstract: Described is using semi-Riemannian geometry in supervised learning to learn a discriminant subspace for classification, e.g., labeled samples are used to learn the geometry of a semi-Riemannian submanifold. For a given sample, the K nearest classes of that sample are determined, along with the nearest samples that are in other classes, and the nearest samples in that sample's same class. The distances between these samples are computed, and used in computing a metric matrix. The metric matrix is used to compute a projection matrix that corresponds to the discriminant subspace. In online classification, as a new sample is received, it is projected into a feature space by use of the projection matrix and classified accordingly.
    Type: Application
    Filed: September 30, 2008
    Publication date: April 1, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang
  • Publication number: 20100076723
    Abstract: Tensor linear Laplacian discrimination for feature extraction is disclosed. One embodiment comprises generating a contextual distance based sample weight and class weight, calculating a within-class scatter using the at least one sample weight and a between-class scatter for multiple classes of data samples in a sample set using the class weight, performing a mode-k matrix unfolding on scatters and generating at least one orthogonal projection matrix.
    Type: Application
    Filed: September 23, 2008
    Publication date: March 25, 2010
    Applicant: Microsoft Corporation
    Inventors: Wei Zhang, Zhouchen Lin, Xiaoou Tang
  • Patent number: 7684651
    Abstract: A search includes comparing a query image provided by a user to a plurality of stored images of faces stored in a stored image database, and determining a similarity of the query image to the plurality of stored images. One or more resultant images of faces, selected from among the stored images, are displayed to the user based on the determined similarity of the stored images to the query image provided by the user. The resultant images are displayed based at least in part on one or more facial features.
    Type: Grant
    Filed: August 23, 2006
    Date of Patent: March 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Xiaoou Tang, Qiong Yang, Leizhong Zhang, Ta Bao, David P. Vronay
  • Publication number: 20100067799
    Abstract: A “globally invariant Radon feature transform,” or “GIRFT,” generates feature descriptors that are both globally affine invariant and illumination invariant. These feature descriptors effectively handle intra-class variations resulting from geometric transformations and illumination changes to provide robust texture classification. In general, GIRFT considers images globally to extract global features that are less sensitive to large variations of material in local regions. Geometric affine transformation invariance and illumination invariance is achieved by converting original pixel represented images into Radon-pixel images by using a Radon Transform. Canonical projection of the Radon-pixel image into a quotient space is then performed using Radon-pixel pairs to produce affine invariant feature descriptors. Illumination invariance of the resulting feature descriptors is then achieved by defining an illumination invariant distance metric on the feature space of each feature descriptor.
    Type: Application
    Filed: September 17, 2008
    Publication date: March 18, 2010
    Applicant: MICROSOFT CORPORATION
    Inventors: Guangcan Liu, Zhouchen Lin, Xiaoou Tang
  • Patent number: 7668346
    Abstract: Methods and systems are provided for selecting features that will be used to recognize faces. Three-dimensional models are used to synthesize a database of virtual face images. The virtual face images cover wide appearance variations, different poses, different lighting conditions and expression changes. A joint boosting algorithm is used to identify discriminative features by selecting features from the plurality of virtual images such that the identified discriminative features are independent of the other images included in the database.
    Type: Grant
    Filed: March 21, 2006
    Date of Patent: February 23, 2010
    Assignee: Microsoft Corporation
    Inventors: Rong Xiao, Xiaoou Tang
  • Patent number: 7646894
    Abstract: A Bayesian competitive model integrated with a generative classifier for unspecific person verification is described. In one aspect, a competitive measure for verification of an unspecific person is calculated using a discriminative classifier. The discriminative classifier is based on a Bayesian competitive model that is adaptable to unknown new classes. The Bayesian competitive model is integrated with a generative verification in view of a set of confidence criteria to make a decision regarding verification of the unspecific person.
    Type: Grant
    Filed: February 14, 2006
    Date of Patent: January 12, 2010
    Assignee: Microsoft Corporation
    Inventors: Qiong Yang, Xiaoou Tang
  • Patent number: 7636098
    Abstract: Salience-preserving image fusion is described. In one aspect, multi-channel images are fused into a single image. The fusing operations are based on importance-weighted gradients. The importance weighted gradients are measured using respective salience maps for each channel in the multi-channel images.
    Type: Grant
    Filed: September 28, 2006
    Date of Patent: December 22, 2009
    Assignee: Microsoft Corporation
    Inventors: Qiong Yang, Chao Wang, Xiaoou Tang, Zhongfu Ye
  • Publication number: 20090313239
    Abstract: Described is a technology in which images initially ranked by some relevance estimate (e.g., according to text-based similarities) are re-ranked according to visual similarity with a user-selected image. A user-selected image is received and classified into an intention class, such as a scenery class, portrait class, and so forth. The intention class is used to determine how visual features of other images compare with visual features of the user-selected image. For example, the comparing operation may use different feature weighting depending on which intention class was determined for the user-selected image. The other images are re-ranked based upon their computed similarity to the user-selected image, and returned as query results. Retuning of the feature weights using actual user-provided relevance feedback is also described.
    Type: Application
    Filed: June 16, 2008
    Publication date: December 17, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Fang Wen, Xiaoou Tang
  • Publication number: 20090297046
    Abstract: An exemplary method for extracting discriminant feature of samples includes providing data for samples in a multidimensional space; based on the data, computing local similarities for the samples; mapping the local similarities to weights; based on the mapping, formulating an inter-class scatter matrix and an intra-class scatter matrix; and based on the matrices, maximizing the ratio of inter-class scatter to intra-class scatter for the samples to provide discriminate features of the samples. Such a method may be used for classifying samples, recognizing patterns, or other tasks. Various other methods, devices, system, etc., are also disclosed.
    Type: Application
    Filed: May 29, 2008
    Publication date: December 3, 2009
    Applicant: Microsoft Corporation
    Inventors: Deli Zhao, Zhouchen Lin, Rong Xiao, Xiaoou Tang
  • Patent number: 7620914
    Abstract: A system that provides binds or associates a clickable hyperlink with an object that appears in a video stream. The hyperlink may be sent in a separate stream from the video stream, and user interfaces are provided to a user to activate the hyperlink. Activation of the hyperlink may cause a redirection to an associated website. Furthermore, feedback may be provided as to the user's activity regarding interest and activation regarding particular hyperlinks in the video.
    Type: Grant
    Filed: May 22, 2006
    Date of Patent: November 17, 2009
    Assignee: Microsoft Corporation
    Inventors: Yin Li, Jian Sun, Li Li, Weiwei Zhang, Xiaoou Tang, Ying Li, Michael Hurt, Eric Picard
  • Patent number: 7609271
    Abstract: A strategy is described for producing an animated scene from multiple high resolution still images. The strategy involves: creating a graph based on an analysis of similarity among the plural still images; performing partial temporal order recovery to define a partial ordering among the plural still images; and extracting an output sequence from the plural still images using second-order Markov Chain analysis, using the partial ordering as a reference. The strategy can perform the above-described analysis with respect to multiple independent animated regions (IARs) within the still images. Further, the strategy can decompose any IAR with a significant amount of motion into multiple semi-independent animated regions (SIARs). The SIARs are defined to be weakly interdependent.
    Type: Grant
    Filed: June 30, 2006
    Date of Patent: October 27, 2009
    Assignee: Microsoft Corporation
    Inventors: Zhouchen Lin, Lifeng Wang, Yunbo Wang, Jian Wang, Xiaoou Tang
  • Publication number: 20090254539
    Abstract: A system performs user intention modeling for interactive image retrieval. In one implementation, the system uses a three stage iterative technique to retrieve images from a database without using any image tags or text descriptors. First, the user submits a query image and the system models the user's search intention and configures a customized search to retrieve relevant images. Then, the system extends a user interface for the user to designate visual features across the retrieved images. The designated visual features refine the intention model and reconfigure the search to retrieve images that match the remodeled intention. Third, the system extends another user interface through which the user can give natural feedback about the retrieved images. The three stages can be iterated to quickly assemble a set of images that accurately fulfills the user's search intention.
    Type: Application
    Filed: April 30, 2008
    Publication date: October 8, 2009
    Applicant: Microsoft Corporation
    Inventors: Fang Wen, Xiaoou Tang
  • Patent number: 7576755
    Abstract: Systems and methods provide picture collage systems and methods. In one implementation, a system determines a salient region in each of multiple images and develops a Bayesian model to maximize visibility of the salient regions in a collage that overlaps the images. The Bayesian model can also minimize blank spaces in the collage and normalize the percentage of each salient region that can be visibly displayed in the collage. Images are placed with diversified rotational orientation to provide a natural artistic collage appearance. A Markov Chain Monte Carlo technique is applied to the parameters of the Bayesian model to obtain image placement, orientation, and layering. The MCMC technique can combine optimization proposals that include local, global, and pairwise samplings from a distribution of state variables.
    Type: Grant
    Filed: February 13, 2007
    Date of Patent: August 18, 2009
    Assignee: Microsoft Corporation
    Inventors: Jian Sun, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20090132213
    Abstract: A method for modeling data affinities and data structures. In one implementation, a contextual distance may be calculated between a selected data point in a data sample and a data point in a contextual set of the selected data point. The contextual set may include the selected data point and one or more data points in the neighborhood of the selected data point. The contextual distance may be the difference between the selected data point's contribution to the integrity of the geometric structure of the contextual set and the data point's contribution to the integrity of the geometric structure of the contextual set. The process may be repeated for each data point in the contextual set of the selected data point. The process may be repeated for each selected data point in the data sample. A digraph may be created using a plurality of contextual distances generated by the process.
    Type: Application
    Filed: February 18, 2008
    Publication date: May 21, 2009
    Applicant: MICROSOFT CORPORATION
    Inventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang
  • Patent number: 7536030
    Abstract: Systems and methods are described for real-time Bayesian 3D pose tracking. In one implementation, exemplary systems and methods formulate key-frame based differential pose tracking in a probabilistic graphical model. An exemplary system receives live captured video as input and tracks a video object's 3D pose in real-time based on the graphical model. An exemplary Bayesian inter-frame motion inference technique simultaneously performs online point matching and pose estimation. This provides robust pose tracking because the relative pose estimate for a current frame is simultaneously estimated from two independent sources, from a key-frame pool and from the video frame preceding the current frame. Then, an exemplary online Bayesian frame fusion technique infers the current pose from the two independent sources, providing stable and drift-free tracking, even during agile motion, occlusion, scale change, and drastic illumination change of the tracked object.
    Type: Grant
    Filed: November 30, 2005
    Date of Patent: May 19, 2009
    Assignee: Microsoft Corporation
    Inventors: Qiang Wang, Weiwei Zhang, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20090099990
    Abstract: An efficient, effective and at times superior object detection and/or recognition (ODR) function may be built from a set of Bayesian stumps. Bayesian stumps may be constructed for each feature and object class, and the ODR function may be constructed from the subset of Bayesian stumps that minimize Bayesian error for a particular object class. That is, Bayesian error may be utilized as a feature selection measure for the ODR function. Furthermore, Bayesian stumps may be efficiently implemented as lookup tables with entries corresponding to unequal intervals of feature histograms. Interval widths and entry values may be determined so as to minimize Bayesian error, yielding Bayesian stumps that are optimal in this respect.
    Type: Application
    Filed: October 12, 2007
    Publication date: April 16, 2009
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Xiaoou Tang
  • Publication number: 20090097772
    Abstract: Systems and methods perform Laplacian Principal Components Analysis (LPCA). In one implementation, an exemplary system receives multidimensional data and reduces dimensionality of the data by locally optimizing a scatter of each local sample of the data. The optimization includes summing weighted distances between low dimensional representations of the data and a mean. The weights of the distances can be determined by a coding length of each local data sample. The system can globally align the locally optimized weighted scatters of the local samples and provide a global projection matrix. The LPCA improves performance of such applications as face recognition and manifold learning.
    Type: Application
    Filed: October 12, 2007
    Publication date: April 16, 2009
    Applicant: Microsoft Corporation
    Inventors: Deli Zhao, Zhouchen Lin, Xiaoou Tang