Patents by Inventor Xiaoou Tang

Xiaoou Tang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20090080774
    Abstract: This disclosure describes an integrated framework for class-unsupervised object segmentation. The class-unsupervised object segmentation occurs by integrating top-down constraints and bottom-up constraints on object shapes using an algorithm in an integrated manner. The algorithm describes a relationship among object parts and superpixels. This process forms object shapes with object parts and oversegments pixel images into the superpixels, with the algorithm in conjunction with the constraints. This disclosure describes computing a mask map from a hybrid graph, segmenting the image into a foreground object and a background, and displaying the foreground object from the background.
    Type: Application
    Filed: September 24, 2007
    Publication date: March 26, 2009
    Applicant: Microsoft Corporation
    Inventors: Zhouchen Lin, Guangcan Liu, Xiaoou Tang
  • Patent number: 7505606
    Abstract: Embodiments of the invention determine whether an image has been altered. Sets of patches are selected in the image, and corresponding inverse response functions are provided to a support vector machine (SVM). The support vector machine is trained with exemplary normal and abnormal inverse response functions. Once trained, the support vector machine analyzes inverse response functions corresponding to a suspected image. The support vector machine determines if the inverse response functions are normal or abnormal by analyzing a set of features. In one embodiment, features include measures for monotonic characteristics, fluctuation characteristics, and divergence characteristics of the red, green, and blue components of a tuple. Each tuple of inverse response functions is associated with a set of patches selected in the image.
    Type: Grant
    Filed: May 19, 2005
    Date of Patent: March 17, 2009
    Assignee: Microsoft Corporation
    Inventors: Zhouchen Lin, Rongrong Wang, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20080304735
    Abstract: Systems and methods are described for learning visual object cutout from a single example. In one implementation, an exemplary system determines the color context near each block in a model image to create an appearance model. The system also learns color sequences that occur across visual edges in the model image to create an edge profile model. The exemplary system then infers segmentation boundaries in unknown images based on the appearance model and edge profile model. In one implementation, the exemplary system minimizes the energy in a graph-cut model where the appearance model is used for data energy and the edge profile is used to modulate edges. The system is not limited to images with nearly identical foregrounds or backgrounds. Some variations in scale, rotation, and viewpoint are allowed.
    Type: Application
    Filed: June 5, 2007
    Publication date: December 11, 2008
    Applicant: Microsoft Corporation
    Inventors: Qiong Yang, Fang Wen, Xiaoou Tang
  • Publication number: 20080304755
    Abstract: Systems and methods are described for a face annotation framework with partial clustering and interactive labeling. In one implementation, an exemplary system automatically groups some images of a collection of images into clusters, each cluster mainly including images that contain a person's face associated with that cluster. After an initial user-labeling of each cluster with the person's name or other label, in which the user may also delete/label images that do not belong in the cluster, the system iteratively proposes subsequent clusters for the user to label, proposing clusters of images that when labeled, produce a maximum information gain at each iteration and minimize the total number of user interactions for labeling the entire collection of images.
    Type: Application
    Filed: June 8, 2007
    Publication date: December 11, 2008
    Applicant: Microsoft Corporation
    Inventors: Rong Xiao, Fang Wen, Xiaoou Tang
  • Publication number: 20080304740
    Abstract: Methods for detecting a salient object in an input image are described. For this, the salient object in an image may be defined using a set of local, regional, and global features including multi-scale contrast, center-surround histogram, and color spatial distribution. These features are optimally combined through conditional random field learning. The learned conditional random field is then used to locate the salient object in the image. The methods can also use image segmentation, where the salient object is separated from the image background.
    Type: Application
    Filed: June 6, 2007
    Publication date: December 11, 2008
    Applicant: Microsoft Corporation
    Inventors: Jian Sun, Tie Liu, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20080304743
    Abstract: Systems and methods of segmenting images are disclosed herein. The similarity of images in a set of images is compared. A group of images is selected from the set of images. The images in the group of images are selected based on compared similarities among the images. An informative image is selected from the group of images. User-defined semantic information of the informative image is received. The group of images as a graph is modeled as a graph. Each image in the group of images denotes a node in the graph. Edges of the graph denote a foreground relationship between images or a background relationship between images. One or more images in the group of images are automatically segmented by propagating the semantic information of the informative image to images in the group of images having a corresponding graph node that is related to a graph node corresponding to the informative image. Segmentation results can be refined according to user provided image semantics.
    Type: Application
    Filed: February 4, 2008
    Publication date: December 11, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Xiaoou Tang, Qiong Yang
  • Publication number: 20080298766
    Abstract: An interactive photo annotation method uses clustering based on facial similarities to improve annotation experience. The method uses a face recognition algorithm to extract facial features of a photo album and cluster the photos into multiple face groups based on facial similarity. The method annotates a face group collectively using annotations, such as name identifiers, in one operation. The method further allows merging and splitting of face groups. Special graphical user interfaces, such as displays in a group view area and a thumbnail area and drag-and-drop features, are used to further improve the annotation experience.
    Type: Application
    Filed: May 29, 2007
    Publication date: December 4, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Fang Wen, Rong Xiao, Qiong Yang, Xiaoou Tang
  • Publication number: 20080297621
    Abstract: A flash-based strategy is used to separate foreground information from background information within image information. In this strategy, a first image is taken without the use of flash. A second image is taken of the same subject matter with the use of flash. The foreground information in the flash image is illuminated by the flash to a much greater extent than the background information. Based on this property, the strategy applies processing to extract the foreground information from the background information. The strategy supplements the flash information by also taking into consideration motion information and color information.
    Type: Application
    Filed: May 29, 2007
    Publication date: December 4, 2008
    Applicant: Microsoft Corporation
    Inventors: Jian Sun, Jian Sun, Sing Bing Kang, Xiaoou Tang, Heung-Yeung Shum
  • Patent number: 7439989
    Abstract: Systems and methods for detecting doctored JPEG images are described. In one aspect, a JPEG image is evaluated to determine if the JPEG image comprises double quantization effects of double quantized Discrete Cosine Transform coefficients. In response to results of these evaluation operations, the systems and methods determine whether the JPEG image has been doctored and identify any doctored portion.
    Type: Grant
    Filed: February 17, 2006
    Date of Patent: October 21, 2008
    Assignee: Microsoft Corporation
    Inventors: Zhouchen Lin, Junfeng He, Xiaoou Tang
  • Publication number: 20080193048
    Abstract: Systems and methods provide picture collage systems and methods. In one implementation, a system determines a salient region in each of multiple images and develops a Bayesian model to maximize visibility of the salient regions in a collage that overlaps the images. The Bayesian model can also minimize blank spaces in the collage and normalize the percentage of each salient region that can be visibly displayed in the collage. Images are placed with diversified rotational orientation to provide a natural artistic collage appearance. A Markov Chain Monte Carlo technique is applied to the parameters of the Bayesian model to obtain image placement, orientation, and layering. The MCMC technique can combine optimization proposals that include local, global, and pairwise samplings from a distribution of state variables.
    Type: Application
    Filed: February 13, 2007
    Publication date: August 14, 2008
    Applicant: Microsoft Corporation
    Inventors: Jian Sun, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20080136820
    Abstract: Progressive cut interactive object segmentation is described. In one implementation, a system analyzes strokes input by the user during iterative image segmentation in order to model the user's intention for refining segmentation. In the user intention model, the color of each stroke indicates the user's expectation of pixel label change to foreground or background, the location of the stroke indicates the user's region of interest, and the position of the stroke relative to a previous segmentation boundary indicates a segmentation error that the user intends to refine. Overexpansion of pixel label change is controlled by penalizing change outside the user's region of interest while overshrinkage is controlled by modeling the image as an eroded graph. In each iteration, energy consisting of a color term, a contrast term, and a user intention term is minimized to obtain a segmentation map.
    Type: Application
    Filed: August 29, 2007
    Publication date: June 12, 2008
    Applicant: Microsoft Corporation
    Inventors: Qiong Yang, Chao Wang, Mo Chen, Xiaoou Tang, Zhongfu Ye
  • Patent number: 7382928
    Abstract: Disclosed is an automatic digital image enhancement method, including: determining the defects existing in the image based on pixel values, such as pixel luminance values and pixel color values, of the input digital image, and making corrections based on the determination results. The method can process images of digital photos, digital videos and make enhancement on exposure and color of the images. The method disclosed can also improve quickly and apparently the visual quality and appearance of images.
    Type: Grant
    Filed: August 6, 2004
    Date of Patent: June 3, 2008
    Assignee: The Chinese University of Hong Kong
    Inventors: Kam Fai Wong, Xiaoou Tang, Ying Fai Ho, Kwok Yiu Leung, Hoi Chung Chu
  • Publication number: 20080112642
    Abstract: Systems and methods for video completion by motion field transfer are described. In one aspect, a spatio-temporal target patch of an input video data sequence is filled in or replaced by motion field transfer from a spatio-temporal source patch of the input video data sequence. Color is propagated to corresponding portions of the spatio-temporal target patch by treating the transferred motion information as directed edges These motion field transfer and color propagation operations result in a video completed spatio-temporal target patch. The systems and methods present the video data sequence, which now includes the video completed spatio-temporal target patch, to user for viewing.
    Type: Application
    Filed: November 14, 2006
    Publication date: May 15, 2008
    Applicant: Microsoft Corporation
    Inventors: Yasuyuki Matsushita, Sing Bing Kang, Xiaoou Tang, Takaaki Shiratori
  • Publication number: 20080112684
    Abstract: Systems and methods for space-time video montage are described. In one aspect, one or more arbitrary space-time volumes representing informative video portion(s) of at least one input video data sequence are identified. A video summary representing a montage of the at least one input video data sequence is generated for presentation to user from the one or more arbitrary space-time volumes.
    Type: Application
    Filed: November 14, 2006
    Publication date: May 15, 2008
    Applicant: Microsoft Corporation
    Inventors: Yasuyuki Matsushita, Hong-Wen Kang, Xiaoou Tang
  • Publication number: 20080080787
    Abstract: Salience-preserving image fusion is described. In one aspect, multi-channel images are fused into a single image. The fusing operations are based on importance-weighted gradients. The importance weighted gradients are measured using respective salience maps for each channel in the multi-channel images.
    Type: Application
    Filed: September 28, 2006
    Publication date: April 3, 2008
    Applicant: Microsoft Corporation
    Inventors: Qiong Yang, Chao Wang, Xiaoou Tang, Zhongfu Ye
  • Publication number: 20080068386
    Abstract: Real-time rendering of realistic rain is described. In one aspect, image samples of real rain and associated information are automatically modeled in real-time to generate synthetic rain particles in view of respective scene radiances of target video content frames. The synthetic rain particles are rendered in real-time using pre-computed radiance transfer with uniform random distribution across respective frames of the target video content.
    Type: Application
    Filed: September 14, 2006
    Publication date: March 20, 2008
    Applicant: Microsoft Corporation
    Inventors: Zhouchen Lin, Lifeng Wang, Tian Fang, Xu Yang, Xuan Yu, Jian Wang, Xiaoou Tang
  • Publication number: 20080052312
    Abstract: A search includes comparing a query image provided by a user to a plurality of stored images of faces stored in a stored image database, and determining a similarity of the query image to the plurality of stored images. One or more resultant images of faces, selected from among the stored images, are displayed to the user based on the determined similarity of the stored images to the query image provided by the user. The resultant images are displayed based at least in part on one or more facial features.
    Type: Application
    Filed: August 23, 2006
    Publication date: February 28, 2008
    Applicant: Microsoft Corporation
    Inventors: Xiaoou Tang, Qiong Yang, Leizhong Zhang, Ta Bao, David P. Vronay
  • Publication number: 20080001950
    Abstract: A strategy is described for producing an animated scene from multiple high resolution still images. The strategy involves: creating a graph based on an analysis of similarity among the plural still images; performing partial temporal order recovery to define a partial ordering among the plural still images; and extracting an output sequence from the plural still images using second-order Markov Chain analysis, using the partial ordering as a reference. The strategy can perform the above-described analysis with respect to multiple independent animated regions (IARs) within the still images. Further, the strategy can decompose any IAR with a significant amount of motion into multiple semi-independent animated regions (SIARs). The SIARs are defined to be weakly interdependent.
    Type: Application
    Filed: June 30, 2006
    Publication date: January 3, 2008
    Applicant: Microsoft Corporation
    Inventors: Zhouchen Lin, Lifeng Wang, Yunbo Wang, Jian Wang, Xiaoou Tang
  • Publication number: 20070216675
    Abstract: Digital video effects are described. In one aspect, a foreground object in a video stream is identified. The video stream comprises multiple image frames. The foreground object is modified by rendering a 3-dimensional (3-D) visual feature over the foreground object for presentation to a user in a modified video stream. Pose of the foreground object is tracked in 3-D space across respective ones of the image frames to identify when the foreground object changes position in respective ones of the image frames. Based on this pose tracking, aspect ratio of the 3-D visual feature is adaptively modified and rendered over the foreground object in corresponding image frames for presentation to the user in the modified video stream.
    Type: Application
    Filed: August 28, 2006
    Publication date: September 20, 2007
    Applicant: Microsoft Corporation
    Inventors: Jian Sun, Qiang Wang, Weiwei Zhang, Xiaoou Tang, Heung-Yeung Shum
  • Publication number: 20070195106
    Abstract: Systems and methods for detecting doctored JPEG images are described. In one aspect, a JPEG image is evaluated to determine if the JPEG image comprises double quantization effects of double quantized Discrete Cosine Transform coefficients. In response to results of these evaluation operations, the systems and methods determine whether the JPEG image has been doctored and identify any doctored portion.
    Type: Application
    Filed: February 17, 2006
    Publication date: August 23, 2007
    Applicant: Microsoft Corporation
    Inventors: Zhouchen Lin, Junfeng He, Xiaoou Tang