Patents by Inventor Hong Jiang Zhang

Hong Jiang Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20060048634
    Abstract: A system that analyzes music to detect musical beats and to rectify beats that are out of sync with the actual beat phase of the music. The music analysis includes onset detection, tempo/meter estimation, and beat analysis, which includes the rectification of out-of-sync beats.
    Type: Application
    Filed: November 1, 2005
    Publication date: March 9, 2006
    Applicant: Microsoft Corporation
    Inventors: Lie Lu, Hong-Jiang Zhang
  • Publication number: 20060026524
    Abstract: Systems and methods for smart media content thumbnail extraction are described. In one aspect program metadata is generated from recorded video content. The program metadata includes one or more key-frames from one or more corresponding shots. An objectively representative key-frame is identified from among the key-frames as a function of shot duration and frequency of appearance of key-frame content across multiple shots. The objectively representative key-frame is an image frame representative of the recorded video content. A thumbnail is created from the objectively representative key-frame.
    Type: Application
    Filed: August 2, 2004
    Publication date: February 2, 2006
    Applicant: Microsoft Corporation
    Inventors: Yu-Fei Ma, Bin Lin, Zhike Kong, Xinli Zou, Wei-Ying Ma, Hong-Jiang Zhang
  • Patent number: 6975750
    Abstract: A system and method that includes a virtual human face generation technique which synthesizes images of a human face at a variety of poses. This is preferably accomplished using just a frontal and profile image of a specific subject. An automatic deformation technique is used to align the features of a generic 3-D graphic face model with the corresponding features of these pre-provided images of the subject. Specifically, a generic frontal face model is aligned with the frontal image and a generic profile face model is aligned with the profile image. The deformation procedure results in a single 3-D face model of the specific human face. It precisely reflects the geometric features of the specific subject. After that, subdivision spline surface construction and multi-direction texture mapping techniques are used to smooth the model and endow photometric detail to the specific 3-D geometric face model.
    Type: Grant
    Filed: December 1, 2000
    Date of Patent: December 13, 2005
    Assignee: Microsoft Corp.
    Inventors: Jie Yan, Hong-Jiang Zhang
  • Patent number: 6970860
    Abstract: A multimedia object retrieval and annotation system integrates an annotation process with object retrieval and relevance feedback processes. The annotation process annotates multimedia objects, such as digital images, with semantically relevant keywords. The annotation process is performed in background, hidden from the user, as the user conducts normal searches. The annotation process is “semi-automatic” in that it utilizes both keyword-based information retrieval and content-based image retrieval techniques to automatically search for multimedia objects, and then encourages users to provide feedback on the retrieved objects. The user identifies objects as either relevant or irrelevant to the query keywords and based on this feedback, the system automatically annotates the objects with semantically relevant keywords and/or updates associations between the keywords and objects. As the retrieval-feedback-annotation cycle is repeated, the annotation coverage and accuracy of future searches continues to improve.
    Type: Grant
    Filed: October 30, 2000
    Date of Patent: November 29, 2005
    Assignee: Microsoft Corporation
    Inventors: Wen-Yin Liu, Hong-Jiang Zhang
  • Patent number: 6965645
    Abstract: A system and process for video characterization that facilitates video classification and retrieval, as well as motion detection, applications. This involves characterizing a video sequence with a gray scale image having pixel levels that reflect the intensity of motion associated with a corresponding region in the sequence of video frames. The intensity of motion is defined using any of three characterizing processes. Namely, a perceived motion energy spectrum (PMES) characterizing process that represents object-based motion intensity over the sequence of frames, a spatio-temporal entropy (STE) characterizing process that represents the intensity of motion based on color variation at each pixel location, a motion vector angle entropy (MVAE) characterizing process which represents the intensity of motion based on the variation of motion vector angles.
    Type: Grant
    Filed: September 25, 2001
    Date of Patent: November 15, 2005
    Assignee: Microsoft Corporation
    Inventors: Hong-Jiang Zhang, Yufei Ma
  • Publication number: 20050211072
    Abstract: A system and methods analyze music to detect musical beats and to rectify beats that are out of sync with the actual beat phase of the music. The music analysis includes onset detection, tempo/meter estimation, and beat analysis, which includes the rectification of out-of-sync beats.
    Type: Application
    Filed: March 25, 2004
    Publication date: September 29, 2005
    Inventors: Lie Lu, Hong-Jiang Zhang
  • Publication number: 20050211071
    Abstract: A system and methods use music features extracted from music to detect a music mood within a hierarchical mood detection framework. A two-dimensional mood model divides music into four moods which include contentment, depression, exuberance, and anxious/frantic. A mood detection algorithm uses a hierarchical mood detection framework to determine which of the four moods is associated with a music clip based on the extracted features. In a first tier of the hierarchical detection process, the algorithm determines one of two mood groups to which the music clip belongs. In a second tier of the hierarchical detection process, the algorithm then determines which mood from within the selected mood group is the appropriate, exact mood for the music clip. Benefits of the mood detection system include automatic detection of music mood which can be used as music metadata to manage music through music representation and classification.
    Type: Application
    Filed: March 25, 2004
    Publication date: September 29, 2005
    Inventors: Lie Lu, Hong-Jiang Zhang
  • Patent number: 6944319
    Abstract: A face recognition system and process for identifying a person depicted in an input image and their face pose. This system and process entails locating and extracting face regions belonging to known people from a set of model images, and determining the face pose for each of the face regions extracted. All the extracted face regions are preprocessed by normalizing, cropping, categorizing and finally abstracting them. More specifically, the images are normalized and cropped to show only a person's face, categorized according to the face pose of the depicted person's face by assigning them to one of a series of face pose ranges, and abstracted preferably via an eigenface approach.
    Type: Grant
    Filed: March 27, 2000
    Date of Patent: September 13, 2005
    Assignee: Microsoft Corporation
    Inventors: Fu Jie Huang, Hong-Jiang Zhang, Tsuhan Chen
  • Publication number: 20050165763
    Abstract: The disclosed subject matter improves iterative results of content-based image retrieval (CBIR) using a bigram model to correlate relevance feedback. Specifically, multiple images are received responsive to multiple image search sessions. Relevance feedback is used to determine whether the received images are semantically relevant. A respective semantic correlation between each of at least one pair of the images is then estimated using respective bigram frequencies. The bigram frequencies are based on multiple search sessions in which each image of a pair of images is semantically relevant.
    Type: Application
    Filed: February 11, 2005
    Publication date: July 28, 2005
    Applicant: Microsoft Corporation
    Inventors: Mingjing Li, Zheng Chen, Liu Wenyin, Hong-Jiang Zhang
  • Publication number: 20050154788
    Abstract: Methods and systems for generic adaptive multimedia content delivery are described. In one embodiment, a novel framework features an abstract content model and an abstract adaptive delivery decision engine. The abstract content model recognizes important aspects of contents while hiding their physical details from other parts of the framework. The decision engine then makes content adaptation plans based on the abstracted model of the contents and needs little knowledge of any physical details of the actual contents. Thus, under the same framework, adaptive delivery of generic contents is possible.
    Type: Application
    Filed: December 29, 2004
    Publication date: July 14, 2005
    Applicant: Microsoft Corporation
    Inventors: Yudong Yang, Hong-Jiang Zhang
  • Publication number: 20050154743
    Abstract: Methods and systems for generic adaptive multimedia content delivery are described. In one embodiment, a novel framework features an abstract content model and an abstract adaptive delivery decision engine. The abstract content model recognizes important aspects of contents while hiding their physical details from other parts of the framework. The decision engine then makes content adaptation plans based on the abstracted model of the contents and needs little knowledge of any physical details of the actual contents. Thus, under the same framework, adaptive delivery of generic contents is possible.
    Type: Application
    Filed: December 29, 2004
    Publication date: July 14, 2005
    Applicant: Microsoft Corporation
    Inventors: Yudong Yang, Hong-Jiang Zhang
  • Publication number: 20050147291
    Abstract: A face recognition system and process for identifying a person depicted in an input image and their face pose. This system and process entails locating and extracting face regions belonging to known people from a set of model images, and determining the face pose for each of the face regions extracted. All the extracted face regions are preprocessed by normalizing, cropping, categorizing and finally abstracting them. More specifically, the images are normalized and cropped to show only a persons face, categorized according to the face pose of the depicted person's face by assigning them to one of a series of face pose ranges, and abstracted preferably via an eigenface approach.
    Type: Application
    Filed: November 5, 2004
    Publication date: July 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Fu Huang, Hong-Jiang Zhang, Tsuhan Chen
  • Publication number: 20050147170
    Abstract: A system and process for video characterization that facilitates video classification and retrieval, as well as motion detection, applications. This involves characterizing a video sequence with a gray scale image having pixel levels that reflect the intensity of motion associated with a corresponding region in the sequence of video frames. The intensity of motion is defined using any of three characterizing processes. Namely, a perceived motion energy spectrum (PMES) characterizing process that represents object-based motion intensity over the sequence of frames, a spatio-temporal entropy (STE) characterizing process that represents the intensity of motion based on color variation at each pixel location, a motion vector angle entropy (MVAE) characterizing process which represents the intensity of motion based on the variation of motion vector angles.
    Type: Application
    Filed: February 11, 2005
    Publication date: July 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Hong-Jiang Zhang, Yufei Ma
  • Publication number: 20050147292
    Abstract: A face recognition system and process for identifying a person depicted in an input image and their face pose. This system and process entails locating and extracting face regions belonging to known people from a set of model images, and determining the face pose for each of the face regions extracted. All the extracted face regions are preprocessed by normalizing, cropping, categorizing and finally abstracting them. More specifically, the images are normalized and cropped to show only a persons face, categorized according to the face pose of the depicted person's face by assigning them to one of a series of face pose ranges, and abstracted preferably via an eigenface approach.
    Type: Application
    Filed: November 5, 2004
    Publication date: July 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Fu Jie Huang, Hong-Jiang Zhang, Tsuhan Chen
  • Publication number: 20050147280
    Abstract: A system and method that includes a virtual human face generation technique which synthesizes images of a human face at a variety of poses. This is preferably accomplished using just a frontal and profile image of a specific subject. An automatic deformation technique is used to align the features of a generic 3-D graphic face model with the corresponding features of these pre-provided images of the subject. Specifically, a generic frontal face model is aligned with the frontal image and a generic profile face model is aligned with the profile image. The deformation procedure results in a single 3-D face model of the specific human face. It precisely reflects the geometric features of the specific subject. After that, subdivision spline surface construction and multi-direction texture mapping techniques are used to smooth the model and endow photometric detail to the specific 3-D geometric face model.
    Type: Application
    Filed: February 7, 2005
    Publication date: July 7, 2005
    Applicant: Microsoft Corporation
    Inventors: Jie Yan, Hong-Jiang Zhang
  • Patent number: 6915025
    Abstract: The described arrangements and procedures identify an image's orientation by extracting features from peripheral portions of the image. The procedure evaluates the extracted features based on training image feature orientation classification models to identify the image's orientation.
    Type: Grant
    Filed: November 27, 2001
    Date of Patent: July 5, 2005
    Assignee: Microsoft Corporation
    Inventors: Yongmei Wang, Hong-Jiang Zhang
  • Publication number: 20050131951
    Abstract: An implementation of a technology, described herein, for relevance-feedback, content-based facilitating accurate and efficient image retrieval minimizes the number of iterations for user feedback regarding the semantic relevance of exemplary images while maximizing the resulting relevance of each iteration. One technique for accomplishing this is to use a Bayesian classifier to treat positive and negative feedback examples with different strategies. In addition, query refinement techniques are applied to pinpoint the users' intended queries with respect to their feedbacks. These techniques further enhance the accuracy and usability of relevance feedback. This abstract itself is not intended to limit the scope of this patent. The scope of the present invention is pointed out in the appending claims.
    Type: Application
    Filed: January 25, 2005
    Publication date: June 16, 2005
    Applicant: Microsoft Corporation
    Inventors: Hong-Jiang Zhang, Zhong Su, Xingquan Zhu
  • Publication number: 20050123886
    Abstract: Systems and methods are described that implement personalized karaoke, wherein a user's personal home video and photographs are used to form a background for the lyrics during a karaoke performance. An exemplary karaoke apparatus is configured to segment visual content to produce a plurality of sub-shots and to segment music to produce a plurality of music sub-clips. Having produced the visual content sub-shots and music sub-clips, the exemplary karaoke apparatus shortens some of the plurality of sub-shots to a length of a corresponding music sub-clip from within the plurality of music sub-clips. The plurality of sub-shots is then displayed as a background to lyrics associated with the music, thereby adding interest to a karaoke performance.
    Type: Application
    Filed: November 26, 2003
    Publication date: June 9, 2005
    Inventors: Xian-Sheng Hua, Lie Lu, Hong-Jiang Zhang
  • Patent number: 6901411
    Abstract: The disclosed subject matter improves iterative results of content-based image retrieval (CBIR) using a bigram model to correlate relevance feedback. Specifically, multiple images are received responsive to multiple image search sessions. Relevance feedback is used to determine whether the received images are semantically relevant. A respective semantic correlation between each of at least one pair of the images is then estimated using respective bigram frequencies. The bigram frequencies are based on multiple search sessions in which each image of a pair of images is semantically relevant.
    Type: Grant
    Filed: February 11, 2002
    Date of Patent: May 31, 2005
    Assignee: Microsoft Corporation
    Inventors: Mingjing Li, Zheng Chen, Liu Wenyin, Hong-Jiang Zhang
  • Publication number: 20050114325
    Abstract: A multimedia object retrieval and annotation system integrates an annotation process with object retrieval and relevance feedback processes. The annotation process annotates multimedia objects, such as digital images, with semantically relevant keywords. The annotation process is performed in background, hidden from the user, as the user conducts normal searches. The annotation process is “semi-automatic” in that it utilizes both keyword-based information retrieval and content-based image retrieval techniques to automatically search for multimedia objects, and then encourages users to provide feedback on the retrieved objects. The user identifies objects as either relevant or irrelevant to the query keywords and based on this feedback, the system automatically annotates the objects with semantically relevant keywords and/or updates associations between the keywords and objects. As the retrieval-feedback-annotation cycle is repeated, the annotation coverage and accuracy of future searches continues to improve.
    Type: Application
    Filed: October 20, 2004
    Publication date: May 26, 2005
    Applicant: Microsoft Corporation
    Inventors: Wen-Yin Liu, Hong-Jiang Zhang