Patents by Inventor Hong Jiang Zhang

Hong Jiang Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and methods for automatically editing a video

Publication number: 20040085341

Abstract: Systems and methods to automatically edit a video to generate a video summary are described. In one aspect, sub-shots are extracted from the video. Importance measures are calculated for at least a portion of the extracted sub-shots. Respective relative distributions for sub-shots having relatively higher importance measures as compared to importance measures of other sub-shots are determined. Based on the determined relative distributions, sub-shots that do not exhibit a uniform distribution with respect to other sub-shots in the particular ones are dropped. The remaining sub-shots are connected with respective transitions to generate the video summary.

Type: Application

Filed: November 1, 2002

Publication date: May 6, 2004

Inventors: Xian-Sheng Hua, Lie Lu, Yu-Fei Ma, Mingjing Li, Hong-Jiang Zhang
Systems and methods for generating a motion attention model

Publication number: 20040086046

Abstract: Systems and methods to generate a motion attention model of a video data sequence are described. In one aspect, a motion saliency map B is generated to precisely indicate motion attention areas for each frame in the video data sequence. The motion saliency maps are each based on intensity I, spatial coherence Cs, and temporal coherence Ct values. These values are extracted from each block or pixel in motion fields that are extracted from the video data sequence. Brightness values of detected motion attention areas in each frame are accumulated to generate, with respect to time, the motion attention model.

Type: Application

Filed: November 1, 2002

Publication date: May 6, 2004

Inventors: Yu-Fei Ma, Hong-Jiang Zhang
Systems and methods for generating a video summary

Publication number: 20040088723

Abstract: Systems and methods to generate a video summary of a video data sequence are described. In one aspect, key-frames of the video data sequence are identified independent of shot boundary detection. A static summary of shots in the video data sequence is then generated based on key-frame importance. For each shot in the static summary of shots, dynamic video skims are calculated. The video summary consists of the calculated dynamic video skims.

Type: Application

Filed: November 1, 2002

Publication date: May 6, 2004

Inventors: Yu-Fei Ma, Lie Lu, Hong-Jiang Zhang, Mingjing Li
Media segmentation system and related methods

Patent number: 6724933

Abstract: A method comprising receiving media content and analyzing one or more attributes of successive shots of the received media. Based, at least in part on the analysis of the one or more attributes, generating a correlation score for each of the successive shots, wherein scene segmentation is performed to group semantically cohesive shots.

Type: Grant

Filed: July 28, 2000

Date of Patent: April 20, 2004

Assignee: Microsoft Corporation

Inventors: Tong Lin, Hong-Jiang Zhang
Hierarchical scheme for blur detection in digital image using wavelet transform

Publication number: 20040066981

Abstract: Methods and apparatuses are provided for detecting blur within digital images using wavelet transform and/or Cepstrum analysis blur detection techniques that are able to detect motion blur and/or out-of-focus blur.

Type: Application

Filed: August 22, 2003

Publication date: April 8, 2004

Inventors: Mingjing Li, Hao Wu, Hong-Jiang Zhang
Pose-adaptive face detection system and process

Patent number: 6671391

Abstract: A face detection system and process capable of detecting a person depicted in an input image and identifying their face pose. Prepared training images are used to train a 2-stage classifier which includes a bank of Support Vector Machines (SVMs) as an initial pre-classifier layer, and a neural network forming a subsequent decision classifier layer. Once the SVMs and the neural network are trained, input image regions are prepared and input into the system. An output is produced from the neural network which indicates first whether the region under consideration depicts a face, and secondly, if a face is present, into what pose range the pose of the face falls.

Type: Grant

Filed: May 26, 2000

Date of Patent: December 30, 2003

Assignee: Microsoft Corp.

Inventors: Hong-Jiang Zhang, Ma Yong
Streaming methods and systems

Publication number: 20030195977

Abstract: Various embodiments provide methods and systems for streaming data that can facilitate streaming during bandwidth fluctuations in a manner that can enhance the user experience. In one aspect, a forward-shifting technique is utilized to buffer data that is to be streamed, e.g. an enhancement layer in a FGS stream. Various techniques can drop layers actively when bandwidth is constant. The saved bandwidth can then be used to pre-stream enhancement layer portions. In another aspect, a content-aware decision can be made as to how to drop enhancement layers when bandwidth decreases. During periods of decreasing bandwidth, if a video segment does not contain important content, the enhancement layers will be dropped to keep the forward-shifting of the enhancement layer unchanged. If the enhancement layer does contain important content, it will be transmitted later when bandwidth increases.

Type: Application

Filed: April 11, 2002

Publication date: October 16, 2003

Inventors: Tianming Liu, Hong-Jiang Zhang, Wei Qi
Statistical bigram correlation model for image retrieval

Publication number: 20030187844

Abstract: The disclosed subject matter improves iterative results of content-based image retrieval (CBIR) using a bigram model to correlate relevance feedback. Specifically, multiple images are received responsive to multiple image search sessions. Relevance feedback is used to determine whether the received images are semantically relevant. A respective semantic correlation between each of at least one pair of the images is then estimated using respective bigram frequencies. The bigram frequencies are based on multiple search sessions in which each image of a pair of images is semantically relevant.

Type: Application

Filed: February 11, 2002

Publication date: October 2, 2003

Inventors: Mingjing Li, Zheng Chen, Liu Wenyin, Hong-Jiang Zhang
System and method for producing a video skim

Publication number: 20030184579

Abstract: A video skim is assembled by identifying one or more key frames from a video shot. Certain lengths of frames to the left and right of the key frame are measured for visual content variety. Depending upon the measured visual content variety to the left and right of the key frame, the video skim is assembled that has L frames to the left of the key frame and R frames to the right of the key frame. Measuring the visual content variety to the left and right of the key frame, provides a video skim that incorporates the more salient features of a shot.

Type: Application

Filed: March 29, 2002

Publication date: October 2, 2003

Inventors: Hong-Jiang Zhang, Dong Zhang
Clustering web queries

Publication number: 20030144994

Abstract: The described subject matter provides systems and procedures to make query similarity determinations, wherein the queries are used in information retrieval operations. A same document and/or multiple similar documents are identified that have been selected by a user in response to multiple queries. Responsive to identifying the same document and/or the similar documents, a query cluster is generated that indicates that the queries used to obtain the same and/or similar documents. This is accomplished in a manner that is independent of whether individual ones of the queries are compositionally similar with respect to other ones of the queries.

Type: Application

Filed: October 12, 2001

Publication date: July 31, 2003

Inventors: Ji-Rong Wen, Jian-Yun Nie, Ming-Jing Li, Hong-Jiang Zhang
Image blur detection methods and arrangements

Publication number: 20030138163

Abstract: An apparatus is provided for detecting blur in an image. The apparatus includes an image generator that is configured to generate a plurality of corresponding different resolution images based on a base image. The plurality of corresponding different resolution images is provided to an edge detector. The edge detector detects edge transitions in each of the plurality of corresponding different resolution images and provides edge maps to an edge parameter comparator. The edge parameter comparator compares corresponding edge parameters as detected by the edge detector and provides a result map to a blur calculator. The blur calculator determines at least one blur parameter based on the result map and provides the blur parameter to a blur detector. The blur detector then determines if the base image is blurred based on a comparison of the blur parameter with at least one blur parameter threshold.

Type: Application

Filed: February 26, 2003

Publication date: July 24, 2003

Inventors: Xiangrong Chen, Hong-Jiang Zhang
Methods and systems for adaptive delivery of multimedia contents

Publication number: 20030110236

Abstract: Methods and systems for generic adaptive multimedia content delivery are described. In one embodiment, a novel framework features an abstract content model and an abstract adaptive delivery decision engine. The abstract content model recognizes important aspects of contents while hiding their physical details from other parts of the framework. The decision engine then makes content adaptation plans based on the abstracted model of the contents and needs little knowledge of any physical details of the actual contents. Thus, under the same framework, adaptive delivery of generic contents is possible.

Type: Application

Filed: November 26, 2001

Publication date: June 12, 2003

Inventors: Yudong Yang, Hong-Jiang Zhang
Media agent

Publication number: 20030105589

Abstract: The described arrangements and procedures provide an intelligent media agent to autonomously collect semantic multimedia data text descriptions on behalf of a user whenever and wherever the user accesses media content. The media agent analyzes these semantic multimedia data text descriptions in view of user behavior patterns and actions to assist the user in identifying multimedia content and related information that is appropriate to the context within which the user is operating or working. For instance, the media agent detects insertion of text and analyzes the inserted text. Based on the analysis, the agent predicts whether a user intends to access media content. If so, the agent retrieves information corresponding to media content from a media content source and presents the information to a user as a suggestion.

Type: Application

Filed: November 30, 2001

Publication date: June 5, 2003

Inventors: Wen-Yin Liu, Hong-Jiang Zhang, Zheng Chen
Function-based object model for use in website adaptation

Publication number: 20030101203

Abstract: By understanding a website author's intention through an analysis of the function of a website, website content can be adapted for presentation or rendering in a manner that more closely appreciates and respects the function behind the website. Various inventive systems and methods analyze a website's function so that its content can be adapted to different client environments, e.g. devices, network conditions, or user preferences. A novel function-based object model automatically identifies objects associated with a website, and analyzes those objects in terms of their functions. The function-based object model permits consistent, informed decisions to be made in the adaptation process, so that web content is displayed not only in an organized manner, but in a manner that reflects the author's intention.

Type: Application

Filed: June 26, 2001

Publication date: May 29, 2003

Inventors: Jin-Lin Chen, Yudong Yang, Hong-Jiang Zhang
Automatic image orientation detection based on classification of low-level image features

Publication number: 20030099395

Abstract: The described arrangements and procedures identify an image's orientation by extracting features from peripheral portions of the image. The procedure evaluates the extracted features based on training image feature orientation classification models to identify the image's orientation.

Type: Application

Filed: November 27, 2001

Publication date: May 29, 2003

Inventors: Yongmei Wang, Hong-Jiang Zhang
Content-based characterization of video frame sequences

Publication number: 20030086496

Abstract: A system and process for video characterization that facilitates video classification and retrieval, as well as motion detection, applications. This involves characterizing a video sequence with a gray scale image having pixel levels that reflect the intensity of motion associated with a corresponding region in the sequence of video frames. The intensity of motion is defined using any of three characterizing processes. Namely, a perceived motion energy spectrum (PMES) characterizing process that represents object-based motion intensity over the sequence of frames, a spatio-temporal entropy (STE) characterizing process that represents the intensity of motion based on color variation at each pixel location, a motion vector angle entropy (MVAE) characterizing process which represents the intensity of motion based on the variation of motion vector angles.

Type: Application

Filed: September 25, 2001

Publication date: May 8, 2003

Inventors: Hong-Jiang Zhang, Yufei Ma
Image blur detection methods and arrangements

Patent number: 6548800

Abstract: An apparatus is provided for detecting blur in an image. The apparatus includes an image generator that is configured to generate a plurality of corresponding different resolution images based on a base image. The plurality of corresponding different resolution images is provided to an edge detector. The edge detector detects edge transitions in each of the plurality of corresponding different resolution images and provides edge maps to an edge parameter comparator. The edge parameter comparator compares corresponding edge parameters as detected by the edge detector and provides a result map to a blur calculator. The blur calculator determines at least one blur parameter based on the result map and provides the blur parameter to a blur detector. The blur detector then determines if the base image is blurred based on a comparison of the blur parameter with at least one blur parameter threshold.

Type: Grant

Filed: April 9, 2001

Date of Patent: April 15, 2003

Assignee: Microsoft Corporation

Inventors: Xiangrong Chen, Hong-Jiang Zhang
System and related methods for analyzing compressed media content

Patent number: 6501794

Abstract: A method comprising receiving media data in a compressed, digital domain, analyzing motion vectors associated with the received media content while in the compressed digital domain and identifying one or more objects in the received media data based, at least in part, on the motion vector analysis.

Type: Grant

Filed: May 22, 2000

Date of Patent: December 31, 2002

Assignee: Microsoft Corporate

Inventors: Ruoyu Roy Wang, Hong-Jiang Zhang, Ya-Qin Zhang
Relevance maximizing, iteration minimizing, relevance-feedback, content-based image retrieval (CBIR)

Publication number: 20020174120

Abstract: An implementation of a technology, described herein, for relevance-feedback, content-based facilitating accurate and efficient image retrieval minimizes the number of iterations for user feedback regarding the semantic relevance of exemplary images while maximizing the resulting relevance of each iteration. One technique for accomplishing this is to use a Bayesian classifier to treat positive and negative feedback examples with different strategies. In addition, query refinement techniques are applied to pinpoint the users' intended queries with respect to their feedbacks. These techniques further enhance the accuracy and usability of relevance feedback. This abstract itself is not intended to limit the scope of this patent. The scope of the present invention is pointed out in the appending claims.

Type: Application

Filed: March 30, 2001

Publication date: November 21, 2002

Inventors: Hong-Jiang Zhang, Zhong Su, Xingquan Zhu
Media content search engine incorporating text content and user log mining

Publication number: 20020161747

Abstract: Text features corresponding to pieces of media content (e.g., images, audio, multimedia content, etc.) are extracted from media content sources. One or more text features (e.g., one or more words) for a piece of media content are extracted from text associated with the piece of media content and text feature vectors generated therefrom and used during subsequent searching. Additional low-level feature vectors may also be extracted from the piece of media content and used during the subsequent searching. Relevance feedback can also be received from a user(s) identifying the relevance of pieces of media content rendered to the user in response to his or her search request. The relevance feedback is logged and can be used in determining how to respond to subsequent search requests, such as by modifying feature vectors (e.g., text feature vectors) corresponding to the pieces of media content for which relevance feedback is received.

Type: Application

Filed: March 13, 2001

Publication date: October 31, 2002

Inventors: Mingjing Li, Hong-Jiang Zhang, Wen-Yin Liu, Zheng Chen

prev … 6 7 8 9 10 11 next