Patents by Inventor Hong Jiang Zhang

Hong Jiang Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Content-based characterization of video frame sequences

Patent number: 9779303

Abstract: A system and process for video characterization that facilitates video classification and retrieval, as well as motion detection, applications. This involves characterizing a video sequence with a gray scale image having pixel levels that reflect the intensity of motion associated with a corresponding region in the sequence of video frames. The intensity of motion is defined using any of three characterizing processes. Namely, a perceived motion energy spectrum (PMES) characterizing process that represents object-based motion intensity over the sequence of frames, a spatio-temporal entropy (STE) characterizing process that represents the intensity of motion based on color variation at each pixel location, a motion vector angle entropy (MVAE) characterizing process which represents the intensity of motion based on the variation of motion vector angles.

Type: Grant

Filed: September 2, 2004

Date of Patent: October 3, 2017

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Hong-Jiang Zhang, Yufei Ma
Small form factor web browsing

Patent number: 9483577

Abstract: A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.

Type: Grant

Filed: September 12, 2011

Date of Patent: November 1, 2016

Assignee: Microsoft Technology Licensing, LLC

Inventors: Yu Chen, Hong-Jiang Zhang, Wei-Ying Ma, Ming-Yu Wang
Tag association with image regions

Patent number: 9047319

Abstract: A computing device configured to determine that one or more regions of an image are associated with a tag of the image is described herein. The computing device is further configured to determine one or more attribute tags describing at least one of the content or context of the one or more regions. Upon determining the attribute tags, the computing device associates the attribute tags with the tag to enable image searching based on the tag and attribute tags.

Type: Grant

Filed: December 17, 2010

Date of Patent: June 2, 2015

Assignee: Microsoft Technology Licensing, LLC

Inventors: Xian-Sheng Hua, Kuiyuan Yang, Meng Wang, Hong-Jiang Zhang
Head pose assessment methods and systems

Patent number: 8457358

Abstract: Improvements are provided to effectively assess a user's face and head pose such that a computer or like device can track the user's attention towards a display device(s). Then the region of the display or graphical user interface that the user is turned towards can be automatically selected without requiring the user to provide further inputs. A frontal face detector is applied to detect the user's frontal face and then key facial points such as left/right eye center, left/right mouth corner, nose tip, etc., are detected by component detectors. The system then tracks the user's head by an image tracker and determines yaw, tilt and roll angle and other pose information of the user's head through a coarse to fine process according to key facial points and/or confidence outputs by pose estimator.

Type: Grant

Filed: February 16, 2012

Date of Patent: June 4, 2013

Assignee: Microsoft Corporation

Inventors: Yuxiao Hu, Hong-Jiang Zhang, Mingjing Li, Lei Zhang
Robust large-scale visual codebook construction

Patent number: 8422802

Abstract: Techniques for construction of a visual codebook are described herein. Feature points may be extracted from large numbers of images. In one example, images providing N feature points may be used to construct a codebook of K words. The centers of each of K clusters of feature points may be initialized. In a looping or iterative manner, an assignment step assigns each feature point to a cluster and an update step locates a center of each cluster. The feature points may be assigned to a cluster based on a lesser of a distance to a center of a previously assigned cluster and a distance to a center derived by operation of an approximate nearest neighbor algorithm having aspects of randomization. The loop terminates when the feature points have sufficiently converged to their respective clusters. Centers of the clusters represent visual words, which may be used to construct the visual codebook.

Type: Grant

Filed: March 31, 2011

Date of Patent: April 16, 2013

Assignee: Microsoft Corporation

Inventors: Linjun Yang, Darui Li, Xian-Sheng Hua, Hong-Jiang Zhang
Robust Large-Scale Visual Codebook Construction

Publication number: 20120251007

Abstract: Techniques for construction of a visual codebook are described herein. Feature points may be extracted from large numbers of images. In one example, images providing N feature points may be used to construct a codebook of K words. The centers of each of K clusters of feature points may be initialized. In a looping or iterative manner, an assignment step assigns each feature point to a cluster and an update step locates a center of each cluster. The feature points may be assigned to a cluster based on a lesser of a distance to a center of a previously assigned cluster and a distance to a center derived by operation of an approximate nearest neighbor algorithm having aspects of randomization. The loop terminates when the feature points have sufficiently converged to their respective clusters. Centers of the clusters represent visual words, which may be used to construct the visual codebook.

Type: Application

Filed: March 31, 2011

Publication date: October 4, 2012

Applicant: Microsoft Corporation

Inventors: Linjun Yang, Darui Li, Xian-Sheng Hua, Hong-Jiang Zhang
Tag Association with Image Regions

Publication number: 20120158721

Abstract: A computing device configured to determine that one or more regions of an image are associated with a tag of the image is described herein. The computing device is further configured to determine one or more attribute tags describing at least one of the content or context of the one or more regions. Upon determining the attribute tags, the computing device associates the attribute tags with the tag to enable image searching based on the tag and attribute tags.

Type: Application

Filed: December 17, 2010

Publication date: June 21, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Xian-Sheng Hua, Kuiyuan Yang, Meng Wang, Hong-Jiang Zhang
Image Tag Refinement

Publication number: 20120158686

Abstract: A computing device configured to determine a subset of the tags associated with at least one image of a plurality of received, tagged images is described herein. The computing device performs the determining based on one or more measures of consistency of visual similarity between ones of the images with semantic similarity between tags of the ones of the images.

Type: Application

Filed: December 17, 2010

Publication date: June 21, 2012

Applicant: MICROSOFT CORPORATION

Inventors: Xian-Sheng Hua, Dong Liu, Meng Wang, Hong-Jiang Zhang
Head Pose Assessment Methods And Systems

Publication number: 20120139832

Abstract: Improvements are provided to effectively assess a user's face and head pose such that a computer or like device can track the user's attention towards a display device(s). Then the region of the display or graphical user interface that the user is turned towards can be automatically selected without requiring the user to provide further inputs. A frontal face detector is applied to detect the user's frontal face and then key facial points such as left/right eye center, left/right mouth corner, nose tip, etc., are detected by component detectors. The system then tracks the user's head by an image tracker and determines yaw, tilt and roll angle and other pose information of the user's head through a coarse to fine process according to key facial points and/or confidence outputs by pose estimator.

Type: Application

Filed: February 16, 2012

Publication date: June 7, 2012

Applicant: Microsoft Corporation

Inventors: Yuxiao HU, Lei Zhang, Mingjing Li, Hong-Jiang Zhang
Tag ranking

Patent number: 8175847

Abstract: Technologies for generating a boosted tag ranking for a media instance, the boosted tag ranking based on probabilistic relevance estimation computed by a probabilistic relevance estimator and tag correlation refining performed by a tag correlation refiner. Such boosted tag rankings may be used for search result ranking, tag recommendation, and group recommendation.

Type: Grant

Filed: March 31, 2009

Date of Patent: May 8, 2012

Assignee: Microsoft Corporation

Inventors: Hong-Jiang Zhang, Dong Liu, Meng Wang, Linjun Yang, Xian-Sheng Hua
Streaming methods and systems

Patent number: 8144769

Abstract: Various embodiments provide methods and systems for streaming data that can facilitate streaming during bandwidth fluctuations in a manner that can enhance the user experience. In one aspect, a forward-shifting technique is utilized to buffer data that is to be streamed, e.g. an enhancement layer in a FGS stream. Various techniques can drop layers actively when bandwidth is constant. The saved bandwidth can then be used to pre-stream enhancement layer portions. In another aspect, a content-aware decision can be made as to how to drop enhancement layers when bandwidth decreases. During periods of decreasing bandwidth, if a video segment does not contain important content, the enhancement layers will be dropped to keep the forward-shifting of the enhancement layer unchanged. If the enhancement layer does contain important content, it will be transmitted later when bandwidth increases.

Type: Grant

Filed: January 19, 2009

Date of Patent: March 27, 2012

Assignee: Microsoft Corporation

Inventors: Tianming Liu, Hong-Jiang Zhang, Wei Qi
Head pose assessment methods and systems

Patent number: 8135183

Abstract: Improvements are provided to effectively assess a user's face and head pose such that a computer or like device can track the user's attention towards a display device(s). Then the region of the display or graphical user interface that the user is turned towards can be automatically selected without requiring the user to provide further inputs. A frontal face detector is applied to detect the user's frontal face and then key facial points such as left/right eye center, left/right mouth corner, nose tip, etc., are detected by component detectors. The system then tracks the user's head by an image tracker and determines yaw, tilt and roll angle and other pose information of the user's head through a coarse to fine process according to key facial points and/or confidence outputs by pose estimator.

Type: Grant

Filed: November 5, 2010

Date of Patent: March 13, 2012

Assignee: Microsoft Corporation

Inventors: Yuxiao Hu, Lei Zhang, Mingjing Li, Hong-Jiang Zhang
Kernelized spatial-contextual image classification

Patent number: 8131086

Abstract: Kernelized spatial-contextual image classification is disclosed. One embodiment comprises generating a first spatial-contextual model to represent a first image, the first spatial-contextual model having a plurality of interconnected nodes arranged in a first pattern of connections with each node connected to at least one other node, generating a second spatial-contextual model to represent a second image using the first pattern of connections, and estimating the distance between corresponding nodes in the first spatial-contextual model and the second spatial-contextual model based on a relationship with adjacent connected nodes to determine a distance between the first image and the second image.

Type: Grant

Filed: September 24, 2008

Date of Patent: March 6, 2012

Assignee: Microsoft Corporation

Inventors: Xian-Sheng Hua, Guo-Jun Qi, Yong Rui, Hong-Jiang Zhang
Function-based object model for use in WebSite adaptation

Patent number: 8122345

Abstract: By understanding a website author's intention through an analysis of the function of a website, website content can be adapted for presentation or rendering in a manner that more closely appreciates and respects the function behind the website. Various inventive systems and methods analyze a website's function so that its content can be adapted to different client environments, e.g. devices, network conditions, or user preferences. A novel function-based object model automatically identifies objects associated with a website, and analyzes those objects in terms of their functions. The function-based object model permits consistent, informed decisions to be made in the adaptation process, so that web content is displayed not only in an organized manner, but in a manner that reflects the author's intention.

Type: Grant

Filed: November 4, 2008

Date of Patent: February 21, 2012

Assignee: Microsoft Corporation

Inventors: Jin-Lin Chen, Yudong Yang, Hong-Jiang Zhang
Generating a motion attention model

Patent number: 8098730

Abstract: Systems and methods to generate a motion attention model of a video data sequence are described. In one aspect, a motion saliency map B is generated to precisely indicate motion attention areas for each frame in the video data sequence. The motion saliency maps are each based on intensity I, spatial coherence Cs, and temporal coherence Ct values. These values are extracted from each block or pixel in motion fields that are extracted from the video data sequence. Brightness values of detected motion attention areas in each frame are accumulated to generate, with respect to time, the motion attention model.

Type: Grant

Filed: April 3, 2006

Date of Patent: January 17, 2012

Assignee: Microsoft Corporation

Inventors: Yu-Fei Ma, Hong-Jiang Zhang
Streaming methods and systems

Patent number: 8094719

Abstract: Various embodiments provide methods and systems for streaming data that can facilitate streaming during bandwidth fluctuations in a manner that can enhance the user experience. In one aspect, a forward-shifting technique is utilized to buffer data that is to be streamed, e.g. an enhancement layer in a FGS stream. Various techniques can drop layers actively when bandwidth is constant. The saved bandwidth can then be used to pre-stream enhancement layer portions. In another aspect, a content-aware decision can be made as to how to drop enhancement layers when bandwidth decreases. During periods of decreasing bandwidth, if a video segment does not contain important content, the enhancement layers will be dropped to keep the forward-shifting of the enhancement layer unchanged. If the enhancement layer does contain important content, it will be transmitted later when bandwidth increases.

Type: Grant

Filed: January 20, 2009

Date of Patent: January 10, 2012

Assignee: Microsoft Corporation

Inventors: Tianming Liu, Hong-Jiang Zhang, Wei Qi
Small Form Factor Web Browsing

Publication number: 20120005565

Abstract: A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.

Type: Application

Filed: September 12, 2011

Publication date: January 5, 2012

Applicant: Microsoft Corporation

Inventors: Yu Chen, Wei-Ying Ma, Ming-Yu Wang, Hong Jiang Zhang
Multi-label active learning

Patent number: 8086549

Abstract: Multi-label active learning may entail training a classifier with a set of training samples having multiple labels per sample. In an example embodiment, a method includes accepting a set of training samples, with the set of training samples having multiple respective samples that are each respectively associated with multiple labels. The set of training samples is analyzed to select a sample-label pair responsive to at least one error parameter. The selected sample-label pair is then submitted to an oracle for labeling.

Type: Grant

Filed: December 17, 2007

Date of Patent: December 27, 2011

Assignee: Microsoft Corporation

Inventors: Guo-Jun Qi, Xian-Sheng Hua, Yong Rui, Hong-Jiang Zhang, Shipeng Li
Region detection

Patent number: 8086031

Abstract: Disclosed herein are systems methods and devices related to region detection of an image. Detected regions include pixels of a particular one or more colors without requiring faces within the image to be previously detected. Region detection may include receiving information that a flash was used to capture the image or that return light was detected in the image.

Type: Grant

Filed: May 13, 2009

Date of Patent: December 27, 2011

Assignee: Microsoft Corporation

Inventors: Tong-Xian Chen, Xiangrong Chen, John C. Platt, Jie Yan, Hong-Jiang Zhang
Small form factor web browsing

Patent number: 8020090

Abstract: A large web page is analyzed and partitioned into smaller sub-pages so that a user can navigate the web page on a small form factor device. The user can browse the sub-pages to find and read information in the content of the large web page. The partitioning can be performed at a web server, an edge server, at the small form factor device, or can be distributed across one or more such devices. The analysis leverages design habits of a web page author to extract a representation structure of an authored web page. The extracted representation structure includes high level structure using several markup language tag selection rules and low level structure using visual boundary detection in which visual units of the low level structure are provided by clustering markup language tags. User viewing habits can be learned to display favorite parts of a web page.

Type: Grant

Filed: August 18, 2006

Date of Patent: September 13, 2011

Assignee: Microsoft Corporation

Inventors: Yu Chen, Wei-Ying Ma, Ming-Yu Wang, Hong Jiang Zhang

1 2 3 4 5 … next