Patents by Inventor Cha Zhang

Cha Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

EFFICIENT IMAGE DISPLAYING

Publication number: 20090220165

Abstract: Efficient image display on a display screen (e.g., in terms of number, space, resolution, and/or distortion) is facilitated by implementing one or more specialized select and pack routines for images. That is, representative images are selected from an image database, based on desired resolution and distortion, then resized and packed into a display arrangement that enhances use of display screen space. This allows, for example, images to be sent to a user from an image database more quickly, with more desirable resolution, and less distortion than traditional display techniques.

Type: Application

Filed: February 29, 2008

Publication date: September 3, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Zicheng Liu, Ying Shan, Cha Zhang, Gang Hua, Zhengyou Zhang
MANAGEMENT OF SPLIT AUDIO/VIDEO STREAMS

Publication number: 20090172779

Abstract: Described herein is a method that includes receiving multiple requests for access to an exposed media object, wherein the exposed media object represents a live media stream that is being generated by a media source. The method also includes receiving data associated with each entity that provided a request, and determining, for each entity, whether the entities that provided the request are authorized to access the media stream based at least in part upon the received data and splitting the media stream into multiple media streams, wherein a number of media streams corresponds to a number of authorized entities. The method also includes automatically applying at least one policy to at least one of the split media streams based at least in part upon the received data.

Type: Application

Filed: January 2, 2008

Publication date: July 2, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Rajesh K. Hegde, Cha Zhang, Philip A. Chou, Zicheng Liu
Distributed hosting of web content using partial replication

Patent number: 7546342

Abstract: The mechanisms described herein are directed at hosting content of a web site on multiple computing devices. A relative importance for each file associated with the web site is calculated. This relative importance is used to calculate several subsets of the content which are distributed to several devices within a computer cluster, such as a server array, peer-to-peer network, and the like. The subsets may include coded messages created using an erasure coding scheme on packets containing portions of one or more files. Upon retrieving a file, a fixed number of distinct coded messages are retrieved from the devices based on the erasure coding scheme. The file is re-created with these distinct messages. Because multiple devices hold the content, the web site may be retrieved significantly faster and the reliability is increased without consuming a large amount of storage space or bandwidth of any one computing device.

Type: Grant

Filed: May 14, 2004

Date of Patent: June 9, 2009

Assignee: Microsoft Corporation

Inventors: Jin Li, Cha Zhang
LEARNING CLASSIFIERS USING COMBINED BOOSTING AND WEIGHT TRIMMING

Publication number: 20090018981

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
MULTIPLE-INSTANCE PRUNING FOR LEARNING EFFICIENT CASCADE DETECTORS

Publication number: 20090018980

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
HISTOGRAM-BASED CLASSIFIERS HAVING VARIABLE BIN SIZES

Publication number: 20090018985

Abstract: A “Classifier Trainer” trains a combination classifier for detecting specific objects in signals (e.g., faces in images, words in speech, patterns in signals, etc.). In one embodiment “multiple instance pruning” (MIP) is introduced for training weak classifiers or “features” of the combination classifier. Specifically, a trained combination classifier and associated final threshold for setting false positive/negative operating points are combined with learned intermediate rejection thresholds to construct the combination classifier. Rejection thresholds are learned using a pruning process which ensures that objects detected by the original combination classifier are also detected by the combination classifier, thereby guaranteeing the same detection rate on the training set after pruning. The only parameter required throughout training is a target detection rate for the final cascade system.

Type: Application

Filed: July 13, 2007

Publication date: January 15, 2009

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Paul Viola
VIDEO NOISE REDUCTION

Publication number: 20080317371

Abstract: A video noise reduction technique is presented. Generally, the technique involves first decomposing each frame of the video into low-pass and high-pass frequency components. Then, for each frame of the video after the first frame, an estimate of a noise variance in the high pass component is obtained. The noise in the high pass component of each pixel of each frame is reduced using the noise variance estimate obtained for the frame under consideration, whenever there has been no substantial motion exhibited by the pixel since the last previous frame. Evidence of motion is determined by analyzing the high and low pass components.

Type: Application

Filed: June 19, 2007

Publication date: December 25, 2008

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Zhengyou Zhang, Zicheng Liu
Learning image enhancement

Publication number: 20080279467

Abstract: Image enhancement techniques are described to enhance an image in accordance with a set of training images. In an implementation, an image color tone map is generated for a facial region included in an image. The image color tone map may be normalized to a color tone map for a set of training images so that the image color tone map matches the map for the training images. The normalized color tone map may be applied to the image to enhance the in-question image. In further implementations, the procedure may be updated when the average color intensity in non-facial regions differs from an accumulated mean by a threshold amount.

Type: Application

Filed: May 10, 2007

Publication date: November 13, 2008

Applicant: Microsoft Corporation

Inventors: Zicheng Liu, Cha Zhang, Zhengyou Zhang
Enhanced Beamforming for Arrays of Directional Microphones

Publication number: 20080240463

Abstract: A novel enhanced beamforming technique that improves beamforming operations by incorporating a model for the directional gains of the sensors, such as microphones, and provides means of estimating these gains. The technique forms estimates of the relative magnitude responses of the sensors (e.g., microphones) based on the data received at the array and includes those in the beamforming computations.

Type: Application

Filed: March 29, 2007

Publication date: October 2, 2008

Applicant: Microsoft Corporation

Inventors: Dinei A. Florencio, Cha Zhang, Demba Elimane Ba
Fast Landmark Detection Using Regression Methods

Publication number: 20080187213

Abstract: A landmark detection technique that can quickly detect both objects of interest and landmarks within the objects in an image using regression methods. The present fast landmark detection scheme reuses existing feature values used for object detection (e.g., face detection) to find the landmarks in an object (e.g., the eyes and mouth of the face). Hence, the technique provides landmark detection functionality at almost no cost.

Type: Application

Filed: February 6, 2007

Publication date: August 7, 2008

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Paul Viola, Sang Min Oh
MULTI-SENSOR SOUND SOURCE LOCALIZATION

Publication number: 20080181430

Abstract: A multi-sensor sound source localization (SSL) technique is presented which provides a true maximum likelihood (ML) treatment for microphone arrays having more than one pair of audio sensors. Generally, this is accomplished by selecting a sound source location that results in a time of propagation from the sound source to the audio sensors of the array, which maximizes a likelihood of simultaneously producing audio sensor output signals inputted from all the sensors in the array. The likelihood includes a unique term that estimates an unknown audio sensor response to the source signal for each of the sensors in the array.

Type: Application

Filed: January 26, 2007

Publication date: July 31, 2008

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Dinei Florencio, Zhengyou Zhang
Identification Of People Using Multiple Types Of Input

Publication number: 20070297682

Abstract: Systems and methods for detecting people or speakers in an automated fashion are disclosed. A pool of features including more than one type of input (like audio input and video input) may be identified and used with a learning algorithm to generate a classifier that identifies people or speakers. The resulting classifier may be evaluated to detect people or speakers.

Type: Application

Filed: June 22, 2006

Publication date: December 27, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Paul A. Viola, Pei Yin, Ross G. Cutler, Xinding Sun, Yong Rui
BACKGROUND BLURRING FOR VIDEO CONFERENCING

Publication number: 20070286520

Abstract: Background blurring is an effective way to both preserve privacy and keep communication effective during video conferencing. The present image background blurring technique is a light weight real-time technique to perform background blurring using a fast background modeling procedure combined with an object (e.g., face) detector/tracker. A soft decision is made at each pixel whether it belongs to the foreground or the background based on multiple vision features. The classification results are mapped to a per-pixel blurring radius image to blur the background. In another embodiment, the image background blurring technique blurs the background of the image without using the object detector.

Type: Application

Filed: June 7, 2006

Publication date: December 13, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Li-wei He, Yong Rui
Image segmentation using spatial-color gaussian mixture models

Publication number: 20070237393

Abstract: A spatial-color Gaussian mixture model (SCGMM) image segmentation technique for segmenting images. The SCGMM image segmentation technique specifies foreground objects in the first frame of an image sequence, either manually or automatically. From the initial segmentation, the SCGMM segmentation system learns two spatial-color Gaussian mixture models (SCGMM) for the foreground and background objects. These models are built into a first-order Markov random field (MRF) energy function.

Type: Application

Filed: March 30, 2006

Publication date: October 11, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Michael Cohen, Yong Rui, Ting Yu
Combined digital and mechanical tracking of a person or object using a single video camera

Publication number: 20070120979

Abstract: A combined digital and mechanical tracking system and process for generating a video using a single digital video camera that tracks a person or object of interest moving in a scene is presented. This generally involves operating the camera at a higher resolution than is needed for the application, and cropping a sub-region out of the image captured that is output as the output video. The person or object being tracked is at least partially contained within the cropped sub-region. As the person or object moves within the field of view of the camera, the location of the cropped sub-region is also moved so as to keep the subject of interest within its boundaries. When the subject of interest moves to the boundary of the FOV of the camera, the camera is mechanically panned to keep the person or object inside its FOV.

Type: Application

Filed: November 21, 2005

Publication date: May 31, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Li-wei He, Yong Rui
Distributed presentations employing inputs from multiple video cameras located at multiple sites and customizable display screen configurations

Publication number: 20070118868

Abstract: A computer network-based distributed presentation system and process is presented that controls the display of one or more video streams output by multiple video cameras located across multiple presentation sites on display screens located at each presentation site. The distributed presentation system and process provides the ability for a user at a site to customize the screen configuration (i.e., what video streams are display at any one time and in what format) for that site via a two-layer display director module. In the design layer of the module, a user interface is provided for a user to specify display priorities dictating what video streams are to be displayed on the screen over time. These display priorities are then provided to the execution layer of the module which translates them into probabilistic timed automata and uses the automata to control what is displayed on the display screen.

Type: Application

Filed: November 23, 2005

Publication date: May 24, 2007

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Bin Yu, Yong Rui
Rebinning methods and arrangements for use in compressing image-based rendering (IBR) data

Patent number: 7110617

Abstract: Rebinning methods and arrangements are provided that significantly improve the 3D wavelet compression performance of the image based rendering data, such as, e.g., concentric mosaic image data. Through what is essentially a selective cutting and pasting process the image data is divided into stripes that are then used to form a set of multi-perspective panoramas. The rebinning process greatly improves the performance of the cross shot filtering, and thus improves the transform and coding efficiency of 3D wavelet codecs. While the region of support after rebinning may cease to be rectangular in some cases, a padding scheme and an arbitrary shape wavelet coder can be implemented to encode the result data volume of the smart rebinning. With an arbitrary shape wavelet codec, the rebinning outperforms MPEG-2 by 3.7 dB, outperforms direct 3D wavelet coder by 4.3 dB, and outperforms a reference block coder (RBC) by 3.2 dB on certain tested concentric mosaic image scenes.

Type: Grant

Filed: October 18, 2004

Date of Patent: September 19, 2006

Assignee: Microsoft Corporation

Inventors: Cha Zhang, Jin Li, Yunnan Wu
Rebinning methods and arrangements for use in compressing image-based rendering (IBR) data

Patent number: 7065260

Abstract: Rebinning methods and arrangements are provided that significantly improve the 3D wavelet compression performance of the image based rendering data, such as, e.g., concentric mosaic image data. Through what is essentially a selective cutting and pasting process the image data is divided into stripes that are then used to form a set of multi-perspective panoramas. The rebinning process greatly improves the performance of the cross shot filtering, and thus improves the transform and coding efficiency of 3D wavelet codecs. While the region of support after rebinning may cease to be rectangular in some cases, a padding scheme and an arbitrary shape wavelet coder can be implemented to encode the result data volume of the smart rebinning. With an arbitrary shape wavelet codec, the rebinning outperforms MPEG-2 by 3.7 dB, outperforms direct 3D wavelet coder by 4.3 dB, and outperforms a reference block coder (RBC) by 3.2 dB on certain tested concentric mosaic image scenes.

Type: Grant

Filed: October 13, 2004

Date of Patent: June 20, 2006

Assignee: Microsoft Corporation

Inventors: Cha Zhang, Jin Li, Yunnan Wu
Rebinning methods and arrangements for use in compressing image-based rendering (IBR) data

Patent number: 6996294

Abstract: Rebinning methods and arrangements are provided that significantly improve the 3D wavelet compression performance of the image based rendering data, such as, e.g., concentric mosaic image data. Through what is essentially a selective cutting and pasting process the image data is divided into stripes that are then used to form a set of multi-perspective panoramas. The rebinning process greatly improves the performance of the cross shot filtering, and thus improves the transform and coding efficiency of 3D wavelet codecs. While the region of support after rebinning may cease to be rectangular in some cases, a padding scheme and an arbitrary shape wavelet coder can be implemented to encode the result data volume of the smart rebinning. With an arbitrary shape wavelet codec, the rebinning outperforms MPEG-2 by 3.7 dB, outperforms direct 3D wavelet coder by 4.3 dB, and outperforms a reference block coder (RBC) by 3.2 dB on certain tested concentric mosaic image scenes.

Type: Grant

Filed: October 18, 2004

Date of Patent: February 7, 2006

Assignee: Microsoft Corporation

Inventors: Cha Zhang, Jin Li, Yunnan Wu
Methods and arrangements for handling concentric mosaic image data

Patent number: 6993074

Abstract: Methods and arrangements are provided for compressing, transporting and decompressing/rendering concentric mosaic image data. The methods and arrangements compress concentric mosaic image data using reference block coding (RBC) techniques. Such RBC techniques selectively divide each of the frames of the concentric mosaic image data into blocks, and then predictively encodes each of these blocks. Some of the blocks are independently encoded as anchor blocks. Each of the remaining blocks is encoded as a predicted block with motion compensation to the anchor frame. The resulting compressed data file includes indexing information that can be used to selectively, randomly access the compressed data during decompression/rendering. A bitstream can be selectively tailored to provide portions of the compressed data file, as needed, for example, during rendering of a particular user-selected view of the concentric mosaic image scene.

Type: Grant

Filed: March 19, 2001

Date of Patent: January 31, 2006

Assignee: Microsoft Corporation

Inventors: Cha Zhang, Jin Li

prev 1 2 3 4 5 6 next