Patents by Inventor Zhengyou Zhang

Zhengyou Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Lighting array control

Patent number: 7869705

Abstract: A subject captured by a camera may be affected by environmental lighting provided by nearby light sources and the sun or moon, which may cause underexposure or overexposure of the image or aesthetically displeasing color tones. Image processing and camera adjustments may mitigate some imaging problems with limited effect and introduce undesirable side effects. A lighting array may be devised to expose the subject to various types of light (e.g., white light comprising full spectrum illumination and red, green, and blue lights comprising partial spectrum illumination) to resolve lighting problems in a more effective manner. Moreover, the lighting array may be responsively controlled to adjust the subject image with respect to one or more target spectra specifying desirable colors for the subject image. The lighting array may be iteratively controlled, e.g. by a gradient descent algorithm, for incrementally adjusting parameters with respect to proximate target spectra for the image.

Type: Grant

Filed: January 21, 2008

Date of Patent: January 11, 2011

Assignee: Microsoft Corporation

Inventors: Zicheng Liu, Mingxuan Sun, Jingyu Qiu, Zhengyou Zhang, Michael J. Sinclair
BOOSTED FACE VERIFICATION

Publication number: 20100329517

Abstract: Techniques for face verification are described. Local binary pattern (LBP) features and boosting classifiers are used to verify faces in images. A boosted multi-task learning algorithm is used for face verification in images. Finally, boosted face verification is used to verify faces in videos.

Type: Application

Filed: June 26, 2009

Publication date: December 30, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Cha Zhang, Xiaogang Wang, Zhengyou Zhang
Adaptive Meeting Management

Publication number: 20100318399

Abstract: A template and/or knowledge associated with a synchronous meeting are obtained by a computing device. The computing device then adaptively manages the synchronous meeting based at least in part on the template and/or knowledge.

Type: Application

Filed: June 15, 2009

Publication date: December 16, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Jin Li, James E. Oker, Rajesh K. Hegde, Dinei Afonso Ferreira Florencio, Michel Pahud, Sharon K. Cunnington, Philip A. Chou, Zhengyou Zhang
Persistent spatial collaboration

Patent number: 7853886

Abstract: Persistent, spatial collaboration on the web supports a free-form, user-intuitive approach to a variety of projects and activities. Users can place differing object types at any time any where on a web page and/or the system can automatically, and with no user effort, affect object placement based on one or more meta data characteristics. A user can, in real-time, see changes made by another user to a web page, and, if desired, react accordingly, enabling true collaboration even if the various users are at remote locations. The flexibility of the methodology and system provides a platform for users to engage in projects and activities in a manner and environment suited to the users' mind sets, creativity, and natural proclivities.

Type: Grant

Filed: February 27, 2007

Date of Patent: December 14, 2010

Assignee: Microsoft Corporation

Inventors: Steven M. Drucker, Aamer Hydrie, Li-wei He, Rajesh K. Hegde, Zhengyou Zhang
System and method providing improved head motion estimations for animation

Patent number: 7853053

Abstract: The computer-readable media provides improved procedures to estimate head motion between two images of a face. Locations of a number of distinct facial features are determined in two images. The locations are converted into as a set of physical face parameters based on the symmetry of the identified distinct facial features. An estimation objective function is determined by: (a) estimating each of the set of physical parameters, (b) estimating a first head pose transform corresponding to the first image, and (c) estimating a second head pose transform corresponding to the second image. The motion is estimated between the two images based on the set of physical face parameters by multiplying each term of the estimation objective function by a weighted contribution factor based on the confidence of data corresponding to the estimation objective function.

Type: Grant

Filed: March 31, 2010

Date of Patent: December 14, 2010

Assignee: Microsoft Corporation

Inventors: Zicheng Liu, Zhengyou Zhang
SPATIALIZED AUDIO OVER HEADPHONES

Publication number: 20100303266

Abstract: A spatial element is added to communications, including over telephone conference calls heard through headphones or a stereo speaker setup. Functions are created to modify signals from different callers to create the illusion that the callers are speaking from different parts of the room.

Type: Application

Filed: May 26, 2009

Publication date: December 2, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Wei-ge Chen, Zhengyou Zhang
Participant positioning in multimedia conferencing

Patent number: 7840638

Abstract: A multimedia conference technique is disclosed that allows physically remote users to participate in an immersive telecollaborative environment by synchronizing multiple data, images and sounds. The multimedia conference implementation provides users with the perception of being in the same room visually as well as acoustically according to an orientation plan which reflects each remote user's position within the multimedia conference environment.

Type: Grant

Filed: June 27, 2008

Date of Patent: November 23, 2010

Assignee: Microsoft Corporation

Inventors: Zhengyou Zhang, Xuedong David Huang, Zicheng Liu, Cha Zhang, Philip A. Chou, Christian Huitema
VIDEO CAPTURE DEVICE PROVIDING MULTIPLE RESOLUTION VIDEO FEEDS

Publication number: 20100289904

Abstract: Systems are disclosed that provide improved transfer speed of video data from a video capture device to a computing device using multiple video feeds respectively comprising different resolutions. A high-resolution image sensor is used to convert light images into a high-resolution video data stream. A down sampler converts the high-resolution video data stream to a low-resolution video data stream, so that both a low-resolution data stream and a high-resolution data stream are available. While the low resolution-data stream can be sent to the computing device, a digital signal processor (DSP) processes the high-resolution video data stream in accordance with an input control signal that is comprised of desired high-resolution video stream parameters derived from the low-resolution video data stream.

Type: Application

Filed: May 15, 2009

Publication date: November 18, 2010

Applicant: Microsoft Corporation

Inventors: Cha Zhang, Zhengyou Zhang, Zicheng Liu, Wanghong Yuan, Christian Huitema
Calibration based beamforming, non-linear adaptive filtering, and multi-sensor headset

Patent number: 7813923

Abstract: A first set of signals from an array of one or more microphones, and a second signal from a reference microphone are used to calibrate a set of filter parameters such that the filter parameters minimize a difference between the second signal and a beamformer output signal that is based on the first set of signals. Once calibrated, the filter parameters are used to form a beamformer output signal that is filtered using a non-linear adaptive filter that is adapted based on portions of a signal that do not contain speech, as determined by a speech detection sensor.

Type: Grant

Filed: October 14, 2005

Date of Patent: October 12, 2010

Assignee: Microsoft Corporation

Inventors: Alejandro Acero, Michael L. Seltzer, Zhengyou Zhang, Zicheng Liu
AMBULATORY PRESENCE FEATURES

Publication number: 20100245536

Abstract: The claimed subject matter provides a system and/or a method that facilitates managing one or more devices utilized for communicating data within a telepresence session. A telepresence session can be initiated within a communication framework that includes two or more virtually represented users that communicate therein. A device can be utilized by at least one virtually represented user that enables communication within the telepresence session, the device includes at least one of an input to transmit a portion of a communication to the telepresence session or an output to receive a portion of a communication from the telepresence session. A detection component can adjust at least one of the input related to the device or the output related to the device based upon the identification of a cue, the cue is at least one of a movement detected, an event detected, or an ambient variation.

Type: Application

Filed: March 30, 2009

Publication date: September 30, 2010

Applicant: Microsoft Corporation

Inventors: Christian Huitema, William A.S. Buxton, Jonathan E. Paff, Zicheng Liu, Rajesh Kutpadi Hegde, Zhengyou Zhang, Kori Marie Quinn, Jin Li, Michel Pahud
SMART MEETING ROOM

Publication number: 20100228825

Abstract: The claimed subject matter provides a system and/or a method that facilitates enhancing the employment of a telepresence session. An automatic telepresence engine that can evaluate data associated with at least one of an attendee, a schedule for an attendee, or a portion of an electronic communication for an attendee. The automatic telepresence engine can identify at least one the following for a telepresence session based upon the evaluated data: a participant to include for the telepresence session, a portion of data related to a presentation within the telepresence session, a portion of data related to a meeting topic within the telepresence session, a device utilized by an attendee to communicate within the telepresence session. The automatic telepresence engine can initiate the telepresence session within a communication framework that includes two or more virtually represented users that communicate therein.

Type: Application

Filed: March 6, 2009

Publication date: September 9, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Rajesh Kutpadi Hegde, Xuedong David Huang, Sharon Kay Cunnington, Jin Li, Michel Pahud, Ryan M. Burkhardt, Kori Marie Quinn, Jayman Dalal, Zhengyou Zhang
2-D barcode recognition

Patent number: 7780084

Abstract: Systems and methods for 2-D barcode recognition are described. In one aspect, the systems and methods use a charge coupled camera capturing device to capture a digital image of a 3-D scene. The systems and methods evaluate the digital image to localize and segment a 2-D barcode from the digital image of the 3-D scene. The 2-D barcode is rectified to remove non-uniform lighting and correct any perspective distortion. The rectified 2-D barcode is divided into multiple uniform cells to generate a 2-D matrix array of symbols. A barcode processing application evaluates the 2-D matrix array of symbols to present data to the user.

Type: Grant

Filed: June 29, 2007

Date of Patent: August 24, 2010

Assignee: Microsoft Corporation

Inventors: Chunhui Zhang, Zhouchen Lin, Zhengyou Zhang, Shi Han
Multi-modal device capable of automated actions

Patent number: 7778632

Abstract: A multi-modal multi-lingual mobile device that facilitates intelligently automating an action. The device can automatically synchronize a user schedule based upon a user state, intention, preference and/or limitation. The device can employ sensors to automatically detect criteria by which to automatically implement an action. Moreover, the system can interrogate a user thus converging upon a user intention and/or preference. An analyzer component can intelligently evaluate the compiled criterion in order to automatically perform an action. The multi-modal multi-lingual mobile device can automatically facilitate identification of an individual. Other actions that are automatically performed can include modifying personal information manager data, translating languages into a language comprehendible to a user, etc. Implementation of these actions can be based at least in part upon an environmental factor, a conversation, a location factor and a temporal factor.

Type: Grant

Filed: October 28, 2005

Date of Patent: August 17, 2010

Assignee: Microsoft Corporation

Inventors: David J. Kurlander, David W. Williams, Yuan Kong, Zhengyou Zhang
AUDIO TRANSFORMS IN CONNECTION WITH MULTIPARTY COMMUNICATION

Publication number: 20100195812

Abstract: The claimed subject matter relates to an architecture that can preprocess audio portions of communications in order to enrich multiparty communication sessions or environments. In particular, the architecture can provide both a public channel for public communications that are received by substantially all connected parties and can further provide a private channel for private communications that are received by a selected subset of all connected parties. Most particularly, the architecture can apply an audio transform to communications that occur during the multiparty communication session based upon a target audience of the communication. By way of illustration, the architecture can apply a whisper transform to private communications, an emotion transform based upon relationships, an ambience or spatial transform based upon physical locations, or a pace transform based upon lack of presence.

Type: Application

Filed: February 5, 2009

Publication date: August 5, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Dinei A. Florencio, Alejandro Acero, William Buxton, Phillip A. Chou, Ross G. Cutler, Jason Garms, Christian Huitema, Kori M. Quinn, Daniel Allen Rosenfeld, Zhengyou Zhang
UNIVERSAL TRANSLATOR

Publication number: 20100198579

Abstract: The claimed subject matter provides a system and/or a method that facilitates communication within a telepresence session. A telepresence session can be initiated within a communication framework that includes two or more virtually represented users that communicate therein. The telepresence session can include at least one virtually represented user that communicates in a first language, the communication is at least one of a portion of audio, a portion of video, a portion of graphic, a gesture, or a portion of text. An interpreter component can evaluate the communication to translate an identified first language into a second language within the telepresence session, the translation is automatically provided to at least one virtually represented user within the telepresence.

Type: Application

Filed: February 4, 2009

Publication date: August 5, 2010

Applicant: MICROSOFT CORPORATION

Inventors: Sharon Kay Cunnington, Jin Li, Michel Pahud, Rajesh K. Hegde, Zhengyou Zhang
System and method for whiteboard and audio capture

Patent number: 7770116

Abstract: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.

Type: Grant

Filed: November 30, 2006

Date of Patent: August 3, 2010

Assignee: Microsoft Corp.

Inventors: Zhengyou Zhang, Ross Cutler, Zicheng Liu, Anoop Gupta, Li-wei He
System and Method Providing Improved Head Motion Estimations for Animation

Publication number: 20100189310

Abstract: The computer-readable media provides improved procedures to estimate head motion between two images of a face. Locations of a number of distinct facial features are determined in two images. The locations are converted into as a set of physical face parameters based on the symmetry of the identified distinct facial features. An estimation objective function is determined by: (a) estimating each of the set of physical parameters, (b) estimating a first head pose transform corresponding to the first image, and (c) estimating a second head pose transform corresponding to the second image. The motion is estimated between the two images based on the set of physical face parameters by multiplying each term of the estimation objective function by a weighted contribution factor based on the confidence of data corresponding to the estimation objective function.

Type: Application

Filed: March 31, 2010

Publication date: July 29, 2010

Applicant: Microsoft Corporation

Inventors: Zicheng Liu, Zhengyou Zhang
VISUAL FEEDBACK FOR NATURAL HEAD POSITIONING

Publication number: 20100149310

Abstract: A videoconferencing conferee may be provided with feedback on his or her location relative a local video camera by altering how remote videoconference video is displayed on a local videoconference display viewed by the conferee. The conferee's location may be tracked and the displayed remote video may be altered in accordance to the changing location of the conferee. The remote video may appear to move in directions mirroring movement of the conferee. This effect may be achieved by modeling the remote video as offset and behind a virtual portal corresponding to the display. The remote video may be displayed according to a view of the remote video through the virtual portal. As the conferee's position changes, the view through the portal changes, and the remote video changes accordingly.

Type: Application

Filed: December 17, 2008

Publication date: June 17, 2010

Applicant: Microsoft Corporation

Inventors: Zhengyou Zhang, Christian Huitema, Alejandro Acero
Computer input device with a self-contained camera

Patent number: 7714843

Abstract: A method and system for visually tracking a point of contact of an optical output from a computer input device includes an internal camera configured to visually track the point of contact of the optical output against a surface and an optical source to transmit the optical output from the computer input device. The camera also transmits the position of the point of contact as a computer input. In one form, the computer applies the position of the point of contact as an input to an application operating on the computer, such as a gaming application. In one form, the camera can visually track the movement of the computer input device along a surface.

Type: Grant

Filed: May 9, 2003

Date of Patent: May 11, 2010

Assignee: Microsoft Corporation

Inventors: Yuan Kong, Zhengyou Zhang
Segmentation of objects by minimizing global-local variational energy

Patent number: 7706610

Abstract: An “Image Segmenter” provides a variational energy formulation for segmentation of natural objects from images. In general, the Image Segmenter operates by adopting Gaussian mixture models (GMM) to capture the appearance variation of objects in one or more images. A global image data likelihood potential is then computed and combined with local region potentials to obtain a robust and accurate estimation of pixel foreground and background distributions. Iterative minimization of a “global-local energy function” is then accomplished by evolution of a foreground/background boundary curve by level set, and estimation of a foreground/background model by fixed-point iteration, termed “quasi-semi-supervised EM.” In various embodiments, this process is further improved by providing general object shape information for use in rectifying objects segmented from the image.

Type: Grant

Filed: November 29, 2005

Date of Patent: April 27, 2010

Assignee: Microsoft Corporation

Inventors: Zhengyou Zhang, Zicheng Liu, Gang Hua

prev … 8 9 10 11 12 13 14 15 16 … next