Patents by Inventor Zhengyou Zhang

Zhengyou Zhang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20080215318
    Abstract: Recognition of events can be performed by accessing an audio signal having static and dynamic features. A value for the audio signal can be calculated by utilizing different weights for the static and dynamic features such that a frame of the audio signal can be associated with a particular event. A filter can also be used to aid in determining the event for the frame.
    Type: Application
    Filed: March 1, 2007
    Publication date: September 4, 2008
    Applicant: Microsoft Corporation
    Inventors: Zhengyou Zhang, Yuan Kong, Chao Huang, Frank Kao-Ping K. Soong
  • Publication number: 20080209327
    Abstract: Persistent, spatial collaboration on the web supports a free-form, user-intuitive approach to a variety of projects and activities. Users can place differing object types at any time any where on a web page and/or the system can automatically, and with no user effort, affect object placement based on one or more meta data characteristics. A user can, in real-time, see changes made by another user to a web page, and, if desired, react accordingly, enabling true collaboration even if the various users are at remote locations. The flexibility of the methodology and system provides a platform for users to engage in projects and activities in a manner and environment suited to the users' mind sets, creativity, and natural proclivities.
    Type: Application
    Filed: February 27, 2007
    Publication date: August 28, 2008
    Applicant: Microsoft Corporation
    Inventors: Steven M. Drucker, Aamer Hydrie, Li-wei He, Rajesh K. Hegde, Zhengyou Zhang
  • Publication number: 20080195389
    Abstract: A text-dependent speaker verification technique that uses a generic speaker-independent speech recognizer for robust speaker verification, and uses the acoustical model of a speaker-independent speech recognizer as a background model. Instead of using a likelihood ratio test (LRT) at the utterance level (e.g., the sentence level), which is typical of most speaker verification systems, the present text-dependent speaker verification technique uses weighted sum of likelihood ratios at the sub-unit level (word, tri-phone, or phone) as well as at the utterance level.
    Type: Application
    Filed: February 12, 2007
    Publication date: August 14, 2008
    Applicant: Microsoft Corporation
    Inventors: Zhengyou Zhang, Amarnag Subramaya
  • Publication number: 20080181430
    Abstract: A multi-sensor sound source localization (SSL) technique is presented which provides a true maximum likelihood (ML) treatment for microphone arrays having more than one pair of audio sensors. Generally, this is accomplished by selecting a sound source location that results in a time of propagation from the sound source to the audio sensors of the array, which maximizes a likelihood of simultaneously producing audio sensor output signals inputted from all the sensors in the array. The likelihood includes a unique term that estimates an unknown audio sensor response to the source signal for each of the sensors in the array.
    Type: Application
    Filed: January 26, 2007
    Publication date: July 31, 2008
    Applicant: Microsoft Corporation
    Inventors: Cha Zhang, Dinei Florencio, Zhengyou Zhang
  • Patent number: 7406303
    Abstract: A synthesized alternative sensor signal is produced from an alternative sensor signal. The synthesized alternative sensor signal is computed using vocal tract resonances estimated based on the alternative sensor signal, and using a waveform synthesis technique that converts the estimated vocal tract resonance sequence into a spectral magnitude sequence. The synthesized alternative sensor signal and the alternative sensor signal are used to estimate a clean speech value.
    Type: Grant
    Filed: September 16, 2005
    Date of Patent: July 29, 2008
    Assignee: Microsoft Corporation
    Inventors: Li Deng, Zhengyou Zhang, Zicheng Liu, Amarnag Subramanya
  • Publication number: 20080170717
    Abstract: An energy based technique to estimate the positions of people speaking from an ad hoc network of microphones. The present technique does not require accurate synchronization of the microphones. In addition, a technique to normalize the gains of the microphones based on people's speech is presented, which allows aggregation of various audio channels from the ad hoc microphone network into a single stream for audio conferencing. The technique is invariant of the speaker's volumes thus making the system easy to deploy in practice.
    Type: Application
    Filed: January 16, 2007
    Publication date: July 17, 2008
    Applicant: Microsoft Corporation
    Inventors: Zicheng Liu, Zhengyou Zhang, Li-wei He, Philip A. Chou, Minghua Chen
  • Publication number: 20080168139
    Abstract: A framework for facilitating collaborative interaction between users of a web application is presented. The framework provides persistence and sharing mechanisms for arbitrary application-defined objects. The sharing mechanism is coupled with a consistency mechanism that keeps client states consistent even when users perform conflicting operations. This framework is application-independent in that the persistence, sharing between clients and maintaining a consistent state among the clients is accomplished without the need for specific knowledge of the design of the web application. Thus, web developers can focus on the logic of the application and the UI, instead of spending time designing and implementing collaborative capabilities.
    Type: Application
    Filed: January 6, 2007
    Publication date: July 10, 2008
    Applicant: Microsoft Corporation
    Inventors: Sasa Junuzovic, Rajesh Hegde, Li-Wei He, Zhengyou Zhang, Steven Drucker
  • Publication number: 20080154714
    Abstract: A technique for providing and receiving personalized e-coupons is presented. In general, the technique involves an e-coupon provider sending e-coupons to a user of a mobile communication device, such as a cellular telephone or PDA, which are personalized in various ways so as to make them attractive to the user. In one embodiment, the e-coupons are provided based on location information received from the mobile communication device. In another embodiment, the e-coupons are provided based on the user's purchasing history. The mobile communication device that receives e-coupons from the provider includes an e-coupon handler program to facilitate the procurement and receipt of the e-coupons. In general, the e-coupon handler receives e-coupons and displays them to the user on a display of the mobile communication device. The e-coupons could have been requested by the e-coupon handler, or pushed to it by the e-coupon provider.
    Type: Application
    Filed: December 21, 2006
    Publication date: June 26, 2008
    Applicant: Microsoft Corporation
    Inventors: Zicheng Liu, Philip A. Chou, Zhengyou Zhang
  • Patent number: 7383181
    Abstract: The present invention combines a conventional audio microphone with an additional speech sensor that provides a speech sensor signal based on an input. The speech sensor signal is generated based on an action undertaken by a speaker during speech, such as facial movement, bone vibration, throat vibration, throat impedance changes, etc. A speech detector component receives an input from the speech sensor and outputs a speech detection signal indicative of whether a user is speaking. The speech detector generates the speech detection signal based on the microphone signal and the speech sensor signal.
    Type: Grant
    Filed: July 29, 2003
    Date of Patent: June 3, 2008
    Assignee: Microsoft Corporation
    Inventors: Xuedong D. Huang, Zicheng Liu, Zhengyou Zhang, Michael J. Sinclair, Alejandro Acero
  • Publication number: 20080126282
    Abstract: A system that facilitates managing resources (e.g., functionality, services) based at least in part upon an established context. More particularly, a context determination component can be employed to establish a context by processing sensor inputs or learning/inferring a user action/preference. Once the context is established via context determination component, a power/mode management component can be employed to activate and/or mask resources in accordance with the established context. The power and mode management of the device can extend life of a power source (e.g., battery) and mask functionality in accordance with a user and/or device state.
    Type: Application
    Filed: January 15, 2008
    Publication date: May 29, 2008
    Applicant: MICROSOFT CORPORATION
    Inventors: Michael J. Sinclair, David W. Williams, Zhengyou Zhang, Zicheng Liu
  • Patent number: 7352386
    Abstract: A method and apparatus for recovering a three-dimensional (3D) scene from two-dimensional (2D) images. A sequence of images is divided into a number of smaller segments and a 3D reconstruction is performed on each segment individually. All the reconstructed segments are then combined together through an efficient bundle adjustment to complete the 3D reconstruction. Segmenting may be achieved by dividing the segments based on the number of feature points that are in each frame. The number of frames per segment is reduced by creating virtual key frames. The virtual key frames encode the 3D structure for each segment, but are only a small subset of the original frames in the segment. A final bundle adjustment is performed on the virtual key frames, rather than all of the original frames. Thus, the final bundle adjustment is two orders of magnitude faster than a conventional bundle adjustment.
    Type: Grant
    Filed: June 22, 1999
    Date of Patent: April 1, 2008
    Assignee: Microsoft Corporation
    Inventors: Heung-Yeung Shum, Zhengyou Zhang, Qifa Ke
  • Patent number: 7346504
    Abstract: A method and apparatus determine a channel response for an alternative sensor using an alternative sensor signal, an air conduction microphone signal. The channel response and a prior probability distribution for clean speech values are then used to estimate a clean speech value.
    Type: Grant
    Filed: June 20, 2005
    Date of Patent: March 18, 2008
    Assignee: Microsoft Corporation
    Inventors: Zicheng Liu, Alejandro Acero, Zhengyou Zhang
  • Patent number: 7342572
    Abstract: A system and method for turning a regular computer monitor screen into a touch screen using an ordinary camera. It includes an image-screen mapping procedure to correct for the non-flatness of the computer screen. It also includes a segmentation method to distinguish the foreground, for example an indicator such as a finger, from the background of a computer screen. Additionally, this system and method includes a robust technique of finding the tip point location of the indicator (such as the finger tip). The screen coordinates of the tip point are then used to control the position of the system indicator.
    Type: Grant
    Filed: April 30, 2004
    Date of Patent: March 11, 2008
    Assignee: Microsoft Corp.
    Inventors: Zhengyou Zhang, Ying Shan
  • Patent number: 7327899
    Abstract: A real-time approximately 360 degree image correction system and a method for alleviating distortion and perception problems in images captured by omni-directional cameras. In general, the real-time panoramic image correction method generates a warp table from pixel coordinates of a panoramic image and applies the warp table to the panoramic image to create a corrected panoramic image. The corrections are performed using a parametric class of warping functions that include Spatially Varying Uniform (SVU) scaling functions. The SVU scaling functions and scaling factors are used to perform vertical scaling and horizontal scaling on the panoramic image pixel coordinates. A horizontal distortion correction is performed using the SVU scaling functions at at least two different scaling factors. This processing generates a warp table that can be applied to the panoramic image to yield the corrected panoramic image.
    Type: Grant
    Filed: August 18, 2006
    Date of Patent: February 5, 2008
    Assignee: Microsoft Corp.
    Inventors: Zicheng Liu, Ross Cutler, Michael Cohen, Zhengyou Zhang
  • Patent number: 7319908
    Abstract: A system that facilitates managing resources (e.g., functionality, services) based at least in part upon an established context. More particularly, a context determination component can be employed to establish a context by processing sensor inputs or learning/inferring a user action/preference. Once the context is established via context determination component, a power/mode management component can be employed to activate and/or mask resources in accordance with the established context. The power and mode management of the device can extend life of a power source (e.g., battery) and mask functionality in accordance with a user and/or device state.
    Type: Grant
    Filed: October 28, 2005
    Date of Patent: January 15, 2008
    Assignee: Microsoft Corporation
    Inventors: Michael J. Sinclair, David W. Williams, Zhengyou Zhang, Zicheng Liu
  • Patent number: 7308115
    Abstract: An incremental motion estimation system and process for estimating the camera pose parameters associated with each image of a long image sequence. Unlike previous approaches, which rely on point matches across three or more views, the present system and process also includes those points shared only by two views. The problem is formulated as a series of localized bundle adjustments in such a way that the estimated camera motions in the whole sequence are consistent with each other. The result of the inclusion of two-view matching points and the localized bundle adjustment approach is more accurate estimates of the camera pose parameters for each image in the sequence than previous incremental techniques, and providing an accuracy approaching that of global bundle adjustment techniques except with processing times about 100 to 700 times faster than the global approaches.
    Type: Grant
    Filed: July 14, 2005
    Date of Patent: December 11, 2007
    Assignee: Microsoft Corporation
    Inventors: Zhengyou Zhang, Ying Shan
  • Patent number: 7302106
    Abstract: A system and method for compressing digital pen stroke data utilizing curve simplification. Digital pen stroke images (ink images) generate a relatively large amount of data to preserve the ink image generated on a device. Current ink compression algorithms utilize lossless compression algorithm that have limited success. The invention provides a lossy compression algorithm to reduce the amount of data required to store and transmit ink data. The invention utilizes a two-part algorithm to reduce and compress the number of data points representing the ink data. The invention also utilizes curve splines to reconstruct and smooth the lossy ink data image.
    Type: Grant
    Filed: May 19, 2003
    Date of Patent: November 27, 2007
    Assignee: Microsoft Corp.
    Inventors: Zicheng Liu, Henrique S. Malvar, Zhengyou Zhang
  • Patent number: 7301548
    Abstract: This invention is directed toward a system and method for scanning a scene or object such as a whiteboard, paper document or similar item. More specifically, the invention is directed toward a system and method for obtaining a high-resolution image of a whiteboard or other object with a low-resolution camera. The system and method of the invention captures either a set of snapshots with overlap or a continuous video sequence, and then stitches them automatically into a single high-resolution image. The stitched image can finally be exported to other image processing systems and methods for further enhancement.
    Type: Grant
    Filed: December 11, 2004
    Date of Patent: November 27, 2007
    Assignee: Microsoft Corp.
    Inventors: Zhengyou Zhang, Li-wei He
  • Publication number: 20070269105
    Abstract: A system that captures both whiteboard content and audio signals of a meeting using a digital camera and a microphone. The system can be retrofit to any existing whiteboard. It computes the time stamps of pen strokes on the whiteboard by analyzing the sequence of captured snapshots. It also automatically produces a set of key frames representing all the written content on the whiteboard before each erasure. The whiteboard content serves as a visual index to efficiently browse the audio meeting. The system not only captures the whiteboard content, but also helps the users to view and manage the captured meeting content efficiently and securely.
    Type: Application
    Filed: August 31, 2006
    Publication date: November 22, 2007
    Applicant: Microsoft Corporation
    Inventors: Zhengyou Zhang, Ross Cutler, Zicheng Lui, Anoop Gupta, Li-wei He
  • Patent number: 7283850
    Abstract: A mobile device includes an air conduction microphone and an alternative sensor that provides an alternative sensor signal indicative of speech. A communication interface permits the mobile device to communicate directly with other mobile devices.
    Type: Grant
    Filed: October 12, 2004
    Date of Patent: October 16, 2007
    Assignee: Microsoft Corporation
    Inventors: Randy Phyllis Granovetter, Michael J. Sinclair, Zhengyou Zhang, Zicheng Liu