Patents by Inventor John R. Hershey

John R. Hershey has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Method and apparatus for high resolution speech reconstruction

Patent number: 7596494

Abstract: A method and apparatus identify a clean speech signal from a noisy speech signal. The noisy speech signal is converted into frequency values in the frequency domain. The parameters of at least one posterior probability of at least one component of a clean signal value are then determined based on the frequency values. This determination is made without applying a frequency-based filter to the frequency values. The parameters of the posterior probability distribution are then used to estimate a set of frequency values for the clean speech signal. A clean speech signal is then constructed from the estimated set of frequency values.

Type: Grant

Filed: November 26, 2003

Date of Patent: September 29, 2009

Assignee: Microsoft Corporation

Inventors: Trausti Thor Kristjansson, John R. Hershey
Audio-visual control system

Patent number: 7518631

Abstract: A visual control system controls a controlled component. In one embodiment, the visual control system controls the controlled component based on a visual location of a user. In another embodiment, input from a visual perception device is used to provide focus control for an audio input device. In additional embodiments, the visual control system stops, starts or suppresses speech recognition or other audio functions when the direction of the sound detected by the audio input device is not coming from the user's visual location.

Type: Grant

Filed: June 28, 2005

Date of Patent: April 14, 2009

Assignee: Microsoft Corporation

Inventors: John R. Hershey, Zhengyou Zhang
Method and apparatus for scene learning and three-dimensional tracking using stereo video cameras

Patent number: 7486815

Abstract: A method and apparatus are provided for learning a model for the appearance of an object while tracking the position of the object in three dimensions. Under embodiments of the present invention, this is achieved by combining a particle filtering technique for tracking the object's position with an expectation-maximization technique for learning the appearance of the object. Two stereo cameras are used to generate data for the learning and tracking.

Type: Grant

Filed: February 20, 2004

Date of Patent: February 3, 2009

Assignee: Microsoft Corporation

Inventors: Trausti Kristjansson, Hagai Attias, John R. Hershey
Speech detection and enhancement using audio/video fusion

Patent number: 7269560

Abstract: A system and method facilitating speech detection and/or enhancement utilizing audio/video fusion is provided. The present invention fuses audio and video in a probabilistic generative model that implements cross-model, self-supervised learning, enabling rapid adaptation to audio visual data. The system can learn to detect and enhance speech in noise given only a short (e.g., 30 second) sequence of audio-visual data. In addition, it automatically learns to track the lips as they move around in the video.

Type: Grant

Filed: June 27, 2003

Date of Patent: September 11, 2007

Assignee: Microsoft Corporation

Inventors: John R. Hershey, Trausti Thor Kristjansson, Hagai Attias, Nebojsa Jojic
Speech detection and enhancement using audio/video fusion

Publication number: 20040267536

Abstract: A system and method facilitating speech detection and/or enhancement utilizing audio/video fusion is provided. The present invention fuses audio and video in a probabilistic generative model that implements cross-model, self-supervised learning, enabling rapid adaptation to audio visual data. The system can learn to detect and enhance speech in noise given only a short (e.g., 30 second) sequence of audio-visual data. In addition, it automatically learns to track the lips as they move around in the video.

Type: Application

Filed: June 27, 2003

Publication date: December 30, 2004

Inventors: John R. Hershey, Trausti Thor Kristjansson, Hagai Attias, Nebojsa Jojic

prev 1 2 3

Method and apparatus for high resolution speech reconstruction

Audio-visual control system

Method and apparatus for scene learning and three-dimensional tracking using stereo video cameras

Speech detection and enhancement using audio/video fusion

Speech detection and enhancement using audio/video fusion