Patents by Inventor John R. Hershey

John R. Hershey has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 7596494
    Abstract: A method and apparatus identify a clean speech signal from a noisy speech signal. The noisy speech signal is converted into frequency values in the frequency domain. The parameters of at least one posterior probability of at least one component of a clean signal value are then determined based on the frequency values. This determination is made without applying a frequency-based filter to the frequency values. The parameters of the posterior probability distribution are then used to estimate a set of frequency values for the clean speech signal. A clean speech signal is then constructed from the estimated set of frequency values.
    Type: Grant
    Filed: November 26, 2003
    Date of Patent: September 29, 2009
    Assignee: Microsoft Corporation
    Inventors: Trausti Thor Kristjansson, John R. Hershey
  • Patent number: 7518631
    Abstract: A visual control system controls a controlled component. In one embodiment, the visual control system controls the controlled component based on a visual location of a user. In another embodiment, input from a visual perception device is used to provide focus control for an audio input device. In additional embodiments, the visual control system stops, starts or suppresses speech recognition or other audio functions when the direction of the sound detected by the audio input device is not coming from the user's visual location.
    Type: Grant
    Filed: June 28, 2005
    Date of Patent: April 14, 2009
    Assignee: Microsoft Corporation
    Inventors: John R. Hershey, Zhengyou Zhang
  • Patent number: 7486815
    Abstract: A method and apparatus are provided for learning a model for the appearance of an object while tracking the position of the object in three dimensions. Under embodiments of the present invention, this is achieved by combining a particle filtering technique for tracking the object's position with an expectation-maximization technique for learning the appearance of the object. Two stereo cameras are used to generate data for the learning and tracking.
    Type: Grant
    Filed: February 20, 2004
    Date of Patent: February 3, 2009
    Assignee: Microsoft Corporation
    Inventors: Trausti Kristjansson, Hagai Attias, John R. Hershey
  • Patent number: 7269560
    Abstract: A system and method facilitating speech detection and/or enhancement utilizing audio/video fusion is provided. The present invention fuses audio and video in a probabilistic generative model that implements cross-model, self-supervised learning, enabling rapid adaptation to audio visual data. The system can learn to detect and enhance speech in noise given only a short (e.g., 30 second) sequence of audio-visual data. In addition, it automatically learns to track the lips as they move around in the video.
    Type: Grant
    Filed: June 27, 2003
    Date of Patent: September 11, 2007
    Assignee: Microsoft Corporation
    Inventors: John R. Hershey, Trausti Thor Kristjansson, Hagai Attias, Nebojsa Jojic
  • Publication number: 20040267536
    Abstract: A system and method facilitating speech detection and/or enhancement utilizing audio/video fusion is provided. The present invention fuses audio and video in a probabilistic generative model that implements cross-model, self-supervised learning, enabling rapid adaptation to audio visual data. The system can learn to detect and enhance speech in noise given only a short (e.g., 30 second) sequence of audio-visual data. In addition, it automatically learns to track the lips as they move around in the video.
    Type: Application
    Filed: June 27, 2003
    Publication date: December 30, 2004
    Inventors: John R. Hershey, Trausti Thor Kristjansson, Hagai Attias, Nebojsa Jojic