Patents by Inventor Phil Hetherington

Phil Hetherington has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Responsive automatic gain control

Patent number: 11164592

Abstract: A system that performs automatic gain control (AGC) using different decay rates. The system may select a slow decay rate to track a loudness level within speech (e.g., within an utterance), improving audio quality and maintaining dynamic range for an individual voice, while selecting a fast decay rate to track the loudness level after a gap of silence (e.g., no voice activity detected for a duration of time) or during large level changes (e.g., actual speech loudness is lower than estimated speech loudness for a duration of time). This improves an accuracy of the loudness estimate and therefore a responsiveness of the automatic gain control, resulting in an improved user experience.

Type: Grant

Filed: May 9, 2019

Date of Patent: November 2, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Biqing Wu, Phil Hetherington, Carlo Murgia, Rong Hu
Method and apparatus for suppressing wind noise

Patent number: 9373340

Abstract: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.

Type: Grant

Filed: January 25, 2011

Date of Patent: June 21, 2016

Assignee: 2236008 Ontario, Inc.

Inventors: Phil Hetherington, Xueman Li, Pierre Zakarauskas
Speech end-pointer

Patent number: 8554564

Abstract: A rule-based end-pointer isolates spoken utterances contained within an audio stream from background noise and non-speech transients. The rule-based end-pointer includes a plurality of rules to determine the beginning and/or end of a spoken utterance based on various speech characteristics. The rules may analyze an audio stream or a portion of an audio stream based upon an event, a combination of events, the duration of an event, or a duration relative to an event. The rules may be manually or dynamically customized depending upon factors that may include characteristics of the audio stream itself, an expected response contained within the audio stream, or environmental conditions.

Type: Grant

Filed: April 25, 2012

Date of Patent: October 8, 2013

Assignee: QNX Software Systems Limited

Inventors: Phil Hetherington, Alex Escott
Noise reduction with integrated tonal noise reduction

Patent number: 8489396

Abstract: The system provides a technique for suppressing or eliminating tonal noise in and input signal. The system operates on the input signal at a plurality of frequency bins and uses information generated at a prior bin to assist in calculating values at subsequent bins. The system first identifies peaks in a signal and then determines if the peaks are from tonal effects. This can be done by comparing the estimated background noise of a current bin to the smoothed background noise of the same bin. The smoothed background noise can be calculated using an asymmetric IIR filter. When the ratio of the current background noise estimate to the currently calculated smoothed background noise is far greater than 1, tonal noise is assumed. When tonal noise is found, a number of suppression techniques can be applied to reduce the tonal noise, including gain suppression with fixed floor factor, an adaptive floor factor gain suppression technique, and a random phase technique.

Type: Grant

Filed: December 20, 2007

Date of Patent: July 16, 2013

Assignee: QNX Software Systems Limited

Inventors: Phil A. Hetherington, Xueman Li
Spectro-temporal varying approach for speech enhancement

Patent number: 8352257

Abstract: The present system proposes a technique called the spectro-temporal varying technique, to compute the suppression gain. This method is motivated by the perceptual properties of human auditory system; specifically, that the human ear has higher frequency resolution in the lower frequencies band and less frequency resolution in the higher frequencies, and also that the important speech information in the high frequencies are consonants which usually have random noise spectral shape. A second property of the human auditory system is that the human ear has lower temporal resolution in the lower frequencies and higher temporal resolution in the higher frequencies. Based on that, the system uses a spectro-temporal varying method which introduces the concept of frequency-smoothing by modifying the estimation of the a posteriori SNR. In addition, the system also makes the a priori SNR time-smoothing factor depend on frequency.

Type: Grant

Filed: December 20, 2007

Date of Patent: January 8, 2013

Assignee: QNX Software Systems Limited

Inventors: Phil A. Hetherington, Xueman Li
Speech End-Pointer

Publication number: 20120265530

Abstract: A rule-based end-pointer isolates spoken utterances contained within an audio stream from background noise and non-speech transients. The rule-based end-pointer includes a plurality of rules to determine the beginning and/or end of a spoken utterance based on various speech characteristics. The rules may analyze an audio stream or a portion of an audio stream based upon an event, a combination of events, the duration of an event, or a duration relative to an event. The rules may be manually or dynamically customized depending upon factors that may include characteristics of the audio stream itself, an expected response contained within the audio stream, or environmental conditions.

Type: Application

Filed: April 25, 2012

Publication date: October 18, 2012

Inventors: Phil Hetherington, Alex Escott
Speech end-pointer

Patent number: 8170875

Abstract: A rule-based end-pointer isolates spoken utterances contained within an audio stream from background noise and non-speech transients. The rule-based end-pointer includes a plurality of rules to determine the beginning and/or end of a spoken utterance based on various speech characteristics. The rules may analyze an audio stream or a portion of an audio stream based upon an event, a combination of events, the duration of an event, or a duration relative to an event. The rules may be manually or dynamically customized depending upon factors that may include characteristics of the audio stream itself, an expected response contained within the audio stream, or environmental conditions.

Type: Grant

Filed: June 15, 2005

Date of Patent: May 1, 2012

Assignee: QNX Software Systems Limited

Inventors: Phil Hetherington, Alex Escott
Method and Apparatus for Suppressing Wind Noise

Publication number: 20110123044

Abstract: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.

Type: Application

Filed: January 25, 2011

Publication date: May 26, 2011

Inventors: Phil Hetherington, Xueman Li, Pierre Zakarauskas
Karaoke system

Patent number: 7928307

Abstract: The system describes a karaoke system that enhances the experience of singing along with music, but without the need to display the lyrics. The system includes a combination of a vocal track reducer and an echo canceller, decision logic for determining when a person is talking or singing (double-talk detector) and a method for “ducking” (i.e., attenuating) the vocal track when the singing is detected. No special CD or DVD with lyric tracks is required, making the system capable of working with CD, mp3, AM, FM, HD radio, satellite radio signals, or any other suitable content source. The result is that any content source may potentially be used as a karaoke soundtrack without any pre-modification.

Type: Grant

Filed: November 3, 2008

Date of Patent: April 19, 2011

Assignee: QNX Software Systems Co.

Inventors: Phil A. Hetherington, Shree Paranjpe
Wind noise suppression system

Patent number: 7885420

Abstract: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.

Type: Grant

Filed: April 10, 2003

Date of Patent: February 8, 2011

Assignee: QNX Software Systems Co.

Inventors: Phil Hetherington, Xueman Li, Pierre Zakarauskas
KARAOKE SYSTEM

Publication number: 20100107856

Abstract: The system describes a karaoke system that enhances the experience of singing along with music, but without the need to display the lyrics. The system includes a combination of a vocal track reducer and an echo canceller, decision logic for determining when a person is talking or singing (double-talk detector) and a method for “ducking” (i.e., attenuating) the vocal track when the singing is detected. No special CD or DVD with lyric tracks is required, making the system capable of working with CD, mp3, AM, FM, HD radio, satellite radio signals, or any other suitable content source. The result is that any content source may potentially be used as a karaoke soundtrack without any pre-modification.

Type: Application

Filed: November 3, 2008

Publication date: May 6, 2010

Applicant: QNX SOFTWARE SYSTEMS (WAVEMAKERS), INC.

Inventors: Phil A. Hetherington, Shree Paranjpe
SPECTRO-TEMPORAL VARYING APPROACH FOR SPEECH ENHANCEMENT

Publication number: 20080167866

Abstract: The present system proposes a technique called the spectro-temporal varying technique, to compute the suppression gain. This method is motivated by the perceptual properties of human auditory system; specifically, that the human ear has higher frequency resolution in the lower frequencies band and less frequency resolution in the higher frequencies, and also that the important speech information in the high frequencies are consonants which usually have random noise spectral shape. A second property of the human auditory system is that the human ear has lower temporal resolution in the lower frequencies and higher temporal resolution in the higher frequencies. Based on that, the system uses a spectro-temporal varying method which introduces the concept of frequency-smoothing by modifying the estimation of the a posteriori SNR. In addition, the system also makes the a priori SNR time-smoothing factor depend on frequency.

Type: Application

Filed: December 20, 2007

Publication date: July 10, 2008

Applicant: HARMAN INTERNATIONAL INDUSTRIES, INC.

Inventors: Phil A. Hetherington, Xueman Li
NOISE REDUCTION WITH INTEGRATED TONAL NOISE REDUCTION

Publication number: 20080167870

Abstract: The system provides a technique for suppressing or eliminating tonal noise in and input signal. The system operates on the input signal at a plurality of frequency bins and uses information generated at a prior bin to assist in calculating values at subsequent bins. The system first identifies peaks in a signal and then determines if the peaks are from tonal effects. This can be done by comparing the estimated background noise of a current bin to the smoothed background noise of the same bin. The smoothed background noise can be calculated using an asymmetric IIR filter. When the ratio of the current background noise estimate to the currently calculated smoothed background noise is far greater than 1, tonal noise is assumed. When tonal noise is found, a number of suppression techniques can be applied to reduce the tonal noise, including gain suppression with fixed floor factor, an adaptive floor factor gain suppression technique, and a random phase technique.

Type: Application

Filed: December 20, 2007

Publication date: July 10, 2008

Applicant: HARMAN INTERNATIONAL INDUSTRIES, INC.

Inventors: PHIL A. HETHERINGTON, XUEMAN LI
Speech end-pointer

Publication number: 20060287859

Abstract: A rule-based end-pointer isolates spoken utterances contained within an audio stream from background noise and non-speech transients. The rule-based end-pointer includes a plurality of rules to determine the beginning and/or end of a spoken utterance based on various speech characteristics. The rules may analyze an audio stream or a portion of an audio stream based upon an event, a combination of events, the duration of an event, or a duration relative to an event. The rules may be manually or dynamically customized depending upon factors that may include characteristics of the audio stream itself, an expected response contained within the audio stream, or environmental conditions.

Type: Application

Filed: June 15, 2005

Publication date: December 21, 2006

Inventors: Phil Hetherington, Alex Escott
Method and apparatus for suppressing wind noise

Publication number: 20040165736

Abstract: The invention includes a method, apparatus, and computer program to selectively suppress wind noise while preserving narrow-band signals in acoustic data. Sound from one or several microphones is digitized into binary data. A time-frequency transform is applied to the data to produce a series of spectra. The spectra are analyzed to detect the presence of wind noise and narrow band signals. Wind noise is selectively suppressed while preserving the narrow band signals. The narrow band signal is interpolated through the times and frequencies when it is masked by the wind noise. A time series is then synthesized from the signal spectral estimate that can be listened to. This invention overcomes prior art limitations that require more than one microphone and an independent measurement of wind speed. Its application results in good-quality speech from data severely degraded by wind noise.

Type: Application

Filed: April 10, 2003

Publication date: August 26, 2004

Inventors: Phil Hetherington, Xueman Li, Pierre Zakarauskas