Enhancement Of Intelligibility Of Clean Or Coded Speech (epo) Patents (Class 704/E21.009)

E Subclasses

Enhancement of diverse speech (epo) (Class 704/E21.01)

Bandwidth extension taking place at the receiving side, e.g., generation of low- or high-frequency components, regeneration of spectral holes, etc. (epo) (Class 704/E21.011)

System and method for non-destructively normalizing loudness of audio signals within portable devices

Patent number: 11948592

Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.

Type: Grant

Filed: April 20, 2023

Date of Patent: April 2, 2024

Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB

Inventors: Jeffrey Riedmiller, Harald Mundt, Michael Schug, Martin Wolters
Adaptive exponential moving average filter

Patent number: 11888713

Abstract: A method includes establishing digital communication between a first user device and a second user device using a first codec. The method also includes selecting, based on an input signal representing an estimated unfiltered available bandwidth for the digital communication satisfying a first filter selection threshold, a first filter of two or more filters, and filtering the input signal using the first filter. The method further includes determining that the filtered input signal satisfies a first channel bandwidth threshold and, in response to determining that the filtered input signal satisfies the channel bandwidth threshold, selecting a second codec different from the first codec for further digital communication between the first user device and the second user device.

Type: Grant

Filed: November 1, 2022

Date of Patent: January 30, 2024

Assignee: Google LLC

Inventors: Michael Horowitz, Philip Eliasson
Electronic device and method of controlling speech recognition by electronic device

Patent number: 11854527

Abstract: An electronic device for adjusting a speech output rate (speech rate) of speech output data.

Type: Grant

Filed: August 11, 2021

Date of Patent: December 26, 2023

Assignee: SAMSUNG ELECTRONICS CO., LTD.

Inventor: Piotr Marcinkiewicz
Voice transmission compensation apparatus, voice transmission compensation method and program

Patent number: 11806213

Abstract: A speech transmission compensation apparatus that assists discrimination of speech heard by a user, includes: one or more computers each including a memory and a processor configured to: accept input of a speech signal, detect a specific type of sound in the speech signal, analyze an acoustic characteristic of the specific type of sound in the speech signal and output the acoustic characteristic; accept input of the acoustic characteristic being output by the memory and the processor, generate a vibration signal of a duration corresponding to the acoustic characteristic and output the vibration signal; and accept input of the vibration signal being output by the memory and the processor and provide the user with vibration for the duration on the basis of the vibration signal.

Type: Grant

Filed: April 30, 2020

Date of Patent: November 7, 2023

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Asuka Ono, Momoko Nakatani, Ai Nakane, Yoko Ishii
Audio encoding/decoding apparatus and method using vector quantized residual error feature

Patent number: 11804230

Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.

Type: Grant

Filed: April 1, 2022

Date of Patent: October 31, 2023

Assignees: Electronics and Telecommunications Research Institute, Gwangju Institute of Science and Technology

Inventors: Inseon Jang, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Woo-taek Lim, Jongwon Shin, Youngju Cheon, Sangwook Han, Soojoong Hwang
Concept for coding mode switching compensation

Patent number: 11600283

Abstract: A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.

Type: Grant

Filed: June 29, 2020

Date of Patent: March 7, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Martin Dietz, Eleni Fotopoulou, Jérémie Lecomte, Markus Multrus, Benjamin Schubert
Harmonic transposition in an audio coding method and system

Patent number: 11594234

Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.

Type: Grant

Filed: September 27, 2022

Date of Patent: February 28, 2023

Assignee: Dolby International AB

Inventors: Per Ekstrand, Lars Villemoes
Voice recognition method using artificial intelligence and apparatus thereof

Patent number: 11568853

Abstract: Disclosed is a voice recognition method and apparatus using artificial intelligence. A voice recognition method using artificial intelligence may include: generating a utterance by receiving a voice command of a user; obtaining a user's intention by analyzing the generated utterance; deriving an urgency level of the user on the basis of the generated utterance and prestored user information; generating a first response in association with the user's intention; obtaining main vocabularies included in the first response; generating a second response by using the main vocabularies and the urgency level of the user; determining a speech rate of the second response on the basis of the urgency level of the user; and outputting the second response according to the speech rate by synthesizing the second response to a voice signal.

Type: Grant

Filed: July 29, 2020

Date of Patent: January 31, 2023

Assignee: LG ELECTRONICS INC.

Inventor: Jonghoon Chae
Method of denoising a noisy signal including speech and noise components

Patent number: 7953596

Abstract: A method of analyzing time coherence in the noisy signal including the steps of: a) determining a reference signal from the noisy signal by applying treatment (10, 18) to the noisy signal that is suitable for attenuating speech components more strongly than the noise component, in particular by an adaptive recursive predictive algorithm of the LMS type; b) determining (24) a probability of speech being present/absent on the basis of the respective energy levels in the spectral domain of the noisy signal and of the reference signal; and c) deriving (26) a denoised estimate of the speech signal from the noise signal as a function of the probability of the speech being present/absent as determined in this way.

Type: Grant

Filed: February 26, 2007

Date of Patent: May 31, 2011

Assignee: PARROT Societe Anonyme

Inventor: Guillaume Pinto
Noise reducer, noise reducing method, and recording medium

Patent number: 7941315

Abstract: Accepting the speech having the noise superimposed thereon and converting it into a signal on a time axis of the speech, an amplitude component of a speech for each predetermined frequency band of the converted signal on the frequency axis is calculated. Calculating a noise reduction coefficient, the noise component is reduced by multiplying the signal on the frequency axis of the original signal by the calculated noise reduction coefficient. By estimating the target value of the remaining noise for each frequency band, a signal on a frequency axis in which a signal corresponding to a frequency band of which target value estimated by the noise target value is larger than the value of the amplitude component of the signal on the frequency axis of which noise component is reduced is corrected to a signal corresponding to the target value is restored, into a signal on a time axis.

Type: Grant

Filed: March 22, 2006

Date of Patent: May 10, 2011

Assignee: Fujitsu Limited

Inventor: Naoshi Matsuo
Speech Intelligibility Improvement Method and Apparatus

Publication number: 20110015922

Abstract: Prevalence detection is advantageously applied to the result of specific spectral discrimination to adaptively determine prevalent frequencies existing within an audio signal containing speech. Prevalent frequencies in this audio signal so isolated are attenuated in a highly selective manner, thus reducing the masking potential of pervasive resonances and obfuscative energy within the speech itself over low energy language-imparting speech elements.

Type: Application

Filed: July 20, 2010

Publication date: January 20, 2011

Inventor: Larry Joseph Kirn
Audio spectral noise reduction method and apparatus

Patent number: 7742914

Abstract: A method of reducing noise in an audio signal, comprising the steps of: using a furrow filter to select spectral components that are narrow in frequency but relatively broad in time; using a bar filter to select spectral components that are broad in frequency but relatively narrow in time; analyzing the relative energy distribution between the output of the furrow and bar filters to determine the optimal proportion of spectral components for the output signal; and reconstructing the audio signal to generate the output signal. A second pair of time-frequency filters may be used to further improve intelligibility of the output signal. The temporal relationship between the furrow filter output and the bar filter output may be monitored so that the fricative components are allowed primarily at boundaries between intervals with no voiced signal present and intervals with voice components. A noise reduction system for an audio signal.

Type: Grant

Filed: March 7, 2005

Date of Patent: June 22, 2010

Inventors: Daniel A. Kosek, Robert Crawford Maher
AUDIO SIGNAL ENCODING METHOD AND APPARATUS

Publication number: 20090024396

Abstract: An audio signal encoding method and apparatus for efficiently encoding an audio signal in an interval having many birth sinusoids and enabling tracking of sinusoidal signals in the next interval, and a computer readable recording medium having embodied thereon a computer program for executing the audio signal encoding method are provided. According to the method and apparatus, by applying transform coding instead of parametric coding to a frame having many birth sinusoids, the sinusoids are encoded, thereby reducing the number of bits required for the encoding and enabling efficient coding. Also, when transform coding is applied to a frame of a predetermined interval, an inverse transform of the transform coding is applied to the encoded data in order to decode the data, and then sinusoids are extracted from the decoded data, thereby enabling tracking of sinusoids of the next frame.

Type: Application

Filed: February 8, 2008

Publication date: January 22, 2009

Applicant: Samsung Electronics Co., Ltd.

Inventors: Chul-woo Lee, Geon-hyoung Lee, Jae-one Oh, Jong-hoon Jeong, Nam-suk Lee
SYSTEM AND METHOD OF CODING SOUND SIGNALS USING SOUND ENHANCMENT

Publication number: 20080215339

Abstract: A system and method of processing sound signals are disclosed. In one embodiment, a speech coder applies a first sound signal enhancement process to a first part of a sound signal and applies a second sound signal enhancement process to a second part of the sound signal. The sound signal is then coded using the enhanced first part of the sound signal and the enhanced first part of the sound signal and the enhanced sound part of the sound signal. Examples of the portions of the sound signal that are separately processed include an excitation signal component and a spectral component of the sound signal.

Type: Application

Filed: May 8, 2008

Publication date: September 4, 2008

Applicant: AT&T Corp.

Inventors: Anthony J Accardi, Richard Vandervoort Cox
SYSTEMS, METHODS, AND APARATUS FOR DYNAMIC NORMALIZATION TO REDUCE LOSS IN PRECISION FOR LOW-LEVEL SIGNALS

Publication number: 20080162126

Abstract: A normalization factor for a current frame of a signal may be determined. The normalization factor may depend on an amplitude of the current frame of the signal. The normalization factor may also depend on non-linear values of states after one or more operations were performed on a previous frame of a normalized signal. The current frame of the signal may be normalized based on the normalization factor that is determined. The states' normalization factor may be adjusted based on the normalization factor that is determined.

Type: Application

Filed: January 30, 2008

Publication date: July 3, 2008

Applicant: QUALCOMM INCORPORATED

Inventors: Vivek Rajendran, Ananthapadmanabhan A. Kandhadai
Method, medium and apparatus enhancing a bass signal using an auditory property

Publication number: 20080091416

Abstract: Provided are a method, medium and apparatus for enhancing an acoustic signal using an auditory property. An acoustic signal is enhanced by generating a plurality of harmonic signals based on a predetermined acoustic signal frequency, selecting harmonic signals, which exist in an area masked by the predetermined harmonic signal, from among the generated plurality of harmonic signals, and outputting harmonic signals remaining after excluding the selected harmonic signals from the generated plurality of harmonic signals. The enhancement results in a bass signal of good sound quality and having a low distortion ratio, without changing the structure of a micro speaker.

Type: Application

Filed: June 22, 2007

Publication date: April 17, 2008

Applicant: SAMSUNG ELECTRONICS CO., LTD.

Inventors: Jung-ho Kim, Sang-wook Kim, Young-tae Kim, Sang-chul Ko
MULTI-CHANNEL PERIODIC SIGNAL ENHANCEMENT SYSTEM

Publication number: 20080019537

Abstract: A multi-channel signal enhancement system reinforces signal content and improves the signal-to-noise ratio of a multi-channel signal. The system detects, tracks, and reinforces non-stationary periodic signal components of a multi-channel signal. The periodic signal components of the signal may represent vowel sounds or other voiced sounds. The system may detect, track, or attenuate quasi-stationary signal components in the multi-channel signal.

Type: Application

Filed: August 31, 2007

Publication date: January 24, 2008

Inventors: Rajeev Nongpiur, Phillip Hetherington
Method for generating an individual hearing device program

Publication number: 20070299654

Abstract: Sound quality and speech comprehensibility are to be improved for hearing device wearers when watching television. Provision is made for this purpose to record acoustic signals at a recording site on a data medium. The acoustic signals are recorded simultaneously with a first microphone as a first recording. The data medium is played back on an individual playback device in an individual environment. Here the signal played back from the data medium is re-recorded as a second recording with a second microphone. The two recordings are connected to each other, subtracted in particular, and the result is used to adjust a hearing device program. It is thus possible to take into account individual acoustic environmental conditions in the hearing device program.

Type: Application

Filed: June 7, 2007

Publication date: December 27, 2007

Inventors: Ulrich Giese, Esfandiar Grafenberg