Enhancement Of Intelligibility Of Clean Or Coded Speech (epo) Patents (Class 704/E21.009)
  • Patent number: 11948592
    Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.
    Type: Grant
    Filed: April 20, 2023
    Date of Patent: April 2, 2024
    Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey Riedmiller, Harald Mundt, Michael Schug, Martin Wolters
  • Patent number: 11888713
    Abstract: A method includes establishing digital communication between a first user device and a second user device using a first codec. The method also includes selecting, based on an input signal representing an estimated unfiltered available bandwidth for the digital communication satisfying a first filter selection threshold, a first filter of two or more filters, and filtering the input signal using the first filter. The method further includes determining that the filtered input signal satisfies a first channel bandwidth threshold and, in response to determining that the filtered input signal satisfies the channel bandwidth threshold, selecting a second codec different from the first codec for further digital communication between the first user device and the second user device.
    Type: Grant
    Filed: November 1, 2022
    Date of Patent: January 30, 2024
    Assignee: Google LLC
    Inventors: Michael Horowitz, Philip Eliasson
  • Patent number: 11854527
    Abstract: An electronic device for adjusting a speech output rate (speech rate) of speech output data.
    Type: Grant
    Filed: August 11, 2021
    Date of Patent: December 26, 2023
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventor: Piotr Marcinkiewicz
  • Patent number: 11806213
    Abstract: A speech transmission compensation apparatus that assists discrimination of speech heard by a user, includes: one or more computers each including a memory and a processor configured to: accept input of a speech signal, detect a specific type of sound in the speech signal, analyze an acoustic characteristic of the specific type of sound in the speech signal and output the acoustic characteristic; accept input of the acoustic characteristic being output by the memory and the processor, generate a vibration signal of a duration corresponding to the acoustic characteristic and output the vibration signal; and accept input of the vibration signal being output by the memory and the processor and provide the user with vibration for the duration on the basis of the vibration signal.
    Type: Grant
    Filed: April 30, 2020
    Date of Patent: November 7, 2023
    Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
    Inventors: Asuka Ono, Momoko Nakatani, Ai Nakane, Yoko Ishii
  • Patent number: 11804230
    Abstract: An audio encoding/decoding apparatus and method using vector quantized residual error features are disclosed. An audio signal encoding method includes outputting a bitstream of a main codec by encoding an original signal, decoding the bitstream of the main codec, determining a residual error feature vector from a feature vector of a decoded signal and a feature vector of the original signal, and outputting a bitstream of additional information by encoding the residual error feature vector.
    Type: Grant
    Filed: April 1, 2022
    Date of Patent: October 31, 2023
    Assignees: Electronics and Telecommunications Research Institute, Gwangju Institute of Science and Technology
    Inventors: Inseon Jang, Seung Kwon Beack, Jongmo Sung, Tae Jin Lee, Woo-taek Lim, Jongwon Shin, Youngju Cheon, Sangwook Han, Soojoong Hwang
  • Patent number: 11600283
    Abstract: A codec allowing for switching between different coding modes is improved by, responsive to a switching instance, performing temporal smoothing and/or blending at a respective transition.
    Type: Grant
    Filed: June 29, 2020
    Date of Patent: March 7, 2023
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Martin Dietz, Eleni Fotopoulou, Jérémie Lecomte, Markus Multrus, Benjamin Schubert
  • Patent number: 11594234
    Abstract: The present invention relates to transposing signals in time and/or frequency and in particular to coding of audio signals. More particular, the present invention relates to high frequency reconstruction (HFR) methods including a frequency domain harmonic transposer. A method and system for generating a transposed output signal from an input signal using a transposition factor T is described. The system comprises an analysis window of length La, extracting a frame of the input signal, and an analysis transformation unit of order M transforming the samples into M complex coefficients. M is a function of the transposition factor T. The system further comprises a nonlinear processing unit altering the phase of the complex coefficients by using the transposition factor T, a synthesis transformation unit of order M transforming the altered coefficients into M altered samples, and a synthesis window of length Ls, generating a frame of the output signal.
    Type: Grant
    Filed: September 27, 2022
    Date of Patent: February 28, 2023
    Assignee: Dolby International AB
    Inventors: Per Ekstrand, Lars Villemoes
  • Patent number: 11568853
    Abstract: Disclosed is a voice recognition method and apparatus using artificial intelligence. A voice recognition method using artificial intelligence may include: generating a utterance by receiving a voice command of a user; obtaining a user's intention by analyzing the generated utterance; deriving an urgency level of the user on the basis of the generated utterance and prestored user information; generating a first response in association with the user's intention; obtaining main vocabularies included in the first response; generating a second response by using the main vocabularies and the urgency level of the user; determining a speech rate of the second response on the basis of the urgency level of the user; and outputting the second response according to the speech rate by synthesizing the second response to a voice signal.
    Type: Grant
    Filed: July 29, 2020
    Date of Patent: January 31, 2023
    Assignee: LG ELECTRONICS INC.
    Inventor: Jonghoon Chae
  • Patent number: 7953596
    Abstract: A method of analyzing time coherence in the noisy signal including the steps of: a) determining a reference signal from the noisy signal by applying treatment (10, 18) to the noisy signal that is suitable for attenuating speech components more strongly than the noise component, in particular by an adaptive recursive predictive algorithm of the LMS type; b) determining (24) a probability of speech being present/absent on the basis of the respective energy levels in the spectral domain of the noisy signal and of the reference signal; and c) deriving (26) a denoised estimate of the speech signal from the noise signal as a function of the probability of the speech being present/absent as determined in this way.
    Type: Grant
    Filed: February 26, 2007
    Date of Patent: May 31, 2011
    Assignee: PARROT Societe Anonyme
    Inventor: Guillaume Pinto
  • Patent number: 7941315
    Abstract: Accepting the speech having the noise superimposed thereon and converting it into a signal on a time axis of the speech, an amplitude component of a speech for each predetermined frequency band of the converted signal on the frequency axis is calculated. Calculating a noise reduction coefficient, the noise component is reduced by multiplying the signal on the frequency axis of the original signal by the calculated noise reduction coefficient. By estimating the target value of the remaining noise for each frequency band, a signal on a frequency axis in which a signal corresponding to a frequency band of which target value estimated by the noise target value is larger than the value of the amplitude component of the signal on the frequency axis of which noise component is reduced is corrected to a signal corresponding to the target value is restored, into a signal on a time axis.
    Type: Grant
    Filed: March 22, 2006
    Date of Patent: May 10, 2011
    Assignee: Fujitsu Limited
    Inventor: Naoshi Matsuo
  • Publication number: 20110015922
    Abstract: Prevalence detection is advantageously applied to the result of specific spectral discrimination to adaptively determine prevalent frequencies existing within an audio signal containing speech. Prevalent frequencies in this audio signal so isolated are attenuated in a highly selective manner, thus reducing the masking potential of pervasive resonances and obfuscative energy within the speech itself over low energy language-imparting speech elements.
    Type: Application
    Filed: July 20, 2010
    Publication date: January 20, 2011
    Inventor: Larry Joseph Kirn
  • Patent number: 7742914
    Abstract: A method of reducing noise in an audio signal, comprising the steps of: using a furrow filter to select spectral components that are narrow in frequency but relatively broad in time; using a bar filter to select spectral components that are broad in frequency but relatively narrow in time; analyzing the relative energy distribution between the output of the furrow and bar filters to determine the optimal proportion of spectral components for the output signal; and reconstructing the audio signal to generate the output signal. A second pair of time-frequency filters may be used to further improve intelligibility of the output signal. The temporal relationship between the furrow filter output and the bar filter output may be monitored so that the fricative components are allowed primarily at boundaries between intervals with no voiced signal present and intervals with voice components. A noise reduction system for an audio signal.
    Type: Grant
    Filed: March 7, 2005
    Date of Patent: June 22, 2010
    Inventors: Daniel A. Kosek, Robert Crawford Maher
  • Publication number: 20090024396
    Abstract: An audio signal encoding method and apparatus for efficiently encoding an audio signal in an interval having many birth sinusoids and enabling tracking of sinusoidal signals in the next interval, and a computer readable recording medium having embodied thereon a computer program for executing the audio signal encoding method are provided. According to the method and apparatus, by applying transform coding instead of parametric coding to a frame having many birth sinusoids, the sinusoids are encoded, thereby reducing the number of bits required for the encoding and enabling efficient coding. Also, when transform coding is applied to a frame of a predetermined interval, an inverse transform of the transform coding is applied to the encoded data in order to decode the data, and then sinusoids are extracted from the decoded data, thereby enabling tracking of sinusoids of the next frame.
    Type: Application
    Filed: February 8, 2008
    Publication date: January 22, 2009
    Applicant: Samsung Electronics Co., Ltd.
    Inventors: Chul-woo Lee, Geon-hyoung Lee, Jae-one Oh, Jong-hoon Jeong, Nam-suk Lee
  • Publication number: 20080215339
    Abstract: A system and method of processing sound signals are disclosed. In one embodiment, a speech coder applies a first sound signal enhancement process to a first part of a sound signal and applies a second sound signal enhancement process to a second part of the sound signal. The sound signal is then coded using the enhanced first part of the sound signal and the enhanced first part of the sound signal and the enhanced sound part of the sound signal. Examples of the portions of the sound signal that are separately processed include an excitation signal component and a spectral component of the sound signal.
    Type: Application
    Filed: May 8, 2008
    Publication date: September 4, 2008
    Applicant: AT&T Corp.
    Inventors: Anthony J Accardi, Richard Vandervoort Cox
  • Publication number: 20080162126
    Abstract: A normalization factor for a current frame of a signal may be determined. The normalization factor may depend on an amplitude of the current frame of the signal. The normalization factor may also depend on non-linear values of states after one or more operations were performed on a previous frame of a normalized signal. The current frame of the signal may be normalized based on the normalization factor that is determined. The states' normalization factor may be adjusted based on the normalization factor that is determined.
    Type: Application
    Filed: January 30, 2008
    Publication date: July 3, 2008
    Applicant: QUALCOMM INCORPORATED
    Inventors: Vivek Rajendran, Ananthapadmanabhan A. Kandhadai
  • Publication number: 20080091416
    Abstract: Provided are a method, medium and apparatus for enhancing an acoustic signal using an auditory property. An acoustic signal is enhanced by generating a plurality of harmonic signals based on a predetermined acoustic signal frequency, selecting harmonic signals, which exist in an area masked by the predetermined harmonic signal, from among the generated plurality of harmonic signals, and outputting harmonic signals remaining after excluding the selected harmonic signals from the generated plurality of harmonic signals. The enhancement results in a bass signal of good sound quality and having a low distortion ratio, without changing the structure of a micro speaker.
    Type: Application
    Filed: June 22, 2007
    Publication date: April 17, 2008
    Applicant: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Jung-ho Kim, Sang-wook Kim, Young-tae Kim, Sang-chul Ko
  • Publication number: 20080019537
    Abstract: A multi-channel signal enhancement system reinforces signal content and improves the signal-to-noise ratio of a multi-channel signal. The system detects, tracks, and reinforces non-stationary periodic signal components of a multi-channel signal. The periodic signal components of the signal may represent vowel sounds or other voiced sounds. The system may detect, track, or attenuate quasi-stationary signal components in the multi-channel signal.
    Type: Application
    Filed: August 31, 2007
    Publication date: January 24, 2008
    Inventors: Rajeev Nongpiur, Phillip Hetherington
  • Publication number: 20070299654
    Abstract: Sound quality and speech comprehensibility are to be improved for hearing device wearers when watching television. Provision is made for this purpose to record acoustic signals at a recording site on a data medium. The acoustic signals are recorded simultaneously with a first microphone as a first recording. The data medium is played back on an individual playback device in an individual environment. Here the signal played back from the data medium is re-recorded as a second recording with a second microphone. The two recordings are connected to each other, subtracted in particular, and the result is used to adjust a hearing device program. It is thus possible to take into account individual acoustic environmental conditions in the hearing device program.
    Type: Application
    Filed: June 7, 2007
    Publication date: December 27, 2007
    Inventors: Ulrich Giese, Esfandiar Grafenberg