Vocoders Using Multiple Modes (epo) Patents (Class 704/E19.041)

E Subclasses

Using sound class specific coding, hybrid encoders, object-based coding (epo) (Class 704/E19.042)

Mode decision, i.e., based on audio signal content versus external parameter (epo) (Class 704/E19.043)

Variable rate or variable quality codecs, e.g., scalable representation encoding, etc. (epo) (Class 704/E19.044)

Device and method for machine-learning based noise suppression

Patent number: 12230289

Abstract: A device, system, and method for machine-learning based noise suppression is provided. The device (and/or system) comprises a microphone, an output device, a noise suppression engine and a machine-learning noise suppression engine. The machine-learning noise suppression engine receives audio data from the noise suppression engine or the microphone, applies machine learning algorithms to the audio data to generate machine-learning based noise suppression parameters, and provides the parameters to the noise suppression engine. The noise suppression engine receives the audio data from the microphone and, prior to receiving the parameters, applies non-machine-learning based noise suppression to the audio data to generate noise-suppressed audio data, and provides the noise-suppressed audio data to the output device.

Type: Grant

Filed: August 29, 2022

Date of Patent: February 18, 2025

Assignee: MOTOROLA SOLUTIONS, INC.

Inventor: Jesus F. Corretjer
Switching between stereo coding modes in a multichannel sound codec

Patent number: 12205598

Abstract: A method and device for encoding a stereo sound signal comprise stereo encoders using stereo modes operating in time domain (TD), in frequency domain (FD) or in modified discrete Fourier transform (MDCT) domain. A controller controls switching between the TD, FD and MDCT stereo modes. Upon switching from one stereo mode to the other, the switching controller may (a) recalculate at least one length of down-processed/mixed signal in a current frame of the stereo sound signal, (b) reconstruct a down-processed/mixed signal and also other signals related to the other stereo mode in the current frame, (c) adapt data structures and/or memories for coding the stereo sound signal in the current frame using the other stereo mode, and/or (d) alter a TD stereo channel down-mixing to maintain a correct phase of left and right channels of the stereo sound signal. Corresponding stereo sound signal decoding method and device are described.

Type: Grant

Filed: February 1, 2021

Date of Patent: January 21, 2025

Assignee: VOICEAGE CORPORATION

Inventor: Vaclav Eksler
Apparatus for encoding and decoding of integrated speech and audio

Patent number: 12205599

Abstract: Provided is an encoding apparatus for integrally encoding and decoding a speech signal and a audio signal, and may include: an input signal analyzer to analyze a characteristic of an input signal; a stereo encoder to down mix the input signal to a mono signal when the input signal is a stereo signal, and to extract stereo sound image information; a frequency band expander to expand a frequency band of the input signal; a sampling rate converter to convert a sampling rate; a speech signal encoder to encode the input signal using a speech encoding module when the input signal is a speech characteristics signal; a audio signal encoder to encode the input signal using a audio encoding module when the input signal is a audio characteristic signal; and a bitstream generator to generate a bitstream.

Type: Grant

Filed: June 21, 2023

Date of Patent: January 21, 2025

Assignees: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE, KWANGWOON UNIVERSITY INDUSTRY-ACADEMIC COLLABORATION FOUNDATION

Inventors: Tae Jin Lee, Seung-Kwon Baek, Min Je Kim, Dae Young Jang, Jeongil Seo, Kyeongok Kang, Jin-Woo Hong, Hochong Park, Young-Cheol Park
Using corrections, of predicted textual segments of spoken utterances, for training of on-device speech recognition model

Patent number: 12183321

Abstract: Processor(s) of a client device can: receive audio data that captures a spoken utterance of a user of the client device; process, using an on-device speech recognition model, the audio data to generate a predicted textual segment that is a prediction of the spoken utterance; cause at least part of the predicted textual segment to be rendered (e.g., visually and/or audibly); receive further user interface input that is a correction of the predicted textual segment to an alternate textual segment; and generate a gradient based on comparing at least part of the predicted output to ground truth output that corresponds to the alternate textual segment. The gradient is used, by processor(s) of the client device, to update weights of the on-device speech recognition model and/or is transmitted to a remote system for use in remote updating of global weights of a global speech recognition model.

Type: Grant

Filed: October 5, 2023

Date of Patent: December 31, 2024

Assignee: GOOGLE LLC

Inventors: Françoise Beaufays, Johan Schalkwyk, Giovanni Motta
Signal processing device, method, and program

Patent number: 12170092

Abstract: The present technology relates to a signal processing device, a method, and a program that can obtain a signal with higher sound quality. The signal processing device includes: a calculation unit that calculates a parameter for generating a difference signal corresponding to an input compressed sound source signal on the basis of a prediction coefficient and the input compressed sound source signal, the prediction coefficient being obtained by learning using, as training data, a difference signal between an original sound signal and a learning compressed sound source signal obtained by compressing and coding the original sound signal; a difference signal generation unit that generates the difference signal on the basis of the parameter and the input compressed sound source signal; and a synthesis unit that synthesizes the generated difference signal and the input compressed sound source signal. The present technology can be applied to a signal processing device.

Type: Grant

Filed: February 20, 2020

Date of Patent: December 17, 2024

Assignee: Sony Group Corporation

Inventor: Takao Fukui
Audio coding method and apparatus

Patent number: 12136430

Abstract: A method comprises determining a first modification weight according to linear spectral frequency (LSF) differences of the current frame and LSF differences of a previous frame of the current frame when a signal characteristic of the current frame meets a preset modification condition, modifying the linear predictive parameter of the current frame according to the determined first modification weight, and coding the current frame according to the modified linear predictive parameter.

Type: Grant

Filed: August 27, 2021

Date of Patent: November 5, 2024

Assignee: Top Quality Telephony, LLC

Inventors: Zexin Liu, Bin Wang, Lei Miao
Sound field adjustment

Patent number: 12126982

Abstract: A device includes one or more processors configured to obtain sound information from an audio source. The one or more processors are further configured to select, based on a latency criterion associated with a playback device, a compression mode in which a representation of the sound information is compressed prior to transmission to the playback device or a bypass mode in which the representation of the sound information is not compressed prior to transmission to the playback device. The one or more processors are further configured to generate audio data that includes, based on the selected one of the compression mode or the bypass mode, a compressed representation of the sound information or an uncompressed representation of the sound information. The one or more processors are also configured to send the audio data as streaming data, via wireless transmission, to the playback device.

Type: Grant

Filed: June 28, 2021

Date of Patent: October 22, 2024

Assignee: QUALCOMM Incorporated

Inventors: Isaac Garcia Munoz, Nils Gunther Peters, Vinay Melkote Krishnaprasad, Andre Schevciw
Dialog detector

Patent number: 12118987

Abstract: The present application relates to a method of extracting audio features in a dialog detector in response to an input audio signal, the method comprising dividing the input audio signal into a plurality of frames, extracting frame audio features from each frame, determining a set of context windows, each context window including a number of frames surrounding a current frame, deriving, for each context window, a relevant context audio feature for the current frame based on the frame audio features of the frames in each respective context, and concatenating each context audio feature to form a combined feature vector to represent the current frame. The context windows with the different length can improve the response speed and improve robustness.

Type: Grant

Filed: April 13, 2020

Date of Patent: October 15, 2024

Inventors: Lie Lu, Xin Liu
Linear prediction analysis device, method, program, and storage medium

Patent number: 11972768

Abstract: An autocorrelation calculation unit 21 calculates an autocorrelation RO(i) from an input signal. A prediction coefficient calculation unit 23 performs linear prediction analysis by using a modified autocorrelation R?O(i) obtained by multiplying a coefficient wO(i) by the autocorrelation RO(i). It is assumed here, for each order i of some orders i at least, that the coefficient wO(i) corresponding to the order i is in a monotonically increasing relationship with an increase in a value that is negatively correlated with a fundamental frequency of the input signal of the current frame or a past frame.

Type: Grant

Filed: October 21, 2022

Date of Patent: April 30, 2024

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yutaka Kamamoto, Takehiro Moriya, Noboru Harada
Apparatus and method for improved signal fade out in different domains during error concealment

Patent number: 11776551

Abstract: An apparatus for decoding an audio signal is provided, having a receiving interface, configured to receive a first frame having a first audio signal portion of the audio signal, and configured to receive a second frame having a second audio signal portion of the audio signal; a noise level tracing unit, wherein the noise level tracing unit is configured to determine noise level information depending on at least one of the first audio signal portion and the second audio signal portion; a first reconstruction unit for reconstructing, in a first reconstruction domain, a third audio signal portion of the audio signal depending on the noise level information; a transform unit for transforming the noise level information to a second reconstruction domain; and a second reconstruction unit for reconstructing, in the second reconstruction domain, a fourth audio signal portion of the audio signal depending on the noise level information.

Type: Grant

Filed: December 14, 2020

Date of Patent: October 3, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Michael Schnabel, Markovic Goran, Ralph Sperschneider, Jérémie Lecomte, Christian Helmrich
Method and apparatus for detecting correctness of pitch period

Patent number: 11741980

Abstract: A method and an apparatus for detecting correctness of a pitch period, where the method for detecting correctness of a pitch period includes determining, according to an initial pitch period of an input signal in a time domain, a pitch frequency bin of the input signal, where the initial pitch period is obtained by performing open-loop detection on the input signal, determining, based on an amplitude spectrum of the input signal in a frequency domain, a pitch period correctness decision parameter, associated with the pitch frequency bin, of the input signal, and determining correctness of the initial pitch period according to the pitch period correctness decision parameter.

Type: Grant

Filed: April 16, 2021

Date of Patent: August 29, 2023

Assignee: HUAWEI TECHNOLOGIES CO., LTD.

Inventors: Fengyan Qi, Lei Miao
Method, apparatus and systems for audio decoding and encoding

Patent number: 11676622

Abstract: An audio processing system (100) accepts an audio bitstream having one of a plurality of predefined audio frame rates. The system comprises a front-end component (110), which receives a variable number of quantized spectral components, corresponding to one audio frame in any of the predefined audio frame rates, and performs an inverse quantization according to predetermined, frequency-dependent quantization levels. The front-end component may be agnostic of the audio frame rate. The audio processing system further comprises a frequency-domain processing stage (120) and a sample rate converter (130), which provide a reconstructed audio signal sampled at a target sampling frequency independent of the audio frame rate. By its frame-rate adaptability, the system can be configured to operate frame-synchronously in parallel with a video processing system that accepts plural video frame rates.

Type: Grant

Filed: June 10, 2021

Date of Patent: June 13, 2023

Assignee: DOLBY INTERNATIONAL AB

Inventors: Heiko Purnhagen, Kristofer Kjoerling, Alexander Stahlmann, Jens Popp, Karl Jonas Roeden
Voice activity detection using a soft decision mechanism

Patent number: 11670325

Abstract: Voice activity detection (VAD) is an enabling technology for a variety of speech based applications. Herein disclosed is a robust VAD algorithm that is also language independent. Rather than classifying short segments of the audio as either “speech” or “silence”, the VAD as disclosed herein employees a soft-decision mechanism. The VAD outputs a speech-presence probability, which is based on a variety of characteristics.

Type: Grant

Filed: May 21, 2020

Date of Patent: June 6, 2023

Assignee: Verint Systems Ltd.

Inventor: Ron Wein
Speech processing method and device thereof

Patent number: 11587573

Abstract: The disclosure provides a speech processing method and a device thereof. The method includes: acquiring a speech sampling signal frame in a mixed-excitation linear prediction (MELP) speech coding system and estimating signal quality of the speech sampling signal frame; determining, based on the signal quality, a specific linear prediction coding (LPC) order used by an LPC circuit; controlling the LPC circuit to convert the speech sampling signal frame into a line spectrum pair parameter based on the specific LPC order; replacing a speech signal spectrum of the speech sampling signal frame with the line spectrum pair parameter to generate a predicted speech signal; and performing a speech coding operation and a signal synthesizing operation of the MELP speech coding system based on the predicted speech signal.

Type: Grant

Filed: November 28, 2019

Date of Patent: February 21, 2023

Assignee: Acer Incorporated

Inventors: Chao-Lun Chen, An-Cheng Lee, Li-Wei Huang
Decompression engine for decompressing compressed input data that includes multiple streams of data

Patent number: 11561797

Abstract: An electronic device that includes a decompression engine that includes N decoders and a decompressor decompresses compressed input data that includes N streams of data. Upon receiving a command to decompress compressed input data, the decompression engine causes each of the N decoders to decode a respective one of the N streams from the compressed input data separately and substantially in parallel with others of the N decoders. Each decoder outputs a stream of decoded data of a respective type for generating commands associated with a compression standard for decompressing the compressed input data. The decompressor next generates, from the streams of decoded data output by the N decoders, commands for decompressing the data using the compression standard to recreate the original data. The decompressor next executes the commands to recreate the original data and stores the original data in a memory or provides the original data to another entity.

Type: Grant

Filed: August 19, 2019

Date of Patent: January 24, 2023

Assignee: ATI Technologies ULC

Inventor: Vinay Patel
Linear prediction analysis device, method, program, and storage medium

Patent number: 11532315

Abstract: An autocorrelation calculation unit 21 calculates an autocorrelation RO(i) from an input signal. A prediction coefficient calculation unit 23 performs linear prediction analysis by using a modified autocorrelation R?O(i) obtained by multiplying a coefficient wO(i) by the autocorrelation RO(i). It is assumed here, for each order i of some orders i at least, that the coefficient wO(i) corresponding to the order i is in a monotonically increasing relationship with an increase in a value that is negatively correlated with a fundamental frequency of the input signal of the current frame or a past frame.

Type: Grant

Filed: December 14, 2020

Date of Patent: December 20, 2022

Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION

Inventors: Yutaka Kamamoto, Takehiro Moriya, Noboru Harada
Method and an apparatus for processing speech, audio, and speech/audio signal using mode information

Patent number: 8781843

Abstract: A method of processing a signal, which includes receiving at least one of a first signal and a second signal, receiving mode information, and decoding the at least one of the first signal and the second signal using at least one of a first coding scheme and a second coding scheme according to the mode information. Further, the mode information is information for indicating that a prescribed mode corresponds to which one of at least three modes.

Type: Grant

Filed: October 15, 2008

Date of Patent: July 15, 2014

Assignee: Intellectual Discovery Co., Ltd.

Inventors: Hyen-O Oh, Hong Goo Kang, Chang Heon Lee, Sang Wook Shin, Yang Won Jung
Method and apparatus for packing and decoding audio and other data

Patent number: 7848929

Abstract: A method and apparatus for compressing digital data, particularly audio and other data, in a way that the packing method used can be automatically detected and decoded at the receiving station. The audio signal is divided into compression packets consisting of four word pairs of left and right words. The first word pair in each compression packet is tagged with an identifier to indicate the start of a new compression packet, and is provided with configuration information which, over an entire compression block of 48 compression packets, constructs a 48-bit word specifying the manner in which the compressed audio and other data is packed.

Type: Grant

Filed: February 6, 2001

Date of Patent: December 7, 2010

Assignee: Harris Systems Limited

Inventor: David Duncan
Method and Apparatus for Estimating High-Band Energy in a Bandwidth Extension System

Publication number: 20090198498

Abstract: A method (100) includes receiving (101) an input digital audio signal comprising a narrow-band signal. The input digital audio signal is processed (102) to generate a processed digital audio signal. A high-band energy level corresponding to the input digital audio signal is estimated (103) based on a transition-band of the processed digital audio signal within a predetermined upper frequency range of a narrow-band bandwidth. A high-band digital audio signal is generated (104) based on the high-band energy level and an estimated high-band spectrum corresponding to the high-band energy level.

Type: Application

Filed: February 1, 2008

Publication date: August 6, 2009

Applicant: MOTOROLA, INC.

Inventors: Tenkasi V. Ramabadran, Mark A. Jasiuk
Speech coding system and method using bi-directional mirror-image predicted pulses

Publication number: 20090043574

Abstract: There is provided a method of decoding speech data generated from a speech signal. The method comprises receiving the speech data having at least one main pulse in a subframe of the speech data; generating a first predicted pulse, based on the at least one main pulse, on one side of the main pulse in the subframe of the speech data, wherein the first predicted pulse has a lower gain than the main pulse; generating a second predicted pulse, as a mirror image of the first predicted pulse on a reverse time scale, on the other side of the main pulse in the subframe of the speech data; reconstructing the speech signal using the at least one main pulse, the first predicted pulse and the second predicted pulse.

Type: Application

Filed: September 23, 2008

Publication date: February 12, 2009

Applicants: Mindspeed Technologies, Inc.

Inventors: Yang Gao, Adil Benyassine, Jes Thyssen, Eyal Shlomot, Huan-yu Su
METHOD, MEDIUM, AND APPARATUS TO CLASSIFY FOR AUDIO SIGNAL, AND METHOD, MEDIUM AND APPARATUS TO ENCODE AND/OR DECODE FOR AUDIO SIGNAL USING THE SAME

Publication number: 20080162121

Abstract: Provided are a classifying method and apparatus for an audio signal, and an encoding/decoding method and apparatus for an audio signal using the classifying method and apparatus. In the classification method, an audio signal is classified by adaptively adjusting a classification threshold for a frame of the audio signal that is to be classified according to a long-term feature of the audio signal, thereby improving a hit rate of signal classification, suppressing frequent mode switching per frame, improving noise tolerance, and providing smooth reconstruction of the audio signal.

Type: Application

Filed: December 27, 2007

Publication date: July 3, 2008

Applicant: Samsung Electronics Co., Ltd

Inventors: Chang-yong Son, Eun-mi Oh, Ki-hyun Choo, Jung-hoe Kim