Frequency Patents (Class 704/205)

Specialized information (Class 704/206)

Pitch (Class 704/207)

Voiced or unvoiced (Class 704/208)

Audio coding and decoding method and apparatus, medium, and electronic device

Patent number: 12347445

Abstract: An electronic device performs sub-band decomposition on a to-be-coded audio to obtain a to-be-coded low frequency signal corresponding to a low frequency band and a to-be-coded high frequency signal corresponding to a high frequency band. The device performs compression coding on the to-be-coded low frequency signal to obtain low frequency coded data of the to-be-coded low frequency signal. The device determines high frequency prediction information according to the to-be-coded low frequency signal. The device performs feature extraction on the to-be-coded high frequency signal to obtain high frequency feature information. The device determines high frequency compensation information of the to-be-coded high frequency signal according to a difference between the high frequency feature information and the high frequency prediction information.

Type: Grant

Filed: May 9, 2022

Date of Patent: July 1, 2025

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventor: Junbin Liang
Low bitrate audio encoding/decoding scheme having cascaded switches

Patent number: 12334086

Abstract: An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Type: Grant

Filed: November 7, 2024

Date of Patent: June 17, 2025

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Bernhard Grill, Roch Lefebvre, Bruno Bessette, Jimmy Lapierre, Philippe Gournay, Redwan Salami, Stefan Bayer, Guillaume Fuchs, Stefan Geyersberger, Ralf Geiger, Johannes Hilpert, Ulrich Kraemer, Jérémie Lecomte, Markus Multrus, Max Neuendorf, Harald Popp, Nikolaus Rettelbach
Audio denoising method and system

Patent number: 12335698

Abstract: In an audio denoising method and system provided in the present disclosure, a gain coefficient corresponding to each frequency unit may be generated based on a parameter related to a frequency by using a frequency of an audio signal as a unit, and gain processing is performed on each frequency unit separately by using the gain coefficient. The gain coefficient corresponding to a frequency unit including more valid audio signals may be larger, and a gain coefficient corresponding to a frequency unit including fewer valid audio signals may be smaller, so that more audio signals corresponding to frequency parts including more valid audio signals are preserved, while less audio signals corresponding to frequency parts including fewer valid audio signals are preserved. In this way, fidelity and intelligibility of an audio signal are improved while quality of the audio signal is improved and noise is reduced.

Type: Grant

Filed: April 14, 2023

Date of Patent: June 17, 2025

Assignee: SHENZHEN SHOKZ CO., LTD.

Inventors: Jinbo Zheng, Meilin Zhou, Fengyun Liao, Xin Qi
Downmixer and method of downmixing

Patent number: 12230281

Abstract: A downmixer for downmixing a multi-channel signal having at least two channels, includes: a weighting value estimator for estimating band-wise weighting values for the at least two channels; a spectral weighter for weighting spectral domain representations of the at least two channels using the band-wise weighting values; a converter for converting weighted spectral domain representations of the at least two channels into time representations of the at least two channels; and a mixer for mixing the time representations of the at least two channels to obtain a downmix signal.

Type: Grant

Filed: August 12, 2021

Date of Patent: February 18, 2025

Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.

Inventors: Franz Reutelhuber, Bernd Edler, Eleni Fotopoulou, Markus Multrus, Pallavi Maben, Sascha Disch
High-resolution sound source map obtaining and analyzing method and system using artificial intelligence neural network

Patent number: 12222430

Abstract: A method and system for generating a target map for training a neural network and obtaining a sound source map regardless of the maximum number of sound sources, having a short computation time for inference, high spatial resolution and high sound source accuracy. The method includes generating grids each having a spacing within a given range at positions where sound sources are present in order to form a sound source map, calculating a result value for each of coordinates of the grids so that the result value is a local maximum at the position of a sound source and the result value decreases depending on the distance from the sound source, arranging the result values at positions on matrices corresponding to the respective coordinates of the grids, and generating a target map having an image form by using the result values arranged in on the matrices.

Type: Grant

Filed: May 27, 2021

Date of Patent: February 11, 2025

Assignees: KOREA RESEARCH INSTITUTE OF STANDARDS AND SCIENCE, POSTECH RESEARCH AND BUSINESS DEVELPMENT FOUNDATION

Inventors: Ji Ho Chang, Seung Chul Lee, Soo Young Lee
Method for selecting output wave beam of microphone array

Patent number: 12223976

Abstract: A method for estimating a direction of arrival of sound signals from a microphone array, comprising: receiving sound signals from the microphone array, and performing beamforming on the sound signals to obtain wave beams and corresponding wave beam output signals; performing the following operation on each wave beam: converting the wave beam output signal of a current wave beam to frequency domain from time domain to obtain a frequency spectrum vector and a power spectrum vector; calculating comprehensive voice signal energy of the current wave beam, wherein the comprehensive voice signal energy is the product of comprehensive energy indicating the energy level of the wave beam output signal and a comprehensive voice existence probability indicating an existence probability of voice in the wave beam output signal; and selecting the wave beam with a maximal comprehensive voice signal energy value as the output wave beam.

Type: Grant

Filed: November 12, 2020

Date of Patent: February 11, 2025

Assignee: ESPRESSIF SYSTEMS (SHANGHAI) CO., LTD.

Inventor: Yang Zhao
Training generative adversarial networks to upsample audio

Patent number: 12159645

Abstract: Introduced here are approaches to training and then employing computer-implemented models designed to upsample discrete audio signals to higher sampling rates. Assume, for example, that a media production platform obtains a first discrete signal at a relatively low sampling rate. The relatively low sampling frequency may make the first discrete audio signal unsuitable for inclusion in media compilations, so the media production platform may attempt to improve its quality through upsampling. To accomplish this, the media production platform can apply a transform to the first discrete signal to produce a first magnitude spectrogram. Then, the media production platform can apply a computer-implemented model to the first magnitude spectrogram to produce a second magnitude spectrogram. Thereafter, the media production platform can apply an inverse transform to the second magnitude spectrogram to create a second discrete signal that has a higher sampling rate than the first discrete audio signal.

Type: Grant

Filed: September 17, 2021

Date of Patent: December 3, 2024

Assignee: Descript, Inc.

Inventors: Rithesh Kumar, Kundan Kumar
Multi-channel speech compression system and method

Patent number: 12114147

Abstract: A method, computer program product, and computing system for generating a plurality of acoustic relative transfer functions between a plurality of audio acquisition devices of an audio recording system based upon, at least in part, one or more of a predefined speech processing application and a predefined acoustic environment. An acoustic relative transfer function codebook may be generated using the plurality of acoustic relative transfer functions. One or more channels from the plurality of audio acquisition devices of the audio recording system may be encoded using the acoustic relative transfer function codebook.

Type: Grant

Filed: February 11, 2022

Date of Patent: October 8, 2024

Assignee: Microsoft Technology Licensing, LLC

Inventors: Dushyant Sharma, Patrick A. Naylor, Uwe Helmut Jost
Accompaniment classification method and apparatus

Patent number: 12093314

Abstract: An accompaniment classification method and apparatus is provided. The method includes the following. A first type of audio features of a target accompaniment is obtained (S301, S401). Data normalization is performed on each kind of audio features in the first type of audio features of the target accompaniment to obtain a first feature-set of the target accompaniment and the first feature-set is input into a first classification model for processing (S302, S402). A first probability value output by the first classification model for the first feature-set is obtained (S303, S403). An accompaniment category of the target accompaniment is determined to be a first category of accompaniments when the first probability value is greater than a first classification threshold (S404). The accompaniment category of the target accompaniment is determined to be other categories of accompaniments when the first probability value is less than or equal to the first classification threshold.

Type: Grant

Filed: May 19, 2022

Date of Patent: September 17, 2024

Assignee: Tencent Music Entertainment Technology (Shenzhen) Co., Ltd.

Inventor: Dong Xu
Joint estimation of acoustic parameters from single-microphone speech

Patent number: 12087319

Abstract: Embodiments described herein provide for end-to-end joint determination of degradation parameter scores for certain types of degradation. Degradation parameters include degradation describing additive noise and multiplicative noise such as Signal-to-Noise Ratio (SNR), reverberation time (T60), and Direct-to-Reverberant Ratio (DRR). Various neural network architectures are described such that the inherent interplay between the degradation parameters is considered in both the degradation parameter score and degradation score determination. The neural network architectures are trained according to computer generated audio datasets.

Type: Grant

Filed: October 23, 2020

Date of Patent: September 10, 2024

Assignee: Pindrop Security, Inc.

Inventors: David Looney, Nikolay Gaubitch
Data processing method based on simultaneous interpretation, computer device, and storage medium

Patent number: 12087290

Abstract: A data processing method based on simultaneous interpretation, applied to a server in a simultaneous interpretation system, including: obtaining audio transmitted by a simultaneous interpretation device; processing the audio by using a simultaneous interpretation model to obtain an initial text; transmitting the initial text to a user terminal; receiving a modified text fed back by the user terminal, the modified text being obtained after the user terminal modifies the initial text; and updating the simultaneous interpretation model according to the initial text and the modified text.

Type: Grant

Filed: July 28, 2020

Date of Patent: September 10, 2024

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Jingliang Bai, Caisheng Ouyang, Haikang Liu, Lianwu Chen, Qi Chen, Yulu Zhang, Min Luo, Dan Su
Resource allocation for uplink transmissions in unlicensed spectrum

Patent number: 12075432

Abstract: Systems and methods related to partial interlace frequency domain resource allocations for uplink transmissions from a wireless device are disclosed. In some embodiments, a method performed by a wireless device comprises receiving a resource allocation for an uplink transmission that allocates resources in one or more partially allocated interlaces and performing an uplink transmission on the allocated resources in the one or more partially allocated interlaces in accordance with the resource allocation. In this manner, a low-complexity approach to support flexible frequency domain resource allocation is provided. In addition, using this approach, an interlace may be shared by two or more wireless devices, which may increase spectral efficiency and reduce latency.

Type: Grant

Filed: January 7, 2020

Date of Patent: August 27, 2024

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Tai Do, Joao Vieira, Stephen Grant
Apparatus and methods for watermarking using starting phase modulation

Patent number: 12041258

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for watermarking using starting phase modulation. An example apparatus includes at least one memory, machine readable instructions, and processor circuitry to execute the machine readable instructions to at least access a media signal, access a watermark symbol to be encoded into the media signal, determine bit values of the watermark symbol, determine a common starting phase value for watermark components of the watermark symbol, the common starting phase value to represent at least one of the bit values of the watermark symbol, and embed the watermark components into the media signal based on the common starting phase value.

Type: Grant

Filed: June 24, 2022

Date of Patent: July 16, 2024

Assignee: The Nielsen Company (US), LLC

Inventors: Alexander Topchy, Vladimir Kuznetsov, Jeremey M. Davis
Resource allocation for uplink transmissions in unlicensed spectrum

Patent number: 12010699

Abstract: Systems and methods related to partial interlace frequency domain resource allocations for uplink transmissions from a wireless device are disclosed. In some embodiments, a method performed by a wireless device comprises receiving a resource allocation for an uplink transmission that allocates resources in one or more partially allocated interlaces and performing an uplink transmission on the allocated resources in the one or more partially allocated interlaces in accordance with the resource allocation. In this manner, a low-complexity approach to support flexible frequency domain resource allocation is provided. In addition, using this approach, an interlace may be shared by two or more wireless devices, which may increase spectral efficiency and reduce latency.

Type: Grant

Filed: January 7, 2020

Date of Patent: June 11, 2024

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Tai Do, Joao Vieira, Stephen Grant
Apparatus and method for encoding or decoding an audio signal with intelligent gap filling in the spectral domain

Patent number: 11922956

Abstract: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.

Type: Grant

Filed: March 3, 2022

Date of Patent: March 5, 2024

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
System and method for podcast repetitive content detection

Patent number: 11922967

Abstract: In one aspect, a method includes detecting a fingerprint match between query fingerprint data representing at least one audio segment within podcast content and reference fingerprint data representing known repetitive content within other podcast content, detecting a feature match between a set of audio features across multiple time-windows of the podcast content, and detecting a text match between at least one query text sentences from a transcript of the podcast content and reference text sentences, the reference text sentences comprising text sentences from the known repetitive content within the other podcast content. The method also includes responsive to the detections, generating sets of labels identifying potential repetitive content within the podcast content. The method also includes selecting, from the sets of labels, a consolidated set of labels identifying segments of repetitive content within the podcast content, and responsive to selecting the consolidated set of labels, performing an action.

Type: Grant

Filed: December 10, 2020

Date of Patent: March 5, 2024

Assignee: Gracenote, Inc.

Inventors: Amanmeet Garg, Aneesh Vartakavi
Phase reconstruction in a speech decoder

Patent number: 11817107

Abstract: Innovations in phase quantization during speech encoding and phase reconstruction during speech decoding are described. For example, to encode a set of phase values, a speech encoder omits higher-frequency phase values and/or represents at least some of the phase values as a weighted sum of basis functions. Or, as another example, to decode a set of phase values, a speech decoder reconstructs at least some of the phase values using a weighted sum of basis functions and/or reconstructs lower-frequency phase values then uses at least some of the lower-frequency phase values to synthesize higher-frequency phase values. In many cases, the innovations improve the performance of a speech codec in low bitrate scenarios, even when encoded data is delivered over a network that suffers from insufficient bandwidth or transmission quality problems.

Type: Grant

Filed: July 27, 2022

Date of Patent: November 14, 2023

Assignee: Microsoft Technology Licensing, LLC

Inventors: Soren Skak Jensen, Sriram Srinivasan, Koen Bernard Vos
Measurement method and measurement apparatus

Patent number: 11812230

Abstract: A measurement method includes generating a plurality of second measurement signals by disposing a plurality of first measurement signals corresponding to each of the plurality of speakers in respective different time zones on a time axis, generating a plurality of third measurement signals by copying a portion of a back end of each of the plurality of second measurement signals and adding the portion to a front end of each of the plurality of second measurement signals, outputting sounds according to each of the plurality of third measurement signals from each of the plurality of speakers, collecting the sounds with a microphone, and calculating a plurality of impulse responses corresponding to the plurality of first measurement signals, based on the collected sound signal collected with the microphone and the plurality of third measurement signals.

Type: Grant

Filed: March 23, 2022

Date of Patent: November 7, 2023

Assignee: Yamaha Corporation

Inventor: Ryo Matsuda
Audio encoding device, method and program, and audio decoding device, method and program

Patent number: 11756556

Abstract: An audio packet error concealment system includes an encoding unit for encoding an audio signal consisting of a plurality of frames, and an auxiliary information encoding unit for estimating and encoding auxiliary information about a temporal change of power of the audio signal. The auxiliary information is used in packet loss concealment in decoding of the audio signal. The auxiliary information about the temporal change of power may contain a parameter that functionally approximates a plurality of powers of subframes shorter than one frame, or may contain information about a vector obtained by vector quantization of a plurality of powers of subframes shorter than one frame.

Type: Grant

Filed: March 23, 2022

Date of Patent: September 12, 2023

Assignees: NTT DOCOMO, INC., JPMORGAN CHASE BANK, N.A., AS COLLATERAL AGENT

Inventors: Kimitaka Tsutsumi, Kei Kikuiri
Audio system for elevator

Patent number: 11751001

Abstract: An audio system for an elevator includes two or more speaker cabinets arranged inside a suspended ceiling fixed to a ceiling board of a car of the elevator, an input device to which sound content radiated to an inside of the car from each of the two or more speaker cabinets are input, and a sound field control device configured to conduct phase control and reverberation time control for the sound content and thereby cause a sound wave based on the sound content to be radiated from the speaker cabinet to the inside of the car. Each of the speaker cabinets includes a casing arranged inside the suspended ceiling, and a speaker unit arranged inside the casing and having a radiation surface that radiates the sound wave.

Type: Grant

Filed: March 13, 2020

Date of Patent: September 5, 2023

Assignee: Mitsubishi Electric Corporation

Inventors: Susumu Fujiwara, Keigo Taruishi, Masami Aikawa
Keyword detection method and related apparatus

Patent number: 11749262

Abstract: A keyword detection method includes: obtaining an enhanced speech signal of a to-be-detected speech signal, the enhanced speech signal corresponding to a target speech speed; performing speed adjustment on the enhanced speech signal to obtain a first speed-adjusted speech signal having a first speech speed, the first speech speed being different from the target speech speed; obtaining a first speech feature signal according to the first speed-adjusted speech signal; obtaining a detection result according to a first keyword detection result corresponding to the first speech feature signal, the detection result indicating whether a target keyword exists in the to-be-detected speech signal; and performing an operation corresponding to the target keyword in response to determining that the target keyword exists according to the detection result.

Type: Grant

Filed: June 10, 2021

Date of Patent: September 5, 2023

Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED

Inventors: Yi Gao, Ian Ernan Liu, Min Luo
Online target-speech extraction method based on auxiliary function for robust automatic speech recognition

Patent number: 11694707

Abstract: A target speech signal extraction method for robust speech recognition includes: initializing a steering vector for a target speech source and an adaptive vector, setting a real output channel of the target speech source as an output by the adaptive vector, initializing adaptive vectors for a noise and setting a dummy channel as an output by the adaptive vectors for the noise; setting a cost function for minimizing dependency between a real output for the target speech source and a dummy output for the noise; setting an auxiliary function to the cost function, and updating the adaptive vector for the target speech source and the adaptive vectors for the noise by using the auxiliary function and the steering vector; estimating the target speech signal by using the adaptive vector thereby extracting the target speech signal from the input signals; and updating the steering vector for the target speech source.

Type: Grant

Filed: March 29, 2021

Date of Patent: July 4, 2023

Assignee: INDUSTRY-UNIVERSITY COOPERATION FOUNDATION SOGANG UNIVERSITY

Inventors: Hyung Min Park, Uihyeop Shin
Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains

Patent number: 11682404

Abstract: An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Type: Grant

Filed: September 20, 2022

Date of Patent: June 20, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Bernhard Grill, Roch Lefebvre, Bruno Bessette, Jimmy Lapierre, Philippe Gournay, Redwan Salami, Stefan Bayer, Guillaume Fuchs, Stefan Geyersberger, Ralf Geiger, Johannes Hilpert, Ulrich Kraemer, Jérémie Lecomte, Markus Multrus, Max Neuendorf, Harald Popp, Nikolaus Rettelbach
Audio decoding device and method with decoding branches for decoding audio signal encoded in a plurality of domains

Patent number: 11676611

Abstract: An audio encoder has a first information sink oriented encoding branch such as a spectral domain encoding branch, a second information source or SNR oriented encoding branch such as an LPC-domain encoding branch, and a switch for switching between the first and second encoding branches, the second encoding branch having a converter into a specific domain different from the spectral domain such as an LPC analysis stage generating an excitation signal, and the second encoding branch having a specific domain coding branch such as LPC domain processing branch, and a specific spectral domain coding branch such as LPC spectral domain processing branch, and an additional switch for switching between the specific domain coding branch and the specific spectral domain coding branch. An audio decoder has a first domain decoder, a second domain decoder, and a third domain decoder as well as two cascaded switches for switching between the decoders.

Type: Grant

Filed: September 20, 2022

Date of Patent: June 13, 2023

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Bernhard Grill, Roch Lefebvre, Bruno Bessette, Jimmy Lapierre, Philippe Gournay, Redwan Salami, Stefan Bayer, Guillaume Fuchs, Stefan Geyersberger, Ralf Geiger, Johannes Hilpert, Ulrich Kraemer, Jérémie Lecomte, Markus Multrus, Max Neuendorf, Harald Popp, Nikolaus Rettelbach
Autonomously motile device with noise suppression

Patent number: 11646009

Abstract: A device capable of autonomous motion may move in an environment and may receive audio data from a microphone. A model may be trained to process the audio data to determine mask data, which may be used to mask noise in the audio data. Training data for the model may be normalized before training, and different loss functions may be used for different types of training data.

Type: Grant

Filed: June 16, 2020

Date of Patent: May 9, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Amit Singh Chhetri, Navin Chatlani
Techniques for wake-up word recognition and related systems and methods

Patent number: 11600269

Abstract: A system for detection of at least one designated wake-up word for at least one speech-enabled application. The system comprises at least one microphone; and at least one computer hardware processor configured to perform: receiving an acoustic signal generated by the at least one microphone at least in part as a result of receiving an utterance spoken by a speaker; obtaining information indicative of the speaker's identity; interpreting the acoustic signal at least in part by determining, using the information indicative of the speaker's identity and automated speech recognition, whether the utterance spoken by the speaker includes the at least one designated wake-up word; and interacting with the speaker based, at least in part, on results of the interpreting.

Type: Grant

Filed: June 15, 2016

Date of Patent: March 7, 2023

Assignee: Cerence Operating Company

Inventors: Meik Pfeffinger, Timo Matheja, Tobias Herbig, Tim Haulick
System and method for processing audio data into a plurality of frequency components

Patent number: 11562758

Abstract: An encoder operable to filter audio signals into a plurality of frequency band components, generate quantized digital components for each band, identify a potential for pre-echo events within the generated quantized digital components, generate an approximate signal by decoding the quantized digital components using inverse pulse code modulation, generate an error signal by comparing the approximate signal with the sampled audio signal, and process the error signal and quantized digital components. The encoder operable to process the error signal by processing delayed audio signals and Q band values, determining the potential for pre-echo events from the Q band values, and determining scale factors and MDCT block sizes for the potential for pre-echo events.

Type: Grant

Filed: March 29, 2022

Date of Patent: January 24, 2023

Assignee: IMMERSION NETWORKS, INC.

Inventors: James David Johnston, Stephen Daniel White, King Wei Hor, Barry M. Genova
Method and device for spectral expansion for an audio signal

Patent number: 11551704

Abstract: A method and device for automatically increasing the spectral bandwidth of an audio signal including generating a “mapping” (or “prediction”) matrix based on the analysis of a reference wideband signal and a reference narrowband signal, the mapping matrix being a transformation matrix to predict high frequency energy from a low frequency energy envelope, generating an energy envelope analysis of an input narrowband audio signal, generating a resynthesized noise signal by processing a random noise signal with the mapping matrix and the envelope analysis, high-pass filtering the resynthesized noise signal, and summing the high-pass filtered resynthesized noise signal with the original an input narrowband audio signal. Other embodiments are disclosed.

Type: Grant

Filed: February 28, 2020

Date of Patent: January 10, 2023

Assignee: Staton Techiya, LLC

Inventors: John Usher, Dan Ellis
Voice signal enhancing method and device

Patent number: 11538487

Abstract: The disclosure discloses a voice signal enhancing method and device, which divide a voice signal at the present scene into multiple frame signals based on a preset time interval; feed multiple frame signals into a trained neural network based on a preset step size, perform convolution operations on multiple frame signals through skip-connected convolutional layers to obtain multiple enhanced frame signals; superpose each enhanced frame signal according to the time domain of each enhanced frame signal to obtain an enhanced voice signal. Compared with the prior art, the present disclosure automatically enhances voice signals through the neural network without manual interference, so the effects and the application scenes of voice enhancement is not necessary to be limited by the preset method and method designers, thereby reducing the occurrence frequency of signal distortion and extra noises, which in turn improves the effects of the voice signal enhancement.

Type: Grant

Filed: March 31, 2020

Date of Patent: December 27, 2022

Assignee: YEALINK (XIAMEN) NETWORK TECHNOLOGY CO., LTD.

Inventors: Wanjian Feng, Lianchang Zhang, Jiantao Liu
Deep learning segmentation of audio using magnitude spectrogram

Patent number: 11521630

Abstract: A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.

Type: Grant

Filed: October 2, 2020

Date of Patent: December 6, 2022

Assignee: AUDIOSHAKE, INC.

Inventor: Luke Miner
Speech synthesis statistical model training device, speech synthesis statistical model training method, and computer program product

Patent number: 11423874

Abstract: A speech synthesis model training device includes one or more hardware processors configured to perform the following. Storing, in a speech corpus storing unit, speech data, and pitch mark information and context information of the speech data. From the speech data, analyzing acoustic feature parameters at each pitch mark timing in pitch mark information. From the acoustic feature parameters analyzed, training a statistical model which has a plurality of states and which includes an output distribution of acoustic feature parameters including pitch feature parameters and a duration distribution based on timing parameters.

Type: Grant

Filed: July 29, 2020

Date of Patent: August 23, 2022

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Masatsune Tamura, Masahiro Morita
Media channel identification with video multi-match detection and disambiguation based on audio fingerprint

Patent number: 11412296

Abstract: Disclosed are methods and systems to help disambiguate channel identification in a scenario where a video fingerprint of media content matches multiple reference video fingerprints corresponding respectively with multiple different channels. Given such a multi-match situation, an entity could disambiguate based on an audio component of the media content, such as by further determining that an audio fingerprint of the media content at issue matches an audio fingerprint of just one of the multiple channels, thereby establishing that that is the channel on which the media content being rendered by the media presentation device is arriving.

Type: Grant

Filed: June 30, 2021

Date of Patent: August 9, 2022

Assignee: Roku, Inc.

Inventors: Chung Won Seo, Youngmoo Kwon, Jaehyung Lee
Signaling of synchronization block patterns

Patent number: 11405875

Abstract: The present disclosure describes various examples of a method, an apparatus, and a computer readable medium for signaling synchronization block patterns in wireless communications (e.g., 5th Generation New Radio (5G NR)). For example, one of the methods described may include receiving, by a user equipment (UE), a message including information of a configuration. The configuration includes at least a group of repetitions of one or more synchronization signal (SS) blocks in an SS burst set, and the repetitions of the one or more SS blocks are configured into at least two groups. The method may further include determining, by the UE, which group of the at least two groups to search for during a synchronous neighbor cell search based on the information and at least one condition at the UE.

Type: Grant

Filed: April 10, 2020

Date of Patent: August 2, 2022

Assignee: QUALCOMM Incorporated

Inventors: Sony Akkarakaran, Tao Luo
Method for controlling plurality of voice recognizing devices and electronic device supporting the same

Patent number: 11398230

Abstract: An electronic device includes a display, a microphone, a memory, a communication circuit, and a processor. The processor is configured to display a user interface for adjusting voice recognition sensitivity of each of a plurality of voice recognizing devices configured to start a voice recognition service in response to the same start utterance, through the display, to transmit a value of the changed sensitivity to at least part of the plurality of voice recognizing devices when the voice recognition sensitivity is changed through the user interface, to transmit a signal for waiting to receive a first utterance of a user, to the plurality of voice recognizing devices, to receive utterance information corresponding to the first utterance from the plurality of voice recognizing devices, and to update the user interface based on the utterance information.

Type: Grant

Filed: October 1, 2019

Date of Patent: July 26, 2022

Assignee: Samsung Electronics Co., Ltd.

Inventors: Sungwoon Jang, Sangki Kang, Namkoo Lee, Euisuk Chung
Apparatuses and methods for creating noise environment noisy data and eliminating noise

Patent number: 11393443

Abstract: A data generating apparatus for generating noise environment noisy data is disclosed. The data generating apparatus according to the present application comprises a signal conversion unit configured to convert each of a noisy signal obtained in real environment and an original sound signal for the noisy signal into a noisy signal spectrum and an original sound signal spectrum in a short-time frequency domain; and a noisy signal generation training unit configured to train deep neural network to output the noisy signal spectrum corresponding to each short-time using the original sound signal spectrum as an input.

Type: Grant

Filed: May 29, 2020

Date of Patent: July 19, 2022

Assignee: Agency for Defense Development

Inventors: Hong Kook Kim, Jung Hyuk Lee, Seung Ho Choi, Deokgyu Yun
Signal processing device having multiple acoustic-electric transducers

Patent number: 11373671

Abstract: The present disclosure relates to a device for processing an audio signal. The device may include a first acoustic-electric transducer and a second acoustic-electric transducer. The first acoustic-electric transducer may have a first frequency response, and may be configured to detect the audio signal and generate a first sub-band signal according to the detected audio signal. The second acoustic-electric transducer may have a second frequency response, the second frequency response being different from the first frequency response. The second acoustic-electric transducer may be configured to detect the audio signal and generate a second sub-band signal according to the detected audio signal.

Type: Grant

Filed: March 18, 2020

Date of Patent: June 28, 2022

Assignee: SHENZHEN SHOKZ CO., LTD.

Inventors: Xin Qi, Lei Zhang
Apparatus and methods for watermarking using starting phase modulation

Patent number: 11375224

Abstract: Methods, apparatus, systems and articles of manufacture are disclosed for watermarking using starting phase modulation. An example apparatus includes memory, and processor circuitry to execute instructions to at least determine a first analyzed phase value for a watermark component of a watermarked media signal at a first time, determine a sum of differences for analyzed phase values with respect to a first one of a plurality of possible starting phase values, the analyzed phase values associated with the watermarked media signal, the analyzed phase values including the first analyzed phase value, in response to the sum of differences satisfying a threshold, decode a first data value corresponding to the first one of the possible starting phase values, and determine a watermark payload based on the first data value.

Type: Grant

Filed: July 6, 2020

Date of Patent: June 28, 2022

Assignee: THE NIELSEN COMPANY (US), LLC

Inventors: Alexander Topchy, Vladimir Kuznetsov, Jeremey M. Davis
Echo and near-end crosstalk cancellation system

Patent number: 11362702

Abstract: An echo and near-end cross-talk (NEXT) cancellation system includes a time-domain processing module and a frequency-domain processing module. The time-domain processing module is configured to receive an unprocessed signal after an analog-to-digital conversion, remove at least one time-domain dominant component of interference from the unprocessed signal, and accordingly cancel a time-domain processed signal. The frequency-domain processing module is connected to the time-domain processing module, and configured to receive the time-domain processed signal, cancel at least one frequency-domain component of the interference from the unprocessed signal, and accordingly generate a processed signal.

Type: Grant

Filed: May 31, 2019

Date of Patent: June 14, 2022

Assignee: AIROHA TECHNOLOGY (SUZHOU) LIMITED

Inventors: Chia-Lung Wu, Dong-Ming Chuang
Deep learning segmentation of audio using magnitude spectrogram

Patent number: 11355134

Abstract: A method, system, and computer readable medium for decomposing an audio signal into different isolated sources. The techniques and mechanisms convert an audio signal into K input spectrogram fragments. The fragments are sent into a deep neural network to isolate for different sources. The isolated fragments are then combined to form full isolated source audio signals.

Type: Grant

Filed: October 2, 2020

Date of Patent: June 7, 2022

Assignee: AUDIOSHAKE, INC.

Inventor: Luke Miner
Bandwidth extension method, bandwidth extension apparatus, program, integrated circuit, and audio decoding apparatus

Patent number: 11341977

Abstract: To provide a bandwidth extension method which allows reduction of computation amount in bandwidth extension and suppression of deterioration of quality in the bandwidth to be extended. In the bandwidth extension method: a low frequency bandwidth signal is transformed into a QMF domain to generate a first low frequency QMF spectrum; pitch-shifted signals are generated by applying different shifting factors on the low frequency bandwidth signal; a high frequency QMF spectrum is generated by time-stretching the pitch-shifted signals in the QMF domain; the high frequency QMF spectrum is modified; and the modified high frequency QMF spectrum is combined with the first low frequency QMF spectrum.

Type: Grant

Filed: December 30, 2019

Date of Patent: May 24, 2022

Assignee: PANASONIC INTELLECTUAL PROPERTY CORPORATION OF AMERICA

Inventors: Tomokazu Ishikawa, Takeshi Norimatsu, Huan Zhou, Kok Seng Chong, Haishan Zhong
Signal processing device having multiple acoustic-electric transducers

Patent number: 11335358

Abstract: The present disclosure relates to a device for processing an audio signal. The device may include a first acoustic-electric transducer and a second acoustic-electric transducer. The first acoustic-electric transducer may have a first frequency response, and may be configured to detect the audio signal and generate a first sub-band signal according to the detected audio signal. The second acoustic-electric transducer may have a second frequency response, the second frequency response being different from the first frequency response. The second acoustic-electric transducer may be configured to detect the audio signal and generate a second sub-band signal according to the detected audio signal.

Type: Grant

Filed: March 18, 2020

Date of Patent: May 17, 2022

Assignee: SHENZHEN SHOKZ CO., LTD.

Inventors: Xin Qi, Lei Zhang
Use of cost maps and convergence maps for localization and mapping

Patent number: 11312382

Abstract: A method for ascertaining features in an environment of at least one mobile unit for implementation of a localization and/or mapping by a control unit. In the course of the method, sensor measurement data of the environment are received, the sensor measurement data received are transformed by an alignment algorithm into a cost function and a cost map is generated with the aid of the cost function, a convergence map is generated based on the alignment algorithm. At least one feature is extracted from the cost map and/or the convergence map and stored, the at least one feature being provided in order to optimize a localization and/or mapping. A control unit, a computer program, and a machine-readable storage medium are also described.

Type: Grant

Filed: October 20, 2020

Date of Patent: April 26, 2022

Assignee: Robert Bosch GmbH

Inventors: Philipp Rasp, Carsten Hasberg, Muhammad Sheraz Khan
Acoustic signal processing with neural network using amplitude, phase, and frequency

Patent number: 11282505

Abstract: According to one embodiment, a signal generation device includes one or more processors. The processors convert an acoustic signal and output amplitude and phase at a plurality of frequencies. The processors, for each of a plurality of nodes of a hidden layer included in a neural network that treats the amplitude and the phase as input, obtain frequency based on a plurality of weights used in arithmetic operation of the node. The processors generate an acoustic signal based on the plurality of obtained frequencies and based on amplitude and phase corresponding to each of the plurality of nodes.

Type: Grant

Filed: March 8, 2019

Date of Patent: March 22, 2022

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Daichi Hayakawa, Takehiko Kagoshima, Hiroshi Fujimura
Decoding device, encoding device, decoding method, and encoding method

Patent number: 11257506

Abstract: A decoding device includes: a separating unit separating first encoded data, a spectrum including a low-band spectrum of audio signals having been encoded, and second encoded data, a high-band spectrum of a higher band having been encoded, based on the first encoded data; a first decoding unit decoding the first encoded data and generating a first decoded spectrum; a first amplitude normalizer dividing amplitude of the first decoded spectrum into sub-bands, normalizing the spectrum of each sub-band by the largest amplitude of the first decoded spectrum within each sub-band, and generating a normalized spectrum; an addition unit adding noise spectrum to the normalized spectrum and generating a noise-added normalized spectrum; a second decoding unit decoding the second encoded data using the noise-added normalized spectrum, and generating a second noise-added spectrum; and a converter performing time-frequency conversion regarding a spectrum coupled based on the first decoded spectrum and second noise-added spe

Type: Grant

Filed: January 24, 2020

Date of Patent: February 22, 2022

Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.

Inventors: Takuya Kawashima, Hiroyuki Ehara
End node spectrogram compression for machine learning speech recognition

Patent number: 11227614

Abstract: A system and method of recording and transmitting compressed audio signals over a network is disclosed. The end node device first converts the audio signal to a spectrogram, which is commonly used by machine learning algorithms to perform speech recognition. The end node device then compresses the spectrogram prior to transmission. In certain embodiments, the compression is performed using Discrete Cosine Transforms (DCT). Furthermore, in some embodiments, the DCT is performed on the difference between two columns of the spectrogram. Further, in some embodiments, a function that replaces values below a predetermined threshold with zeroes in the Encoded Spectrogram is utilized. These functions may be performed in hardware or software.

Type: Grant

Filed: June 11, 2020

Date of Patent: January 18, 2022

Assignee: Silicon Laboratories Inc.

Inventors: Antonio Torrini, Sebastian Ahmed
Audio upmixer operable in prediction or non-prediction mode

Patent number: 11217259

Abstract: The invention provides methods and devices for outputting a stereo audio signal having a left channel and a right channel. The apparatus includes a demultiplexer, decoder, and upmixer. The upmixer is configured operate either in a prediction mode or a non-prediction mode based on a parameter encoded in the audio bitstream.

Type: Grant

Filed: July 16, 2020

Date of Patent: January 4, 2022

Assignee: Dolby International AB

Inventors: Heiko Purnhagen, Pontus Carlsson, Lars Villemoes
Classification of piping and instrumental diagram information using machine-learning

Patent number: 11195007

Abstract: Systems and methods for identifying patterns of symbols in standardized system diagrams are disclosed. Disclosed implementations obtain or synthetically generate a symbol recognition training data set including multiple training images, generate a symbol recognition model based on the symbol recognition training data set, obtain an image comprising a pattern of symbols, group symbols into process loops based on the logical relationships captured by process loop identification algorithm, apply a character classification model to image contours to identify the characters and group characters into tags via hierarchical clustering, and store the identified tags, symbols and identified process loops in a relational database.

Type: Grant

Filed: April 5, 2019

Date of Patent: December 7, 2021

Assignee: CHEVRON U.S.A. INC.

Inventors: Paul Duke, Shuxing Cheng
Speech processing device, speech processing method, and computer program product

Patent number: 11170756

Abstract: A speech processing device of an embodiment includes a spectrum parameter calculation unit, a phase spectrum calculation unit, a group delay spectrum calculation unit, a band group delay parameter calculation unit, and a band group delay compensation parameter calculation unit. The spectrum parameter calculation unit calculates a spectrum parameter. The phase spectrum calculation unit calculates a first phase spectrum. The group delay spectrum calculation unit calculates a group delay spectrum from the first phase spectrum based on a frequency component of the first phase spectrum. The band group delay parameter calculation unit calculates a band group delay parameter in a predetermined frequency band from a group delay spectrum. The band group delay compensation parameter calculation unit calculates a band group delay compensation parameter to compensate a difference between a second phase spectrum reconstructed from the band group delay parameter and the first phase spectrum.

Type: Grant

Filed: April 7, 2020

Date of Patent: November 9, 2021

Assignee: KABUSHIKI KAISHA TOSHIBA

Inventors: Masatsune Tamura, Masahiro Morita
Detecting distortion in spread spectrum signals

Patent number: 11146307

Abstract: The invention relates to a method, a circuit, and an apparatus for detecting distortion in spread spectrum signals. An edge in a spread spectrum clock signal is identified based on a reference clock signal. The edge data is then provided to a set of counters which are incremented corresponding to an identified edge. Each bit of a respective output of the counters are provided to a respective OR gate of a set of OR gates. An OR gate from the set of OR gates corresponding to a selected bit then outputs an indication of whether distortion exists in the spread spectrum clock signal.

Type: Grant

Filed: April 13, 2020

Date of Patent: October 12, 2021

Assignee: INTERNATIONAL BUSINESS MACHINES CORPORATION

Inventors: John Borkenhagen, Christopher Steffen, Grant P. Kesselring
Human auditory system modeling with masking energy adaptation

Patent number: 11145317

Abstract: A method for generating a psychoacoustic model from an audio signal transforms a block of samples of an audio signal into a frequency spectrum comprising frequency components. From this frequency spectrum, it derives group masking energies. These group masking energies each correspond to a group of neighboring frequency components in the frequency spectrum. For a group of frequency components, the method allocates the group masking energy to the frequency components in the group in proportion to energy of the frequency components within the group to provide adapted mask energies for the frequency components within the group, the adapted mask energies providing masking thresholds for the psychoacoustic model of the audio signal.

Type: Grant

Filed: August 6, 2018

Date of Patent: October 12, 2021

Assignee: Digimarc Corporation

Inventors: Aparna R. Gurijala, Shankar Thagadur Shivappa, Ravi K. Sharma, Brett A. Bradley

1 2 3 4 5 … next