Audio Signal Bandwidth Compression Or Expansion Patents (Class 704/500)
  • Patent number: 11354355
    Abstract: An apparatus, method, and computer-readable storage medium that generate at least a cue point in a musical piece. The method includes generating a beat grid representing the musical piece, determining values for the beat grid, the values corresponding to an audio feature of the musical piece, and each value representing an entire duration of each beat in the beat grid of the musical piece, calculating a score for the audio feature at each of a plurality of positions in the beat grid of the musical piece, using some or all of the determined values, and generating the cue point at a particular position of the plurality of positions, based on the calculated scores.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: June 7, 2022
    Assignee: MIXED IN KEY LLC
    Inventor: Martin Douglas
  • Patent number: 11348598
    Abstract: Systems, methods, and devices for monitoring industrial equipment using audio are described herein. One system includes two computing devices. The first computing device can receive, from an audio sensor, audio sensed during operation of industrial equipment, extract a plurality of features from the audio, determine whether any portion of the audio is anomalous, and send, upon determining a portion of the audio is anomalous, the anomalous portion of the audio to the second, remotely located, computing device. The second computing device can provide the anomalous portion of the audio to a user to determine whether the anomalous portion of the audio corresponds to a fault occurring in the equipment, and receive, from the user upon determining the anomalous portion of the audio corresponds to a fault occurring in the equipment, input indicating the anomalous portion of the audio corresponds to the fault to learn fault patterns in the equipment.
    Type: Grant
    Filed: October 22, 2020
    Date of Patent: May 31, 2022
    Assignee: HONEYWELL INTERNATIONA, INC.
    Inventors: Ramprasad Yelchuru, Shreyan Chowdhury, Pradyumna Sampath
  • Patent number: 11328734
    Abstract: An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.
    Type: Grant
    Filed: January 6, 2020
    Date of Patent: May 10, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jeong Il Seo, Jong Mo Sung, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11325407
    Abstract: A method is provided for extending the frequency band of an audio signal during a decoding or improvement process. The method includes obtaining the decoded signal in a first frequency band, referred to as a low band. Tonal components and a surround signal are extracted from the signal from the low-band signal, and the tonal components and the surround signal are combined by adaptive mixing using energy-level control factors to obtain an audio signal, referred to as a combined signal. The low-band decoded signal before the extraction step or the combined signal after the combination step are extended over at least one second frequency band which is higher than the first frequency band. Also proved are a frequency-band extension device which implements the described method and a decoder including a device of this type.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: May 10, 2022
    Assignee: Koninklijke Philips N.V.
    Inventors: Magdalena Kaniewska, Stephane Ragot
  • Patent number: 11322164
    Abstract: The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).
    Type: Grant
    Filed: January 17, 2019
    Date of Patent: May 3, 2022
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Kristofer Kjoerling, David S. McGrath, Heiko Purnhagen, Mark R. P. Thomas
  • Patent number: 11315582
    Abstract: A method for recovering audio signals, a terminal and a storage medium are provided. The method includes: buffering an audio signal sampled at a preset number of sampling points each time, and then performing frequency spectrum analysis on the sampled audio signal by FFT; when it is determined that the audio signal is compressed, filtering a frequency point; recovering high-frequency signals based on audio signals before the frequency point; and performing phase recovery on the high-frequency signals. Thus, compressed high-frequency signals in the audio signals may be recovered.
    Type: Grant
    Filed: November 27, 2018
    Date of Patent: April 26, 2022
    Assignee: GUANGZHOU KUGOU COMPUTER TECHNOLOGY CO., LTD.
    Inventors: Jiaze Liu, Yufei Wang
  • Patent number: 11316490
    Abstract: An audio signal processing device comprises: a receiver for receiving an input audio signal; a processor for generating loudness metadata corresponding to the input audio signal; and an outputter for transmitting the loudness metadata generated by the processor. The processor is configured to acquire loudness information analyzed from input content, acquires loudness information about the input audio signal by measuring the loudness of the input audio signal, generates the loudness metadata by converting the loudness information, and transmits, through the outputter, the generated loudness metadata to an output device for outputting the input audio signal.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: April 26, 2022
    Assignee: GAUDIO LAB, INC.
    Inventors: Sangbae Chon, Kyutae Park, Younghoon Kwon, Hyunoh Oh, Jeonghun Seo, Taegyu Lee, Hyunjoo Chung
  • Patent number: 11308977
    Abstract: According to an embodiment, the above-described specification discloses an electronic device comprises at least one processor configured to: receive a first audio signal and a second audio signal; detect a spectral envelope signal from the first audio signal and extract a feature point from the second audio signal; extend a high-band of the second audio signal based on the spectral envelope signal from the first audio signal and the feature point from the second audio signal to generate a high-band extension signal; and mix the high-band extension signal and the first audio signal, thereby resulting in a synthesized signal.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: April 19, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hangil Moon, Aran Cha, Hwan Shim, Gunwoo Lee, Kyuhan Kim
  • Patent number: 11297398
    Abstract: The disclosed embodiments relate to media devices implementing new television standards, such as ATSC 3.0, which includes audio/video essence and metadata/signaling. The disclosed embodiments include techniques for creating a signaling file that contains metadata and signaling data about the broadcast service being presented. The signaling file may include URLs that can be used to access signaling information for supplementary signaling and content. The signaling also contains a mapping between a first timing system and a second timing system. The first timing system may comprise the watermark timeline represented in interval s and the second timing system may comprise a DASH presentation time.
    Type: Grant
    Filed: June 21, 2018
    Date of Patent: April 5, 2022
    Assignee: Verance Corporation
    Inventors: Jian Zhao, Joseph M. Winograd, Rade Petrovic, Julia Ann Kenyon
  • Patent number: 11297533
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Grant
    Filed: June 19, 2020
    Date of Patent: April 5, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Per Ekstrand
  • Patent number: 11289104
    Abstract: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
    Type: Grant
    Filed: February 26, 2019
    Date of Patent: March 29, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
  • Patent number: 11270709
    Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: March 8, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Kristofer Kjoerling, Toni Hirvonen, Lars Villemoes, Dirk Jeroen Breebaart
  • Patent number: 11270711
    Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: March 8, 2022
    Assignee: Qualcomm Incorproated
    Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
  • Patent number: 11271678
    Abstract: An encoding method for a wireless local area network includes encoding a packet via a cyclic coding scheme into a first amount of multiple first blocks; processing each of the first amount of the first blocks into a second amount of multiple second blocks; and encoding the second amount of the second blocks via a block coding scheme.
    Type: Grant
    Filed: July 2, 2020
    Date of Patent: March 8, 2022
    Assignee: Realtek Semiconductor Corp.
    Inventors: John Timothy Coffey, Der-Zheng Liu
  • Patent number: 11264042
    Abstract: An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: March 1, 2022
    Inventors: Sascha Disch, Ralf Geiger, Andreas Niedermeier, Matthias Neusinger, Konstantin Schmidt, Stephan Wilde, Benjamin Schubert, Christian Neukam
  • Patent number: 11264038
    Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
    Type: Grant
    Filed: February 20, 2020
    Date of Patent: March 1, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Lars Villemoes, Pontus Carlsson
  • Patent number: 11264017
    Abstract: Systems and methods include a plurality of audio input components configured to generate a plurality of audio input signals, and a logic device configured to receive the plurality of audio input signals, determine whether the plurality of audio signals comprise target audio associated with an audio source, estimate a relative location of the audio source with respect to the plurality of audio input components based on the plurality of audio signals and a determination of whether the plurality of audio signals comprise the target audio, and process the plurality of audio signals to generate an audio output signal by enhancing the target audio based on the estimated relative location. The logic device is further configured to use relative transfer-based covariance to construct directional covariance matrix aligned across frequency bands and find a direction that minimizes beam power subject to distortionless criteria.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: March 1, 2022
    Assignee: SYNAPTICS INCORPORATED
    Inventors: Alireza Masnadi-Shirazi, Francesco Nesta
  • Patent number: 11250863
    Abstract: The techniques disclosed herein provide apparatuses and related methods for the communication of spatial audio and related metadata. In some implementations, a source provides prerecorded spatial audio that has embedded metadata. A computing device processes the prerecorded spatial audio to generate an audio codec that is segmented to include a first section of audio data and a second section that includes metadata extracted from the prerecorded spatial audio. The generated audio codec may be received by a device that includes an encoder. The encoder may process the generated audio codec to generate audio data that includes the metadata.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: February 15, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Brian C. McDowell, Philip Andrew Edry, Ziyad Ibrahim, Robert Norman Heitkamp, Steven Wilssens
  • Patent number: 11244674
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.
    Type: Grant
    Filed: August 9, 2019
    Date of Patent: February 8, 2022
    Assignee: Google LLC
    Inventor: Ricardo Antonio Garcia
  • Patent number: 11238875
    Abstract: This disclosure provides an encoding method, a decoding method, an encoding apparatus, and a decoding apparatus for a stereo signal. The encoding method includes: performing interpolation processing based on the inter-channel time difference in the current frame and an inter-channel time difference in a previous frame of the current frame; performing time-domain downmixing processing on the stereo signal after the delay alignment in the current frame, to obtain a primary-channel signal and a secondary-channel signal in the current frame; and quantizing the inter-channel time difference after the interpolation processing in the current frame, the primary channel signal and the secondary channel signal.
    Type: Grant
    Filed: January 24, 2020
    Date of Patent: February 1, 2022
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Eyal Shlomot, Halting Li, Bin Wang
  • Patent number: 11232803
    Abstract: An encoding device according to the disclosure includes a first encoding unit that generates a first encoded signal in which a low-band signal having a frequency lower than or equal to a predetermined frequency from a voice or audio input signal is encoded, and a low-band decoded signal; a second encoding unit that encodes, on the basis of the low-band decoded signal, a high-band signal having a band higher than that of the low-band signal to generate a high-band encoded signal; and a first multiplexing unit that multiplexes the first encoded signal and the high-band encoded signal to generate and output an encoded signal. The second encoding unit calculates an energy ratio between a high-band noise component, which is a noise component of the high-band signal, and a high-band non-tonal component of a high-band decoded signal generated from the low-band decoded signal and outputs the ratio as the high-band encoded signal.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: January 25, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Srikanth Nagisetty, Zong Xian Liu, Hiroyuki Ehara
  • Patent number: 11232801
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: January 25, 2022
    Assignee: Dolby International AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11232802
    Abstract: Methods for converting, encoding, decoding and transcoding an acoustic field, more particularly a first-order Ambisonics three-dimensional acoustic field.
    Type: Grant
    Filed: September 28, 2017
    Date of Patent: January 25, 2022
    Assignee: Coronal Encoding S.A.S.
    Inventors: Benjamin Bernard, Francois Becker
  • Patent number: 11227615
    Abstract: A sound processing apparatus includes: a conversion unit that samples predetermined audio data with a first quantization bit rate and a second quantization bit rate greater than the first quantization bit rate and outputs the respective results as first digital audio data and second digital audio data; a frequency analysis unit that performs frequency analysis on each output from the conversion unit; and a determination unit that performs predetermined determination processing on the basis of analysis results by the frequency analysis unit.
    Type: Grant
    Filed: July 25, 2018
    Date of Patent: January 18, 2022
    Assignee: SONY CORPORATION
    Inventors: Tatsuya Koizumi, Taiji Yoshida
  • Patent number: 11227616
    Abstract: Audio encoder for encoding audio input data to obtain audio output data includes an input interface for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel including audio data of a channel and audio data of at least one object; a core encoder for core encoding core encoder input data; and a metadata compressor for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes.
    Type: Grant
    Filed: February 15, 2019
    Date of Patent: January 18, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Alexander Adami, Christian Borss, Sascha Dick, Christian Ertel, Simone Neukam, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Michael Kratschmer, Fabian Kuech, Achim Kuntz, Adrian Murtaza, Jan Plogsties, Andreas Silzle, Hanne Stenzel
  • Patent number: 11228856
    Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: January 18, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Peter Jax, Johannes Boehm, William Redmann
  • Patent number: 11223925
    Abstract: An apparatus configured to: based on (i) captured spatial audio content of a scene comprising audio that is associated with information indicative of at least a direction in the scene from which said audio was captured; and (ii) visual focus information comprising information indicative of at least a first part of the scene on which corresponding captured visual imagery of the scene is focused for presentation to a user; provide for presentation of the captured spatial audio content to accompany the captured visual imagery, the captured spatial audio content presented as spatial audio, the spatial audio content provided for presentation with a spatial audio focus selectively applied to audio captured from a second part of the scene different to the first part, the spatial audio focus comprising an audio-modifying effect to increase the audibility of the audio having a direction corresponding to the second part.
    Type: Grant
    Filed: December 11, 2018
    Date of Patent: January 11, 2022
    Assignee: Nokia Technologies Oy
    Inventors: Lasse Laaksonen, Arto Lehtiniemi, Toni Mäkinen, Mikko Heikkinen
  • Patent number: 11217261
    Abstract: In methods and apparatus and non-transitory memory units for encoding/decoding audio signal information, the encoder side may determine if a signal frame is useful for long term post filtering and/or packet lost concealment and may encode information in accordance to the results of the determination, and the decoder side may apply the LTPF and/or PLC in accordance to the information obtained from the encoder.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: January 4, 2022
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Adrian Tomasek, Manfred Lutzky, Conrad Benndorf
  • Patent number: 11218754
    Abstract: Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.
    Type: Grant
    Filed: February 7, 2020
    Date of Patent: January 4, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Bernd Czelhan, Harald Fuchs, Ingo Hofmann, Herbert Thoma, Stephan Schreiner
  • Patent number: 11176951
    Abstract: A method for processing a monophonic signal in a 3D audio decoder, including processing binauralizing decoded signals intended to be delivered spatially by a headset. The method is such that, on detection, in a datastream representative of the monophonic signal, of an indication of non-binauralization processing, which indication is associated with spatial delivery position information, the decoded monophonic signal is directed to a stereophonic rendering engine, which takes into account the position information to construct two delivery channels that are directly processed via a direct mixing that sums these two channels with a binauralized signal output from the binauralization processing, in order to be delivered via the headset. A decoder device that implements the processing method is also provided.
    Type: Grant
    Filed: December 7, 2018
    Date of Patent: November 16, 2021
    Assignee: ORANGE
    Inventor: Gregory Pallone
  • Patent number: 11170792
    Abstract: To suitably regulate sound pressure of object content on a receiving side. An audio stream including coded data of a predetermined number of pieces of object content is generated. A container of a predetermined format including the audio stream is transmitted. Information indicating a range within which sound pressure is allowed to increase and decrease for each piece of object content is inserted into a layer of the audio stream and/or a layer of the container. On a receiving side, sound pressure of each piece of object content increases and decreases within the allowable range based on the information.
    Type: Grant
    Filed: December 16, 2019
    Date of Patent: November 9, 2021
    Assignee: SONY CORPORATION
    Inventors: Ikuo Tsukagoshi, Toru Chinen
  • Patent number: 11172293
    Abstract: A low power voice processing system that includes a plurality of non-audio sensors, at least one microphone system, and a plurality of audio modules, at least some of which can be configured in selected modes. A context determination module is connected to the plurality of audio modules, and further connected to receive input from the plurality of non-audio sensors and the at least one microphone system. The context determination module acts to determine use context for the voice processing system and at least in part selects mode operation of at least some of the plurality of audio modules.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: November 9, 2021
    Assignee: AMBIQ MICRO, INC.
    Inventors: Arpit Shah, Scott McLean Hanson, Stephen Howard Nease
  • Patent number: 11170795
    Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: November 9, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
  • Patent number: 11151176
    Abstract: A method for processing text information is provided, the method including: obtaining text information and extracting text characters contained in the text information; extracting, from the text characters, target characters satisfying a predetermined rule; calculating a filtering index of the text information according to the target characters; and when the filtering index meets a predetermined condition, executing an operation corresponding to the predetermined condition on the text information. In addition, an apparatus for processing text information is further provided. The method and apparatus for processing text information can improve the accuracy and efficiency of filtering out junk text information.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: October 19, 2021
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Shengrong Chen
  • Patent number: 11130066
    Abstract: System and method that handles the inconsistent processing of interactive media (e.g. digital games, music visualization software, video playback software, animation software, etc.) and the generally consistent timeline delay to enable synchronization of messages and events with the playback of potentially variable-rate timelines.
    Type: Grant
    Filed: August 29, 2016
    Date of Patent: September 28, 2021
    Assignee: Sonic Bloom, LLC
    Inventor: Eric D. Robinson
  • Patent number: 11133015
    Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
    Type: Grant
    Filed: November 5, 2018
    Date of Patent: September 28, 2021
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Woo-taek Lim, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Patent number: 11113287
    Abstract: A system may include at least one processor. The at least one processor may receive data from a plurality of independent data sources. The data from each respective data source is received at a rate determined by the respective data source. The at least one processor may further write the received data to at least one data store at a rate independent of the respective rates at which data from the plurality of independent data sources is received. A method and computer-readable medium are also disclosed.
    Type: Grant
    Filed: May 11, 2017
    Date of Patent: September 7, 2021
    Assignee: Teradata US, Inc.
    Inventors: Louis B. Landry, Ilsun A. Park, Oliver Ratzesberger
  • Patent number: 11115731
    Abstract: The present disclosure relates to sampling wireless signals received at a receiver. Subsets of the digital samples are multiplied by a Pseudo-Noise (PN) code to generate tone signals. A pattern of tone signals may indicate the presence of a repeating preamble. This may be used to locate a payload that follows the preamble. A tone signal of a payload may be decoded by multiplying a subset of the digital samples with the PN code to generate a tone signal. The tone signal may be transformed into the frequency domain to identify a frequency component having an energy level above a threshold. Using the frequency component, the payload is decoded. By structuring a packet as a series of PN-code modulated tone signals, packets received from different sources may be differentiated even when they arrive at overlapping points of time. This allows for a larger transmitting capacity in a network.
    Type: Grant
    Filed: May 12, 2020
    Date of Patent: September 7, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Benjamin Wild
  • Patent number: 11089425
    Abstract: The present invention pertains to an audio playback method and an audio playback apparatus in a 6DoF environment. The audio playback method of the present invention is characterised by comprising: a decoding step of decoding a received audio signal, and outputting the decoded audio signal and metadata; a modelling step of receiving input of position information of a user, checking whether the position of the user has changed from a previous position, and if the position of the user has changed, modelling binaural rendering data so as to correspond to the changed position of the user; and a rendering step of binaural-rendering the decoded audio signal using the modelled rendering data, and outputting the same as a two-channel audio signal.
    Type: Grant
    Filed: November 14, 2017
    Date of Patent: August 10, 2021
    Assignee: LG Electronics Inc.
    Inventors: Tung Chin Lee, Sejin Oh
  • Patent number: 11080010
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Grant
    Filed: June 1, 2020
    Date of Patent: August 3, 2021
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jeffrey Riedmiller, Scott Gregory Norcross, Karl Jonas Roeden
  • Patent number: 11064069
    Abstract: The present invention provides an IP telephone and the like capable of preventing a loss of the beginning part of a talk. A communication apparatus according to the present invention includes: a memory (110) configured to store communication data with a communication destination; and a control unit (120) configured to transmit and receive the communication data to and from the communication destination in each first duration and store the transmitted and received communication data in the memory (110), then output, after a communication session with an external recording apparatus that records the communication data is established, the communication data stored in the memory (110) to the external recording apparatus in each second duration that is shorter than the first duration.
    Type: Grant
    Filed: November 30, 2018
    Date of Patent: July 13, 2021
    Assignee: NEC Platforms, Ltd.
    Inventor: Kazuhiro Maeda
  • Patent number: 11056101
    Abstract: A method for training hotword detection includes receiving a training input audio sequence including a sequence of input frames that define a hotword that initiates a wake-up process on a device. The method also includes feeding the training input audio sequence into an encoder and a decoder of a memorized neural network. Each of the encoder and the decoder of the memorized neural network include sequentially-stacked single value decomposition filter (SVDF) layers. The method further includes generating a logit at each of the encoder and the decoder based on the training input audio sequence. For each of the encoder and the decoder, the method includes smoothing each respective logit generated from the training input audio sequence, determining a max pooling loss from a probability distribution based on each respective logit, and optimizing the encoder and the decoder based on all max pooling losses associated with the training input audio sequence.
    Type: Grant
    Filed: December 10, 2019
    Date of Patent: July 6, 2021
    Assignee: Google LLC
    Inventors: Raziel Alvarez Guevara, Hyun Jin Park, Patrick Violette
  • Patent number: 11056125
    Abstract: A gain adjustment apparatus for use in decoding of audio that has been encoded with separate gain and shape representations includes an accuracy meter configured to estimate an accuracy measure of the shape representation, and to determine a gain correction based on the estimated accuracy measure. An envelope adjuster further included in the apparatus is configured to adjust the gain representation based on the determined gain correction.
    Type: Grant
    Filed: September 10, 2019
    Date of Patent: July 6, 2021
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Erik Norvell, Volodya Grancharov
  • Patent number: 11049507
    Abstract: Methods, apparatus and articles of manufacture to identify sources of network streaming services are disclosed. An example method includes receiving a first audio signal that represents a decompressed second audio signal, identifying, from the first audio signal, a parameter of an audio compression configuration used to form the decompressed second audio signal, and identifying a source of the decompressed second audio signal based on the identified audio compression configuration.
    Type: Grant
    Filed: January 2, 2019
    Date of Patent: June 29, 2021
    Assignee: Gracenote, Inc.
    Inventors: Zafar Rafii, Markus Cremer, Bongjun Kim
  • Patent number: 11049508
    Abstract: An audio encoder for encoding an audio signal has: a first encoding processor for encoding a first audio signal portion in a frequency domain, having: a time frequency converter for converting the first audio signal portion into a frequency domain representation; an analyzer for analyzing the frequency domain representation to determine first spectral portions to be encoded with a first spectral resolution and second regions to be encoded with a second resolution; and a spectral encoder for encoding the first spectral portions with the first spectral resolution and encoding the second portions with the second resolution; a second encoding processor for encoding a second different audio signal portion in the time domain; a controller for analyzing and determining, which portion of the audio signal is the first audio signal portion encoded in the frequency domain and which portion is the second audio signal portion encoded in the time domain; and an encoded signal former for forming an encoded audio signal havi
    Type: Grant
    Filed: February 26, 2019
    Date of Patent: June 29, 2021
    Inventors: Sascha Disch, Martin Dietz, Markus Multrus, Guillaume Fuchs, Emmanuel Ravelli, Matthias Neusinger, Markus Schnell, Benjamin Schubert, Bernhard Grill
  • Patent number: 11049506
    Abstract: An apparatus for decoding an encoded audio signal, includes: a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions being spectral prediction residual values; a frequency regenerator for generating a reconstructed second spectral portion using a first spectral portion of the first set of first spectral portions, wherein the reconstructed second spectral portion additionally includes spectral prediction residual values; and an inverse prediction filter for performing an inverse prediction over frequency using the spectral residual values for the first set of first spectral portions and the reconstructed second spectral portion using prediction filter information included in the encoded audio signal.
    Type: Grant
    Filed: May 20, 2019
    Date of Patent: June 29, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
  • Patent number: 11031019
    Abstract: The application relates to HFR (High Frequency Reconstruction/Regeneration) of audio signals. In particular, the application relates to a method and system for performing HFR of audio signals having large variations in energy level across the low frequency range which is used to reconstruct the high frequencies of the audio signal. A system configured to generate a plurality of high frequency subband signals covering a high frequency interval from a plurality of low frequency subband signals is described.
    Type: Grant
    Filed: March 27, 2019
    Date of Patent: June 8, 2021
    Assignee: Dolby International AB
    Inventor: Kristofer Kjoerling
  • Patent number: 11031022
    Abstract: Noise filling of a spectrum of an audio signal is improved in quality with respect to the noise filled spectrum so that the reproduction of the noise filled audio signal is less annoying, by performing the noise filling in a manner dependent on a tonality of the audio signal.
    Type: Grant
    Filed: July 26, 2019
    Date of Patent: June 8, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Marc Gayer, Christian Helmrich, Goran Markovic, Maria Luis Valero
  • Patent number: 11011181
    Abstract: An encoder for encoding a parametric spectral representation (ƒ) of auto-regressive coefficients that partially represent an audio signal. The encoder includes a low-frequency encoder configured to quantize elements of a part of the parametric spectral representation that correspond to a low-frequency part of the audio signal. It also includes a high-frequency encoder configured to encode a high-frequency part (ƒH) of the parametric spectral representation (ƒ) by weighted averaging based on the quantized elements ({circumflex over (ƒ)}L) flipped around a quantized mirroring frequency ({circumflex over (ƒ)}m), which separates the low-frequency part from the high-frequency part, and a frequency grid determined from a frequency grid codebook in a closed-loop search procedure. Described are also a corresponding decoder, corresponding encoding/decoding methods and UEs including such an encoder/decoder.
    Type: Grant
    Filed: March 27, 2020
    Date of Patent: May 18, 2021
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Sigurdur Sverrisson
  • Patent number: 11012802
    Abstract: A computing system that facilitates decoding a spherical harmonics (SH) representation of a three-dimensional sound signal to a binaural sound signal is described herein. The computing system generates a binaural sound signal based upon the SH representation, a tapering window function that is selected based on an SH encoding order of the SH representation, and a coloration compensation filter that incorporates the tapering window function. The computing system causes the binaural sound signal to be played over at least two speakers.
    Type: Grant
    Filed: July 2, 2019
    Date of Patent: May 18, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Christoph Felix Hold, Hannes Gamper, Ville Topias Pulkki, Nikunj Raghuvanshi, Ivan Jelev Tashev