Audio Signal Bandwidth Compression Or Expansion Patents (Class 704/500)
  • Patent number: 11398240
    Abstract: Systems and methods are presented for cross-fading (or other multiple clip processing) of information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Multiple clip processing can be accomplished at a client end according to directions sent from a service provider that specify a combination of (i) the clips involved; (ii) the device on which the cross-fade or other processing is to occur and its parameters; and (iii) the service provider system. For example, a consumer device with only one decoder, can utilize that decoder (typically hardware) to decompress one or more elements that are involved in a cross-fade at faster than real time, thus pre-fetching the next element(s) to be played in the cross-fade at the end of the currently being played element.
    Type: Grant
    Filed: June 9, 2020
    Date of Patent: July 26, 2022
    Assignee: Sirius XM Radio Inc.
    Inventors: Raymond Lowe, Christopher Ward, Charles W. Christine
  • Patent number: 11393480
    Abstract: An inter-channel phase difference (IPD) parameter extraction method and apparatus, where the extraction method includes obtaining a parameter obtaining an information extraction manner for a current frame of a multi-channel signal, obtaining an IPD parameter extraction manner for the current frame based on the parameter obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners, and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.
    Type: Grant
    Filed: November 27, 2018
    Date of Patent: July 19, 2022
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
  • Patent number: 11392640
    Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to identify media that has been pitch shifted, time shifted, and/or resampled. An example method includes: generating, by executing an instruction with a processor, a fingerprint from an audio signal; transmitting the fingerprint and adjusting instructions to a central facility to facilitate a query, the adjusting instructions identifying at least one of a pitch shift or a time shift; and receiving a response including an identifier for the audio signal and information corresponding to how the audio signal was adjusted; storing information indicative of the identifier and the information into a database.
    Type: Grant
    Filed: November 27, 2019
    Date of Patent: July 19, 2022
    Assignee: Gracenote, Inc.
    Inventors: Robert Coover, Matthew James Wilkinson, Jeffrey Scott, Yongju Hong
  • Patent number: 11393144
    Abstract: A system and a method for rendering an image including an image gateway arranged to receive content data representative of one or more objects and style data representative of one or more desired rendering styles; and an image manipulation engine arranged to process the content data and the style data to render a target image, wherein the target image includes a representation of the one or more objects and includes the one or more desired rendering styles.
    Type: Grant
    Filed: April 11, 2019
    Date of Patent: July 19, 2022
    Assignee: City University of Hong Kong
    Inventors: Qi Liu, Jeremiah B. Palmerston, Qi She, Ho Man Chan
  • Patent number: 11386907
    Abstract: A multi-channel signal encoding method includes determining a downmixed signal of a first channel signal and a second channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter, and quantizing the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.
    Type: Grant
    Filed: September 27, 2019
    Date of Patent: July 12, 2022
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Zexin Liu, Lei Miao
  • Patent number: 11380338
    Abstract: Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.
    Type: Grant
    Filed: September 29, 2020
    Date of Patent: July 5, 2022
    Assignee: SAMSUNG ELECTRONICS CO., LTD.
    Inventors: Ho-sang Sung, Holly Francois, Ki-hyun Choo, Eun-mi Oh, Kyung-hun Jung, Yaodu Wei
  • Patent number: 11374663
    Abstract: A method includes receiving a set of time domain samples representing audio captured using one or more microphones, and generating, from the time domain samples, a spectrum comprising a set of frequency domain coefficients, each coefficient representing a frequency bin corresponding to a range of frequencies. The method also includes adjusting the spectrum to generate a smoothed spectrum, wherein generating the smoothed spectrum includes determining that a magnitude of a first frequency domain coefficient is less than a threshold, and in response, replacing the first frequency domain coefficient by a value computed as a function of a plurality of frequency domain coefficients that include the first frequency domain coefficient. The method further includes generating an audio signal based on time domain samples computed using the smoothed spectrum.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: June 28, 2022
    Assignee: Bose Corporation
    Inventors: Cristian Marius Hera, Elie Bou Daher
  • Patent number: 11367453
    Abstract: An apparatus for generating an error concealment signal, includes: an LPC representation generator for generating a replacement LPC representation; a gain calculator for calculating a gain information from the LPC representations; a compensator for compensating a gain influence of the replacement LPC representation using the gain information; and an LPC synthesizer for filtering codebook information using the replacement LPC representation to obtain the error concealment signal, wherein the compensator is configured for weighting the codebook information or an LPC synthesis output signal.
    Type: Grant
    Filed: July 8, 2020
    Date of Patent: June 21, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
  • Patent number: 11367454
    Abstract: An apparatus for encoding directional audio coding parameters having diffuseness parameters and direction parameters, has: a parameter quantizer for quantizing the diffuseness parameters and the direction parameters; a parameter encoder for encoding quantized diffuseness parameters and quantized direction parameters; and an output interface for generating an encoded parameter representation having information on encoded diffuseness parameters and encoded direction parameters.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: June 21, 2022
    Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
    Inventors: Guillaume Fuchs, Jürgen Herre, Fabian Küch, Stefan Döhla, Markus Multrus, Oliver Thiergart, Oliver Wübbolt, Florin Ghido, Stefan Bayer, Wolfgang Jaegers
  • Patent number: 11363398
    Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.
    Type: Grant
    Filed: December 10, 2015
    Date of Patent: June 14, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
  • Patent number: 11354355
    Abstract: An apparatus, method, and computer-readable storage medium that generate at least a cue point in a musical piece. The method includes generating a beat grid representing the musical piece, determining values for the beat grid, the values corresponding to an audio feature of the musical piece, and each value representing an entire duration of each beat in the beat grid of the musical piece, calculating a score for the audio feature at each of a plurality of positions in the beat grid of the musical piece, using some or all of the determined values, and generating the cue point at a particular position of the plurality of positions, based on the calculated scores.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: June 7, 2022
    Assignee: MIXED IN KEY LLC
    Inventor: Martin Douglas
  • Patent number: 11348598
    Abstract: Systems, methods, and devices for monitoring industrial equipment using audio are described herein. One system includes two computing devices. The first computing device can receive, from an audio sensor, audio sensed during operation of industrial equipment, extract a plurality of features from the audio, determine whether any portion of the audio is anomalous, and send, upon determining a portion of the audio is anomalous, the anomalous portion of the audio to the second, remotely located, computing device. The second computing device can provide the anomalous portion of the audio to a user to determine whether the anomalous portion of the audio corresponds to a fault occurring in the equipment, and receive, from the user upon determining the anomalous portion of the audio corresponds to a fault occurring in the equipment, input indicating the anomalous portion of the audio corresponds to the fault to learn fault patterns in the equipment.
    Type: Grant
    Filed: October 22, 2020
    Date of Patent: May 31, 2022
    Assignee: HONEYWELL INTERNATIONA, INC.
    Inventors: Ramprasad Yelchuru, Shreyan Chowdhury, Pradyumna Sampath
  • Patent number: 11325407
    Abstract: A method is provided for extending the frequency band of an audio signal during a decoding or improvement process. The method includes obtaining the decoded signal in a first frequency band, referred to as a low band. Tonal components and a surround signal are extracted from the signal from the low-band signal, and the tonal components and the surround signal are combined by adaptive mixing using energy-level control factors to obtain an audio signal, referred to as a combined signal. The low-band decoded signal before the extraction step or the combined signal after the combination step are extended over at least one second frequency band which is higher than the first frequency band. Also proved are a frequency-band extension device which implements the described method and a decoder including a device of this type.
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: May 10, 2022
    Assignee: Koninklijke Philips N.V.
    Inventors: Magdalena Kaniewska, Stephane Ragot
  • Patent number: 11328734
    Abstract: An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.
    Type: Grant
    Filed: January 6, 2020
    Date of Patent: May 10, 2022
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Jeong Il Seo, Jong Mo Sung, Tae Jin Lee, Jin Soo Choi
  • Patent number: 11322164
    Abstract: The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).
    Type: Grant
    Filed: January 17, 2019
    Date of Patent: May 3, 2022
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Kristofer Kjoerling, David S. McGrath, Heiko Purnhagen, Mark R. P. Thomas
  • Patent number: 11315582
    Abstract: A method for recovering audio signals, a terminal and a storage medium are provided. The method includes: buffering an audio signal sampled at a preset number of sampling points each time, and then performing frequency spectrum analysis on the sampled audio signal by FFT; when it is determined that the audio signal is compressed, filtering a frequency point; recovering high-frequency signals based on audio signals before the frequency point; and performing phase recovery on the high-frequency signals. Thus, compressed high-frequency signals in the audio signals may be recovered.
    Type: Grant
    Filed: November 27, 2018
    Date of Patent: April 26, 2022
    Assignee: GUANGZHOU KUGOU COMPUTER TECHNOLOGY CO., LTD.
    Inventors: Jiaze Liu, Yufei Wang
  • Patent number: 11316490
    Abstract: An audio signal processing device comprises: a receiver for receiving an input audio signal; a processor for generating loudness metadata corresponding to the input audio signal; and an outputter for transmitting the loudness metadata generated by the processor. The processor is configured to acquire loudness information analyzed from input content, acquires loudness information about the input audio signal by measuring the loudness of the input audio signal, generates the loudness metadata by converting the loudness information, and transmits, through the outputter, the generated loudness metadata to an output device for outputting the input audio signal.
    Type: Grant
    Filed: March 12, 2020
    Date of Patent: April 26, 2022
    Assignee: GAUDIO LAB, INC.
    Inventors: Sangbae Chon, Kyutae Park, Younghoon Kwon, Hyunoh Oh, Jeonghun Seo, Taegyu Lee, Hyunjoo Chung
  • Patent number: 11308977
    Abstract: According to an embodiment, the above-described specification discloses an electronic device comprises at least one processor configured to: receive a first audio signal and a second audio signal; detect a spectral envelope signal from the first audio signal and extract a feature point from the second audio signal; extend a high-band of the second audio signal based on the spectral envelope signal from the first audio signal and the feature point from the second audio signal to generate a high-band extension signal; and mix the high-band extension signal and the first audio signal, thereby resulting in a synthesized signal.
    Type: Grant
    Filed: January 3, 2020
    Date of Patent: April 19, 2022
    Assignee: Samsung Electronics Co., Ltd.
    Inventors: Hangil Moon, Aran Cha, Hwan Shim, Gunwoo Lee, Kyuhan Kim
  • Patent number: 11297398
    Abstract: The disclosed embodiments relate to media devices implementing new television standards, such as ATSC 3.0, which includes audio/video essence and metadata/signaling. The disclosed embodiments include techniques for creating a signaling file that contains metadata and signaling data about the broadcast service being presented. The signaling file may include URLs that can be used to access signaling information for supplementary signaling and content. The signaling also contains a mapping between a first timing system and a second timing system. The first timing system may comprise the watermark timeline represented in interval s and the second timing system may comprise a DASH presentation time.
    Type: Grant
    Filed: June 21, 2018
    Date of Patent: April 5, 2022
    Assignee: Verance Corporation
    Inventors: Jian Zhao, Joseph M. Winograd, Rade Petrovic, Julia Ann Kenyon
  • Patent number: 11297533
    Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.
    Type: Grant
    Filed: June 19, 2020
    Date of Patent: April 5, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Per Ekstrand
  • Patent number: 11289104
    Abstract: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.
    Type: Grant
    Filed: February 26, 2019
    Date of Patent: March 29, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
  • Patent number: 11270711
    Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: March 8, 2022
    Assignee: Qualcomm Incorproated
    Inventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
  • Patent number: 11270709
    Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.
    Type: Grant
    Filed: November 22, 2017
    Date of Patent: March 8, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Kristofer Kjoerling, Toni Hirvonen, Lars Villemoes, Dirk Jeroen Breebaart
  • Patent number: 11271678
    Abstract: An encoding method for a wireless local area network includes encoding a packet via a cyclic coding scheme into a first amount of multiple first blocks; processing each of the first amount of the first blocks into a second amount of multiple second blocks; and encoding the second amount of the second blocks via a block coding scheme.
    Type: Grant
    Filed: July 2, 2020
    Date of Patent: March 8, 2022
    Assignee: Realtek Semiconductor Corp.
    Inventors: John Timothy Coffey, Der-Zheng Liu
  • Patent number: 11264042
    Abstract: An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.
    Type: Grant
    Filed: November 21, 2019
    Date of Patent: March 1, 2022
    Inventors: Sascha Disch, Ralf Geiger, Andreas Niedermeier, Matthias Neusinger, Konstantin Schmidt, Stephan Wilde, Benjamin Schubert, Christian Neukam
  • Patent number: 11264017
    Abstract: Systems and methods include a plurality of audio input components configured to generate a plurality of audio input signals, and a logic device configured to receive the plurality of audio input signals, determine whether the plurality of audio signals comprise target audio associated with an audio source, estimate a relative location of the audio source with respect to the plurality of audio input components based on the plurality of audio signals and a determination of whether the plurality of audio signals comprise the target audio, and process the plurality of audio signals to generate an audio output signal by enhancing the target audio based on the estimated relative location. The logic device is further configured to use relative transfer-based covariance to construct directional covariance matrix aligned across frequency bands and find a direction that minimizes beam power subject to distortionless criteria.
    Type: Grant
    Filed: June 12, 2020
    Date of Patent: March 1, 2022
    Assignee: SYNAPTICS INCORPORATED
    Inventors: Alireza Masnadi-Shirazi, Francesco Nesta
  • Patent number: 11264038
    Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.
    Type: Grant
    Filed: February 20, 2020
    Date of Patent: March 1, 2022
    Assignee: Dolby International AB
    Inventors: Heiko Purnhagen, Lars Villemoes, Pontus Carlsson
  • Patent number: 11250863
    Abstract: The techniques disclosed herein provide apparatuses and related methods for the communication of spatial audio and related metadata. In some implementations, a source provides prerecorded spatial audio that has embedded metadata. A computing device processes the prerecorded spatial audio to generate an audio codec that is segmented to include a first section of audio data and a second section that includes metadata extracted from the prerecorded spatial audio. The generated audio codec may be received by a device that includes an encoder. The encoder may process the generated audio codec to generate audio data that includes the metadata.
    Type: Grant
    Filed: December 17, 2019
    Date of Patent: February 15, 2022
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Brian C. McDowell, Philip Andrew Edry, Ziyad Ibrahim, Robert Norman Heitkamp, Steven Wilssens
  • Patent number: 11244674
    Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.
    Type: Grant
    Filed: August 9, 2019
    Date of Patent: February 8, 2022
    Assignee: Google LLC
    Inventor: Ricardo Antonio Garcia
  • Patent number: 11238875
    Abstract: This disclosure provides an encoding method, a decoding method, an encoding apparatus, and a decoding apparatus for a stereo signal. The encoding method includes: performing interpolation processing based on the inter-channel time difference in the current frame and an inter-channel time difference in a previous frame of the current frame; performing time-domain downmixing processing on the stereo signal after the delay alignment in the current frame, to obtain a primary-channel signal and a secondary-channel signal in the current frame; and quantizing the inter-channel time difference after the interpolation processing in the current frame, the primary channel signal and the secondary channel signal.
    Type: Grant
    Filed: January 24, 2020
    Date of Patent: February 1, 2022
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Eyal Shlomot, Halting Li, Bin Wang
  • Patent number: 11232802
    Abstract: Methods for converting, encoding, decoding and transcoding an acoustic field, more particularly a first-order Ambisonics three-dimensional acoustic field.
    Type: Grant
    Filed: September 28, 2017
    Date of Patent: January 25, 2022
    Assignee: Coronal Encoding S.A.S.
    Inventors: Benjamin Bernard, Francois Becker
  • Patent number: 11232803
    Abstract: An encoding device according to the disclosure includes a first encoding unit that generates a first encoded signal in which a low-band signal having a frequency lower than or equal to a predetermined frequency from a voice or audio input signal is encoded, and a low-band decoded signal; a second encoding unit that encodes, on the basis of the low-band decoded signal, a high-band signal having a band higher than that of the low-band signal to generate a high-band encoded signal; and a first multiplexing unit that multiplexes the first encoded signal and the high-band encoded signal to generate and output an encoded signal. The second encoding unit calculates an energy ratio between a high-band noise component, which is a noise component of the high-band signal, and a high-band non-tonal component of a high-band decoded signal generated from the low-band decoded signal and outputs the ratio as the high-band encoded signal.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: January 25, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Srikanth Nagisetty, Zong Xian Liu, Hiroyuki Ehara
  • Patent number: 11232801
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: July 24, 2020
    Date of Patent: January 25, 2022
    Assignee: Dolby International AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 11228856
    Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.
    Type: Grant
    Filed: August 26, 2020
    Date of Patent: January 18, 2022
    Assignee: Dolby Laboratories Licensing Corporation
    Inventors: Peter Jax, Johannes Boehm, William Redmann
  • Patent number: 11227615
    Abstract: A sound processing apparatus includes: a conversion unit that samples predetermined audio data with a first quantization bit rate and a second quantization bit rate greater than the first quantization bit rate and outputs the respective results as first digital audio data and second digital audio data; a frequency analysis unit that performs frequency analysis on each output from the conversion unit; and a determination unit that performs predetermined determination processing on the basis of analysis results by the frequency analysis unit.
    Type: Grant
    Filed: July 25, 2018
    Date of Patent: January 18, 2022
    Assignee: SONY CORPORATION
    Inventors: Tatsuya Koizumi, Taiji Yoshida
  • Patent number: 11227616
    Abstract: Audio encoder for encoding audio input data to obtain audio output data includes an input interface for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel including audio data of a channel and audio data of at least one object; a core encoder for core encoding core encoder input data; and a metadata compressor for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes.
    Type: Grant
    Filed: February 15, 2019
    Date of Patent: January 18, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Alexander Adami, Christian Borss, Sascha Dick, Christian Ertel, Simone Neukam, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Michael Kratschmer, Fabian Kuech, Achim Kuntz, Adrian Murtaza, Jan Plogsties, Andreas Silzle, Hanne Stenzel
  • Patent number: 11223925
    Abstract: An apparatus configured to: based on (i) captured spatial audio content of a scene comprising audio that is associated with information indicative of at least a direction in the scene from which said audio was captured; and (ii) visual focus information comprising information indicative of at least a first part of the scene on which corresponding captured visual imagery of the scene is focused for presentation to a user; provide for presentation of the captured spatial audio content to accompany the captured visual imagery, the captured spatial audio content presented as spatial audio, the spatial audio content provided for presentation with a spatial audio focus selectively applied to audio captured from a second part of the scene different to the first part, the spatial audio focus comprising an audio-modifying effect to increase the audibility of the audio having a direction corresponding to the second part.
    Type: Grant
    Filed: December 11, 2018
    Date of Patent: January 11, 2022
    Assignee: Nokia Technologies Oy
    Inventors: Lasse Laaksonen, Arto Lehtiniemi, Toni Mäkinen, Mikko Heikkinen
  • Patent number: 11217261
    Abstract: In methods and apparatus and non-transitory memory units for encoding/decoding audio signal information, the encoder side may determine if a signal frame is useful for long term post filtering and/or packet lost concealment and may encode information in accordance to the results of the determination, and the decoder side may apply the LTPF and/or PLC in accordance to the information obtained from the encoder.
    Type: Grant
    Filed: May 6, 2020
    Date of Patent: January 4, 2022
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Adrian Tomasek, Manfred Lutzky, Conrad Benndorf
  • Patent number: 11218754
    Abstract: Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.
    Type: Grant
    Filed: February 7, 2020
    Date of Patent: January 4, 2022
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Bernd Czelhan, Harald Fuchs, Ingo Hofmann, Herbert Thoma, Stephan Schreiner
  • Patent number: 11176951
    Abstract: A method for processing a monophonic signal in a 3D audio decoder, including processing binauralizing decoded signals intended to be delivered spatially by a headset. The method is such that, on detection, in a datastream representative of the monophonic signal, of an indication of non-binauralization processing, which indication is associated with spatial delivery position information, the decoded monophonic signal is directed to a stereophonic rendering engine, which takes into account the position information to construct two delivery channels that are directly processed via a direct mixing that sums these two channels with a binauralized signal output from the binauralization processing, in order to be delivered via the headset. A decoder device that implements the processing method is also provided.
    Type: Grant
    Filed: December 7, 2018
    Date of Patent: November 16, 2021
    Assignee: ORANGE
    Inventor: Gregory Pallone
  • Patent number: 11172293
    Abstract: A low power voice processing system that includes a plurality of non-audio sensors, at least one microphone system, and a plurality of audio modules, at least some of which can be configured in selected modes. A context determination module is connected to the plurality of audio modules, and further connected to receive input from the plurality of non-audio sensors and the at least one microphone system. The context determination module acts to determine use context for the voice processing system and at least in part selects mode operation of at least some of the plurality of audio modules.
    Type: Grant
    Filed: July 10, 2019
    Date of Patent: November 9, 2021
    Assignee: AMBIQ MICRO, INC.
    Inventors: Arpit Shah, Scott McLean Hanson, Stephen Howard Nease
  • Patent number: 11170795
    Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.
    Type: Grant
    Filed: April 22, 2016
    Date of Patent: November 9, 2021
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
  • Patent number: 11170792
    Abstract: To suitably regulate sound pressure of object content on a receiving side. An audio stream including coded data of a predetermined number of pieces of object content is generated. A container of a predetermined format including the audio stream is transmitted. Information indicating a range within which sound pressure is allowed to increase and decrease for each piece of object content is inserted into a layer of the audio stream and/or a layer of the container. On a receiving side, sound pressure of each piece of object content increases and decreases within the allowable range based on the information.
    Type: Grant
    Filed: December 16, 2019
    Date of Patent: November 9, 2021
    Assignee: SONY CORPORATION
    Inventors: Ikuo Tsukagoshi, Toru Chinen
  • Patent number: 11151176
    Abstract: A method for processing text information is provided, the method including: obtaining text information and extracting text characters contained in the text information; extracting, from the text characters, target characters satisfying a predetermined rule; calculating a filtering index of the text information according to the target characters; and when the filtering index meets a predetermined condition, executing an operation corresponding to the predetermined condition on the text information. In addition, an apparatus for processing text information is further provided. The method and apparatus for processing text information can improve the accuracy and efficiency of filtering out junk text information.
    Type: Grant
    Filed: June 28, 2019
    Date of Patent: October 19, 2021
    Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITED
    Inventor: Shengrong Chen
  • Patent number: 11130066
    Abstract: System and method that handles the inconsistent processing of interactive media (e.g. digital games, music visualization software, video playback software, animation software, etc.) and the generally consistent timeline delay to enable synchronization of messages and events with the playback of potentially variable-rate timelines.
    Type: Grant
    Filed: August 29, 2016
    Date of Patent: September 28, 2021
    Assignee: Sonic Bloom, LLC
    Inventor: Eric D. Robinson
  • Patent number: 11133015
    Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.
    Type: Grant
    Filed: November 5, 2018
    Date of Patent: September 28, 2021
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Seung Kwon Beack, Woo-taek Lim, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
  • Patent number: 11113287
    Abstract: A system may include at least one processor. The at least one processor may receive data from a plurality of independent data sources. The data from each respective data source is received at a rate determined by the respective data source. The at least one processor may further write the received data to at least one data store at a rate independent of the respective rates at which data from the plurality of independent data sources is received. A method and computer-readable medium are also disclosed.
    Type: Grant
    Filed: May 11, 2017
    Date of Patent: September 7, 2021
    Assignee: Teradata US, Inc.
    Inventors: Louis B. Landry, Ilsun A. Park, Oliver Ratzesberger
  • Patent number: 11115731
    Abstract: The present disclosure relates to sampling wireless signals received at a receiver. Subsets of the digital samples are multiplied by a Pseudo-Noise (PN) code to generate tone signals. A pattern of tone signals may indicate the presence of a repeating preamble. This may be used to locate a payload that follows the preamble. A tone signal of a payload may be decoded by multiplying a subset of the digital samples with the PN code to generate a tone signal. The tone signal may be transformed into the frequency domain to identify a frequency component having an energy level above a threshold. Using the frequency component, the payload is decoded. By structuring a packet as a series of PN-code modulated tone signals, packets received from different sources may be differentiated even when they arrive at overlapping points of time. This allows for a larger transmitting capacity in a network.
    Type: Grant
    Filed: May 12, 2020
    Date of Patent: September 7, 2021
    Assignee: Amazon Technologies, Inc.
    Inventor: Benjamin Wild
  • Patent number: 11089425
    Abstract: The present invention pertains to an audio playback method and an audio playback apparatus in a 6DoF environment. The audio playback method of the present invention is characterised by comprising: a decoding step of decoding a received audio signal, and outputting the decoded audio signal and metadata; a modelling step of receiving input of position information of a user, checking whether the position of the user has changed from a previous position, and if the position of the user has changed, modelling binaural rendering data so as to correspond to the changed position of the user; and a rendering step of binaural-rendering the decoded audio signal using the modelled rendering data, and outputting the same as a two-channel audio signal.
    Type: Grant
    Filed: November 14, 2017
    Date of Patent: August 10, 2021
    Assignee: LG Electronics Inc.
    Inventors: Tung Chin Lee, Sejin Oh
  • Patent number: 11080010
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Grant
    Filed: June 1, 2020
    Date of Patent: August 3, 2021
    Assignees: Dolby Laboratories Licensing Corporation, Dolby International AB
    Inventors: Jeffrey Riedmiller, Scott Gregory Norcross, Karl Jonas Roeden