Audio Signal Bandwidth Compression Or Expansion Patents (Class 704/500)
-
Patent number: 11398240Abstract: Systems and methods are presented for cross-fading (or other multiple clip processing) of information streams on a user or client device, such as a telephone, tablet, computer or MP3 player, or any consumer device with audio playback. Multiple clip processing can be accomplished at a client end according to directions sent from a service provider that specify a combination of (i) the clips involved; (ii) the device on which the cross-fade or other processing is to occur and its parameters; and (iii) the service provider system. For example, a consumer device with only one decoder, can utilize that decoder (typically hardware) to decompress one or more elements that are involved in a cross-fade at faster than real time, thus pre-fetching the next element(s) to be played in the cross-fade at the end of the currently being played element.Type: GrantFiled: June 9, 2020Date of Patent: July 26, 2022Assignee: Sirius XM Radio Inc.Inventors: Raymond Lowe, Christopher Ward, Charles W. Christine
-
Patent number: 11393480Abstract: An inter-channel phase difference (IPD) parameter extraction method and apparatus, where the extraction method includes obtaining a parameter obtaining an information extraction manner for a current frame of a multi-channel signal, obtaining an IPD parameter extraction manner for the current frame based on the parameter obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners, and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.Type: GrantFiled: November 27, 2018Date of Patent: July 19, 2022Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
-
Patent number: 11392640Abstract: Methods, apparatus, systems and articles of manufacture are disclosed to identify media that has been pitch shifted, time shifted, and/or resampled. An example method includes: generating, by executing an instruction with a processor, a fingerprint from an audio signal; transmitting the fingerprint and adjusting instructions to a central facility to facilitate a query, the adjusting instructions identifying at least one of a pitch shift or a time shift; and receiving a response including an identifier for the audio signal and information corresponding to how the audio signal was adjusted; storing information indicative of the identifier and the information into a database.Type: GrantFiled: November 27, 2019Date of Patent: July 19, 2022Assignee: Gracenote, Inc.Inventors: Robert Coover, Matthew James Wilkinson, Jeffrey Scott, Yongju Hong
-
Patent number: 11393144Abstract: A system and a method for rendering an image including an image gateway arranged to receive content data representative of one or more objects and style data representative of one or more desired rendering styles; and an image manipulation engine arranged to process the content data and the style data to render a target image, wherein the target image includes a representation of the one or more objects and includes the one or more desired rendering styles.Type: GrantFiled: April 11, 2019Date of Patent: July 19, 2022Assignee: City University of Hong KongInventors: Qi Liu, Jeremiah B. Palmerston, Qi She, Ho Man Chan
-
Patent number: 11386907Abstract: A multi-channel signal encoding method includes determining a downmixed signal of a first channel signal and a second channel signal, an initial reverberation gain parameter of the first channel signal and the second channel signal, determining a target reverberation gain parameter of the first channel signal and the second channel signal based on a correlation between the first channel signal and the downmixed signal, a correlation between the second channel signal and the downmixed signal, and the initial reverberation gain parameter, and quantizing the first channel signal and the second channel signal based on the downmixed signal and the target reverberation gain parameter, and writing a quantized first channel signal and a quantized second channel signal into a bitstream.Type: GrantFiled: September 27, 2019Date of Patent: July 12, 2022Assignee: HUAWEI TECHNOLOGIES CO., LTD.Inventors: Zexin Liu, Lei Miao
-
Patent number: 11380338Abstract: Provided are a signal processing method and apparatus for enhancing sound quality. The signal processing method performed by a signal transmitting apparatus includes determining, based on a plurality of parameters, a valid bandwidth so as to encode an input signal; performing pre-processing on the input signal, based on the valid bandwidth; and encoding the pre-processed input signal, based on the valid bandwidth, and the signal processing method performed by a signal receiving apparatus includes decoding a bitstream or a packet received via a transmission channel; determining a valid bandwidth, based on a plurality of parameters used in the decoding; and performing post-processing on a decoded signal, based on the valid bandwidth.Type: GrantFiled: September 29, 2020Date of Patent: July 5, 2022Assignee: SAMSUNG ELECTRONICS CO., LTD.Inventors: Ho-sang Sung, Holly Francois, Ki-hyun Choo, Eun-mi Oh, Kyung-hun Jung, Yaodu Wei
-
Patent number: 11374663Abstract: A method includes receiving a set of time domain samples representing audio captured using one or more microphones, and generating, from the time domain samples, a spectrum comprising a set of frequency domain coefficients, each coefficient representing a frequency bin corresponding to a range of frequencies. The method also includes adjusting the spectrum to generate a smoothed spectrum, wherein generating the smoothed spectrum includes determining that a magnitude of a first frequency domain coefficient is less than a threshold, and in response, replacing the first frequency domain coefficient by a value computed as a function of a plurality of frequency domain coefficients that include the first frequency domain coefficient. The method further includes generating an audio signal based on time domain samples computed using the smoothed spectrum.Type: GrantFiled: November 21, 2019Date of Patent: June 28, 2022Assignee: Bose CorporationInventors: Cristian Marius Hera, Elie Bou Daher
-
Patent number: 11367453Abstract: An apparatus for generating an error concealment signal, includes: an LPC representation generator for generating a replacement LPC representation; a gain calculator for calculating a gain information from the LPC representations; a compensator for compensating a gain influence of the replacement LPC representation using the gain information; and an LPC synthesizer for filtering codebook information using the replacement LPC representation to obtain the error concealment signal, wherein the compensator is configured for weighting the codebook information or an LPC synthesis output signal.Type: GrantFiled: July 8, 2020Date of Patent: June 21, 2022Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Michael Schnabel, Jérémie Lecomte, Ralph Sperschneider, Manuel Jander
-
Patent number: 11367454Abstract: An apparatus for encoding directional audio coding parameters having diffuseness parameters and direction parameters, has: a parameter quantizer for quantizing the diffuseness parameters and the direction parameters; a parameter encoder for encoding quantized diffuseness parameters and quantized direction parameters; and an output interface for generating an encoded parameter representation having information on encoded diffuseness parameters and encoded direction parameters.Type: GrantFiled: May 6, 2020Date of Patent: June 21, 2022Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.Inventors: Guillaume Fuchs, Jürgen Herre, Fabian Küch, Stefan Döhla, Markus Multrus, Oliver Thiergart, Oliver Wübbolt, Florin Ghido, Stefan Bayer, Wolfgang Jaegers
-
Patent number: 11363398Abstract: Example embodiments disclosed herein relate to audio object clustering. A method for metadata-preserved audio object clustering is disclosed. The method comprises classifying a plurality of audio objects into a number of categories based on information to be preserved in metadata associated with the plurality of audio objects. The method further comprises assigning a predetermined number of clusters to the categories and allocating an audio object in each of the categories to at least one of the clusters according to the assigning. Corresponding system and computer program product are also disclosed.Type: GrantFiled: December 10, 2015Date of Patent: June 14, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Lianwu Chen, Lie Lu, Nicolas R. Tsingos
-
Patent number: 11354355Abstract: An apparatus, method, and computer-readable storage medium that generate at least a cue point in a musical piece. The method includes generating a beat grid representing the musical piece, determining values for the beat grid, the values corresponding to an audio feature of the musical piece, and each value representing an entire duration of each beat in the beat grid of the musical piece, calculating a score for the audio feature at each of a plurality of positions in the beat grid of the musical piece, using some or all of the determined values, and generating the cue point at a particular position of the plurality of positions, based on the calculated scores.Type: GrantFiled: July 10, 2019Date of Patent: June 7, 2022Assignee: MIXED IN KEY LLCInventor: Martin Douglas
-
Patent number: 11348598Abstract: Systems, methods, and devices for monitoring industrial equipment using audio are described herein. One system includes two computing devices. The first computing device can receive, from an audio sensor, audio sensed during operation of industrial equipment, extract a plurality of features from the audio, determine whether any portion of the audio is anomalous, and send, upon determining a portion of the audio is anomalous, the anomalous portion of the audio to the second, remotely located, computing device. The second computing device can provide the anomalous portion of the audio to a user to determine whether the anomalous portion of the audio corresponds to a fault occurring in the equipment, and receive, from the user upon determining the anomalous portion of the audio corresponds to a fault occurring in the equipment, input indicating the anomalous portion of the audio corresponds to the fault to learn fault patterns in the equipment.Type: GrantFiled: October 22, 2020Date of Patent: May 31, 2022Assignee: HONEYWELL INTERNATIONA, INC.Inventors: Ramprasad Yelchuru, Shreyan Chowdhury, Pradyumna Sampath
-
Patent number: 11325407Abstract: A method is provided for extending the frequency band of an audio signal during a decoding or improvement process. The method includes obtaining the decoded signal in a first frequency band, referred to as a low band. Tonal components and a surround signal are extracted from the signal from the low-band signal, and the tonal components and the surround signal are combined by adaptive mixing using energy-level control factors to obtain an audio signal, referred to as a combined signal. The low-band decoded signal before the extraction step or the combined signal after the combination step are extended over at least one second frequency band which is higher than the first frequency band. Also proved are a frequency-band extension device which implements the described method and a decoder including a device of this type.Type: GrantFiled: July 27, 2020Date of Patent: May 10, 2022Assignee: Koninklijke Philips N.V.Inventors: Magdalena Kaniewska, Stephane Ragot
-
Patent number: 11328734Abstract: An encoding method for a multi-channel audio signal, an encoding apparatus for performing the encoding method, and a decoding method for a multi-channel audio signal and a decoding apparatus for performing the decoding method are disclosed. A method and apparatus of bypassing an MPEG Surround (MPS) standard operation and using an arbitrary tree when a number of audio signals of N channels exceeds a channel number defined in an MPS standard, is disclosed.Type: GrantFiled: January 6, 2020Date of Patent: May 10, 2022Assignee: Electronics and Telecommunications Research InstituteInventors: Seung Kwon Beack, Jeong Il Seo, Jong Mo Sung, Tae Jin Lee, Jin Soo Choi
-
Patent number: 11322164Abstract: The present document describes a method (400) for encoding a soundfield representation (SR) input signal (101, 301) describing a soundfield at a reference position, wherein the SR input signal (101, 301) comprises a plurality of channels for a plurality of different directivity patterns of the soundfield at the reference position. The method (400) comprises extracting (401) one or more audio objects (103, 303) from the SR input signal (101, 301). Furthermore, the method (400) comprises determining (402) a residual signal (102, 302) based on the SR input signal (101, 301) and based on the one or more audio objects (103, 303). The method (400) also comprises performing joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302). In addition, the method (400) comprises generating (403) a bitstream (701) based on data generated in the context of joint coding of the one or more audio objects (103, 303) and/or the residual signal (102, 302).Type: GrantFiled: January 17, 2019Date of Patent: May 3, 2022Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Kristofer Kjoerling, David S. McGrath, Heiko Purnhagen, Mark R. P. Thomas
-
Patent number: 11315582Abstract: A method for recovering audio signals, a terminal and a storage medium are provided. The method includes: buffering an audio signal sampled at a preset number of sampling points each time, and then performing frequency spectrum analysis on the sampled audio signal by FFT; when it is determined that the audio signal is compressed, filtering a frequency point; recovering high-frequency signals based on audio signals before the frequency point; and performing phase recovery on the high-frequency signals. Thus, compressed high-frequency signals in the audio signals may be recovered.Type: GrantFiled: November 27, 2018Date of Patent: April 26, 2022Assignee: GUANGZHOU KUGOU COMPUTER TECHNOLOGY CO., LTD.Inventors: Jiaze Liu, Yufei Wang
-
Patent number: 11316490Abstract: An audio signal processing device comprises: a receiver for receiving an input audio signal; a processor for generating loudness metadata corresponding to the input audio signal; and an outputter for transmitting the loudness metadata generated by the processor. The processor is configured to acquire loudness information analyzed from input content, acquires loudness information about the input audio signal by measuring the loudness of the input audio signal, generates the loudness metadata by converting the loudness information, and transmits, through the outputter, the generated loudness metadata to an output device for outputting the input audio signal.Type: GrantFiled: March 12, 2020Date of Patent: April 26, 2022Assignee: GAUDIO LAB, INC.Inventors: Sangbae Chon, Kyutae Park, Younghoon Kwon, Hyunoh Oh, Jeonghun Seo, Taegyu Lee, Hyunjoo Chung
-
Patent number: 11308977Abstract: According to an embodiment, the above-described specification discloses an electronic device comprises at least one processor configured to: receive a first audio signal and a second audio signal; detect a spectral envelope signal from the first audio signal and extract a feature point from the second audio signal; extend a high-band of the second audio signal based on the spectral envelope signal from the first audio signal and the feature point from the second audio signal to generate a high-band extension signal; and mix the high-band extension signal and the first audio signal, thereby resulting in a synthesized signal.Type: GrantFiled: January 3, 2020Date of Patent: April 19, 2022Assignee: Samsung Electronics Co., Ltd.Inventors: Hangil Moon, Aran Cha, Hwan Shim, Gunwoo Lee, Kyuhan Kim
-
Patent number: 11297398Abstract: The disclosed embodiments relate to media devices implementing new television standards, such as ATSC 3.0, which includes audio/video essence and metadata/signaling. The disclosed embodiments include techniques for creating a signaling file that contains metadata and signaling data about the broadcast service being presented. The signaling file may include URLs that can be used to access signaling information for supplementary signaling and content. The signaling also contains a mapping between a first timing system and a second timing system. The first timing system may comprise the watermark timeline represented in interval s and the second timing system may comprise a DASH presentation time.Type: GrantFiled: June 21, 2018Date of Patent: April 5, 2022Assignee: Verance CorporationInventors: Jian Zhao, Joseph M. Winograd, Rade Petrovic, Julia Ann Kenyon
-
Patent number: 11297533Abstract: The present disclosure provides methods, devices and computer program products for non-uniform quantization of parameters. The disclosure further relates to a method and apparatus for reconstructing an audio object in an audio decoding system taking the non-uniformly quantized parameters into account. According to the disclosure, such an approach renders it possible to reduce bit consumption without substantially reducing the quality of the reconstructed audio object.Type: GrantFiled: June 19, 2020Date of Patent: April 5, 2022Assignee: Dolby International ABInventors: Heiko Purnhagen, Per Ekstrand
-
Patent number: 11289104Abstract: An apparatus for decoding an encoded audio signal, includes a spectral domain audio decoder for generating a first decoded representation of a first set of first spectral portions, the decoded representation having a first spectral resolution; a parametric decoder for generating a second decoded representation of a second set of second spectral portions having a second spectral resolution being lower than the first spectral resolution; a frequency regenerator for regenerating every constructed second spectral portion having the first spectral resolution using a first spectral portion and spectral envelope information for the second spectral portion; and a spectrum time converter for converting the first decoded representation and the reconstructed second spectral portion into a time representation.Type: GrantFiled: February 26, 2019Date of Patent: March 29, 2022Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Sascha Disch, Frederik Nagel, Ralf Geiger, Balaji Nagendran Thoshkahna, Konstantin Schmidt, Stefan Bayer, Christian Neukam, Bernd Edler, Christian Helmrich
-
Patent number: 11270711Abstract: In general, techniques are described by which to provide priority information for higher order ambisonic (HOA) audio data. A device comprising a memory and a processor may perform the techniques. The memory stores HOA coefficients of the HOA audio data, the HOA coefficients representative of a soundfield. The processor may decompose the HOA coefficients into a sound component and a corresponding spatial component, the corresponding spatial component defining shape, width, and directions of the sound component, and the corresponding spatial component defined in a spherical harmonic domain. The processor may also determine, based on one or more of the sound component and the corresponding spatial component, priority information indicative of a priority of the sound component relative to other sound components of the soundfield, and specify, in a data object representative of a compressed version of the HOA audio data, the sound component and the priority information.Type: GrantFiled: May 6, 2020Date of Patent: March 8, 2022Assignee: Qualcomm IncorproatedInventors: Moo Young Kim, Nils Günther Peters, Shankar Thagadur Shivappa, Dipanjan Sen
-
Patent number: 11270709Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.Type: GrantFiled: November 22, 2017Date of Patent: March 8, 2022Assignee: Dolby International ABInventors: Heiko Purnhagen, Kristofer Kjoerling, Toni Hirvonen, Lars Villemoes, Dirk Jeroen Breebaart
-
Patent number: 11271678Abstract: An encoding method for a wireless local area network includes encoding a packet via a cyclic coding scheme into a first amount of multiple first blocks; processing each of the first amount of the first blocks into a second amount of multiple second blocks; and encoding the second amount of the second blocks via a block coding scheme.Type: GrantFiled: July 2, 2020Date of Patent: March 8, 2022Assignee: Realtek Semiconductor Corp.Inventors: John Timothy Coffey, Der-Zheng Liu
-
Patent number: 11264042Abstract: An apparatus for generating an enhanced signal from an input signal, wherein the enhanced signal has spectral values for an enhancement spectral region, the spectral values for the enhancement spectral regions not being contained in the input signal, includes a mapper for mapping a source spectral region of the input signal to a target region in the enhancement spectral region, the source spectral region including a noise-filling region; and a noise filler configured for generating first noise values for the noise-filling region in the source spectral region of the input signal and for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from the first noise values or for generating second noise values for a noise region in the target region, wherein the second noise values are decorrelated from first noise values in the source region.Type: GrantFiled: November 21, 2019Date of Patent: March 1, 2022Inventors: Sascha Disch, Ralf Geiger, Andreas Niedermeier, Matthias Neusinger, Konstantin Schmidt, Stephan Wilde, Benjamin Schubert, Christian Neukam
-
Patent number: 11264017Abstract: Systems and methods include a plurality of audio input components configured to generate a plurality of audio input signals, and a logic device configured to receive the plurality of audio input signals, determine whether the plurality of audio signals comprise target audio associated with an audio source, estimate a relative location of the audio source with respect to the plurality of audio input components based on the plurality of audio signals and a determination of whether the plurality of audio signals comprise the target audio, and process the plurality of audio signals to generate an audio output signal by enhancing the target audio based on the estimated relative location. The logic device is further configured to use relative transfer-based covariance to construct directional covariance matrix aligned across frequency bands and find a direction that minimizes beam power subject to distortionless criteria.Type: GrantFiled: June 12, 2020Date of Patent: March 1, 2022Assignee: SYNAPTICS INCORPORATEDInventors: Alireza Masnadi-Shirazi, Francesco Nesta
-
Patent number: 11264038Abstract: The invention provides methods and devices for stereo encoding and decoding using complex prediction in the frequency domain. In one embodiment, a decoding method, for obtaining an output stereo signal from an input stereo signal encoded by complex prediction coding and comprising first frequency-domain representations of two input channels, comprises the upmixing steps of: (i) computing a second frequency-domain representation of a first input channel; and (ii) computing an output channel on the basis of the first and second frequency-domain representations of the first input channel, the first frequency-domain representation of the second input channel and a complex prediction coefficient. The upmixing can be suspended responsive to control data.Type: GrantFiled: February 20, 2020Date of Patent: March 1, 2022Assignee: Dolby International ABInventors: Heiko Purnhagen, Lars Villemoes, Pontus Carlsson
-
Patent number: 11250863Abstract: The techniques disclosed herein provide apparatuses and related methods for the communication of spatial audio and related metadata. In some implementations, a source provides prerecorded spatial audio that has embedded metadata. A computing device processes the prerecorded spatial audio to generate an audio codec that is segmented to include a first section of audio data and a second section that includes metadata extracted from the prerecorded spatial audio. The generated audio codec may be received by a device that includes an encoder. The encoder may process the generated audio codec to generate audio data that includes the metadata.Type: GrantFiled: December 17, 2019Date of Patent: February 15, 2022Assignee: Microsoft Technology Licensing, LLCInventors: Brian C. McDowell, Philip Andrew Edry, Ziyad Ibrahim, Robert Norman Heitkamp, Steven Wilssens
-
Patent number: 11244674Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for suppressing hotword triggers when detecting a hotword in recorded media are disclosed. In one aspect, a method includes the actions of receiving, by a computing device, audio corresponding to playback of an item of media content. The actions further include determining, by the computing device, that the audio includes an utterance of a predefined hotword and that the audio includes an audio watermark. The actions further include analyzing, by the computing device, the audio watermark. The actions further include based on analyzing the audio watermark, determining, by the computing device, whether to perform speech recognition on a portion of the audio following the predefined hotword.Type: GrantFiled: August 9, 2019Date of Patent: February 8, 2022Assignee: Google LLCInventor: Ricardo Antonio Garcia
-
Patent number: 11238875Abstract: This disclosure provides an encoding method, a decoding method, an encoding apparatus, and a decoding apparatus for a stereo signal. The encoding method includes: performing interpolation processing based on the inter-channel time difference in the current frame and an inter-channel time difference in a previous frame of the current frame; performing time-domain downmixing processing on the stereo signal after the delay alignment in the current frame, to obtain a primary-channel signal and a secondary-channel signal in the current frame; and quantizing the inter-channel time difference after the interpolation processing in the current frame, the primary channel signal and the secondary channel signal.Type: GrantFiled: January 24, 2020Date of Patent: February 1, 2022Assignee: Huawei Technologies Co., Ltd.Inventors: Eyal Shlomot, Halting Li, Bin Wang
-
Patent number: 11232802Abstract: Methods for converting, encoding, decoding and transcoding an acoustic field, more particularly a first-order Ambisonics three-dimensional acoustic field.Type: GrantFiled: September 28, 2017Date of Patent: January 25, 2022Assignee: Coronal Encoding S.A.S.Inventors: Benjamin Bernard, Francois Becker
-
Patent number: 11232803Abstract: An encoding device according to the disclosure includes a first encoding unit that generates a first encoded signal in which a low-band signal having a frequency lower than or equal to a predetermined frequency from a voice or audio input signal is encoded, and a low-band decoded signal; a second encoding unit that encodes, on the basis of the low-band decoded signal, a high-band signal having a band higher than that of the low-band signal to generate a high-band encoded signal; and a first multiplexing unit that multiplexes the first encoded signal and the high-band encoded signal to generate and output an encoded signal. The second encoding unit calculates an energy ratio between a high-band noise component, which is a noise component of the high-band signal, and a high-band non-tonal component of a high-band decoded signal generated from the low-band decoded signal and outputs the ratio as the high-band encoded signal.Type: GrantFiled: March 7, 2019Date of Patent: January 25, 2022Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Srikanth Nagisetty, Zong Xian Liu, Hiroyuki Ehara
-
Patent number: 11232801Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.Type: GrantFiled: July 24, 2020Date of Patent: January 25, 2022Assignee: Dolby International ABInventors: Sven Kordon, Alexander Krueger
-
Patent number: 11228856Abstract: A method for generating loudspeaker signals associated with a target screen size is disclosed. The method includes receiving a bit stream containing encoded higher order ambisonics signals, the encoded higher order ambisonics signals describing a sound field associated with a production screen size. The method further includes decoding the encoded higher order ambisonics signals to obtain a first set of decoded higher order ambisonics signals representing dominant components of the sound field and a second set of decoded higher order ambisonics signals representing ambient components of the sound field. The method also includes combining the first set of decoded higher order ambisonics signals and the second set of decoded higher order ambisonics signals to produce a combined set of decoded higher order ambisonics signals.Type: GrantFiled: August 26, 2020Date of Patent: January 18, 2022Assignee: Dolby Laboratories Licensing CorporationInventors: Peter Jax, Johannes Boehm, William Redmann
-
Patent number: 11227615Abstract: A sound processing apparatus includes: a conversion unit that samples predetermined audio data with a first quantization bit rate and a second quantization bit rate greater than the first quantization bit rate and outputs the respective results as first digital audio data and second digital audio data; a frequency analysis unit that performs frequency analysis on each output from the conversion unit; and a determination unit that performs predetermined determination processing on the basis of analysis results by the frequency analysis unit.Type: GrantFiled: July 25, 2018Date of Patent: January 18, 2022Assignee: SONY CORPORATIONInventors: Tatsuya Koizumi, Taiji Yoshida
-
Patent number: 11227616Abstract: Audio encoder for encoding audio input data to obtain audio output data includes an input interface for receiving a plurality of audio channels, a plurality of audio objects and metadata related to one or more of the plurality of audio objects; a mixer for mixing the plurality of objects and the plurality of channels to obtain a plurality of pre-mixed channels, each pre-mixed channel including audio data of a channel and audio data of at least one object; a core encoder for core encoding core encoder input data; and a metadata compressor for compressing the metadata related to the one or more of the plurality of audio objects, wherein the audio encoder is configured to operate in at least one mode of the group of two modes.Type: GrantFiled: February 15, 2019Date of Patent: January 18, 2022Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Alexander Adami, Christian Borss, Sascha Dick, Christian Ertel, Simone Neukam, Juergen Herre, Johannes Hilpert, Andreas Hoelzer, Michael Kratschmer, Fabian Kuech, Achim Kuntz, Adrian Murtaza, Jan Plogsties, Andreas Silzle, Hanne Stenzel
-
Patent number: 11223925Abstract: An apparatus configured to: based on (i) captured spatial audio content of a scene comprising audio that is associated with information indicative of at least a direction in the scene from which said audio was captured; and (ii) visual focus information comprising information indicative of at least a first part of the scene on which corresponding captured visual imagery of the scene is focused for presentation to a user; provide for presentation of the captured spatial audio content to accompany the captured visual imagery, the captured spatial audio content presented as spatial audio, the spatial audio content provided for presentation with a spatial audio focus selectively applied to audio captured from a second part of the scene different to the first part, the spatial audio focus comprising an audio-modifying effect to increase the audibility of the audio having a direction corresponding to the second part.Type: GrantFiled: December 11, 2018Date of Patent: January 11, 2022Assignee: Nokia Technologies OyInventors: Lasse Laaksonen, Arto Lehtiniemi, Toni Mäkinen, Mikko Heikkinen
-
Patent number: 11217261Abstract: In methods and apparatus and non-transitory memory units for encoding/decoding audio signal information, the encoder side may determine if a signal frame is useful for long term post filtering and/or packet lost concealment and may encode information in accordance to the results of the determination, and the decoder side may apply the LTPF and/or PLC in accordance to the information obtained from the encoder.Type: GrantFiled: May 6, 2020Date of Patent: January 4, 2022Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.Inventors: Emmanuel Ravelli, Adrian Tomasek, Manfred Lutzky, Conrad Benndorf
-
Patent number: 11218754Abstract: Audio video synchronization and alignment or alignment of audio to some other external clock are rendered more effective or easier by treating fragment grid and frame grid as independent values, but, nevertheless, for each fragment the frame grid is aligned to the respective fragment's beginning. A compression effectiveness lost may be kept low when appropriately selecting the fragment size. On the other hand, the alignment of the frame grid with respect to the fragments' beginnings allows for an easy and fragment-synchronized way of handling the fragments in connection with, for example, parallel audio video streaming, bitrate adaptive streaming or the like.Type: GrantFiled: February 7, 2020Date of Patent: January 4, 2022Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Bernd Czelhan, Harald Fuchs, Ingo Hofmann, Herbert Thoma, Stephan Schreiner
-
Patent number: 11176951Abstract: A method for processing a monophonic signal in a 3D audio decoder, including processing binauralizing decoded signals intended to be delivered spatially by a headset. The method is such that, on detection, in a datastream representative of the monophonic signal, of an indication of non-binauralization processing, which indication is associated with spatial delivery position information, the decoded monophonic signal is directed to a stereophonic rendering engine, which takes into account the position information to construct two delivery channels that are directly processed via a direct mixing that sums these two channels with a binauralized signal output from the binauralization processing, in order to be delivered via the headset. A decoder device that implements the processing method is also provided.Type: GrantFiled: December 7, 2018Date of Patent: November 16, 2021Assignee: ORANGEInventor: Gregory Pallone
-
Patent number: 11172293Abstract: A low power voice processing system that includes a plurality of non-audio sensors, at least one microphone system, and a plurality of audio modules, at least some of which can be configured in selected modes. A context determination module is connected to the plurality of audio modules, and further connected to receive input from the plurality of non-audio sensors and the at least one microphone system. The context determination module acts to determine use context for the voice processing system and at least in part selects mode operation of at least some of the plurality of audio modules.Type: GrantFiled: July 10, 2019Date of Patent: November 9, 2021Assignee: AMBIQ MICRO, INC.Inventors: Arpit Shah, Scott McLean Hanson, Stephen Howard Nease
-
Patent number: 11170795Abstract: The invention provides a concept for combined dynamic range compression and guided clipping prevention for audio devices. An audio decoder for decoding an audio bitstream and a metadata bitstream related to the audio bitstream according to the concept includes an audio processing chain including a plurality of adjustment stages including a dynamic range control stage for adjusting a dynamic range of the audio output signal and a guided clipping prevention stage for preventing clipping of the audio output signal; and a metadata decoder configured to receive the metadata bitstream and to extract dynamic range control gain sequences and guided clipping prevention gain sequences from the metadata bitstream, at least a part of the dynamic range control gain sequences being supplied to the dynamic range control stage, and at least a part of the guided clipping prevention gain sequences being supplied to the guided clipping prevention stage.Type: GrantFiled: April 22, 2016Date of Patent: November 9, 2021Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.Inventors: Fabian Kuech, Christian Uhle, Michael Kratschmer, Bernhard Neugebauer, Michael Meier, Arne Borsum
-
Patent number: 11170792Abstract: To suitably regulate sound pressure of object content on a receiving side. An audio stream including coded data of a predetermined number of pieces of object content is generated. A container of a predetermined format including the audio stream is transmitted. Information indicating a range within which sound pressure is allowed to increase and decrease for each piece of object content is inserted into a layer of the audio stream and/or a layer of the container. On a receiving side, sound pressure of each piece of object content increases and decreases within the allowable range based on the information.Type: GrantFiled: December 16, 2019Date of Patent: November 9, 2021Assignee: SONY CORPORATIONInventors: Ikuo Tsukagoshi, Toru Chinen
-
Patent number: 11151176Abstract: A method for processing text information is provided, the method including: obtaining text information and extracting text characters contained in the text information; extracting, from the text characters, target characters satisfying a predetermined rule; calculating a filtering index of the text information according to the target characters; and when the filtering index meets a predetermined condition, executing an operation corresponding to the predetermined condition on the text information. In addition, an apparatus for processing text information is further provided. The method and apparatus for processing text information can improve the accuracy and efficiency of filtering out junk text information.Type: GrantFiled: June 28, 2019Date of Patent: October 19, 2021Assignee: TENCENT TECHNOLOGY (SHENZHEN) COMPANY LIMITEDInventor: Shengrong Chen
-
Patent number: 11130066Abstract: System and method that handles the inconsistent processing of interactive media (e.g. digital games, music visualization software, video playback software, animation software, etc.) and the generally consistent timeline delay to enable synchronization of messages and events with the playback of potentially variable-rate timelines.Type: GrantFiled: August 29, 2016Date of Patent: September 28, 2021Assignee: Sonic Bloom, LLCInventor: Eric D. Robinson
-
Patent number: 11133015Abstract: A method of predicting a channel parameter of an original signal from a downmix signal is disclosed. The method may include generating an input feature map to be used to predict a channel parameter of the original signal based on a downmix signal of an original signal, determining an output feature map including a predicted parameter to be used to predict the channel parameter by applying the input feature map to a neural network, generating a label map including information associated with the channel parameter of the original signal, and predicting the channel parameter of the original signal by comparing the output feature map and the label map.Type: GrantFiled: November 5, 2018Date of Patent: September 28, 2021Assignee: Electronics and Telecommunications Research InstituteInventors: Seung Kwon Beack, Woo-taek Lim, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Hui Yong Kim
-
Patent number: 11113287Abstract: A system may include at least one processor. The at least one processor may receive data from a plurality of independent data sources. The data from each respective data source is received at a rate determined by the respective data source. The at least one processor may further write the received data to at least one data store at a rate independent of the respective rates at which data from the plurality of independent data sources is received. A method and computer-readable medium are also disclosed.Type: GrantFiled: May 11, 2017Date of Patent: September 7, 2021Assignee: Teradata US, Inc.Inventors: Louis B. Landry, Ilsun A. Park, Oliver Ratzesberger
-
Patent number: 11115731Abstract: The present disclosure relates to sampling wireless signals received at a receiver. Subsets of the digital samples are multiplied by a Pseudo-Noise (PN) code to generate tone signals. A pattern of tone signals may indicate the presence of a repeating preamble. This may be used to locate a payload that follows the preamble. A tone signal of a payload may be decoded by multiplying a subset of the digital samples with the PN code to generate a tone signal. The tone signal may be transformed into the frequency domain to identify a frequency component having an energy level above a threshold. Using the frequency component, the payload is decoded. By structuring a packet as a series of PN-code modulated tone signals, packets received from different sources may be differentiated even when they arrive at overlapping points of time. This allows for a larger transmitting capacity in a network.Type: GrantFiled: May 12, 2020Date of Patent: September 7, 2021Assignee: Amazon Technologies, Inc.Inventor: Benjamin Wild
-
Patent number: 11089425Abstract: The present invention pertains to an audio playback method and an audio playback apparatus in a 6DoF environment. The audio playback method of the present invention is characterised by comprising: a decoding step of decoding a received audio signal, and outputting the decoded audio signal and metadata; a modelling step of receiving input of position information of a user, checking whether the position of the user has changed from a previous position, and if the position of the user has changed, modelling binaural rendering data so as to correspond to the changed position of the user; and a rendering step of binaural-rendering the decoded audio signal using the modelled rendering data, and outputting the same as a two-channel audio signal.Type: GrantFiled: November 14, 2017Date of Patent: August 10, 2021Assignee: LG Electronics Inc.Inventors: Tung Chin Lee, Sejin Oh
-
Patent number: 11080010Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.Type: GrantFiled: June 1, 2020Date of Patent: August 3, 2021Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Jeffrey Riedmiller, Scott Gregory Norcross, Karl Jonas Roeden