Audio Signal Bandwidth Compression Or Expansion Patents (Class 704/500)
  • Patent number: 12380902
    Abstract: A method comprises: vector quantizing input vectors representative of audio into an original sequence including indices of codewords of a codebook; generating candidate sequences including the indices of the codewords of the codebook by evaluating, for each candidate sequence, transition costs for transitions between the indices based on (i) transition probabilities of the transitions, and (ii) distances between the codewords represented by the indices and the input vectors that corresponds to the indices; determining a preferred candidate sequence of the candidate sequences to replace the original sequence based on the transition costs for each candidate sequence; and transmitting the preferred candidate sequence in place of the original sequence.
    Type: Grant
    Filed: December 14, 2023
    Date of Patent: August 5, 2025
    Assignee: CISCO TECHNOLOGY, INC.
    Inventors: Marcin Ciolek, Michal Sulewski, Raul A. Casas, Samer Lutfi Hijazi, Mihailo Kolundzija
  • Patent number: 12374341
    Abstract: A decoder-side method for outputting several audio channels of a sound program is described. An audio channel of the sound program, a residual signal, a gain parameter, and a delay parameter are received, for example within a bitstream. The audio channel is adjusted in accordance with the gain parameter and the delay parameter, to produce an adjusted audio signal, and is then combined with the residual signal to produce a combined signal. The audio channel is output as a first audio channel of the sound program for playback, while the combined signal is output as a second audio channel of the sound program. Other aspects are also described and claimed.
    Type: Grant
    Filed: April 14, 2023
    Date of Patent: July 29, 2025
    Assignee: Apple Inc.
    Inventor: Frank Baumgarte
  • Patent number: 12367885
    Abstract: An inter-channel phase difference (IPD) parameter extraction method includes obtaining a parameter for obtaining an information extraction manner for a current frame of a multi-channel signal; obtaining an IPD parameter extraction manner for the current frame based on the parameter for obtaining the information extraction manner, where the obtained IPD parameter extraction manner is one of at least two preset IPD parameter extraction manners; and obtaining an IPD parameter of the current frame based on the obtained IPD parameter extraction manner for the current frame.
    Type: Grant
    Filed: January 19, 2024
    Date of Patent: July 22, 2025
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Xingtao Zhang, Haiting Li, Zexin Liu, Lei Miao
  • Patent number: 12361953
    Abstract: This disclosure provides a encoding method, and an encoder for a multi-channel signal. The encoding method includes: obtaining a first ITD of a current frame of a multi-channel signal includes an initial left channel signal and an initial right channel signal; obtaining a second ITD of the current frame based on the first ITD and a third ITD of a previous frame of the multi-channel signal; performing delay alignment on the initial left channel signal and the initial right channel signal based on the second ITD, to obtain an aligned left channel signal and an aligned right channel signal; and encoding the aligned left channel signal and the aligned right channel signal.
    Type: Grant
    Filed: July 12, 2023
    Date of Patent: July 15, 2025
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Eyal Shlomot, Haiting Li, Bin Wang
  • Patent number: 12347443
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: March 12, 2024
    Date of Patent: July 1, 2025
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 12334087
    Abstract: This application provides example psychoacoustics-based audio encoding methods and apparatuses. One example method includes receiving audio data. The audio data can be decoded. Auditory feature information of a user can be obtained, where the auditory feature information includes at least one of the following: personal information, listening test result information, or frequency response curve information. A psychoacoustics model parameter of the user can be calculated based on the auditory feature information of the user, where the psychoacoustics model parameter includes at least one of the following: an intra-band masking parameter, a slope of a low-frequency inter-band masking line, a slope of a high-frequency inter-band masking line, or a human ear quiet threshold curve. The decoded audio data can be encoded based on the psychoacoustics model parameter of the user.
    Type: Grant
    Filed: November 18, 2022
    Date of Patent: June 17, 2025
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Meng Wang, Zhuo Wang, Fan Fan
  • Patent number: 12334109
    Abstract: The present invention discloses a video processing circuit, which is coupled to a memory chip and includes an image processing circuit. The image processing circuit includes a first channel, a second channel and a compression circuit. The two channels process first image data and second image data to generate first processed image data and second processed image data, respectively. The compression circuit compresses the first processed image data and the second processed image data to generate first compressed image data and second compressed image data, respectively. A memory block in the memory chip is configured as a ring buffer shared by the first channel and the second channel so as to store the first compressed image data and the second compressed image data.
    Type: Grant
    Filed: March 13, 2023
    Date of Patent: June 17, 2025
    Assignee: SIGMASTAR TECHNOLOGY LTD.
    Inventor: Xiao-Ding Zhu
  • Patent number: 12333214
    Abstract: Embodiments are directed to a method and system for receiving, in a bitstream, metadata associated with the audio data, and analyzing the metadata to determine whether a loudness parameter for a first group of audio playback devices are available in the bitstream. Responsive to determining that the parameters are present for the first group, the system uses the parameters and audio data to render audio. Responsive to determining that the loudness parameters are not present for the first group, the system analyzes one or more characteristics of the first group, and determines the parameter based on the one or more characteristics.
    Type: Grant
    Filed: October 9, 2023
    Date of Patent: June 17, 2025
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey Riedmiller, Scott Gregory Norcross, Karl Jonas Roeden
  • Patent number: 12334083
    Abstract: Disclosed is an apparatus and method for audio encoding/decoding that is robust against coding distortion in a transition section. An audio encoding method includes outputting a frequency domain signal by time-to-frequency (T/F) transform of an input signal, outputting a frequency domain residual signal in which a frequency axis envelope is removed from the frequency domain signal by applying frequency domain noise shaping (FDNS) encoding to the frequency domain signal, outputting a time domain residual signal in which a time axis envelope is removed by performing linear prediction coefficient (LPC) analysis based on the frequency domain residual signal, and quantizing and transmitting the time domain residual signal.
    Type: Grant
    Filed: July 2, 2021
    Date of Patent: June 17, 2025
    Assignee: ELECTRONICS AND TELECOMMUNICATIONS RESEARCH INSTITUTE
    Inventors: Seung Kwon Beack, Jongmo Sung, Mi Suk Lee, Tae Jin Lee, Woo-taek Lim, Inseon Jang
  • Patent number: 12334085
    Abstract: The present document relates to a method of layered encoding of a frame of a compressed higher-order Ambisonics, HOA, representation of a sound or sound field. The compressed HOA representation comprises a plurality of transport signals. The method comprises assigning the plurality of transport signals to a plurality of hierarchical layers, the plurality of layers including a base layer and one or more hierarchical enhancement layers, generating, for each layer, a respective HOA extension payload including side information for parametrically enhancing a reconstructed HOA representation obtainable from the transport signals assigned to the respective layer and any layers lower than the respective layer, assigning the generated HOA extension payloads to their respective layers, and signaling the generated HOA extension payloads in an output bitstream.
    Type: Grant
    Filed: February 8, 2024
    Date of Patent: June 17, 2025
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 12335328
    Abstract: This disclosure provides a network call method and apparatus, a computer device, and a storage medium, and belongs to the field of audio data processing. The method includes: performing time-frequency transformation on an acquired audio signal, to obtain a plurality of pieces of frequency domain information of the audio signal; determining a target bit rate corresponding to the audio signal according to the plurality of pieces of frequency domain information; and encoding the audio signal based on the target bit rate, and performing a network call based on the encoded audio signal.
    Type: Grant
    Filed: October 21, 2021
    Date of Patent: June 17, 2025
    Assignee: Tencent Technology (Shenzhen) Company Limited
    Inventor: Junbin Liang
  • Patent number: 12302074
    Abstract: An apparatus for combining three or more audio signals is described. The apparatus includes a segmentation block for segmenting each audio signal into segments, a weight determination block, which is configured to determine a weight value for each of the temporally weighted audio signal segments, a combination block for combining the temporally weighted audio signal segments of each audio signal, and a synthesis block for generating an output audio signal. A method for combining three or more audio signals and a computer program product are also described.
    Type: Grant
    Filed: March 14, 2023
    Date of Patent: May 13, 2025
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventor: Christian Borss
  • Patent number: 12301829
    Abstract: Techniques and solutions are described for encoding digital video files, such as for streaming applications. Data associated with the digital video file forms a dataset that can be characterized by a measure of the dataset's center, such as an average, and a spread of the dataset, such as a deviation, with respective to a bitrate over a duration of the digital video file. The measure of center and spread are used to calculate a deviation-adjusted bitrate. A deviation adjusted bitrate can be calculated for the entire digital video file, or for particular subsets of the digital video file, such as for segments of a duration forming units of video streaming Disclosed techniques can provide various advantages, including using a reduced bitrate for video or video portions as compared with an average or static bitrate, for lower-complexity video, or using a higher bitrate for video or video portions for higher-complexity video.
    Type: Grant
    Filed: May 31, 2022
    Date of Patent: May 13, 2025
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Nitin Suri, Carlos Alberto Lopez Servin
  • Patent number: 12283282
    Abstract: To reduce a processing load at a reception side in a case where a plurality kinds of audio data items are transmitted. A container in a predetermined format having a predetermined number of audio streams including coded data items of a plurality of groups is transmitted. For example, the coded data items of the plurality of groups include either or both of channel coded data items and object coded data items. Attribute information indicating respective attributes of the coded data items of the plurality of groups is inserted into a layer of the container and/or a layer of an audio stream. For example, stream correspondence relation information indicating which audio stream includes each of the coded data items of the plurality of groups is further inserted.
    Type: Grant
    Filed: November 16, 2023
    Date of Patent: April 22, 2025
    Assignee: SONY GROUP CORPORATION
    Inventor: Ikuo Tsukagoshi
  • Patent number: 12243540
    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding comprising: means for determining at least two of a type of spatial audio parameter for one or more audio signals, wherein a first of the type of spatial audio parameter is associated with a first group of samples in a domain of the one or more audio signals and a second of the type of spatial audio parameter is associated with a second group of samples in the domain of the one or more audio signals; and means for merging the first of the type of spatial audio parameter and the second of the type of spatial audio parameter into a merged spatial audio parameter.
    Type: Grant
    Filed: November 13, 2020
    Date of Patent: March 4, 2025
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Mikko-Ville Laitinen, Lasse Laaksonen, Adriana Vasilache, Tapani Pihlajakuja, Anssi Rämö
  • Patent number: 12236963
    Abstract: The present document relates to a method of layered encoding of a compressed sound representation of a sound or sound field. The compressed sound representation comprises a basic compressed sound representation comprising a plurality of components, basic side information for decoding the basic compressed sound representation to a basic reconstructed sound representation of the sound or sound field, and enhancement side information including parameters for improving the basic reconstructed sound representation.
    Type: Grant
    Filed: March 15, 2024
    Date of Patent: February 25, 2025
    Assignee: DOLBY INTERNATIONAL AB
    Inventors: Sven Kordon, Alexander Krueger
  • Patent number: 12231864
    Abstract: Disclosed is an apparatus and method for processing a multichannel audio signal. A multichannel audio signal processing method may include: generating an N-channel audio signal of N channels by down-mixing an M-channel audio signal of M channels; and generating a stereo audio signal by performing binaural rendering of the N-channel audio signal.
    Type: Grant
    Filed: December 1, 2023
    Date of Patent: February 18, 2025
    Assignee: Electronics and Telecommunications Research Institute
    Inventors: Yong Ju Lee, Jeong Il Seo, Seung Kwon Beack, Kyeong Ok Kang, Jin Woong Kim, Jae Hyoun Yoo
  • Patent number: 12210799
    Abstract: In an audio encoder, for audio content received in a source audio format, default gains are generated based on a default dynamic range compression (DRC) curve, and non-default gains are generated for a non-default gain profile. Based on the default gains and non-default gains, differential gains are generated. An audio signal comprising the audio content, the default DRC curve, and differential gains is generated. In an audio decoder, the default DRC curve and the differential gains are identified from the audio signal. Default gains are re-generated based on the default DRC curve. Based on the combination of the re-generated default gains and the differential gains, operations are performed on the audio content extracted from the audio signal.
    Type: Grant
    Filed: December 11, 2023
    Date of Patent: January 28, 2025
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeroen Koppens, Jeffrey Riedmiller, Kristofer Kjoerling, Alexander Stahlmann, Holger Hoerich, Alan J. Seefeldt
  • Patent number: 12200465
    Abstract: The technology generally relates to spatial audio communication between devices. For example, a first device and a second device may be connected via a communication link. The first device may capture audio signals in an environment through two or more microphones. The first device may encode the captured audio with spatial configuration data. The first device may transmit the encoded audio via the communication link to the second device. The second device may decode the encoded audio into binaural or ambisonic audio to be output by one or more speakers of the second device. The binaural or ambisonic audio may be converted into spatial audio to be output. The second device may output the binaural or spatial audio to create an immersive listening experience.
    Type: Grant
    Filed: May 19, 2022
    Date of Patent: January 14, 2025
    Assignee: Google LLC
    Inventors: Rajeev Conrad Nongpiur, Qian Zhang, Andrew James Sutter, Kung-Wei Liu, Jihan Li, Hélène Bahu, Leonardo Kusumo, Sze Chie Lim, Marco Tagliasacchi, Neil Zeghidour, Michael Takezo Chinen
  • Patent number: 12198706
    Abstract: An audio signal coding method is provided that includes: obtaining a current frame of an audio signal; obtaining a coding parameter based on a power spectrum ratio of a current frequency in a current frequency area of at least a part of signals of the current frame, where the coding parameter indicates tonal component information of the at least a part of signals, the tonal component information includes at least one of location information of a tonal component, quantity information of tonal components, amplitude information of the tonal component, or energy information of the tonal component, and the power spectrum ratio of the current frequency is a ratio of a value of a power spectrum of the current frequency to a mean value of power spectrums of the current frequency area; and performing bitstream multiplexing on the coding parameter to obtain a coded bitstream.
    Type: Grant
    Filed: October 19, 2022
    Date of Patent: January 14, 2025
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Bingyin Xia, Jiawei Li, Zhe Wang
  • Patent number: 12183355
    Abstract: Many portable playback devices cannot decode and playback encoded audio content having wide bandwidth and wide dynamic range with consistent loudness and intelligibility unless the encoded audio content has been prepared specially for these devices. This problem can be overcome by including with the encoded content some metadata that specifies a suitable dynamic range compression profile by either absolute values or differential values relative to another known compression profile. A playback device may also adaptively apply gain and limiting to the playback audio. Implementations in encoders, in transcoders and in decoders are disclosed.
    Type: Grant
    Filed: March 28, 2024
    Date of Patent: December 31, 2024
    Assignees: Dolby Laboratories Licensing Corporation, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey Riedmiller, Harald Mundt, Michael Schug, Martin Wolters
  • Patent number: 12175991
    Abstract: An audio encoder for encoding audio input data has: a preprocessor for preprocessing the audio input data to obtain audio data to be coded; a coder processor for coding the audio data to be coded; and a controller for controlling the coder processor so that, depending on a first signal characteristic of a first frame of the audio data to be coded, a number of audio data items of the audio data to be coded by the coder processor for the first frame is reduced compared to a second signal characteristic of a second frame, and a first number of information units used for coding the reduced number of audio data items for the first frame is stronger enhanced compared to a second number of information units for the second frame.
    Type: Grant
    Filed: December 9, 2021
    Date of Patent: December 24, 2024
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jan Büthe, Markus Schnell, Stefan Döhla, Bernhard Grill, Martin Dietz
  • Patent number: 12177037
    Abstract: A vehicle control system includes a transmitting electronic control unit for transmitting multiplexed protocol data units. The transmitting electronic control unit is enabled to transmit first and second protocol data units with corresponding first and second selector field values. A receiving electronic control unit is in electronic communication with the transmitting electronic control unit. The receiving electronic control unit is enabled to be programmed to operate in a first mode of operation or in a second mode of operation. The receiving electronic control unit is enabled to receive and process the first protocol data unit based on receipt of the first selector field value when operating in the first mode of operation and is enabled to receive and process the second protocol data unit based on receipt of the second selector field value when operating in the second mode of operation.
    Type: Grant
    Filed: February 8, 2023
    Date of Patent: December 24, 2024
    Assignee: GM GLOBAL TECHNOLOGY OPERATIONS LLC
    Inventors: Sudhakaran Maydiga, Steve DiBella, John Patrick Saavedra Gonzaga, Subhash Vijaya Doddachikkaveeranna
  • Patent number: 12175987
    Abstract: A time-domain stereo parameter encoding method and a related product are provided. The time-domain stereo parameter encoding method includes: determining a channel combination scheme for a current frame; determining a time-domain stereo parameter of the current frame based on the channel combination scheme for the current frame; and encoding the determined time-domain stereo parameter of the current frame, where the time-domain stereo parameter includes at least one of a channel combination ratio factor or an inter-channel time difference.
    Type: Grant
    Filed: June 21, 2023
    Date of Patent: December 24, 2024
    Assignee: Huawei Technologies Co., Ltd.
    Inventors: Haiting Li, Bin Wang, Lei Miao
  • Patent number: 12175992
    Abstract: On the basis of a bitstream (P), an n-channel audio signal (X) is reconstructed by deriving an m-channel core signal (Y) and multichannel coding parameters (?) from the bitstream, where 1?m<n. Also derived from the bitstream are pre-processing dynamic range control, DRC, parameters (DRC2) quantifying an encoder-side dynamic range limiting of the core signal. The n-channel audio signal is obtained by parametric synthesis in accordance with the multichannel coding parameters and while cancelling any encoder-side dynamic range limiting based on the pre-processing DRC parameters. In particular embodiments, the reconstruction further includes use of compensated post-processing DRC parameters quantifying a potential decoder-side dynamic range compression. Cancellation of an encoder-side range limitation and range compression are preferably performed by different decoder-side components. Cancellation and compression may be coordinated by a DRC pre-processor.
    Type: Grant
    Filed: July 19, 2023
    Date of Patent: December 24, 2024
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Jeffrey Riedmiller, Karl J. Roeden, Kristofer Kjoerling, Heiko Purnhagen, Vinay Melkote, Leif Sehlstrom
  • Patent number: 12170093
    Abstract: An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.
    Type: Grant
    Filed: April 26, 2023
    Date of Patent: December 17, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Daniel Fischer, Bernd Czelhan, Max Neuendorf, Nikolaus Rettelbach, Ingo Hofmann, Harald Fuchs, Stefan Doehla, Nikolaus Faerber
  • Patent number: 12165665
    Abstract: A method is described that processes an audio signal. A discontinuity between a filtered past frame and a filtered current frame of the audio signal is removed using linear predictive filtering.
    Type: Grant
    Filed: December 12, 2023
    Date of Patent: December 10, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Emmanuel Ravelli, Manuel Jander, Grzegorz Pietrzyk, Martin Dietz, Marc Gayer
  • Patent number: 12165664
    Abstract: An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.
    Type: Grant
    Filed: March 1, 2024
    Date of Patent: December 10, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Daniel Fischer, Bernd Czelhan, Max Neuendorf, Nikolaus Rettelbach, Ingo Hofmann, Harald Fuchs, Stefan Doehla, Nikolaus Faerber
  • Patent number: 12165661
    Abstract: There is inter alia disclosed an apparatus for spatial audio encoding which can receive or determine for one or more audio signals (102), spatial audio parameters (106) on a sub band basis for providing spatial audio reproduction, the spatial audio parameters can comprise a coherence value (112) for each sub band of a plurality of subbands (202) of a frame. The apparatus then determines a significance measure for the coherence values (401) of the plurality of sub bands of the frame and uses the significance measure to determine whether to encode (403) the coherence values of the plurality of sub bands of the frame.
    Type: Grant
    Filed: March 26, 2020
    Date of Patent: December 10, 2024
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Mikko-Ville Laitinen, Adriana Vasilache
  • Patent number: 12165662
    Abstract: A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of ¼ of the frame length.
    Type: Grant
    Filed: May 9, 2023
    Date of Patent: December 10, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Schnell, Manfred Lutzky, Eleni Fotopoulou, Konstantin Schmidt, Conrad Benndorf, Adrian Tomasek, Tobias Albert, Timon Seidl
  • Patent number: 12165037
    Abstract: An embodiment method comprises applying domain transformation processing to a time-series of signal samples, received from a sensor coupled to a dynamical system, to produce a dataset of transformed signal samples therefrom, buffering the transformed signal samples, obtaining a data buffer having transformed signal samples as entries, computing statistical parameters of the data buffer, producing a drift signal indicative of the evolution of the dynamical system as a function of the computed statistical parameters, selecting transformed signal samples buffered in the data buffer as a function of the drift signal, applying normalization processing to the buffered transformed signal samples, applying auto-encoder artificial neural network processing to a dataset of resealed signal samples, and producing a dataset of reconstructed signal samples and calculating an error of reconstruction.
    Type: Grant
    Filed: August 2, 2021
    Date of Patent: December 10, 2024
    Assignee: STMicroelectronics S.R.L.
    Inventor: Angelo Bosco
  • Patent number: 12159641
    Abstract: An audio encoder for encoding audio input data has: a preprocessor for preprocessing the audio input data to obtain audio data to be coded; a coder processor for coding the audio data to be coded; and a controller for controlling the coder processor so that, depending on a first signal characteristic of a first frame of the audio data to be coded, a number of audio data items of the audio data to be coded by the coder processor for the first frame is reduced compared to a second signal characteristic of a second frame, and a first number of information units used for coding the reduced number of audio data items for the first frame is stronger enhanced compared to a second number of information units for the second frame.
    Type: Grant
    Filed: December 10, 2021
    Date of Patent: December 3, 2024
    Assignee: Fraunhofer-Gesellschaft zur Förderung der angewandten Forschung e.V.
    Inventors: Jan Büthe, Markus Schnell, Stefan Döhla, Bernhard Grill, Martin Dietz
  • Patent number: 12159638
    Abstract: A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of ¼ of the frame length.
    Type: Grant
    Filed: April 25, 2023
    Date of Patent: December 3, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Schnell, Manfred Lutzky, Eleni Fotopoulou, Konstantin Schmidt, Conrad Benndorf, Adrian Tomasek, Tobias Albert, Timon Seidl
  • Patent number: 12154580
    Abstract: A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of ¼ of the frame length.
    Type: Grant
    Filed: May 9, 2023
    Date of Patent: November 26, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Schnell, Manfred Lutzky, Eleni Fotopoulou, Konstantin Schmidt, Conrad Benndorf, Adrian Tomasek, Tobias Albert, Timon Seidl
  • Patent number: 12154583
    Abstract: An apparatus for processing an audio signal includes a configurable first audio signal processor for processing the audio signal in accordance with different configuration settings to obtain a processed audio signal, wherein the apparatus is adapted so that different configuration settings result in different sampling rates of the processed audio signal. The apparatus furthermore includes n analysis filter bank having a first number of analysis filter bank channels, a synthesis filter bank having a second number of synthesis filter bank channels, a second audio processor being adapted to receive and process an audio signal having a predetermined sampling rate, and a controller for controlling the first number of analysis filter bank channels or the second number of synthesis filter bank channels in accordance with a configuration setting.
    Type: Grant
    Filed: October 9, 2023
    Date of Patent: November 26, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Lohwasser, Manuel Jander, Max Neuendorf, Ralf Geiger, Markus Schnell, Matthias Hildenbrand, Tobias Chalupka
  • Patent number: 12154104
    Abstract: Methods and apparatus provide digital media with spatial audio to a blockchain. The blockchain network executes a decentralized application (Dapp) with a user interface (UI) that enables a user to select audio for spatialization and uploading to the blockchain. The spatial audio transmits to the blockchain network to reduce processing and transmission of network data.
    Type: Grant
    Filed: June 4, 2021
    Date of Patent: November 26, 2024
    Inventor: Philip Scott Lyren
  • Patent number: 12154579
    Abstract: A downscaled version of an audio decoding procedure may more effectively and/or at improved compliance maintenance be achieved if the synthesis window used for downscaled audio decoding is a downsampled version of a reference synthesis window involved in the non-downscaled audio decoding procedure by downsampling by the downsampling factor by which the downsampled sampling rate and the original sampling rate deviate, and downsampled using a segmental interpolation in segments of 1/4 of the frame length.
    Type: Grant
    Filed: May 9, 2023
    Date of Patent: November 26, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Markus Schnell, Manfred Lutzky, Eleni Fotopoulou, Konstantin Schmidt, Conrad Benndorf, Adrian Tomasek, Tobias Albert, Timon Seidl
  • Patent number: 12154582
    Abstract: A system and method code an object-based audio signal comprising audio objects in response to audio streams with associated metadata. In the system and method, a metadata processor codes the metadata and generates information about bit-budgets for the coding of the metadata of the audio objects. An encoder codes the audio streams while a bit-budget allocator is responsive to the information about the bit-budgets for the coding of the metadata of the audio objects from the metadata processor to allocate bitrates for the coding of the audio streams by the encoder.
    Type: Grant
    Filed: July 7, 2020
    Date of Patent: November 26, 2024
    Inventor: Vaclav Eksler
  • Patent number: 12154587
    Abstract: Some implementations involve receiving a content stream that includes audio data, receiving at least one type of level adjustment indication relating to playback of the audio data and controlling a level of the input audio data, based on the at least one type of level adjustment indication, to produce level-adjusted audio data. Some examples involve determining, based at least in part on the type(s) of level adjustment indication, a multiband limiter configuration, applying the multiband limiter to the level-adjusted audio data, to produce multiband limited audio data and providing the multiband limited audio data to one or more audio reproduction transducers of an audio environment.
    Type: Grant
    Filed: December 9, 2020
    Date of Patent: November 26, 2024
    Assignee: DOLBY LABORATORIES LICENSING CORPORATION
    Inventors: Timothy Alan Port, Benjamin Alexander Jancovich
  • Patent number: 12148437
    Abstract: A method of processing speech includes: providing a first set of audio data having audio features in a first bandwidth; down-sampling the first set of audio data to a second bandwidth lower than the first bandwidth; producing, by a high frequency reconstruction network (HFRN), an estimate of audio features in the first bandwidth for the first set of audio data, based on at least the down-sampled audio data; inputting, into the HFRN, a second set of audio data having audio features in the second bandwidth; producing, by the HFRN, based on a second set of audio data having audio features in the second bandwidth, an estimate of audio features in the first bandwidth for the second set of audio data; and training a speech processing system (SPS) using the estimates of audio features in the first bandwidth for the first and second sets of audio data.
    Type: Grant
    Filed: December 10, 2021
    Date of Patent: November 19, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventor: Dushyant Sharma
  • Patent number: 12136419
    Abstract: Systems and processes for operating an intelligent automated assistant are provided. An example process for determining user intent includes receiving a natural language input and detecting an event. The process further includes, determining, at a first time, based on the natural language input, a first value for a first node of a parsing structure; and determining, at a second time, based on the detected data event, a second value for a second node of the parsing structure. The process further includes in accordance with a determination that the first time and the second time are within the predetermined time: determining, using the parsing structure, the first value, and the second value, a user intent associated with the natural language input; initiating a task based on the determined intent; and providing an output indicative of the task.
    Type: Grant
    Filed: August 31, 2023
    Date of Patent: November 5, 2024
    Assignee: Apple Inc.
    Inventors: Pierre P. Greborio, Didier Rene Guzzoni, Philippe P. Piernot
  • Patent number: 12131744
    Abstract: A method for encoding an input audio stream including the steps of obtaining a first playback stream presentation of the input audio stream intended for reproduction on a first audio reproduction system, obtaining a second playback stream presentation of the input audio stream intended for reproduction on a second audio reproduction system, determining a set of transform parameters suitable for transforming an intermediate playback stream presentation to an approximation of the second playback stream presentation, wherein the transform parameters are determined by minimization of a measure of a difference between the approximation of the second playback stream presentation and the second playback stream presentation, and encoding the first playback stream presentation and the set of transform parameters for transmission to a decoder.
    Type: Grant
    Filed: October 16, 2023
    Date of Patent: October 29, 2024
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Dirk Jeroen Breebaart, David Matthew Cooper, Leif Jonas Samuelsson, Jeroen Koppens, Rhonda J. Wilson, Heiko Purnhagen, Alexander Stahlmann
  • Patent number: 12106763
    Abstract: An apparatus for encoding directional audio coding parameters comprising diffuseness parameters and direction parameters having a parameter calculator (100) for calculating the diffuseness parameters with a first time or frequency resolution and for calculating the direction parameters with a second time or frequency resolution; and a quantizer and encoder processor (200) for generating a quantized and encoded representation of the diffuseness parameters and the direction parameters.
    Type: Grant
    Filed: January 10, 2022
    Date of Patent: October 1, 2024
    Assignee: FRAUNHOFER-GESELLSCHAFT ZUR FÖRDERUNG DER ANGEWANDTEN FORSCHUNG E.V.
    Inventors: Guillaume Fuchs, Jürgen Herre, Fabian Küch, Stefan Döhla, Markus Multrus, Oliver Thiergart, Oliver Wübbolt, Florin Ghido, Stefan Bayer, Wolfgang Jaegers
  • Patent number: 12101618
    Abstract: A method for spatial audio signal encoding comprising: obtaining a plurality of audio direction parameters, wherein each parameter comprises an elevation value and an azimuth value and wherein each parameter has an ordered position; deriving for each of the plurality of audio direction parameters a corresponding derived audio direction parameter (SP) comprising an elevation and an azimuth value, corresponding derived audio direction parameters (SP) being arranged in a manner determined by a spatial utilization defined by the elevation values and the azimuth values of the plurality of audio direction parameters; rotating each derived audio direction parameter (SP) by the azimuth value (?0) of an audio direction parameter in the first position of the plurality of audio direction parameters and quantizing the rotation to determine for each a corresponding quantized rotated derived audio direction parameter; changing the ordered position of an audio direction parameter to a further position coinciding with a posi
    Type: Grant
    Filed: July 27, 2020
    Date of Patent: September 24, 2024
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Adriana Vasilache, Mikko-Ville Laitinen
  • Patent number: 12100418
    Abstract: Disclosed is a dialogue emotion correction method based on a graph neural network, including: extracting acoustic features, text features, and image features from a video file to fuse them into multi-modal features; obtaining an emotion prediction result of each sentence of a dialogue in the video file by using the multi-modal features; fusing the emotion prediction result of each sentence with interaction information between talkers in the video file to obtain interaction information fused emotion features; combining, on the basis of the interaction information fused emotion features, with context-dependence relationship in the dialogue to obtain time-series information fused emotion features; correcting, by using the time-series information fused emotion features, the emotion prediction result of each sentence that is obtained previously as to obtain a more accurate emotion recognition result.
    Type: Grant
    Filed: September 10, 2021
    Date of Patent: September 24, 2024
    Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
    Inventors: Jianhua Tao, Zheng Lian, Bin Liu, Xuefei Liu
  • Patent number: 12094478
    Abstract: An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.
    Type: Grant
    Filed: March 1, 2024
    Date of Patent: September 17, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Daniel Fischer, Bernd Czelhan, Max Neuendorf, Nikolaus Rettelbach, Ingo Hofmann, Harald Fuchs, Stefan Doehla, Nikolaus Faerber
  • Patent number: 12094479
    Abstract: An audio decoder decodes a bit stream of encoded audio data, which bit stream represents a sequence of audio sample values and includes a plurality of frames, wherein each frame includes associated encoded audio sample values. The audio decoder includes a determiner configured to determine whether a frame of the encoded audio data is a special frame including encoded audio sample values associated with the special frame and additional information, wherein the additional information include encoded audio sample values of a number of frames preceding the special frame, wherein the encoded audio sample values of the preceding frames are encoded using the same codec configuration as the special frame, wherein the number of preceding frames is sufficient to initialize the decoder to be in a position to decode the audio sample values associated with the special frame if the special frame is the first frame upon start-up of the decoder.
    Type: Grant
    Filed: March 1, 2024
    Date of Patent: September 17, 2024
    Assignee: Fraunhofer-Gesellschaft zur Foerderung der angewandten Forschung e.V.
    Inventors: Daniel Fischer, Bernd Czelhan, Max Neuendorf, Nikolaus Rettelbach, Ingo Hofmann, Harald Fuchs, Stefan Doehla, Nikolaus Faerber
  • Patent number: 12094476
    Abstract: Embodiments are disclosed for channel-based audio (CBA) (e.g., 22.2-ch audio) to object-based audio (OBA) conversion. The conversion includes converting CBA metadata to object audio metadata (OAMD) and reordering the CBA channels based on channel shuffle information derived in accordance with channel ordering constraints of the OAMD. The OBA with reordered channels is rendered in a playback device using the OAMD or in a source device, such as a set-top box or audio/video recorder. In an embodiment, the CBA metadata includes signaling that indicates a specific OAMD representation to be used in the conversion of the metadata. In an embodiment, pre-computed OAMD is transmitted in a native audio bitstream (e.g., AAC) for transmission (e.g., over HDMI) or for rendering in a source device. In an embodiment, pre-computed OAMD is transmitted in a transport layer bitstream (e.g., ISO BMFF, MPEG4 audio bitstream) to a playback device or source device.
    Type: Grant
    Filed: December 2, 2020
    Date of Patent: September 17, 2024
    Assignees: DOLBY LABORATORIES LICENSING CORPORATION, DOLBY INTERNATIONAL AB
    Inventors: Michael C. Ward, Freddie Sanchez, Christof Joseph Fersch
  • Patent number: 12087312
    Abstract: In a stereo encoding method, a channel combination encoding solution of a current frame is first obtained, and then a quantized channel combination ratio factor of the current frame and an encoding index of the quantized channel combination ratio factor are obtained based on the obtained channel combination encoding solution, so that an obtained primary channel signal and secondary channel signal of the current frame meet a characteristic of the current frame.
    Type: Grant
    Filed: September 6, 2023
    Date of Patent: September 10, 2024
    Assignee: HUAWEI TECHNOLOGIES CO., LTD.
    Inventors: Bin Wang, Haiting Li, Lei Miao
  • Patent number: 12087314
    Abstract: An encoder for encoding a parametric spectral representation (f) of auto-regressive coefficients that partially represent an audio signal. The encoder includes a low-frequency encoder configured to quantize elements of a part of the parametric spectral representation that correspond to a low-frequency part of the audio signal. It also includes a high-frequency encoder configured to encode a high-frequency part (fH) of the parametric spectral representation (f) by weighted averaging based on the quantized elements ({circumflex over (f)}L) flipped around a quantized mirroring frequency ({circumflex over (f)}m), which separates the low-frequency part from the high-frequency part, and a frequency grid determined from a frequency grid codebook in a closed-loop search procedure. Described are also a corresponding decoder, corresponding encoding/decoding methods and UEs including such an encoder/decoder.
    Type: Grant
    Filed: January 31, 2023
    Date of Patent: September 10, 2024
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Sigurdur Sverrisson