Patents by Inventor Toni Hirvonen
Toni Hirvonen has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 9666198Abstract: Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency-dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.Type: GrantFiled: May 23, 2014Date of Patent: May 30, 2017Assignee: Dolby International ABInventors: Toni Hirvonen, Heiko Purnhagen, Leif Jonas Samuelsson, Lars Villemoes
-
Patent number: 9502046Abstract: A method for encoding sound field signals includes allocating coding rate by application of a uniform criterion to all subbands of all signals in a joint process. An allocation criterion may be based on a comparison, in a given subband, between a spectral envelope of the signals to be encoded and a coding noise profile, wherein the noise profile may be a sum of a noise shape and a noise offset, which noise offset is computed on the basis of the coding bit budget. The rate allocation process may be combined with an energy-compacting orthogonal transform, for which there is proposed a parameterization susceptible of efficient coding and having adjustable directivity. In a further aspect, the invention provides a corresponding decoding method.Type: GrantFiled: September 20, 2013Date of Patent: November 22, 2016Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Heiko Purnhagen, Leif Jonas Samuelsson, Janusz Klejsa, Lars Villemoes, Toni Hirvonen, Glenn N. Dickins, Richard James Cartwright
-
Publication number: 20160261967Abstract: An encoding system encodes multiple audio signals (X) as a downmix signal (Y) together with wet and dry upmix coefficients (P, C). In a decoding system, a pre-multiplier (101) computes an intermediate signal (W) by mapping the downmix signal linearly in accordance with a first set of coefficients (Q); a decorrelating section (102) outputs a decorrelated signal (Z) based on the intermediate signal; a wet upmix section (103) computes a wet upmix signal by mapping the decorrelated signal linearly in accordance with the wet upmix coefficients; a dry upmix section (104) computes a dry upmix signal by mapping the downmix signal linearly in accordance with the dry upmix coefficients; a combining section (105) provides a multidimensional reconstructed signal (X) by combining the wet and dry upmix signals; and a converter (106) computes the first set of coefficients based on the wet and dry upmix coefficients and supplies this to the pre-multiplier.Type: ApplicationFiled: October 21, 2014Publication date: September 8, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Lars VILLEMOES, Toni HIRVONEN, Heiko PURNHAGEN
-
Publication number: 20160247514Abstract: An encoding system (400) encodes an N-channel audio signal (X), wherein N?3, as a single-channel downmix signal (Y) together with dry and wet upmix parameters (C, P). In a decoding system (200), a decorrelating section (101) outputs, based on the downmix signal, an (N?1)-channel decorrelated signal (Z); a dry upmix section (102) maps the downmix signal linearly in accordance with dry upmix coefficients (C) determined based on the dry upmix parameters; a wet upmix section (103) populates an intermediate matrix based on the wet upmix parameters and knowing that the intermediate matrix belongs to a predefined matrix class, obtains wet upmix coefficients (P) by multiplying the intermediate matrix by a predefined matrix, and maps the decorrelated signal linearly in accordance with the wet upmix coefficients; and a combining section (104) combines outputs from the upmix sections to obtain a reconstructed signal (X) corresponding to the signal to be reconstructed.Type: ApplicationFiled: October 21, 2014Publication date: August 25, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Lars VILLEMOES, Heidi-Maria LEHTONEN, Heiko PURNHAGEN, Toni HIRVONEN
-
Publication number: 20160240206Abstract: This disclosure falls into the field of audio coding, in particular it is related to the field of spatial audio coding, where the audio information is represented by multiple signals, where the signals may comprise audio channels or/and audio objects. In particular the disclosure provides a method and apparatus for reconstructing audio objects in an audio decoding system. Furthermore, this disclosure provides a method and apparatus for encoding such audio objects.Type: ApplicationFiled: October 21, 2014Publication date: August 18, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko Purnhagen, Janusz Klejsa, Lars Villemoes, Toni Hirvonen
-
Publication number: 20160155448Abstract: The present document relates to multichannel audio coding and more precisely to techniques for discrete multichannel audio encoding and decoding. In particular, the present document relates to systems and method for coding soundfields. An audio encoder (200) configured to encode a frame of a soundfield signal (110) comprising a plurality of audio signals is described. The audio encoder (200) comprises a transform determination unit (203, 204) configured to determine an energy-compacting orthogonal transform (V) based on the frame of the soundfield signal (110). Furthermore, the encoder (200) comprises a transform unit (202) configured to apply the energy-compacting orthogonal transform (V) to the frame of the soundfield signal (110), and configured to provide a frame of a rotated soundfield signal (112) comprising a plurality of rotated audio signals (E1, E2, E3).Type: ApplicationFiled: June 27, 2014Publication date: June 2, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko PURNHAGEN, Toni HIRVONEN, Leif Jonas SAMUELSSON, Lars VILLEMOES, Janusz KLEJSA, Harald MUNDT
-
Publication number: 20160125887Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.Type: ApplicationFiled: May 23, 2014Publication date: May 5, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko PURNHAGEN, Kristofer KJOERLING, Toni HIRVONEN, Lars VILLEMOES, Dirk Jeroen BREEBAART, Leif Jonas SAMUELSSON
-
Publication number: 20160125888Abstract: Exemplary embodiments provide encoding and decoding methods, and associated encoders and decoders, for encoding and decoding of an audio scene which at least comprises one or more audio objects (106a). The encoder (108, 110) generates a bit stream (116) which comprises downmix signals (112) and side information which includes individual matrix elements (114) of a reconstruction matrix which enables reconstruction of the one or more audio objects (106a) in the decoder (120).Type: ApplicationFiled: May 23, 2014Publication date: May 5, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko PURNHAGEN, Lars VILLEMOES, Leif Jonas SAMUELSSON, Toni HIRVONEN
-
Publication number: 20160111097Abstract: The present disclosure provides methods, devices and computer program products which provide less complex and more flexible control of the introduced decorrelation in an audio coding system. According to the disclosure, this is achieved by calculating and using two weighting factors, one for an approximated audio object and one for a decorrelated audio object, for introduction of decorrelation of audio objects in the audio coding system.Type: ApplicationFiled: May 23, 2014Publication date: April 21, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko PURNHAGEN, Lars VILLEMOES, Leif Jonas SAMUELSSON, Toni HIRVONEN
-
Publication number: 20160111099Abstract: Audio objects are associated with positional metadata. A received downmix signal comprises downmix channels that are linear combinations of one or more audio objects and are associated with respective positional locators. In a first aspect, the downmix signal, the positional metadata and frequency-dependent object gains are received. An audio object is reconstructed by applying the object gain to an upmix of the downmix signal in accordance with coefficients based on the positional metadata and the positional locators. In a second aspect, audio objects have been encoded together with at least one bed channel positioned at a positional locator of a corresponding downmix channel. The decoding system receives the downmix signal and the positional metadata of the audio objects. A bed channel is reconstructed by suppressing the content representing audio objects from the corresponding downmix channel on the basis of the positional locator of the corresponding downmix channel.Type: ApplicationFiled: May 23, 2014Publication date: April 21, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Toni HIRVONEN, Heiko PURNHAGEN, Leif Jonas SAMUELSSON, Lars VILLEMOES
-
Publication number: 20160104496Abstract: There is provided encoding and decoding methods for encoding and decoding of object based audio. An exemplary encoding method includes inter alia calculating M downmix signals by forming combinations of N audio objects, wherein M?N, and calculating parameters which allow reconstruction of a set of audio objects formed on basis of the N audio objects from the M downmix signals. The calculation of the M downmix signals is made according to a criterion which is independent of any loudspeaker configuration.Type: ApplicationFiled: May 23, 2014Publication date: April 14, 2016Applicant: DOLBY INTERNATIONAL ABInventors: Heiko PURNHAGEN, Kristofer KJOERLING, Toni HIRVONEN, Lars VILLEMOES, Dirk Jeroen BREEBAART
-
Publication number: 20150221313Abstract: A method for encoding sound field signals includes allocating coding rate by application of a uniform criterion to all subbands of all signals in a joint process. An allocation criterion may be based on a comparison, in a given subband, between a spectral envelope of the signals to be encoded and a coding noise profile, wherein the noise profile may be a sum of a noise shape and a noise offset, which noise offset is computed on the basis of the coding bit budget. The rate allocation process may be combined with an energy-compacting orthogonal transform, for which there is proposed a parameterization susceptible of efficient coding and having adjustable directivity. In a further aspect, the invention provides a corresponding decoding method.Type: ApplicationFiled: September 20, 2013Publication date: August 6, 2015Applicants: DOLBY INTERNATIONAL AB, DOLBY LABORATORIES LICENSING CORPORATIONInventors: Heiko Purnhagen, Leif Jonas Samuelsson, Janusz Klejsa, Lars Villemoes, Toni Hirvonen, Glenn N. Dickins, Richard James Cartwright
-
Patent number: 9094771Abstract: In some embodiments, a method for upmixing input audio comprising N full range channels to generate 3D output audio comprising N+M full range channels, where the N+M full range channels are intended to be rendered by speakers including at least two speakers at different distances from the listener. The N channel input audio is a 2D audio program whose N full range channels are intended for rendering by N speakers nominally equidistant from the listener. The upmixing of the input audio to generate the 3D output audio is typically performed in an automated manner, in response to cues determined in automated fashion from stereoscopic 3D video corresponding to the input audio, or in response to cues determined in automated fashion from the input audio. Other aspects include a system configured to perform, and a computer readable medium which stores code for implementing any embodiment of the inventive method.Type: GrantFiled: April 5, 2012Date of Patent: July 28, 2015Assignees: Dolby Laboratories Licensing Corporation, Dolby International ABInventors: Nicolas R. Tsingos, Charles Q. Robinson, Christophe Chabanne, Toni Hirvonen, Patrick Griffis
-
Publication number: 20140037117Abstract: In some embodiments, a method for upmixing input audio comprising N full range channels to generate 3D output audio comprising N+M full range channels, where the N+M full range channels are intended to be rendered by speakers including at least two speakers at different distances from the listener. The N channel input audio is a 2D audio program whose N full range channels are intended for rendering by N speakers nominally equidistant from the listener. The upmixing of the input audio to generate the 3D output audio is typically performed in an automated manner, in response to cues determined in automated fashion from stereoscopic 3D video corresponding to the input audio, or in response to cues determined in automated fashion from the input audio. Other aspects include a system configured to perform, and a computer readable medium which stores code for implementing any embodiment of the inventive method.Type: ApplicationFiled: April 5, 2012Publication date: February 6, 2014Applicant: DOLBY INTERNATIONAL ABInventors: Nicolas R. Tsingos, Charles Q. Robinson, Christophe Chabanne, Toni Hirvonen, Patrick Griffis