Patents by Inventor Erlendur Karlsson
Erlendur Karlsson has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 11968520Abstract: In one aspect, there is a method for rendering a spatially-heterogeneous audio element. In some embodiments, the method includes obtaining two or more audio signals representing the spatially-heterogeneous audio element, wherein a combination of the audio signals provides a spatial image of the spatially-heterogeneous audio element. The method also includes obtaining metadata associated with the spatially-heterogeneous audio element, the metadata comprising spatial extent information indicating a spatial extent of the audio element. The method further includes rendering the audio element using: i) the spatial extent information and ii) location information indicating a position (e.g. virtual position) and/or an orientation of the user relative to the audio element.Type: GrantFiled: December 20, 2019Date of Patent: April 23, 2024Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Tommy Falk, Werner De Bruijn, Erlendur Karlsson, Tomas Jansson Toftgård, Mengqiu Zhang
-
Patent number: 11930351Abstract: A method of audio rendering. The method includes receiving an audio element, wherein the audio element comprises: i) an interior representation that is valid within a spatial region, the interior representation of the audio element being in a listener-centric format and ii) information indicating the spatial region. The method further includes determining that a listener is outside the spatial region. The method further includes deriving an exterior representation of the audio element and rendering the audio element using the exterior representation of the audio element. In another aspect, a method of providing a spatially-bounded audio element is provided. The method includes providing, to a rendering node, an audio element. The audio element includes: (i) an interior representation that is valid within a spatial region, the interior representation being in a listener-centric format; and (ii) information indicating the spatial region.Type: GrantFiled: December 20, 2019Date of Patent: March 12, 2024Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Tommy Falk, Werner De Bruijn, Erlendur Karlsson, Tomas Jansson Toftgård, Mengqiu Zhang
-
Publication number: 20230336936Abstract: A method (1900) for audio signal filtering. The method includes generating (s1902) a pair of filters for a certain location specified by an elevation angle ? and an azimuth angle ?, the pair of filters consisting of a right filter ( h ^ r ? , ? (?, ?)) and a left filter h ^ l ? , ? ; filtering (s1904) an audio signal using the right filter; and filtering (s1906) the audio signal using the left filter.Type: ApplicationFiled: October 15, 2020Publication date: October 19, 2023Applicant: Telefonaktiebolaget LM Erissson (publ)Inventors: Mengqiu ZHANG, Erlendur KARLSSON
-
Patent number: 11749286Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: GrantFiled: December 6, 2021Date of Patent: September 5, 2023Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
-
Patent number: 11610590Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: GrantFiled: March 30, 2021Date of Patent: March 21, 2023Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
-
Publication number: 20220093107Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: ApplicationFiled: December 6, 2021Publication date: March 24, 2022Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
-
Publication number: 20220070606Abstract: A method of audio rendering. The method includes receiving an audio element, wherein the audio element comprises: i) an interior representation that is valid within a spatial region, the interior representation of the audio element being in a listener-centric format and ii) information indicating the spatial region. The method further includes determining that a listener is outside the spatial region. The method further includes deriving an exterior representation of the audio element and rendering the audio element using the exterior representation of the audio element. In another aspect, a method of providing a spatially-bounded audio element is provided. The method includes providing, to a rendering node, an audio element. The audio element includes: (i) an interior representation that is valid within a spatial region, the interior representation being in a listener-centric format; and (ii) information indicating the spatial region.Type: ApplicationFiled: December 20, 2019Publication date: March 3, 2022Applicant: Telefonaktiebolaget LM Ericsson (publ)Inventors: Tommy FALK, Werner DE BRUIJN, Erlendur KARLSSON, Tomas JANSSON TOFTGÅRD, Mengqiu ZHANG
-
Publication number: 20220030375Abstract: In one aspect, there is a method for rendering a spatially-heterogeneous audio element. In some embodiments, the method includes obtaining two or more audio signals representing the spatially-heterogeneous audio element, wherein a combination of the audio signals provides a spatial image of the spatially-heterogeneous audio element. The method also includes obtaining metadata associated with the spatially-heterogeneous audio element, the metadata comprising spatial extent information indicating a spatial extent of the audio element. The method further includes rendering the audio element using: i) the spatial extent information and ii) location information indicating a position (e.g. virtual position) and/or an orientation of the user relative to the audio element.Type: ApplicationFiled: December 20, 2019Publication date: January 27, 2022Applicant: Telefonaktiebolaget LM Ericsson (publ)Inventors: Tommy FALK, Werner DE BUIJN, Erlendur KARLSSON, Tomas JANSSON TOFTGÅRD, Mengqiu ZHANG
-
Publication number: 20210358507Abstract: A method for audio signal filtering. The method includes generating a pair of filters for a certain location specified by an elevation angle ? and an azimuth angle ?, the pair of filters consisting of a right filter (?r(?, ?)) and a left filter (?l(?, ?)); filtering an audio signal using the right filter; and filtering the audio signal using the left filter. Generating the pair of filters comprises: i) obtaining at least a first set of elevation basis function values at the elevation angle; ii) obtaining at least a first set of azimuth basis function values at the azimuth angle; iii) generating the right filter using: a) at least the first set of elevation basis function values, b) at least the first set of azimuth basis function values, and c) right filter model parameters; and iv) generating the left filter using: a) at least the first set of elevation basis function values, b) at least the first set of azimuth basis function values, and c) left filter model parameters.Type: ApplicationFiled: July 29, 2021Publication date: November 18, 2021Applicant: Telefonaktiebolaget LM Ericsson (publ)Inventors: Mengqiu ZHANG, Erlendur KARLSSON
-
Publication number: 20210217424Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: ApplicationFiled: March 30, 2021Publication date: July 15, 2021Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
-
Patent number: 10984801Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: GrantFiled: May 8, 2017Date of Patent: April 20, 2021Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
-
Patent number: 10915569Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.Type: GrantFiled: March 15, 2016Date of Patent: February 9, 2021Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Harald Pobloth, Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson
-
Patent number: 10825472Abstract: Detecting voiced speech in an audio signal. A method comprises calculating an autocorrelation function (ACF) of a portion of an input audio signal and detecting a highest peak of said autocorrelation function within a determined range. A peak width and a peak height of said detected highest peak are determined and based on the peak width and the peak height it is decided whether a segment of an input audio signal comprises voiced speech.Type: GrantFiled: May 10, 2018Date of Patent: November 3, 2020Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Tommy Falk, Harald Pobloth, Erlendur Karlsson
-
Publication number: 20200066281Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.Type: ApplicationFiled: May 8, 2017Publication date: February 27, 2020Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
-
Publication number: 20200036462Abstract: A method in a device for predicting the number of views of broadcast media on a broadcast channel at a future instance in time n+k. The method comprises receiving past values representing the actual number of views of the broadcast media at particular instances in time, and defining a time window previous to the current time n. The past values received during the time window are analysed to determine if the time window is corrupt. The time window is utilised to predict the number of views of the broadcast channel at the future instance in time, n+k, depending on whether the time window is corrupt.Type: ApplicationFiled: September 9, 2016Publication date: January 30, 2020Inventors: Volodya Grancharov, Erlendur Karlsson, Valentin Kulyk
-
Patent number: 10418037Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.Type: GrantFiled: March 23, 2016Date of Patent: September 17, 2019Assignee: Telefonaktiebolaget LM Ericsson (publ)Inventors: Volodya Grancharov, Erlendur Karlsson, Harald Pobloth, Sigurdur Sverrisson
-
Patent number: 10313276Abstract: It is presented a method for managing a jitter buffer depth for receiving real-time communication. The method is performed in a receiver and comprises the steps of: determining an adaptive bitrate state of the receiver when a current capacity of a communication channel for receiving the real-time communication is below a maximum bitrate for receiving the real-time communication; and increasing a depth of a jitter buffer for receiving the real-time communication when the adaptive bitrate state is determined.Type: GrantFiled: November 4, 2014Date of Patent: June 4, 2019Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Fredrik Jansson, Erlendur Karlsson, Jonas Lundberg, Yang Zuo
-
Publication number: 20190080697Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.Type: ApplicationFiled: March 23, 2016Publication date: March 14, 2019Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Harald POBLOTH, Sigurdur SVERRISSON
-
Publication number: 20190034428Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.Type: ApplicationFiled: March 15, 2016Publication date: January 31, 2019Inventors: Harald POBLOTH, Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON
-
Publication number: 20180261239Abstract: Detecting voiced speech in an audio signal. A method comprises calculating an autocorrelation function (ACF) of a portion of an input audio signal and detecting a highest peak of said autocorrelation function within a determined range. A peak width and a peak height of said detected highest peak are determined and based on the peak width and the peak height it is decided whether a segment of an input audio signal comprises voiced speech.Type: ApplicationFiled: May 10, 2018Publication date: September 13, 2018Applicant: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)Inventors: Tommy FALK, Harald POBLOTH, Erlendur KARLSSON