Patents by Inventor Erlendur Karlsson

Erlendur Karlsson has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11968520
    Abstract: In one aspect, there is a method for rendering a spatially-heterogeneous audio element. In some embodiments, the method includes obtaining two or more audio signals representing the spatially-heterogeneous audio element, wherein a combination of the audio signals provides a spatial image of the spatially-heterogeneous audio element. The method also includes obtaining metadata associated with the spatially-heterogeneous audio element, the metadata comprising spatial extent information indicating a spatial extent of the audio element. The method further includes rendering the audio element using: i) the spatial extent information and ii) location information indicating a position (e.g. virtual position) and/or an orientation of the user relative to the audio element.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: April 23, 2024
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Tommy Falk, Werner De Bruijn, Erlendur Karlsson, Tomas Jansson Toftgård, Mengqiu Zhang
  • Patent number: 11930351
    Abstract: A method of audio rendering. The method includes receiving an audio element, wherein the audio element comprises: i) an interior representation that is valid within a spatial region, the interior representation of the audio element being in a listener-centric format and ii) information indicating the spatial region. The method further includes determining that a listener is outside the spatial region. The method further includes deriving an exterior representation of the audio element and rendering the audio element using the exterior representation of the audio element. In another aspect, a method of providing a spatially-bounded audio element is provided. The method includes providing, to a rendering node, an audio element. The audio element includes: (i) an interior representation that is valid within a spatial region, the interior representation being in a listener-centric format; and (ii) information indicating the spatial region.
    Type: Grant
    Filed: December 20, 2019
    Date of Patent: March 12, 2024
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Tommy Falk, Werner De Bruijn, Erlendur Karlsson, Tomas Jansson Toftgård, Mengqiu Zhang
  • Publication number: 20230336936
    Abstract: A method (1900) for audio signal filtering. The method includes generating (s1902) a pair of filters for a certain location specified by an elevation angle ? and an azimuth angle ?, the pair of filters consisting of a right filter ( h ^ r ? , ? (?, ?)) and a left filter h ^ l ? , ? ; filtering (s1904) an audio signal using the right filter; and filtering (s1906) the audio signal using the left filter.
    Type: Application
    Filed: October 15, 2020
    Publication date: October 19, 2023
    Applicant: Telefonaktiebolaget LM Erissson (publ)
    Inventors: Mengqiu ZHANG, Erlendur KARLSSON
  • Patent number: 11749286
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Grant
    Filed: December 6, 2021
    Date of Patent: September 5, 2023
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
  • Patent number: 11610590
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Grant
    Filed: March 30, 2021
    Date of Patent: March 21, 2023
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
  • Publication number: 20220093107
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Application
    Filed: December 6, 2021
    Publication date: March 24, 2022
    Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
  • Publication number: 20220070606
    Abstract: A method of audio rendering. The method includes receiving an audio element, wherein the audio element comprises: i) an interior representation that is valid within a spatial region, the interior representation of the audio element being in a listener-centric format and ii) information indicating the spatial region. The method further includes determining that a listener is outside the spatial region. The method further includes deriving an exterior representation of the audio element and rendering the audio element using the exterior representation of the audio element. In another aspect, a method of providing a spatially-bounded audio element is provided. The method includes providing, to a rendering node, an audio element. The audio element includes: (i) an interior representation that is valid within a spatial region, the interior representation being in a listener-centric format; and (ii) information indicating the spatial region.
    Type: Application
    Filed: December 20, 2019
    Publication date: March 3, 2022
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Tommy FALK, Werner DE BRUIJN, Erlendur KARLSSON, Tomas JANSSON TOFTGÅRD, Mengqiu ZHANG
  • Publication number: 20220030375
    Abstract: In one aspect, there is a method for rendering a spatially-heterogeneous audio element. In some embodiments, the method includes obtaining two or more audio signals representing the spatially-heterogeneous audio element, wherein a combination of the audio signals provides a spatial image of the spatially-heterogeneous audio element. The method also includes obtaining metadata associated with the spatially-heterogeneous audio element, the metadata comprising spatial extent information indicating a spatial extent of the audio element. The method further includes rendering the audio element using: i) the spatial extent information and ii) location information indicating a position (e.g. virtual position) and/or an orientation of the user relative to the audio element.
    Type: Application
    Filed: December 20, 2019
    Publication date: January 27, 2022
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Tommy FALK, Werner DE BUIJN, Erlendur KARLSSON, Tomas JANSSON TOFTGÅRD, Mengqiu ZHANG
  • Publication number: 20210358507
    Abstract: A method for audio signal filtering. The method includes generating a pair of filters for a certain location specified by an elevation angle ? and an azimuth angle ?, the pair of filters consisting of a right filter (?r(?, ?)) and a left filter (?l(?, ?)); filtering an audio signal using the right filter; and filtering the audio signal using the left filter. Generating the pair of filters comprises: i) obtaining at least a first set of elevation basis function values at the elevation angle; ii) obtaining at least a first set of azimuth basis function values at the azimuth angle; iii) generating the right filter using: a) at least the first set of elevation basis function values, b) at least the first set of azimuth basis function values, and c) right filter model parameters; and iv) generating the left filter using: a) at least the first set of elevation basis function values, b) at least the first set of azimuth basis function values, and c) left filter model parameters.
    Type: Application
    Filed: July 29, 2021
    Publication date: November 18, 2021
    Applicant: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Mengqiu ZHANG, Erlendur KARLSSON
  • Publication number: 20210217424
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Application
    Filed: March 30, 2021
    Publication date: July 15, 2021
    Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
  • Patent number: 10984801
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Grant
    Filed: May 8, 2017
    Date of Patent: April 20, 2021
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson, Maxim Teslenko, Konstantinos Vandikas, Aneta Vulgarakis Feljan
  • Patent number: 10915569
    Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.
    Type: Grant
    Filed: March 15, 2016
    Date of Patent: February 9, 2021
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Harald Pobloth, Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson
  • Patent number: 10825472
    Abstract: Detecting voiced speech in an audio signal. A method comprises calculating an autocorrelation function (ACF) of a portion of an input audio signal and detecting a highest peak of said autocorrelation function within a determined range. A peak width and a peak height of said detected highest peak are determined and based on the peak width and the peak height it is decided whether a segment of an input audio signal comprises voiced speech.
    Type: Grant
    Filed: May 10, 2018
    Date of Patent: November 3, 2020
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Tommy Falk, Harald Pobloth, Erlendur Karlsson
  • Publication number: 20200066281
    Abstract: AM and LM parameters to be used for adapting an ASR model are derived for each audio segment of an audio stream comprising multiple audio programs. A set of identifiers, including a speaker identifier, a speaker domain identifier and a program domain identifier, is obtained for each audio segment. The set of identifiers are used to select most suitable AM and LM parameters for the particular audio segment. The embodiments enable provision of maximum constraints on the AMs and LMs and enable adaptation of the ASR model on the fly for audio streams of multiple audio programs, such as broadcast audio. This means that the embodiments enable selecting AM and LM parameters that are most suitable in terms of ASR performance for each audio segment.
    Type: Application
    Filed: May 8, 2017
    Publication date: February 27, 2020
    Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON, Maxim TESLENKO, Konstantinos VANDIKAS, Aneta VULGARAKIS FELJAN
  • Publication number: 20200036462
    Abstract: A method in a device for predicting the number of views of broadcast media on a broadcast channel at a future instance in time n+k. The method comprises receiving past values representing the actual number of views of the broadcast media at particular instances in time, and defining a time window previous to the current time n. The past values received during the time window are analysed to determine if the time window is corrupt. The time window is utilised to predict the number of views of the broadcast channel at the future instance in time, n+k, depending on whether the time window is corrupt.
    Type: Application
    Filed: September 9, 2016
    Publication date: January 30, 2020
    Inventors: Volodya Grancharov, Erlendur Karlsson, Valentin Kulyk
  • Patent number: 10418037
    Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.
    Type: Grant
    Filed: March 23, 2016
    Date of Patent: September 17, 2019
    Assignee: Telefonaktiebolaget LM Ericsson (publ)
    Inventors: Volodya Grancharov, Erlendur Karlsson, Harald Pobloth, Sigurdur Sverrisson
  • Patent number: 10313276
    Abstract: It is presented a method for managing a jitter buffer depth for receiving real-time communication. The method is performed in a receiver and comprises the steps of: determining an adaptive bitrate state of the receiver when a current capacity of a communication channel for receiving the real-time communication is below a maximum bitrate for receiving the real-time communication; and increasing a depth of a jitter buffer for receiving the real-time communication when the adaptive bitrate state is determined.
    Type: Grant
    Filed: November 4, 2014
    Date of Patent: June 4, 2019
    Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Fredrik Jansson, Erlendur Karlsson, Jonas Lundberg, Yang Zuo
  • Publication number: 20190080697
    Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.
    Type: Application
    Filed: March 23, 2016
    Publication date: March 14, 2019
    Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Harald POBLOTH, Sigurdur SVERRISSON
  • Publication number: 20190034428
    Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.
    Type: Application
    Filed: March 15, 2016
    Publication date: January 31, 2019
    Inventors: Harald POBLOTH, Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON
  • Publication number: 20180261239
    Abstract: Detecting voiced speech in an audio signal. A method comprises calculating an autocorrelation function (ACF) of a portion of an input audio signal and detecting a highest peak of said autocorrelation function within a determined range. A peak width and a peak height of said detected highest peak are determined and based on the peak width and the peak height it is decided whether a segment of an input audio signal comprises voiced speech.
    Type: Application
    Filed: May 10, 2018
    Publication date: September 13, 2018
    Applicant: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)
    Inventors: Tommy FALK, Harald POBLOTH, Erlendur KARLSSON