Patents by Inventor Harald Pobloth

Harald Pobloth has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

CODING OF MULTI-CHANNEL AUDIO SIGNALS

Publication number: 20230274748

Abstract: A method for assisting a selection of an encoding mode for a multi-channel audio signal encoding where different encoding modes may be chosen for the different channels. The method is performed in an audio encoder and comprises obtaining a plurality of audio signal channels and coordinating or synchronizing the selection of an encoding mode for a plurality of the obtained channels, wherein the coordination is based on an encoding mode selected for one of the obtained channels or for a group of the obtained channels.

Type: Application

Filed: February 16, 2023

Publication date: August 31, 2023

Applicant: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Harald POBLOTH, Stefan BRUHN
Voice-controlled management of user profiles

Patent number: 11727939

Abstract: A network node in a communication network receives, from a user equipment, a cluster of audio segments. The network node calculates a first confidence measure representing a first probability that a first speaker model represents a speaker of the cluster of audio segments. The network node also calculates a second confidence measure representing a second probability that a second speaker model represents the speaker of the cluster of audio segments. In response to the first confidence measure and the second confidence measure both representing probabilities that are higher than a target probability, the network node updates a first user profile associated with the first speaker model and a second user profile associated with the second speaker model based on a user preference assigned to the cluster of audio segments.

Type: Grant

Filed: January 5, 2022

Date of Patent: August 15, 2023

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Object tracking in real-time applications

Patent number: 11694346

Abstract: An object tracking, in particular adapted for real-time augmented reality applications, involves determining a location of an object (20) in a current frame (10) of a video stream (15), at a point in time following output of a preceding frame (11) of the video stream (15) but preceding output of the current frame (10), by starting from a location of the object (20) determined by an object-detection server (5) for a previous frame (12) of the video stream (15) and recursively track the location of the object (20) in frames (11) of the video stream (15) following the previous frame (12) up to the current frame (10) and recursively update a model of the object (20). Accurate objection detection from an object-detection server (5) can thereby be used even if the object was detected in a past frame (12) of the video stream (15) that has already been visualized.

Type: Grant

Filed: June 27, 2018

Date of Patent: July 4, 2023

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Volodya Grancharov, Harald Pobloth, Sigurdur Sverrisson
Voice-controlled management of user profiles

Patent number: 11430449

Abstract: A management of user profiles comprises calculating, for an audio segment, a user confidence measure representing a probability that the audio segment comprises speech of a user and a group confidence measure representing a probability that the audio segment comprises speech of a group of users. A user profile is then managed based on a comparison between the user confidence measure and a user confidence threshold and between the group confidence measure and a group confidence threshold. The embodiments thereby achieve an efficient voice-controlled user profile management by utilizing a layered approach that provides user profiles for group of users as fallback when the identity of the speaking user can not accurately be recognized.

Type: Grant

Filed: September 11, 2017

Date of Patent: August 30, 2022

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Transform Encoding/Decoding of Harmonic Audio Signals

Publication number: 20220139408

Abstract: An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.

Type: Application

Filed: January 20, 2022

Publication date: May 5, 2022

Inventors: Volodya Grancharov, Tomas Jansson Toftgård, Sebastian Näslund, Harald Pobloth
Voice-Controlled Management of User Profiles

Publication number: 20220130395

Abstract: A network node in a communication network receives, from a user equipment, a cluster of audio segments. The network node calculates a first confidence measure representing a first probability that a first speaker model represents a speaker of the cluster of audio segments. The network node also calculates a second confidence measure representing a second probability that a second speaker model represents the speaker of the cluster of audio segments. In response to the first confidence measure and the second confidence measure both representing probabilities that are higher than a target probability, the network node updates a first user profile associated with the first speaker model and a second user profile associated with the second speaker model based on a user preference assigned to the cluster of audio segments.

Type: Application

Filed: January 5, 2022

Publication date: April 28, 2022

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Transform encoding/decoding of harmonic audio signals

Patent number: 11264041

Abstract: An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.

Type: Grant

Filed: January 8, 2020

Date of Patent: March 1, 2022

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Volodya Grancharov, Tomas Jansson Toftgård, Sebastian Näslund, Harald Pobloth
Object Location Determination in Frames of a Video Stream

Publication number: 20220027623

Abstract: An object locating method comprises deciding, for at least one frame of a video stream and based on at least one parameter representative of a change between a scene represented by the at least one frame and a scene represented by a reference frame of the video stream, whether determination of a location of at least one object (11, 13, 15, 17, 19) in the at least one frame is based on object detection applied to the at least one frame, or is based on a transformation of a location of the at least one object (11, 13, 15, 17, 19) detected in the reference frame.

Type: Application

Filed: December 5, 2018

Publication date: January 27, 2022

Inventors: Alfredo Fanghella, Volodya Grancharov, Harald Pobloth
Voice-controlled management of user profiles

Patent number: 11227605

Abstract: A management of user profiles comprises calculating, for each speaker model of at least one speaker model, a confidence measure representing a probability that the speaker model represents a speaker of a cluster of audio segments. A user profile associated with the speaker model is updated based on a user preference assigned to the cluster of audio segments if the confidence measure calculated for the speaker model represents a probability that is higher than a target probability. The embodiments achieve an efficient user profile management in a voice-controlled context but without the need for any dedicated enrollment sessions to train speaker models.

Type: Grant

Filed: September 11, 2017

Date of Patent: January 18, 2022

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Object Tracking in Real-Time Applications

Publication number: 20210264619

Abstract: An object tracking, in particular adapted for real-time augmented reality applications, involves determining a location of an object (20) in a current frame (10) of a video stream (15), at a point in time following output of a preceding frame (11) of the video stream (15) but preceding output of the current frame (10), by starting from a location of the object (20) determined by an object-detection server (5) for a previous frame (12) of the video stream (15) and recursively track the location of the object (20) in frames (11) of the video stream (15) following the previous frame (12) up to the current frame (10) and recursively update a model of the object (20). Accurate objection detection from an object-detection server (5) can thereby be used even if the object was detected in a past frame (12) of the video stream (15) that has already been visualized.

Type: Application

Filed: June 27, 2018

Publication date: August 26, 2021

Inventors: Volodya Grancharov, Harald Pobloth, Sigurdur Sverrisson
Object Tracking and Detection in Augmented Reality Applications

Publication number: 20210118175

Abstract: Object tracking for augmented reality involves determining a location of an object in a current frame (10) of a video stream based on object feature representations extracted from different locations in the current frame (10) and object feature representations belonging to a model of the object. The model comprises at least one object feature representation extracted from at least one previous frame (20) of the video stream and at least one object feature representation from a feature set comprising at least one server-generated object feature representation of objects belonging to a same object class as the object.

Type: Application

Filed: April 26, 2018

Publication date: April 22, 2021

Inventors: Volodya Grancharov, Harald Pobloth, Sigurdur Sverrisson
Associating metadata with a multimedia file

Patent number: 10915569

Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.

Type: Grant

Filed: March 15, 2016

Date of Patent: February 9, 2021

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Harald Pobloth, Volodya Grancharov, Erlendur Karlsson, Sigurdur Sverrisson
Voice-Controlled Management of User Profiles

Publication number: 20210027789

Abstract: A management of user profiles comprises calculating, for an audio segment, a user confidence measure representing a probability that the audio segment comprises speech of a user and a group confidence measure representing a probability that the audio segment comprises speech of a group of users. A user profile is then managed based on a comparison between the user confidence measure and a user confidence threshold and between the group confidence measure and a group confidence threshold. The embodiments thereby achieve an efficient voice-controlled user profile management by utilizing a layered approach that provides user profiles for group of users as fallback when the identity of the speaking user can not accurately be recognized.

Type: Application

Filed: September 11, 2017

Publication date: January 28, 2021

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Method and apparatus for voiced speech detection

Patent number: 10825472

Abstract: Detecting voiced speech in an audio signal. A method comprises calculating an autocorrelation function (ACF) of a portion of an input audio signal and detecting a highest peak of said autocorrelation function within a determined range. A peak width and a peak height of said detected highest peak are determined and based on the peak width and the peak height it is decided whether a segment of an input audio signal comprises voiced speech.

Type: Grant

Filed: May 10, 2018

Date of Patent: November 3, 2020

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Tommy Falk, Harald Pobloth, Erlendur Karlsson
Voice-Controlled Management of User Profiles

Publication number: 20200194006

Abstract: A management of user profiles comprises calculating, for each speaker model of at least one speaker model, a confidence measure representing a probability that the speaker model represents a speaker of a cluster of audio segments. A user profile associated with the speaker model is updated based on a user preference assigned to the cluster of audio segments if the confidence measure calculated for the speaker model represents a probability that is higher than a target probability. The embodiments achieve an efficient user profile management in a voice-controlled context but without the need for any dedicated enrollment sessions to train speaker models.

Type: Application

Filed: September 11, 2017

Publication date: June 18, 2020

Inventors: Volodya Grancharov, Tomer Amiaz, Hadar Gecht, Harald Pobloth
Transform Encoding/Decoding of Harmonic Audio Signals

Publication number: 20200143818

Abstract: An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.

Type: Application

Filed: January 8, 2020

Publication date: May 7, 2020

Inventors: Volodya Grancharov, Tomas Jansson Toftgård, Sebastian Näslund, Harald Pobloth
Transform encoding/decoding of harmonic audio signals

Patent number: 10566003

Abstract: An encoder for encoding frequency transform coefficients of a harmonic audio signal include the following elements: A peak locator configured to locate spectral peaks having magnitudes exceeding a predetermined frequency dependent threshold. A peak region encoder configured to encode peak regions including and surrounding the located peaks. A low-frequency set encoder configured to encode at least one low-frequency set of coefficients outside the peak regions and below a crossover frequency that depends on the number of bits used to encode the peak regions. A noise-floor gain encoder configured to encode a noise-floor gain of at least one high-frequency set of not yet encoded coefficients outside the peak regions.

Type: Grant

Filed: August 4, 2016

Date of Patent: February 18, 2020

Assignee: TELEFONAKTIEBOLAGET LM ERICSSON (PUBL)

Inventors: Volodya Grancharov, Tomas Jansson Toftgård, Sebastian Näslund, Harald Pobloth
Speaker verification computer system with textual transcript adaptations of universal background model and enrolled speaker model

Patent number: 10418037

Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.

Type: Grant

Filed: March 23, 2016

Date of Patent: September 17, 2019

Assignee: Telefonaktiebolaget LM Ericsson (publ)

Inventors: Volodya Grancharov, Erlendur Karlsson, Harald Pobloth, Sigurdur Sverrisson
SPEAKER VERIFICATION COMPUTER SYSTEM WITH TEXTUAL TRANSCRIPT ADAPTATIONS OF UNIVERSAL BACKGROUND MODEL AND ENROLLED SPEAKER MODEL

Publication number: 20190080697

Abstract: A sampled speech data sequence contains words spoken by a speaker. A sequence of feature vectors is generated characterizing spectral distribution of sampled speech data. A textual transcript of the words spoken by the speaker is obtained. Data structures of a universal background model of a Gaussian mixture model (UBM-GMM) and of an Enrolled speaker Gaussian mixture model (ENR-GMM) are adapted responsive to the textual transcript, to generate an adapted UBM-GMM and an adapted ENR-GMM, respectively. An enrolled speaker probability is generated based on the sequence of feature vectors and the adapted ENR-GMM, and a universal speaker probability is generated based on the sequence of feature vectors and the adapted UBM-GMM. A speaker verification indication of whether the speaker is an enrolled speaker is generated by comparing the enrolled speaker probability to the universal speaker probability.

Type: Application

Filed: March 23, 2016

Publication date: March 14, 2019

Inventors: Volodya GRANCHAROV, Erlendur KARLSSON, Harald POBLOTH, Sigurdur SVERRISSON
ASSOCIATING METADATA WITH A MULTIMEDIA FILE

Publication number: 20190034428

Abstract: There are provided mechanisms for associating metadata with a multimedia file. The method is performed by a mood detector. A method includes detecting presence of a mood indicator in the multimedia file by a mood detection module analysing the multimedia file. The method includes determining a mood descriptive value by a mood classification module analysing a segment of the multimedia file, wherein the segment is defined by the mood indicator. The method further includes associating the mood descriptive value with the multimedia file as metadata. Detecting presence of the mood indicator by the mood detection module acts as a trigger to determine the mood descriptive value by the mood classification module.

Type: Application

Filed: March 15, 2016

Publication date: January 31, 2019

Inventors: Harald POBLOTH, Volodya GRANCHAROV, Erlendur KARLSSON, Sigurdur SVERRISSON

1 2 next