Patents by Inventor Alexey OZEROV

Alexey OZEROV has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20240098434
    Abstract: A method and device for audio steering from a loudspeaker line array of a display device toward a user direction is disclosed. Data corresponding to a viewer gesture is obtained from at least one sensor of a display device. A distance and an angle between the viewer and a plurality of loudspeakers coupled to the display is determined based on the obtained data. Phase shifting is applied to an audio signal powering the plurality of loudspeakers based on the determined distance and angle to audio steer toward the user direction.
    Type: Application
    Filed: November 29, 2021
    Publication date: March 21, 2024
    Inventors: Hassane Guermoud, Michel Kerdranvat, Alexey Ozerov
  • Patent number: 11735199
    Abstract: Method for modifying a style of an audio object, and corresponding electronic device, computer readable program products and computer readable storage medium The disclosure relates to a method for processing an input audio signal. According to an embodiment, the method includes obtaining a base audio signal being a copy of the input audio signal and generating an output audio signal from the base signal, the output audio signal having style features obtained by modifying the base signal so that a distance between base style features representative of a style of the base signal and a reference style feature decreases. The disclosure also relates to corresponding electronic device, computer readable program product and computer readable storage medium.
    Type: Grant
    Filed: September 14, 2018
    Date of Patent: August 22, 2023
    Assignee: INTERDIGITAL MADISON PATENT HOLDINGS, SAS
    Inventors: Quang Khanh Ngoc Duong, Alexey Ozerov, Eric Grinstein, Patrick Perez
  • Publication number: 20200286499
    Abstract: Method for modifying a style of an audio object, and corresponding electronic device, computer readable program products and computer readable storage medium The disclosure relates to a method for processing an input audio signal. According to an embodiment, the method includes obtaining a base audio signal being a copy of the input audio signal and generating an output audio signal from the base signal, the output audio signal having style features obtained by modifying the base signal so that a distance between base style features representative of a style of the base signal and a reference style feature decreases. The disclosure also relates to corresponding electronic device, computer readable program product and computer readable storage medium.
    Type: Application
    Filed: September 14, 2018
    Publication date: September 10, 2020
    Inventors: Quang Khanh Ngoc DUONG, Alexey OZEROV, Eric GRINSTEIN, Patrick PEREZ
  • Patent number: 10235126
    Abstract: A method and a system (20) of audio source separation are described. The method comprises: receiving (10) an audio mixture and at least one text query associated to the audio mixture; retrieving (11) at least one audio sample from an auxiliary audio database; evaluating (12) the retrieved audio samples; and separating (13) the audio mixture into a plurality of audio sources using the audio samples. The corresponding system (20) comprises a receiving (21) and a processor (22) configured to implement the method.
    Type: Grant
    Filed: May 11, 2015
    Date of Patent: March 19, 2019
    Assignee: INTERDIGITAL CE PATENT HOLDINGS
    Inventors: Quang Khanh Ngoc Duong, Alexey Ozerov, Dalia Elbadawy
  • Publication number: 20180358025
    Abstract: To represent and recover the constituent sources present in an audio mixture, informed source separation techniques are used. In particular, a universal spectral model (USM) is used to obtain a sparse time activation matrix for an individual audio source in the audio mixture. The indices of non-zero groups in the time activation matrix are encoded as the side information into a bitstream. The non-zero coefficients of the time activation matrix may also be encoded into the bitstream. At the decoder side, when the coefficients of the time activation matrix are included in the bitstream, the matrix can be decoded from the bitstream. Otherwise, the time activation matrix can be estimated from the audio mixture, the non-zero indices included in the bitstream, and the USM model. Given the time activation matrix, the constituent audio sources can be recovered based on the audio mixture and the USM model.
    Type: Application
    Filed: November 25, 2016
    Publication date: December 13, 2018
    Inventors: Quang Khanh Ngoc DUONG, Alexey OZEROV
  • Patent number: 10114891
    Abstract: A method and a system of audio retrieval and source separation are described. The method comprises the steps of: receiving a textual query; retrieving a preliminary audio sample from an auxiliary audio database; retrieving a target audio sample from a target audio database; and separating the retrieved target audio sample into a plurality of audio source signals. The corresponding system comprises an input unit, a storing unit and a processing unit to implement the method.
    Type: Grant
    Filed: December 19, 2014
    Date of Patent: October 30, 2018
    Assignee: Thomson Licensing
    Inventors: Alexey Ozerov, Patrick Perez, Louis Chevallier, Lionel Oisel
  • Publication number: 20180308502
    Abstract: A method for processing an input signal having an audio component is described. The method includes obtaining a set of time parameters from a time frequency transformation of the audio component of the input signal, the audio component being a mixture of audio signals comprising at least one first audio signal of a first audio source; determining at least one motion feature of the first audio source from a visual sequence corresponding to the first audio signal; obtaining a weight vector of the set of time parameters based on the motion feature; and determining a time frequency transformation of the first audio signal based on the weight vector.
    Type: Application
    Filed: April 18, 2018
    Publication date: October 25, 2018
    Inventors: Sanjeel PAREKH, Alexey OZEROV, Quang Khanh Ngoc DUONG, Gael RICHARD, Slim ESSID, Patrick PEREZ
  • Publication number: 20180288452
    Abstract: A solution for delivery of audiovisual content to a receiver device is provided. At the transmitter side, a transmission buffer is constituted, while offering fast channel change and fast trick modes at the receiver side. At least one GoP, starting with a first I-frame is sought in the content that is to be transmitted. The timing references of the data in the at least one GoP that is prepared for delivery to a receiver device are modified so that the data is decoded by the receiver at a slowed-down rate for a given duration. This creates a lag between reading of data in by the transmitter and decoding of data by the receiver. The lag is used by the transmitter to fill the transmission buffer, while the receiver does not have to wait for the transmission buffer to be filled to start decoding.
    Type: Application
    Filed: April 1, 2018
    Publication date: October 4, 2018
    Inventors: Bruno LE GARJAN, Arunkumar PALANICHAMY, Philippe BORDES, Thierry QUERE, Alexey OZEROV
  • Publication number: 20180211672
    Abstract: A method for performing audio inpainting, wherein missing portions in an input audio signal are recovered and a recovered audio signal is obtained, comprises computing a Short-Time Fourier Transform (STFT) on portions of the input audio signal, computing conditional expectations of the source power spectra of the input audio signal, wherein estimated source power spectra P(f, n, j) are obtained and wherein the variance tensor V and complex Short-Time Fourier Transform (STFT) coefficients of the input audio signals are used, iteratively re-calculating the variance tensor V from the estimated power spectra P(f, n, j) and re-calculating updated estimated power spectra P(f, n, j), computing an array of STFT coefficients ? from the resulting variance tensor V according to ?(f, n, j)=E{S(f, n, j)|x, Is, IL, V}, and converting the array of STFT coefficients ? to the time domain, wherein coefficients {tilde over (s)}1, {tilde over (s)}2, . . . , {tilde over (s)}j of the recovered audio signal are obtained.
    Type: Application
    Filed: April 6, 2016
    Publication date: July 26, 2018
    Applicant: Dolby International AB
    Inventors: Cagdas BILEN, Alexey OZEROV, Patrick PEREZ
  • Patent number: 10021395
    Abstract: A particular implementation determines parameters of a generative probabilistic model from visual descriptors extracted from at least one image. The extracted visual descriptors are quantized and encoded using the model-based arithmetic encoding to be stored or for transmission to a decoder. The model parameters are also stored to be available to a decoder, or transmitted directly to a decoder. A decoder uses the stored, or received, model parameters to reconstruct the generative probabilistic model and then to decode the visual descriptors. The visual descriptors are used for image analysis tasks, such as image retrieval or object detection. A particular implementation uses a Gaussian mixture model as a generative probabilistic model.
    Type: Grant
    Filed: November 27, 2015
    Date of Patent: July 10, 2018
    Assignee: Thomson Licensing
    Inventors: Alexey Ozerov, Jean-Ronan Vigouroux, Frederic Lefebvre
  • Patent number: 9990936
    Abstract: A method and an apparatus for separating speech data from background data in an audio communication are suggested. The method comprises: applying a speech model to the audio communication for separating the speech data from the background data of the audio communication; and updating the speech model as a function of the speech data and the background data during the audio communication.
    Type: Grant
    Filed: October 12, 2015
    Date of Patent: June 5, 2018
    Assignee: THOMSON Licensing
    Inventors: Alexey Ozerov, Quang Khanh Ngoc Duong, Louis Chevallier
  • Patent number: 9930466
    Abstract: A method and apparatus for processing audio content is described. The method and apparatus include receiving (510) audio content, the audio content including an input audio signal, a first reference audio signal, and a second reference audio signal, determining (550) a processing function for the input audio signal, the processing function determined based on a cost function between the input audio signal, the first reference audio signal and a second reference audio signal, and processing (560) the input audio signal using the determined processing function in order to produce an output audio signal.
    Type: Grant
    Filed: December 1, 2016
    Date of Patent: March 27, 2018
    Assignee: THOMSON Licensing
    Inventors: Alexey Ozerov, Marie Guegan, Quang Khanh Ngoc Duong
  • Publication number: 20180082693
    Abstract: A method for encoding multiple audio signals comprises random sampling and quantizing each of the multiple audio signals, and encoding the sampled and quantized multiple audio signals as side information that can be used for decoding and separating the multiple audio signals from a mixture of said multiple audio signals. A method for decoding a mixture of multiple audio signals comprises decoding and demultiplexing side information, the side information comprising quantized samples of each of the multiple audio signals, receiving or retrieving from any data source a mixture of said multiple audio signals, and generating multiple estimated audio signals that approximate said multiple audio signals, wherein said quantized samples of each of the multiple audio signals are used.
    Type: Application
    Filed: March 10, 2016
    Publication date: March 22, 2018
    Inventors: Cagdas BILEN, Alexey OZEROV, Patrick PEREZ
  • Publication number: 20180075863
    Abstract: A method is proposed for encoding at least two signals. The method includes mixing the at least two signals in a mixture; sampling a map Z representative of locations of the at least two signals in a time-frequency plane at sampling locations, the sampling delivering a first list of values Z?; and transmitting the mixture of the at least two signals and information representative of the first list of values Z?. The disclosure also relates to the corresponding method for separating signals in a mixture, and corresponding computer program products, devices and bitstream.
    Type: Application
    Filed: September 7, 2017
    Publication date: March 15, 2018
    Inventors: Quang Khanh Ngoc DUONG, Gilles PUY, Alexey OZEROV, Patrick PEREZ
  • Publication number: 20170337913
    Abstract: An apparatus and method for generating visual content from an audio signal are described. The method includes receiving (310) audio content, processing (320) the audio content to separate into a first and second portion of the audio content, converting (330) the second portion into visual content, delaying (340) the first portion based on a time relationship between the audio content and the visual content, the delaying accounting for time to process the first portion and convert the second portion, and providing (350) the visual content and audio content for reproduction. The apparatus includes a source separation module (210) processing the received audio content to separate into a first and second portion of the audio content, a converter module (220) converting the second portion into visual content, and a synchronization module (230) delaying the first portion based on a time relationship between the audio content and the visual content.
    Type: Application
    Filed: November 24, 2016
    Publication date: November 23, 2017
    Inventors: Marie GUEGAN, Alexey OZEROV
  • Publication number: 20170309291
    Abstract: A method and an apparatus for separating speech data from background data in an audio communication are suggested. The method comprises: applying a speech model to the audio communication for separating the speech data from the background data of the audio communication; and updating the speech model as a function of the speech data and the background data during the audio communication.
    Type: Application
    Filed: October 12, 2015
    Publication date: October 26, 2017
    Inventors: Alexey OZEROV, Quang Khanh Ngoc DUONG, Louis CHEVALLIER
  • Patent number: 9734842
    Abstract: Separation of speech and background from an audio mixture by using a speech example, generated from a source associated with a speech component in the audio mixture, to guide the separation process.
    Type: Grant
    Filed: June 4, 2014
    Date of Patent: August 15, 2017
    Assignee: THOMSON LICENSING
    Inventors: Luc Le Magoarou, Alexey Ozerov, Quang Khanh Ngoc Duong
  • Publication number: 20170180903
    Abstract: A method and apparatus for processing audio content is described. The method and apparatus include receiving (510) audio content, the audio content including an input audio signal, a first reference audio signal, and a second reference audio signal, determining (550) a processing function for the input audio signal, the processing function determined based on a cost function between the input audio signal, the first reference audio signal and a second reference audio signal, and processing (560) the input audio signal using the determined processing function in order to produce an output audio signal.
    Type: Application
    Filed: December 1, 2016
    Publication date: June 22, 2017
    Inventors: Alexey Ozerov, Marie Guegan, Quang Khanh Ngoc Duong
  • Publication number: 20170075649
    Abstract: A method and a system (20) of audio source separation are described. The method comprises: receiving (10) an audio mixture and at least one text query associated to the audio mixture; retrieving (11) at least one audio sample from an auxiliary audio database; evaluating (12) the retrieved audio samples; and separating (13) the audio mixture into a plurality of audio sources using the audio samples. The corresponding system (20) comprises a receiving (21) and a processor (22) configured to implement the method.
    Type: Application
    Filed: May 11, 2015
    Publication date: March 16, 2017
    Inventors: Quang Khanh Ngoc DUONG, Alexey OZEROV, Dalia ELBADAWY
  • Publication number: 20160360150
    Abstract: Isolation of an active participant in a group of participants commences by first capturing images and audio of participants. Thereafter, an active one of the participants in the group of participants (e.g., a participant that is currently speaking) is identified. After identification of the active participant, at least one of participants' images and participants' audio are rendered to isolate the active participant.
    Type: Application
    Filed: June 3, 2016
    Publication date: December 8, 2016
    Inventors: Stephane ONNO, Alexey OZEROV, Quang Khanh Ngoc DUONG, Frederic LEFEBVRE