Patents by Inventor Juha Tapio Vilkamo

Juha Tapio Vilkamo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250095659
    Abstract: An apparatus for spatial audio signal decoding and rendering associated with a plurality of speaker nodes placed within a three-dimensional space having virtual surface arrangement comprising a plurality of virtual surfaces. The apparatus determines an azimuth angle for each virtual surface of the virtual surface set and the arrange the virtual surfaces of the virtual surface set into an order based on azimuth angles to give an ordered virtual surface set. The apparatus then associates a virtual surface of the ordered virtual surface set to a search sector and starting from the associated virtual surface for the search sector, search the ordered virtual surface set to determine a virtual surface that encloses a target panning direction.
    Type: Application
    Filed: January 18, 2022
    Publication date: March 20, 2025
    Inventors: Mikko-Ville LAITINEN, Tapani PIHLAJAKUJA, Juha Tapio VILKAMO
  • Publication number: 20250071497
    Abstract: Examples of the disclosure enable spatial audio rendering in a different format to the format that is used for the spatial audio coding. In examples of the disclosure spatial audio and first spatial metadata in a first format are obtained. The first spatial metadata enables rendering of spatial audio in a first audio format. In order to enable rendering of the spatial audio in a different format the spatial metadata is converted to second spatial metadata corresponding to a second audio format. The spatial audio can then be rendered for the second format using the second spatial metadata.
    Type: Application
    Filed: December 9, 2022
    Publication date: February 27, 2025
    Inventors: Mikko-Ville LAITINEN, Juha Tapio VILKAMO
  • Publication number: 20250039602
    Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for providing spatial audio. In examples of the disclosure speech or other sources of a first category can be identified. Spatial information relating to the speech or sources of a first category can be determined using data obtained from audio signals. The identified speech or other sources of a first category can then be spatially reproduced using the spatial information. This can enable the speech or other sources of a first category to be enhanced compared to other parts of the sound signals. This can provide for improved spatial audio content.
    Type: Application
    Filed: November 25, 2022
    Publication date: January 30, 2025
    Inventors: Juha Tapio Vilkamo, Mikko-Ville LAITINEN
  • Patent number: 12192735
    Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for repositioning spatial audio streams. The apparatus is configured to receive a plurality of spatial audio streams wherein the spatial audio streams include one or more audio signals and associated spatial metadata. The apparatus is also configured to receive obtaining repositioning information relating to at least one of the plurality of spatial audio streams and repositioning the at least one of the plurality of spatial audio streams based on the repositioning information.
    Type: Grant
    Filed: September 1, 2022
    Date of Patent: January 7, 2025
    Assignee: Nokia Technologies Oy
    Inventors: Mikko-Ville Laitinen, Juha Tapio Vilkamo, Jussi Kalevi Virolainen
  • Publication number: 20250008285
    Abstract: Examples of the disclosure relate to an apparatus (201) for determining whether one or more microphones (207) within a plurality of microphones is blocked. In examples of the disclosure correlation between at least two microphones is estimated so as to provide an indication of whether or not incoherent noise, such as wind noise (205), is present. This can be used to avoid incorrectly identifying a microphone as being blocked and so can help to maintain a higher quality level for the audio signals captured by the microphones.
    Type: Application
    Filed: June 20, 2022
    Publication date: January 2, 2025
    Inventors: Juha Tapio VILKAMO, Miikka Tapani VILERMO, Mikko Tapio TAMMI
  • Publication number: 20240298133
    Abstract: An apparatus includes circuitry for training a machine learning model such as a neural network to estimate spatial metadata for a spatial sound distribution. The apparatus includes circuitry for obtaining first capture data for a machine learning model where the first capture data is related to a plurality of spatial sound distributions and where the first capture data relates to a target device configured to obtain at least two microphone signals. The apparatus also includes circuitry for obtaining second capture data for the machine learning model where the second capture data is obtained using the same spatial sound distributions and where the data includes information indicative of spatial properties of the spatial sound distributions and the data is obtained using a reference capture method. The apparatus also includes circuitry for training the machine learning model to estimate the second capture data based on the first capture data.
    Type: Application
    Filed: May 24, 2022
    Publication date: September 5, 2024
    Inventors: Juha Tapio Vilkamo, Mikko Johannes HONKALA
  • Publication number: 20240284134
    Abstract: Examples of the disclosure relate to obtaining spatial metadata for use in rendering, or otherwise processing spatial audio. In examples of the disclosure a machine learning model can be used to process microphone signals, or data obtained from microphone signals, to obtain the spatial metadata. The machine learning model can be trained to enable high quality spatial metadata to be obtained from sub-optimal or low-quality microphone arrays. Examples of the disclosure include an apparatus including circuitry for: accessing a trained machine learning model; determining input data for the machine learning model based on two or more microphone signals; enabling using the machine learning model to process the input data to obtain spatial metadata; and associating the obtained spatial metadata with at least one signal based on the two or more microphone signals to enable processing of the at least one signal based on the obtained spatial metadata.
    Type: Application
    Filed: May 16, 2022
    Publication date: August 22, 2024
    Inventors: Juha Tapio VILKAMO, Mikko Johannes HONKALA
  • Publication number: 20240274137
    Abstract: An apparatus (317) comprising means configured to: receive a spatial audio signal, the spatial audio signal comprising at least one audio signal and spatial metadata (122) associated with the at least one audio signal; generate a mixing value (320) based on the spatial metadata (122) and a predefined parameter (322) which imparts effects of a rendering of a multichannel audio signal having a multichannel configuration to a further multichannel audio signal having a further multichannel configuration on generated output signals; and generating the output audio signals having the further multichannel configuration based on the mixing value (320) and the spatial audio signal.
    Type: Application
    Filed: June 10, 2021
    Publication date: August 15, 2024
    Applicant: NOKIA TECHNOLOGIES OY
    Inventors: Mikko-Ville LAITINEN, Juha Tapio VILKAMO, Lasse Juhani LAAKSONEN, Anssi Sakari RÄMÖ
  • Publication number: 20240267678
    Abstract: A method comprising capturing, using a first capturing mode, immersive audio using a first capturing device comprising a first microphone and a second capturing device comprising a second microphone, recognizing, based on obtaining data from one or more sensors, movement of the first capturing device, wherein the movement is with respect to the second capturing device, recognizing the movement as movement for changing from the first capturing mode to a second capturing mode, wherein the second capturing mode is for capturing immersive audio, and capturing the immersive audio using the second capturing mode.
    Type: Application
    Filed: January 23, 2024
    Publication date: August 8, 2024
    Inventors: Mikko Olavi HEIKKINEN, Matti Sakari HÄMÄLÄINEN, Juha Petteri OJANPERÄ, Juha Tapio VILKAMO
  • Publication number: 20240259758
    Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for processing spatial audio. In examples, an apparatus can be configured to receive information indicative of a first orientation and/or location of a listener and also to obtain input audio signals. The apparatus can be configured to process the input audio signals using at least the information indicative of the first orientation and/or location of the listener to generate processed audio signals and to generate spatial metadata based at least on the input audio signals. The apparatus can also be configured to enable transmission of the processed audio signals and the spatial metadata, wherein the transmitted signals and the spatial metadata are configured to enable rendering of a spatial audio output based on the processed audio signals and the spatial metadata and information indicative of a second orientation and/or location of the listener.
    Type: Application
    Filed: January 25, 2024
    Publication date: August 1, 2024
    Inventors: Juha Tapio Vilkamo, Mikko-Ville Laitinen, Jussi Kalevi Virolainen
  • Publication number: 20240236611
    Abstract: A method for generating a parametric spatial audio stream, the method including: obtaining at least one mono-channel audio signal from at least one close microphone; obtaining at least one of: at least one reverberation parameter; at least one control parameter configured to control spatial features of the parametric spatial audio stream; generating, based on the at least one reverberation parameter, at least one reverberated audio signal from a respective at least one mono-channel audio signal; generating at least one spatial metadata parameter based on at least one of: the at least one mono-channel audio signal; the at least one reverberated audio signal; the at least one control parameter; and the at least one reverberation parameter; and encoding the at least one reverberated audio signal and the at least one spatial metadata parameter to generate the spatial audio stream.
    Type: Application
    Filed: October 19, 2023
    Publication date: July 11, 2024
    Inventors: Mikko-Ville Laitinen, Juha Tapio VILKAMO, Jussi Kalevi VIROLAINEN
  • Publication number: 20240236601
    Abstract: A method for generating a spatial audio stream, the method including: obtaining at least two audio signals from at least two microphones; extracting from the at least two audio signals a first audio signal, the first audio signal including at least partially speech of a user; extracting from the at least two audio signals a second audio signal, wherein speech of the user is substantially not present within the second audio signal; and encoding the first audio signal and the second audio signal to generate the spatial audio stream such that a rendering of speech of the user to a controllable direction and/or distance is enabled.
    Type: Application
    Filed: October 19, 2023
    Publication date: July 11, 2024
    Inventors: Mikko-Ville Laitinen, Juha Tapio Vilkamo, Jussi Kalevi Virolainen
  • Publication number: 20240171927
    Abstract: An apparatus for processing at least two audio signals and associated metadata, the apparatus including circuitry configured to: obtain the audio signals, the audio signals including at least one audio object portion and at least one non-audio object portion; obtain the associated metadata, wherein the associated metadata is configured to define at least one audio object position and at least one audio object energy proportion; obtain object position control information; determine mixing information based on the object position control information and the at least one audio object position and at least one audio object energy proportion; and process the at least two audio signals based on the mixing information, wherein the processing is configured to enable the at least one object portion of a first of the at least two audio signals to be at least partially moved to a second of the at least two audio signals.
    Type: Application
    Filed: February 25, 2022
    Publication date: May 23, 2024
    Inventors: Mikko-Ville LAITINEN, Juha Tapio VILKAMO
  • Publication number: 20240137723
    Abstract: A method for generating a spatial audio stream, the method including: obtaining at least two audio signals from at least two microphones; extracting from the at least two audio signals a first audio signal, the first audio signal including at least partially speech of a user; extracting from the at least two audio signals a second audio signal, wherein speech of the user is substantially not present within the second audio signal; and encoding the first audio signal and the second audio signal to generate the spatial audio stream such that a rendering of speech of the user to a controllable direction and/or distance is enabled.
    Type: Application
    Filed: October 18, 2023
    Publication date: April 25, 2024
    Inventors: Mikko-Ville Laitinen, Juha Tapio Vilkamo, Jussi Kalevi Virolainen
  • Publication number: 20240137728
    Abstract: A method for generating a parametric spatial audio stream, the method including: obtaining at least one mono-channel audio signal from at least one close microphone; obtaining at least one of: at least one reverberation parameter; at least one control parameter configured to control spatial features of the parametric spatial audio stream; generating, based on the at least one reverberation parameter, at least one reverberated audio signal from a respective at least one mono-channel audio signal; generating at least one spatial metadata parameter based on at least one of: the at least one mono-channel audio signal; the at least one reverberated audio signal; the at least one control parameter; and the at least one reverberation parameter; and encoding the at least one reverberated audio signal and the at least one spatial metadata parameter to generate the spatial audio stream.
    Type: Application
    Filed: October 18, 2023
    Publication date: April 25, 2024
    Inventors: Mikko-Ville Laitinen, Juha Tapio VILKAMO, Jussi Kalevi VIROLAINEN
  • Publication number: 20240087589
    Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for spatial processing audio scenes with improved intelligibility for speech or other key sounds. In examples of the disclosure at least one audio signal including two or more channels is obtained. The audio signal is processed with program code to identify at least a first portion of the audio signal wherein the first portion predominantly includes audio of interest. The first portion is processed using a first process. The second portion is processed using a second process including spatial audio processing. The first process includes no spatial audio processing or a low level of spatial audio processing compared to the second process and the second portion predominantly includes a remainder. The processed first portion and second portion can be played back using two or more loudspeakers.
    Type: Application
    Filed: September 13, 2023
    Publication date: March 14, 2024
    Inventors: Juha Tapio VILKAMO, Mikko-Ville LAITINEN, Sampo VESA
  • Publication number: 20230402050
    Abstract: Examples of the disclosure relate to speech enhancement that can be adapted for varying sound scenes. In examples of the disclosure a control parameter for speech enhancement is obtained. The control parameter indicates a user preference for speech enhancement. One or more audio signals are obtained and the one or more audio signals are processed to determine a sound classification based at least on the one or more audio signals. The control parameter and the sound classification are used to determine a processing parameter. Speech enhancement is enabled on the one or more audio signals. The speech enhancement uses the processing parameter such that the processing parameter is configured to control proportions of speech and remainder in an output signal.
    Type: Application
    Filed: June 8, 2023
    Publication date: December 14, 2023
    Inventors: Juha Tapio Vilkamo, Kai Petteri Havukainen, Toni Makinen
  • Publication number: 20230110257
    Abstract: An apparatus for generating a spatialized audio output based on a listener position, the apparatus including circuitry configured to: obtain two or more audio signal sets; obtain a listener position within an audio environment, wherein the audio environment includes one or more area having one or more inside and outside regions in relation to the respective audio signal set positions; obtain metadata based on a processing of the at least two audio signals; determine, for the listener position within an audio environment outside the inside region, a second listener position; determine modified metadata for the second listener position based on the metadata; determine at least two modified audio signals for the second listener position based on the at least two audio signals; determine spatial metadata for the listener position; and output the at least two modified audio signals and the spatial metadata.
    Type: Application
    Filed: October 5, 2022
    Publication date: April 13, 2023
    Inventors: Mikko-Ville Laitinen, Archontis POLITIS, Lauros Anton PAJUNEN, Juha Tapio VILKAMO, Antti Johannes ERONEN
  • Publication number: 20230084225
    Abstract: Examples of the disclosure relate to apparatus, methods and computer programs for repositioning spatial audio streams. The apparatus is configured to receive a plurality of spatial audio streams wherein the spatial audio streams include one or more audio signals and associated spatial metadata. The apparatus is also configured to receive obtaining repositioning information relating to at least one of the plurality of spatial audio streams and repositioning the at least one of the plurality of spatial audio streams based on the repositioning information.
    Type: Application
    Filed: September 1, 2022
    Publication date: March 16, 2023
    Inventors: Mikko-Ville Laitinen, Juha Tapio Vilkamo, Jussi Kalevi Virolainen
  • Patent number: 11302339
    Abstract: An apparatus for spatial audio signal decoding associated with a plurality of speaker nodes (201, 203, 205, 207, 209) placed within a three dimensional space, the apparatus comprising at least one processor and at least one memory including a computer program code. The at least one memory and the computer program code configured to, with the at least one processor, cause the apparatus at least to determine a non-overlapping virtual surface arrangement (400), the virtual surface arrangement (400) comprising a plurality of virtual surfaces (421, 423, 431, 433) with corners positioned at at least three speaker nodes of the plurality of speaker nodes (201, 203, 205, 207, 209) and sides connecting pairs of corners configured to be non-intersecting with at least one defined virtual plane within the three dimensional space.
    Type: Grant
    Filed: March 7, 2019
    Date of Patent: April 12, 2022
    Assignee: NOKIA TECHNOLOGIES OY
    Inventors: Mikko-Ville Laitinen, Juha Tapio Vilkamo, Tapani Pihlajakuja, Antti Johannes Eronen