Patents by Inventor Ferdinando OLIVIERI

Ferdinando OLIVIERI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 12177644
    Abstract: Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.
    Type: Grant
    Filed: October 29, 2020
    Date of Patent: December 24, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Nils Gunther Peters, Shankar Thagadur Shivappa, S M Akramus Salehin, Jason Filos, Siddhartha Goutham Swaminathan, Ferdinando Olivieri
  • Patent number: 12142285
    Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: November 12, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 12141498
    Abstract: A device for managing sound playback includes one or more processors configured to receive an indication of a user-device interaction between a user and an audio interface device during a sound playback operation of a multi-speaker audio playback system. The one or more processors are also configured to, based on receiving the indication of the user-device interaction, initiate a selective adjustment of the sound playback operation to reduce a playback sound of the multi-speaker audio playback system based on a position of the user.
    Type: Grant
    Filed: November 17, 2020
    Date of Patent: November 12, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Jason Filos, Ferdinando Olivieri, Nils Gunther Peters
  • Patent number: 12073842
    Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: August 27, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 11889128
    Abstract: A device includes one or more processors configured to, during a call, receive a sequence of audio frames from a first device. The one or more processors are configured to, in response to determining that no audio frame of the sequence has been received for a threshold duration since a last received audio frame of the sequence, initiate transmission of a frame loss indication to the first device. The one or more processors are also configured to, responsive to the frame loss indication, receive a set of audio frames of the sequence and an indication of a second playback speed from the first device. The one or more processors are configured to initiate playback, via a speaker, of the set of audio frames based on the second playback speed. The second playback speed is greater than a first playback speed of a first set of audio frames of the sequence.
    Type: Grant
    Filed: January 5, 2021
    Date of Patent: January 30, 2024
    Assignee: QUALCOMM Incorporated
    Inventors: Shankar Thagadur Shivappa, Reid Westburg, Ferdinando Olivieri
  • Publication number: 20230079716
    Abstract: Various aspects of the present disclosure generally relate to media systems. In some aspects, a media device may monitor, using a radio frequency (RF) sensor, an environment of the media device; determine, from a received RF signal obtained by the RF sensor, a user attribute of a user within the environment; and control an audio system, associated with the media device, to direct an audio beam toward or away from the user. Numerous other aspects are provided.
    Type: Application
    Filed: February 4, 2021
    Publication date: March 16, 2023
    Inventors: Sharad SAMBHWANI, Ferdinando OLIVIERI, Menucher MENUCHEHRY, Ariel Yaakov SAGI, Evyatar HEMO
  • Publication number: 20230051841
    Abstract: A device includes a memory configured to store instructions and also includes one or more processors configured to execute the instructions to obtain audio data corresponding to a sound source and metadata indicative of a direction of the sound source. The one or more processors are configured to execute the instructions to obtain direction data indicating a viewing direction associated with a user of a playback device. The one or more processors are configured to execute the instructions to determine a resolution setting based on a similarity between the viewing direction and the direction of the sound source. The one or more processors are also configured to execute the instructions to process the audio data based on the resolution setting to generate processed audio data.
    Type: Application
    Filed: July 30, 2021
    Publication date: February 16, 2023
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo
  • Patent number: 11580954
    Abstract: A device for communication includes one or more processors configured to receive, during an online meeting, a speech audio stream representing speech of a first user. The one or more processors are also configured to receive a text stream representing the speech of the first user. The one or more processors are further configured to selectively generate an output based on the text stream in response to an interruption in the speech audio stream.
    Type: Grant
    Filed: February 3, 2021
    Date of Patent: February 14, 2023
    Assignee: QUALCOMM Incorporated
    Inventors: Ferdinando Olivieri, Reid Westburg, Shankar Thagadur Shivappa
  • Patent number: 11538489
    Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: December 27, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Publication number: 20220385748
    Abstract: A device includes a memory configured to store instructions and one or more processors configured to execute the instructions to receive a media packet and to determine, based on a field of the media packet, whether the media packet includes motion data. The one or more processors are also configured to execute the instructions to, based on the media packet including motion data, extract the motion data from the media packet.
    Type: Application
    Filed: May 27, 2021
    Publication date: December 1, 2022
    Inventors: Joel LINSKY, Andre SCHEVCIW, Mayank BATRA, Ferdinando OLIVIERI
  • Publication number: 20220383881
    Abstract: A device includes a memory configured to store instructions and one or more processors configured to execute the instructions. The one or more processors are configured to execute the instructions to obtain link data corresponding to a communication link to a second device. The one or more processors are configured to execute the instructions to select, at least partially based on the link data, between an ambisonics mode and a stereo mode.
    Type: Application
    Filed: May 27, 2021
    Publication date: December 1, 2022
    Inventors: Taher SHAHBAZI MIRZAHASANLOO, Joel LINSKY, Ferdinando OLIVIERI, Mayank BATRA
  • Publication number: 20220386060
    Abstract: Methods, systems, computer-readable media, and apparatuses for manipulating a soundfield are presented. Some configurations include receiving a bitstream that comprises metadata and a soundfield description; parsing the metadata to obtain an effect identifier and at least one effect parameter value; and applying, to the soundfield description, an effect identified by the effect identifier. The applying may include using the at least one effect parameter value to apply the identified effect to the soundfield description.
    Type: Application
    Filed: October 29, 2020
    Publication date: December 1, 2022
    Inventors: Nils Gunther PETERS, Shankar THAGADUR SHIVAPPA, S M Akramus SALEHIN, Jason FILOS, Siddhartha Goutham SWAMINATHAN, Ferdinando OLIVIERI
  • Publication number: 20220382509
    Abstract: A device for managing sound playback includes one or more processors configured to receive an indication of a user-device interaction between a user and an audio interface device during a sound playback operation of a multi-speaker audio playback system. The one or more processors are also configured to, based on receiving the indication of the user-device interaction, initiate a selective adjustment of the sound playback operation to reduce a playback sound of the multi-speaker audio playback system based on a position of the user.
    Type: Application
    Filed: November 17, 2020
    Publication date: December 1, 2022
    Inventors: Jason FILOS, Ferdinando Olivieri, Nils Gunther Peters
  • Publication number: 20220246133
    Abstract: A device for communication includes one or more processors configured to receive, during an online meeting, a speech audio stream representing speech of a first user. The one or more processors are also configured to receive a text stream representing the speech of the first user. The one or more processors are further configured to selectively generate an output based on the text stream in response to an interruption in the speech audio stream.
    Type: Application
    Filed: February 3, 2021
    Publication date: August 4, 2022
    Inventors: Ferdinando OLIVIERI, Reid WESTBURG, Shankar THAGADUR SHIVAPPA
  • Publication number: 20220217425
    Abstract: A device includes one or more processors configured to, during a call, receive a sequence of audio frames from a first device. The one or more processors are configured to, in response to determining that no audio frame of the sequence has been received for a threshold duration since a last received audio frame of the sequence, initiate transmission of a frame loss indication to the first device. The one or more processors are also configured to, responsive to the frame loss indication, receive a set of audio frames of the sequence and an indication of a second playback speed from the first device. The one or more processors are configured to initiate playback, via a speaker, of the set of audio frames based on the second playback speed. The second playback speed is greater than a first playback speed of a first set of audio frames of the sequence.
    Type: Application
    Filed: January 5, 2021
    Publication date: July 7, 2022
    Inventors: Shankar THAGADUR SHIVAPPA, Reid WESTBURG, Ferdinando OLIVIERI
  • Patent number: 11361776
    Abstract: In general, techniques are described by which to code scaled spatial components. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing psychoacoustic audio decoding, a bit allocation for the encoded foreground audio signal. The one or more processors may dequantize the quantized spatial component to obtain a scaled spatial component, and descale, based on the bit allocation, the scaled spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.
    Type: Grant
    Filed: June 22, 2020
    Date of Patent: June 14, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 11356796
    Abstract: An example device configured to obtain image data includes a memory configured to store one or more priority values, each of the one or more priority values being associated with a type of image object associated with the image data. The device includes one or more processors coupled to the memory, and configured to associate image objects in the image data with one or more audio sources represented in one or more audio streams. The one or more processors are also configured to assign a respective priority value to each of the one or more audio sources represented in the one or more streams and code ambisonic coefficients based on the assigned priority value.
    Type: Grant
    Filed: November 18, 2020
    Date of Patent: June 7, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 11317236
    Abstract: An example device includes a memory configured to store at least one spatial component and at least one audio source within a plurality of audio streams. The device also includes one or more processors coupled to the memory. The one or more processors are configured to receive, from motion sensors, rotation information. The one or more processors are configured to rotate the at least one spatial component based on the rotation information to form at least one rotated spatial component. The one or more processors are also configured to reconstruct ambisonic signals from the at least one rotated spatial component and the at least one audio source, wherein the at least one spatial component describes spatial characteristics associated with the at least one audio source in a spherical harmonic domain representation.
    Type: Grant
    Filed: November 18, 2020
    Date of Patent: April 26, 2022
    Assignee: Qualcomm Incorporated
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
  • Patent number: 11062713
    Abstract: In general, techniques are described by which to specify spatially formatted enhanced audio data for backward compatible audio bitstreams. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the backward compatible bitstream that conforms to a legacy transport format. The processor(s) may obtain, from the backward compatible bitstream, legacy audio data that conforms to a legacy audio format and a spatially formatted extended audio stream. The processor(s) may process the spatially formatted extended audio stream to obtain extended audio data that enhances the legacy audio data. The processor(s) may next obtain, based on the legacy audio data and the extended audio data, enhanced audio data that conforms to an enhanced audio format. The processor(s) may output the enhanced audio data to one or more speakers.
    Type: Grant
    Filed: June 24, 2019
    Date of Patent: July 13, 2021
    Assignee: Qualcomm Incorported
    Inventors: Nils Günther Peters, Ferdinando Olivieri, Moo Young Kim, Dipanjan Sen, Shankar Thagadur Shivappa
  • Publication number: 20210160644
    Abstract: An example device configured to obtain image data includes a memory configured to store one or more priority values, each of the one or more priority values being associated with a type of image object associated with the image data. The device includes one or more processors coupled to the memory, and configured to associate image objects in the image data with one or more audio sources represented in one or more audio streams. The one or more processors are also configured to assign a respective priority value to each of the one or more audio sources represented in the one or more streams and code ambisonic coefficients based on the assigned priority value.
    Type: Application
    Filed: November 18, 2020
    Publication date: May 27, 2021
    Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters