Patents by Inventor Taher Shahbazi Mirzahasanloo
Taher Shahbazi Mirzahasanloo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).
-
Patent number: 10951975Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.Type: GrantFiled: June 8, 2020Date of Patent: March 16, 2021Assignee: Qualcomm IncorporatedInventors: Lae-Hoon Kim, Dongmei Wang, Fatemeh Saki, Taher Shahbazi Mirzahasanloo, Erik Visser, Rogerio Guedes Alves
-
Publication number: 20200402523Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.Type: ApplicationFiled: June 22, 2020Publication date: December 24, 2020Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
-
Publication number: 20200402529Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.Type: ApplicationFiled: June 22, 2020Publication date: December 24, 2020Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
-
Publication number: 20200402522Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.Type: ApplicationFiled: June 22, 2020Publication date: December 24, 2020Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
-
Publication number: 20200402519Abstract: In general, techniques are described by which to code scaled spatial components. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing psychoacoustic audio decoding, a bit allocation for the encoded foreground audio signal. The one or more processors may dequantize the quantized spatial component to obtain a scaled spatial component, and descale, based on the bit allocation, the scaled spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.Type: ApplicationFiled: June 22, 2020Publication date: December 24, 2020Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
-
Publication number: 20200402521Abstract: In general, various aspects of the techniques described in this disclosure are directed to performing psychoacoustic audio coding based on operating conditions. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store the encoded scene-based audio data. The one or more processors may be configured to obtain an operating condition of the device for decoding the encoded scene-based audio data and perform, based on the operating condition, psychoacoustic audio decoding with respect to the encoded scene-based audio data to obtain ambisonic transport format audio data. The one or more processors may also be configured to perform spatial audio decoding with respect to the ambisonic transport format audio data to obtain scene-based audio data.Type: ApplicationFiled: June 22, 2020Publication date: December 24, 2020Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
-
Patent number: 10854214Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.Type: GrantFiled: March 29, 2019Date of Patent: December 1, 2020Assignee: QUALCOMM IncorporatedInventors: Rogerio Guedes Alves, Taher Shahbazi Mirzahasanloo, Erik Visser, Lae-Hoon Kim, Fatemeh Saki, Dongmei Wang
-
Publication number: 20200312341Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.Type: ApplicationFiled: March 29, 2019Publication date: October 1, 2020Inventors: Rogerio Guedes ALVES, Taher SHAHBAZI MIRZAHASANLOO, Erik VISSER, Lae-Hoon KIM, Fatemeh SAKI, Dongmei WANG
-
Publication number: 20200304903Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.Type: ApplicationFiled: June 8, 2020Publication date: September 24, 2020Inventors: Lae-Hoon KIM, Dongmei WANG, Fatemeh SAKI, Taher SHAHBAZI MIRZAHASANLOO, Erik VISSER, Rogerio Guedes ALVES
-
Publication number: 20200278832Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.Type: ApplicationFiled: February 25, 2020Publication date: September 3, 2020Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves, Lae-Hoon Kim, Erik Visser, Dongmei Wang, Fatemeh Saki
-
Patent number: 10762910Abstract: In general, this disclosure describes techniques for seamless audio data compression control system that can respond to the increasing adaptation demands for source coding characteristics to accommodate co-existence of various devices in a personal area network. In one example, a source device includes a memory and one or more processors in communication with the memory. The memory is configured to store audio data. The one or more processors are configured to select, based on a nominal compression ratio or a streaming rate, a compression level of multiple compression levels, the multiple compression levels arranged hierarchically to allow for dynamic transitioning between the compression levels. The one or more processors are further configured to encode, based on the selected compression level of the multiple compression levels, the audio data stored to the memory.Type: GrantFiled: July 31, 2018Date of Patent: September 1, 2020Assignee: Qualcomm IncorporatedInventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Patent number: 10734006Abstract: In general, techniques are described by which to perform audio coding based on audio pattern recognition. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store audio data. The processor may obtain, from a plurality of categories, a category to which the audio data corresponds, and obtain, based on the category, a set of pyramid vector quantization (PVQ) parameters from a plurality of sets of PVQ parameters. The processor may also perform, based on the set of PVQ parameters, PVQ with respect to the audio data to obtain a residual identifier representative of the audio data, and specify, in the bitstream, the residual identifier.Type: GrantFiled: July 31, 2018Date of Patent: August 4, 2020Assignee: Qualcomm IncorporatedInventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Patent number: 10681452Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.Type: GrantFiled: February 26, 2019Date of Patent: June 9, 2020Assignee: Qualcomm IncorporatedInventors: Lae-Hoon Kim, Dongmei Wang, Fatemeh Saki, Taher Shahbazi Mirzahasanloo, Erik Visser, Rogerio Guedes Alves
-
Patent number: 10586546Abstract: An example apparatus includes a memory configured to store the audio data; and one or more processors in communication with the memory, the one or more processors configured to: decode, from an encoded audio bitstream, a unique identifier for each of a plurality of subbands of audio data; perform inverse pyramid vector quantization (PVQ) using a compact map to reconstruct a residual vector for each subband of the plurality of subbands of the audio data based on the unique identifier for the respective subband of the plurality of subbands of the audio data, wherein the compact map is generated using structural unification of vectors across subbands and relational compression, and wherein the unique identifiers correspond to codevectors; and reconstruct, based on the residual vectors and energy scalars for each subband, the plurality of subbands of the audio data.Type: GrantFiled: July 31, 2018Date of Patent: March 10, 2020Assignee: Qualcomm IncorporatedInventors: Taher Shahbazi Mirzahasanloo, Walter Andres Zuluaga, Rogerio Guedes Alves
-
Patent number: 10580424Abstract: In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.Type: GrantFiled: July 31, 2018Date of Patent: March 3, 2020Assignee: Qualcomm IncorporatedInventor: Taher Shahbazi Mirzahasanloo
-
Patent number: 10573331Abstract: An example apparatus includes a memory configured to store the audio data; and one or more processors in communication with the memory, the one or more processors configured to: obtain, for each of a plurality of subbands of audio data, a respective energy scalar and a respective residual identifier; determine overall distortion levels for a plurality of candidate subband pulse allocations for performing pyramid vector dequantization (PVdQ) of the residual identifiers; select, from the plurality of subband pulse allocations and based on the overall distortion levels, a candidate subband pulse allocation; and perform, using the candidate subband pulse allocation, PVdQ on the residual identifiers to reconstruct a residual vector for each subband.Type: GrantFiled: July 31, 2018Date of Patent: February 25, 2020Assignee: Qualcomm IncorporatedInventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Patent number: 10559315Abstract: A method of encoding audio data includes determining an energy level of a first subband of frequency domain audio data, determining a bit allocation for a coarse quantization process and a fine quantization process, determining that the energy level of the first subband of frequency domain audio data is outside a predetermined range of energy levels for the coarse quantization process, reallocating bits assigned to the fine quantization process to an extended-range coarse quantization process, the extended-range coarse quantization process using an extended range of energy levels, wherein the extended range of energy levels is larger than the predetermined range of energy levels for the coarse quantization process, and quantizing the energy level of the first subband of frequency domain audio data using the extended-range coarse quantization process to produce a quantized extended-range coarse energy level.Type: GrantFiled: July 25, 2018Date of Patent: February 11, 2020Assignee: Qualcomm IncorporatedInventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Publication number: 20190371349Abstract: In general, techniques are described by which to perform audio coding based on audio pattern recognition. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store audio data. The processor may obtain, from a plurality of categories, a category to which the audio data corresponds, and obtain, based on the category, a set of pyramid vector quantization (PVQ) parameters from a plurality of sets of PVQ parameters. The processor may also perform, based on the set of PVQ parameters, PVQ with respect to the audio data to obtain a residual identifier representative of the audio data, and specify, in the bitstream, the residual identifier.Type: ApplicationFiled: July 31, 2018Publication date: December 5, 2019Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Publication number: 20190371347Abstract: In general, this disclosure describes techniques for seamless audio data compression control system that can respond to the increasing adaptation demands for source coding characteristics to accommodate co-existence of various devices in a personal area network. In one example, a source device includes a memory and one or more processors in communication with the memory. The memory is configured to store audio data. The one or more processors are configured to select, based on a nominal compression ratio or a streaming rate, a compression level of multiple compression levels, the multiple compression levels arranged hierarchically to allow for dynamic transitioning between the compression levels. The one or more processors are further configured to encode, based on the selected compression level of the multiple compression levels, the audio data stored to the memory.Type: ApplicationFiled: July 31, 2018Publication date: December 5, 2019Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
-
Publication number: 20190371348Abstract: In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.Type: ApplicationFiled: July 31, 2018Publication date: December 5, 2019Inventor: Taher Shahbazi Mirzahasanloo