Patents by Inventor Taher Shahbazi Mirzahasanloo

Taher Shahbazi Mirzahasanloo has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Seamless listen-through for a wearable device

Patent number: 10951975

Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.

Type: Grant

Filed: June 8, 2020

Date of Patent: March 16, 2021

Assignee: Qualcomm Incorporated

Inventors: Lae-Hoon Kim, Dongmei Wang, Fatemeh Saki, Taher Shahbazi Mirzahasanloo, Erik Visser, Rogerio Guedes Alves
PSYCHOACOUSTIC AUDIO CODING OF AMBISONIC AUDIO DATA

Publication number: 20200402523

Abstract: In general, techniques are described for psychoacoustic audio coding of ambisonic audio data. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store the bitstream that includes an encoded audio object and a corresponding spatial component that defines spatial characteristics of the encoded foreground audio signal. The encoded foreground audio signal may include a coded gain and a coded shape. The one or more processors may perform a gain and shape synthesis with respect to the coded gain and the coded shape to obtain a foreground audio signal, and reconstruct, based on the foreground audio signal and the spatial component, the ambisonic audio data.

Type: Application

Filed: June 22, 2020

Publication date: December 24, 2020

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
CORRELATING SCENE-BASED AUDIO DATA FOR PSYCHOACOUSTIC AUDIO CODING

Publication number: 20200402529

Abstract: In general, techniques are described by which to correlate scene-based audio data for psychoacoustic audio coding. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including a plurality of encoded correlated components of a soundfield represented by scene-based audio data. The one or more processors may perform psychoacoustic audio decoding with respect to one or more of the plurality of encoded correlated components to obtain a plurality of correlated components, and obtain, from the bitstream, an indication representative of how the one or more of the plurality of correlated components were reordered in the bitstream. The one or more processors may reorder, based on the indication, the plurality of correlated components to obtain a plurality of reordered components, and reconstruct, based on the plurality of reordered components, the scene-based audio data.

Type: Application

Filed: June 22, 2020

Publication date: December 24, 2020

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
QUANTIZING SPATIAL COMPONENTS BASED ON BIT ALLOCATIONS DETERMINED FOR PSYCHOACOUSTIC AUDIO CODING

Publication number: 20200402522

Abstract: In general, techniques are described for quantizing spatial components based on bit allocations determined for psychoacoustic audio coding. A device comprising a memory and one or more processors may perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing the psychoacoustic audio decoding, a first bit allocation for the encoded foreground audio signal. The one or more processors may also determine, based on the first bit allocation, a second bit allocation, and dequantize, based on the second bit allocation, the quantized spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.

Type: Application

Filed: June 22, 2020

Publication date: December 24, 2020

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
CODING SCALED SPATIAL COMPONENTS

Publication number: 20200402519

Abstract: In general, techniques are described by which to code scaled spatial components. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may store a bitstream including an encoded foreground audio signal and a corresponding quantized spatial component. The one or more processors may perform psychoacoustic audio decoding with respect to the encoded foreground audio signal to obtain a foreground audio signal, and determine, when performing psychoacoustic audio decoding, a bit allocation for the encoded foreground audio signal. The one or more processors may dequantize the quantized spatial component to obtain a scaled spatial component, and descale, based on the bit allocation, the scaled spatial component to obtain a spatial component. The one or more processors may reconstruct, based on the foreground audio signal and the spatial component, scene-based audio data.

Type: Application

Filed: June 22, 2020

Publication date: December 24, 2020

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
PERFORMING PSYCHOACOUSTIC AUDIO CODING BASED ON OPERATING CONDITIONS

Publication number: 20200402521

Abstract: In general, various aspects of the techniques described in this disclosure are directed to performing psychoacoustic audio coding based on operating conditions. A device comprising a memory and one or more processors may be configured to perform the techniques. The memory may be configured to store the encoded scene-based audio data. The one or more processors may be configured to obtain an operating condition of the device for decoding the encoded scene-based audio data and perform, based on the operating condition, psychoacoustic audio decoding with respect to the encoded scene-based audio data to obtain ambisonic transport format audio data. The one or more processors may also be configured to perform spatial audio decoding with respect to the ambisonic transport format audio data to obtain scene-based audio data.

Type: Application

Filed: June 22, 2020

Publication date: December 24, 2020

Inventors: Ferdinando Olivieri, Taher Shahbazi Mirzahasanloo, Nils Günther Peters
Noise suppression wearable device

Patent number: 10854214

Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.

Type: Grant

Filed: March 29, 2019

Date of Patent: December 1, 2020

Assignee: QUALCOMM Incorporated

Inventors: Rogerio Guedes Alves, Taher Shahbazi Mirzahasanloo, Erik Visser, Lae-Hoon Kim, Fatemeh Saki, Dongmei Wang
NOISE SUPPRESSION WEARABLE DEVICE

Publication number: 20200312341

Abstract: A device includes a memory and one or more processors coupled to the memory. The one or more processors are configured to perform an active noise cancellation (ANC) operation on noisy input speech as captured by a first microphone, the noisy input speech as captured by a second microphone, or both, to suppress a noise level associated with the noisy input speech. The one or more processors are configured to match a second frequency spectrum of a second signal with a first frequency spectrum of a first signal. The first signal is representative of the noisy input speech as captured by the first microphone, and the second signal is representative of the noisy input speech as captured by the second microphone. The one or more processors are also configured to generate an output speech signal that is representative of input speech based on the second signal.

Type: Application

Filed: March 29, 2019

Publication date: October 1, 2020

Inventors: Rogerio Guedes ALVES, Taher SHAHBAZI MIRZAHASANLOO, Erik VISSER, Lae-Hoon KIM, Fatemeh SAKI, Dongmei WANG
SEAMLESS LISTEN-THROUGH FOR A WEARABLE DEVICE

Publication number: 20200304903

Abstract: Methods, systems, and devices for signal processing are described. Generally, in one example as provided for by the described techniques, a wearable device includes a processor configured to retrieve a plurality of external microphone signals that includes audio sound from outside of the device from a memory; to separate, based on at least information from an internal microphone signal, a self-voice component from a background component; to perform a first listen-through operation on the separated self-voice component to produce a first listen-through signal; and to produce an output audio signal that is based on at least the first listen-through signal, wherein the output audio signal includes an audio zoom signal that includes audio sound of the plurality of external microphone signals.

Type: Application

Filed: June 8, 2020

Publication date: September 24, 2020

Inventors: Lae-Hoon KIM, Dongmei WANG, Fatemeh SAKI, Taher SHAHBAZI MIRZAHASANLOO, Erik VISSER, Rogerio Guedes ALVES
VOICE ACTIVATION FOR COMPUTING DEVICES

Publication number: 20200278832

Abstract: In general, techniques are described that enable voice activation for computing devices. A computing device configured to support an audible interface that comprises a memory and one or more processors may be configured to perform the techniques. The memory may store a first audio signal representative of an environment external to a user associated with the computing device and a second audio signal sensed by a microphone coupled to a housing of the computing device. The one or more processors may verify, based on the first audio signal and the second audio signal, that the user activated the audible interface of the computing device, and obtain, based on the verification, additional audio signals representative of one or more audible commands.

Type: Application

Filed: February 25, 2020

Publication date: September 3, 2020

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves, Lae-Hoon Kim, Erik Visser, Dongmei Wang, Fatemeh Saki
Hierarchical fine quantization for audio coding

Patent number: 10762910

Abstract: In general, this disclosure describes techniques for seamless audio data compression control system that can respond to the increasing adaptation demands for source coding characteristics to accommodate co-existence of various devices in a personal area network. In one example, a source device includes a memory and one or more processors in communication with the memory. The memory is configured to store audio data. The one or more processors are configured to select, based on a nominal compression ratio or a streaming rate, a compression level of multiple compression levels, the multiple compression levels arranged hierarchically to allow for dynamic transitioning between the compression levels. The one or more processors are further configured to encode, based on the selected compression level of the multiple compression levels, the audio data stored to the memory.

Type: Grant

Filed: July 31, 2018

Date of Patent: September 1, 2020

Assignee: Qualcomm Incorporated

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
Audio coding based on audio pattern recognition

Patent number: 10734006

Abstract: In general, techniques are described by which to perform audio coding based on audio pattern recognition. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store audio data. The processor may obtain, from a plurality of categories, a category to which the audio data corresponds, and obtain, based on the category, a set of pyramid vector quantization (PVQ) parameters from a plurality of sets of PVQ parameters. The processor may also perform, based on the set of PVQ parameters, PVQ with respect to the audio data to obtain a residual identifier representative of the audio data, and specify, in the bitstream, the residual identifier.

Type: Grant

Filed: July 31, 2018

Date of Patent: August 4, 2020

Assignee: Qualcomm Incorporated

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
Seamless listen-through for a wearable device

Patent number: 10681452

Abstract: Methods, systems, and devices for signal processing are described. Generally, as provided for by the described techniques, a wearable device may receive an input audio signal (e.g., including both an external signal and a self-voice signal). The wearable device may detect the self-voice signal in the input audio signal based on a self-voice activity detection (SVAD) procedure, and may implement the described techniques based thereon. The wearable device may perform beamforming operations or other separation procedures to isolate the external signal and the self-voice signal from the input audio signal. The wearable device may apply a first filter to the external signal, and a second filter to the self-voice signal. The wearable device may then mix the filtered signals, and generate an output signal that sounds natural to the user.

Type: Grant

Filed: February 26, 2019

Date of Patent: June 9, 2020

Assignee: Qualcomm Incorporated

Inventors: Lae-Hoon Kim, Dongmei Wang, Fatemeh Saki, Taher Shahbazi Mirzahasanloo, Erik Visser, Rogerio Guedes Alves
Inversely enumerated pyramid vector quantizers for efficient rate adaptation in audio coding

Patent number: 10586546

Abstract: An example apparatus includes a memory configured to store the audio data; and one or more processors in communication with the memory, the one or more processors configured to: decode, from an encoded audio bitstream, a unique identifier for each of a plurality of subbands of audio data; perform inverse pyramid vector quantization (PVQ) using a compact map to reconstruct a residual vector for each subband of the plurality of subbands of the audio data based on the unique identifier for the respective subband of the plurality of subbands of the audio data, wherein the compact map is generated using structural unification of vectors across subbands and relational compression, and wherein the unique identifiers correspond to codevectors; and reconstruct, based on the residual vectors and energy scalars for each subband, the plurality of subbands of the audio data.

Type: Grant

Filed: July 31, 2018

Date of Patent: March 10, 2020

Assignee: Qualcomm Incorporated

Inventors: Taher Shahbazi Mirzahasanloo, Walter Andres Zuluaga, Rogerio Guedes Alves
Perceptual audio coding as sequential decision-making problems

Patent number: 10580424

Abstract: In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.

Type: Grant

Filed: July 31, 2018

Date of Patent: March 3, 2020

Assignee: Qualcomm Incorporated

Inventor: Taher Shahbazi Mirzahasanloo
Cooperative pyramid vector quantizers for scalable audio coding

Patent number: 10573331

Abstract: An example apparatus includes a memory configured to store the audio data; and one or more processors in communication with the memory, the one or more processors configured to: obtain, for each of a plurality of subbands of audio data, a respective energy scalar and a respective residual identifier; determine overall distortion levels for a plurality of candidate subband pulse allocations for performing pyramid vector dequantization (PVdQ) of the residual identifiers; select, from the plurality of subband pulse allocations and based on the overall distortion levels, a candidate subband pulse allocation; and perform, using the candidate subband pulse allocation, PVdQ on the residual identifiers to reconstruct a residual vector for each subband.

Type: Grant

Filed: July 31, 2018

Date of Patent: February 25, 2020

Assignee: Qualcomm Incorporated

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
Extended-range coarse-fine quantization for audio coding

Patent number: 10559315

Abstract: A method of encoding audio data includes determining an energy level of a first subband of frequency domain audio data, determining a bit allocation for a coarse quantization process and a fine quantization process, determining that the energy level of the first subband of frequency domain audio data is outside a predetermined range of energy levels for the coarse quantization process, reallocating bits assigned to the fine quantization process to an extended-range coarse quantization process, the extended-range coarse quantization process using an extended range of energy levels, wherein the extended range of energy levels is larger than the predetermined range of energy levels for the coarse quantization process, and quantizing the energy level of the first subband of frequency domain audio data using the extended-range coarse quantization process to produce a quantized extended-range coarse energy level.

Type: Grant

Filed: July 25, 2018

Date of Patent: February 11, 2020

Assignee: Qualcomm Incorporated

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
AUDIO CODING BASED ON AUDIO PATTERN RECOGNITION

Publication number: 20190371349

Abstract: In general, techniques are described by which to perform audio coding based on audio pattern recognition. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store audio data. The processor may obtain, from a plurality of categories, a category to which the audio data corresponds, and obtain, based on the category, a set of pyramid vector quantization (PVQ) parameters from a plurality of sets of PVQ parameters. The processor may also perform, based on the set of PVQ parameters, PVQ with respect to the audio data to obtain a residual identifier representative of the audio data, and specify, in the bitstream, the residual identifier.

Type: Application

Filed: July 31, 2018

Publication date: December 5, 2019

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
HIERARCHICAL FINE QUANTIZATION FOR AUDIO CODING

Publication number: 20190371347

Abstract: In general, this disclosure describes techniques for seamless audio data compression control system that can respond to the increasing adaptation demands for source coding characteristics to accommodate co-existence of various devices in a personal area network. In one example, a source device includes a memory and one or more processors in communication with the memory. The memory is configured to store audio data. The one or more processors are configured to select, based on a nominal compression ratio or a streaming rate, a compression level of multiple compression levels, the multiple compression levels arranged hierarchically to allow for dynamic transitioning between the compression levels. The one or more processors are further configured to encode, based on the selected compression level of the multiple compression levels, the audio data stored to the memory.

Type: Application

Filed: July 31, 2018

Publication date: December 5, 2019

Inventors: Taher Shahbazi Mirzahasanloo, Rogerio Guedes Alves
PERCEPTUAL AUDIO CODING AS SEQUENTIAL DECISION-MAKING PROBLEMS

Publication number: 20190371348

Abstract: In general, techniques are described by which to perform perceptual audio coding as sequential decision making problems. A source device comprising a memory and a processor may be configured to perform the techniques. The memory may store at least a portion of the audio data. The processor may apply a filter to the audio data to obtain subbands of the audio data. The processor may adapt a controller according to a machine learning algorithm, the controller configured to determine bit distributions across the subbands of the audio data. The processor may specify, based on the bit distributions and in a bitstream representative of the audio data, one or more indications representative of the subbands of the audio data, and output the bitstream via a wireless connection in accordance with a wireless communication protocol.

Type: Application

Filed: July 31, 2018

Publication date: December 5, 2019

Inventor: Taher Shahbazi Mirzahasanloo

prev 1 2 3 next