Patents by Inventor Anshuman Ganguly

Anshuman Ganguly has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Beamforming using multiple sensor data

Patent number: 12288566

Abstract: A device capable of using data from multiple sensors to determine an estimated position/direction of a user with respect to the device. The device may use estimated position data, along with confidence data, that originated from a plurality of sensors to fuse the data to determine the user's estimated position and comprehensive confidence of the estimated position. The system may use the location information to perform beamforming/beam steering and/or other downstream operations using the comprehensive estimated position.

Type: Grant

Filed: June 27, 2022

Date of Patent: April 29, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Anshuman Ganguly, Srivatsan Kandadai, Trausti Thor Kristjansson, Wontak Kim
TECHNIQUES FOR ESTIMATING ROOM BOUNDARIES AND LAYOUT USING MICROPHONE PAIRS

Publication number: 20250085420

Abstract: In various embodiments, a method for estimating a room layout includes receiving candidate wall distance estimates for walls in an acoustic environment and confidence scores; receiving device location data for devices; determining, based on the device location data, a location of each wall relative to each audio processing device; generating a room layout matrix that includes the candidate wall distance estimates, wherein each row of the room layout matrix is associated with a respective device and each column is associated with a respective wall and each of the candidate wall distance estimates is associated with the respective walls based on the confidence scores; determining, based on a highest confidence score in each column of the room layout matrix and the location data, a set of wall distance estimates from an ordinal device; and determining, based on the set of wall distance estimates, room layout estimates of the acoustic environment.

Type: Application

Filed: September 9, 2024

Publication date: March 13, 2025

Inventors: Anshuman GANGULY, Abdullah KUCUK, Kadagattur Gopinatha SRINIDHI
TECHNIQUES FOR ESTIMATING ROOM BOUNDARIES AND LAYOUT USING MICROPHONE PAIRS

Publication number: 20250085421

Abstract: In various embodiments, a computer-implemented method comprises emitting a reference audio signal using one or more speakers of an audio processing device located in an acoustic environment, receiving, by each microphone in a microphone pair of the audio processing device, a response of the acoustic environment to the reference audio signal, determining an impulse response of the acoustic environment based on the reference audio signal and the received response of the acoustic environment, determining, based on the impulse response, a plurality of candidate wall distance estimates and a plurality of confidence scores, and determining based on the plurality of candidate wall distance estimates and confidence scores, including candidate wall distance estimates and confidence scores from one or more other audio processing devices, a set of wall distance estimates, each wall distance estimate in the set of wall distance estimates being associated with a different wall in a plurality of walls.

Type: Application

Filed: September 9, 2024

Publication date: March 13, 2025

Inventors: Anshuman GANGULY, Abdullah KUCUK, Kadagattur Gopinatha SRINIDHI
DETERMINING SPATIAL IMPULSE RESPONSE VIA ACOUSTIC SCRAMBLING

Publication number: 20250048055

Abstract: Disclosed embodiments include techniques for determining spatial impulse response via acoustic scrambling. These techniques include a computer-implemented method for generating a frequency sweep signal, the method comprising generating a frequency sweep signal having a monotonically increasing frequency, partitioning the frequency sweep signal into N input segments, each of the N input segments representing a different frequency range, generating an encoding key having a sequence of N non-consecutive numbers, wherein each number in the sequence appears once, generating an output signal by selecting each of the N input segments in an order based on the sequence of N non-consecutive numbers in the encoding key, and causing a speaker to produce audio tones in an audio space based on the output signal.

Type: Application

Filed: July 31, 2023

Publication date: February 6, 2025

Inventors: Anshuman GANGULY, Kadagattur Gopinatha SRINIDHI
Sound source localization

Patent number: 11762052

Abstract: Techniques for improving sound source localization (SSL) are provided. A method for probabilistic SSL using a deep neural network (DNN) may include receiving audio data including a representation of audio such as a wakeword from a microphone array. The audio data may be processed by a DNN to output a plurality of values where each value indicates a probability that the audio originated from a direction corresponding to that value. A sensor may provide computer vision or other data which may be used to inform the plurality of values based on detecting presence of a human or obstacle. A probability that the audio originated from one of the directions of the plurality of directions may be determined based at least in part on the DNN output and the computer vision or other data.

Type: Grant

Filed: September 15, 2021

Date of Patent: September 19, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Anshuman Ganguly, Mrudula V. Athi, Spencer Russell, Alexander M. Epstein, Wontak Kim
Autonomously motile device with audio reflection detection

Patent number: 11217235

Abstract: A device capable of autonomous motion may move in response to a user speaking an utterance, such as a command. Before moving, the device processes audio data received from a microphone array to identify different audio signals arriving at the device from different directions. Based on properties of the audio signals, the device determines which of the audio signals are merely reflections of other audio.

Type: Grant

Filed: November 18, 2019

Date of Patent: January 4, 2022

Assignee: Amazon Technologies, Inc.

Inventors: Wai Chung Chu, Anshuman Ganguly, Carlo Murgia
Audio beam selection

Patent number: 11158335

Abstract: A voice-controlled device includes a beamformer for determining audio data corresponding to one or more directions and a beam selector for selecting in which direction a source of target audio lies. The device determines magnitude spectrums for each beam and for each frequency bin in each beam for each frame of audio data. The device determines frame-by-frame changes in the magnitude and filters the changes to smooth them. The device selects the beam having the greatest smoothed change in magnitude as corresponding to speech.

Type: Grant

Filed: March 28, 2019

Date of Patent: October 26, 2021

Assignee: Amazon Technologies, Inc.

Inventors: Anshuman Ganguly, Srivatsan Kandadai, Wontak Kim

Beamforming using multiple sensor data

TECHNIQUES FOR ESTIMATING ROOM BOUNDARIES AND LAYOUT USING MICROPHONE PAIRS

TECHNIQUES FOR ESTIMATING ROOM BOUNDARIES AND LAYOUT USING MICROPHONE PAIRS

DETERMINING SPATIAL IMPULSE RESPONSE VIA ACOUSTIC SCRAMBLING

Sound source localization

Autonomously motile device with audio reflection detection

Audio beam selection