Patents by Inventor Mrudula V. Athi

Mrudula V. Athi has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Dereverberation and noise reduction

Patent number: 12272369

Abstract: A system configured to improve audio processing by performing dereverberation and noise reduction during a communication session. In some examples, the system may include a deep neural network (DNN) configured to perform speech enhancement, which is located after an Acoustic Echo Cancellation (AEC) component. For example, the DNN may process isolated audio data output by the AEC component to jointly mitigate additive noise and reverberation. In other examples, the system may include a DNN configured to perform acoustic interference cancellation, which may jointly mitigate additive noise, reverberation, and residual echo, removing the need to perform residual echo suppression processing. The DNN is configured to process complex-valued spectrograms corresponding to the isolated audio data and/or estimated echo data generated by the AEC component.

Type: Grant

Filed: January 19, 2022

Date of Patent: April 8, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Amit Singh Chhetri, Mrudula V. Athi, Pradeep Kumar Govindaraju, Rong Hu
User orientation estimation

Patent number: 12200449

Abstract: A system configured to perform user orientation estimation to determine a direction a user is facing using a deep neural network (DNN). As a directionality of human speech increases with frequency, the DNN may estimate the user orientation by comparing high-frequency components detected by each of the multiple devices. For example, a group of devices may individually generate feature data, which represents audio features and spatial information, and send the feature data to the other devices. Thus, each device in the group receives feature data generated by the other devices and processes this feature data using a DNN to determine an estimate of user orientation. In some examples, the DNN may also generate sound source localization (SSL) data and/or a confidence score associated with the user orientation estimate. A post-processing step may process the individual user orientation estimates generated by the individual devices and determine a final user orientation estimate.

Type: Grant

Filed: December 14, 2022

Date of Patent: January 14, 2025

Assignee: Amazon Technologies, Inc.

Inventors: Mahathir Monjur, Mrudula V Athi, Md Tamzeed Islam, Wontak Kim
Sound source localization

Patent number: 11762052

Abstract: Techniques for improving sound source localization (SSL) are provided. A method for probabilistic SSL using a deep neural network (DNN) may include receiving audio data including a representation of audio such as a wakeword from a microphone array. The audio data may be processed by a DNN to output a plurality of values where each value indicates a probability that the audio originated from a direction corresponding to that value. A sensor may provide computer vision or other data which may be used to inform the plurality of values based on detecting presence of a human or obstacle. A probability that the audio originated from one of the directions of the plurality of directions may be determined based at least in part on the DNN output and the computer vision or other data.

Type: Grant

Filed: September 15, 2021

Date of Patent: September 19, 2023

Assignee: Amazon Technologies, Inc.

Inventors: Anshuman Ganguly, Mrudula V. Athi, Spencer Russell, Alexander M. Epstein, Wontak Kim

Dereverberation and noise reduction

User orientation estimation

Sound source localization