Patents by Inventor Vikramjit Mitra

Vikramjit Mitra has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Audio detection and monitoring of respiration

Patent number: 12551113

Abstract: The subject technology provides a framework for estimating respiratory rates from audio data recordings. A multi-task learning network may be trained to output respiratory rates, breathing conditions, and/or noise conditions based on input audio data recordings. The audio data recordings may be generated using wearable audio devices with near-field microphones. The respiratory rates may be provided along with other workout information by a health application of an electronic device. Additional sensor data and/or health data may be used in combination with the audio data and/or the respiratory rates and/or breathing conditions for respiratory and/or other health monitoring by an electronic device.

Type: Grant

Filed: August 4, 2021

Date of Patent: February 17, 2026

Assignee: Apple Inc.

Inventors: Vikramjit Mitra, Agni Kumar, Carolyn R. Oliver, Adeeti V. Ullal, Matthew Biddulph, Irida Mance
PHYSIOLOGICAL STATE PREDICTION BASED ON ACOUSTIC DATA USING MACHINE LEARNING

Publication number: 20250221671

Abstract: The subject technology provides physiological state prediction based on acoustic data using machine learning. An apparatus receives input data comprising acoustic signal information associated with a user. The apparatus extracts one or more acoustic features from the acoustic signal information. The apparatus produces a trained machine learning model by training a neural network to predict one or more physiological states of the user from the one or more acoustic features.

Type: Application

Filed: August 27, 2024

Publication date: July 10, 2025

Inventors: Vikramjit MITRA, Jingping NIE, Erdrin AZEMI
Determining Respiration Rates Based On Audio Streams and User Conditions

Publication number: 20250134408

Abstract: A system can receive an input indicating a user condition. The system can also receive an internal audio stream from an in-ear microphone and an external audio stream from an external microphone of a head worn system. The system can determine a respiration rate of a user based on the internal audio stream, the external audio stream, and the input indicating the user condition. In some implementations, the respiration rate may be determined from a respiration signal in the internal audio stream and/or the external audio stream. The respiration signal may measure breathing of the user. In some implementations, the system can invoke a machine learning model to determine the respiration signal from the internal audio stream and/or the external audio stream based on the user condition.

Type: Application

Filed: September 30, 2024

Publication date: May 1, 2025

Inventors: Juri Minxha, Narimene Lezzoum, Vikramjit Mitra, Erdrin Azemi
Systems and methods for speech recognition in unseen and noisy channel conditions

Patent number: 11217228

Abstract: Systems and methods for speech recognition are provided. In some aspects, the method comprises receiving, using an input, an audio signal. The method further comprises splitting the audio signal into auditory test segments. The method further comprises extracting, from each of the auditory test segments, a set of acoustic features. The method further comprises applying the set of acoustic features to a deep neural network to produce a hypothesis for the corresponding auditory test segment. The method further comprises selectively performing one or more of: indirect adaptation of the deep neural network and direct adaptation of the deep neural network.

Type: Grant

Filed: March 22, 2017

Date of Patent: January 4, 2022

Assignee: SRI International

Inventors: Vikramjit Mitra, Horacio E. Franco, Chris D. Bartels, Dimitra Vergyri, Julien van Hout, Martin Graciarena
Time-frequency convolutional neural network with bottleneck architecture for query-by-example processing

Patent number: 10777188

Abstract: A computing system determines whether a reference audio signal contains a query. A time-frequency convolutional neural network (TFCNN) comprises a time and frequency convolutional layers and a series of additional layers, which include a bottleneck layer. The computation engine applies the TFCNN to samples of a query utterance at least through the bottleneck layer. A query feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the query utterance. The computation engine also applies the TFCNN to samples of the reference audio signal at least through the bottleneck layer. A reference feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the reference audio signal. The computation engine determines at least one detection score based on the query feature vector and the reference feature vector.

Type: Grant

Filed: November 14, 2018

Date of Patent: September 15, 2020

Assignee: SRI International

Inventors: Julien van Hout, Vikramjit Mitra, Horacio Franco, Emre Yilmaz
SYSTEMS AND METHODS FOR SPEECH RECOGNITION IN UNSEEN AND NOISY CHANNEL CONDITIONS

Publication number: 20200168208

Abstract: Systems and methods for speech recognition are provided. In some aspects, the method comprises receiving, using an input, an audio signal. The method further comprises splitting the audio signal into auditory test segments. The method further comprises extracting, from each of the auditory test segments, a set of acoustic features. The method further comprises applying the set of acoustic features to a deep neural network to produce a hypothesis for the corresponding auditory test segment. The method further comprises selectively performing one or more of: indirect adaptation of the deep neural network and direct adaptation of the deep neural network.

Type: Application

Filed: March 22, 2017

Publication date: May 28, 2020

Inventors: Vikramjit Mitra, Horacio E. Franco, Chris D. Bartels, Dimitra Vergyri, Julien van Hout, Martin Graciarena
TIME-FREQUENCY CONVOLUTIONAL NEURAL NETWORK WITH BOTTLENECK ARCHITECTURE FOR QUERY-BY-EXAMPLE PROCESSING

Publication number: 20200152179

Abstract: A computing system determines whether a reference audio signal contains a query. A time-frequency convolutional neural network (TFCNN) comprises a time and frequency convolutional layers and a series of additional layers, which include a bottleneck layer. The computation engine applies the TFCNN to samples of a query utterance at least through the bottleneck layer. A query feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the query utterance. The computation engine also applies the TFCNN to samples of the reference audio signal at least through the bottleneck layer. A reference feature vector comprises output values of the bottleneck layer generated when the computation engine applies the TFCNN to the samples of the reference audio signal. The computation engine determines at least one detection score based on the query feature vector and the reference feature vector.

Type: Application

Filed: November 14, 2018

Publication date: May 14, 2020

Inventors: Julien van Hout, Vikramjit Mitra, Horacio Franco, Emre Yilmaz
Systems for speech-based assessment of a patient's state-of-mind

Patent number: 10478111

Abstract: A computer-implemented method can include a speech collection module collecting a speech pattern from a patient, a speech feature computation module computing at least one speech feature from the collected speech pattern, a mental health determination module determining a state-of-mind of the patient based at least in part on the at least one computed speech feature, and an output module providing an indication of a diagnosis with regard to a possibility that the patient is suffering from a certain condition such as depression or Post-Traumatic Stress Disorder (PTSD).

Type: Grant

Filed: August 5, 2015

Date of Patent: November 19, 2019

Assignee: SRI International

Inventors: Bruce Knoth, Dimitra Vergyri, Elizabeth Shriberg, Vikramjit Mitra, Mitchell McLaren, Andreas Kathol, Colleen Richey, Martin Graciarena
SYSTEMS FOR SPEECH-BASED ASSESSMENT OF A PATIENT'S STATE-OF-MIND

Publication number: 20180214061

Abstract: A computer-implemented method can include a speech collection module collecting a speech pattern from a patient, a speech feature computation module computing at least one speech feature from the collected speech pattern, a mental health determination module determining a state-of-mind of the patient based at least in part on the at least one computed speech feature, and an output module providing an indication of a diagnosis with regard to a possibility that the patient is suffering from a certain condition such as depression or Post-Traumatic Stress Disorder (PTSD).

Type: Application

Filed: August 5, 2015

Publication date: August 2, 2018

Inventors: Bruce Knoth, Dimitra Vergyri, Elizabeth Shriberg, Vikramjit Mitra, Mitchell McLaren, Andreas Kathol, Colleen Richey, Martin Graciarena