Patents by Inventor Daniel A. Barreda

Daniel A. Barreda has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Voice activity detection (VAD) for a coded speech bitstream without decoding

Patent number: 9997172

Abstract: A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.

Type: Grant

Filed: December 2, 2013

Date of Patent: June 12, 2018

Assignee: Nuance Communications, Inc.

Inventors: Daniel A. Barreda, Jose E. G. Lainez, Dushyant Sharma, Patrick Naylor
System and method to reduce transmission bandwidth via improved discontinuous transmission

Patent number: 9489958

Abstract: The present disclosure is directed towards a method for discontinuous transmission (“DTX”) bandwidth reduction. The method may include receiving, at a processor, a frame identified as speech and determining that the frame was mistakenly identified as speech based upon, at least in part, a voice activity detection algorithm. The method may further include labeling the frame as a silence indicator frame.

Type: Grant

Filed: July 31, 2014

Date of Patent: November 8, 2016

Assignee: Nuance Communications, Inc.

Inventors: Sridhar Pilli, Jose Lainez, Dushyant Sharma, Daniel A. Barreda, Patrick Naylor, Mahesh Godavarti
System and method for compressed domain estimation of the signal to noise ratio of a coded speech signal

Patent number: 9361899

Abstract: The present disclosure is directed towards a process for estimating the signal to noise ratio of a speech signal. The process may include receiving, at a computing device, a speech signal having a bitstream and a signal-to-noise ratio (“SNR”) associated therewith. The process may further include estimating the SNR directly from the bitstream or using a partial decoder that is configured to extract one or more parameters, the parameters including at least one of a fixed codebook gain, an adaptive codebook gain, a pitch lag, and a line spectral frequency (“LSF”) coefficient.

Type: Grant

Filed: July 2, 2014

Date of Patent: June 7, 2016

Assignee: Nuance Communications, Inc.

Inventors: Jose Lainez, Daniel A. Barreda, Dushyant Sharma, Patrick Naylor, Sridhar Pilli
SYSTEM AND METHOD TO REDUCE TRANSMISSION BANDWIDTH VIA IMPROVED DISCONTINUOUS TRANSMISSION

Publication number: 20160035359

Abstract: The present disclosure is directed towards a method for discontinuous transmission (“DTX”) bandwidth reduction. The method may include receiving, at a processor, a frame identified as speech and determining that the frame was mistakenly identified as speech based upon, at least in part, a voice activity detection algorithm. The method may further include labeling the frame as a silence indicator frame.

Type: Application

Filed: July 31, 2014

Publication date: February 4, 2016

Inventors: Sridhar Pilli, Jose Lainez, Dushyant Sharma, Daniel A. Barreda, Patrick Naylor, Mahesh Godavarti
SYSTEM AND METHOD FOR COMPRESSED DOMAIN ESTIMATION OF THE SIGNAL TO NOISE RATIO OF A CODED SPEECH SIGNAL

Publication number: 20160005414

Abstract: The present disclosure is directed towards a process for estimating the signal to noise ratio of a speech signal. The process may include receiving, at a computing device, a speech signal having a bitstream and a signal-to-noise ratio (“SNR”) associated therewith. The process may further include estimating the SNR directly from the bitstream or using a partial decoder that is configured to extract one or more parameters, the parameters including at least one of a fixed codebook gain, an adaptive codebook gain, a pitch lag, and a line spectral frequency (“LSF”) coefficient.

Type: Application

Filed: July 2, 2014

Publication date: January 7, 2016

Inventors: Jose Lainez, Daniel A. Barreda, Dushyant Sharma, Patrick Naylor, Sridhar Pilli
Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding

Publication number: 20150154981

Abstract: A system, method and computer program product are described for voice activity detection (VAD) within a digitally encoded bitstream. A parameter extraction module is configured to extract parameters from a sequence of coded frames from a digitally encoded bitstream containing speech. A VAD classifier is configured to operate with input of the digitally encoded bitstream to evaluate each coded frame based on bitstream coding parameter classification features to output a VAD decision indicative of whether or not speech is present in one or more of the coded frames.

Type: Application

Filed: December 2, 2013

Publication date: June 4, 2015

Applicant: Nuance Communications, Inc.

Inventors: Daniel A. Barreda, Jose E.G. Lainez, Dushyant Sharma, Patrick Naylor

Voice activity detection (VAD) for a coded speech bitstream without decoding

System and method to reduce transmission bandwidth via improved discontinuous transmission

System and method for compressed domain estimation of the signal to noise ratio of a coded speech signal

SYSTEM AND METHOD TO REDUCE TRANSMISSION BANDWIDTH VIA IMPROVED DISCONTINUOUS TRANSMISSION

SYSTEM AND METHOD FOR COMPRESSED DOMAIN ESTIMATION OF THE SIGNAL TO NOISE RATIO OF A CODED SPEECH SIGNAL

Voice Activity Detection (VAD) for a Coded Speech Bitstream without Decoding