Patents by Inventor David SEETAPUN

David SEETAPUN has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

ACCELERATING CONVOLUTIONS FOR SPARSE INPUTS

Publication number: 20230008777

Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for performing an accelerated convolution on sparse inputs. In one aspect, a method comprises receiving sensor data input comprising input features for input spatial locations; and processing the sensor data input using a convolutional neural network having a first convolutional layer with a filter having multiple filter spatial locations to generate a network output comprising output features for output spatial locations, wherein processing the sensor data input comprises: obtaining a rule book tensor that identifies for each filter spatial location (i) a subset of the input features, and (ii) for each input feature in the subset, a respective output feature; for each particular filter spatial location: generating input tile, filter tile, and output tile sets in accordance with the rule book tensor; and generating the output features in the output tile set based on the tile sets.

Type: Application

Filed: July 9, 2021

Publication date: January 12, 2023

Inventor: David Seetapun
Systems and methods for principled bias reduction in production speech models

Patent number: 10657955

Abstract: Described herein are systems and methods to identify and address sources of bias in an end-to-end speech model. In one or more embodiments, the end-to-end model may be a recurrent neural network with two 2D-convolutional input layers, followed by multiple bidirectional recurrent layers and one fully connected layer before a softmax layer. In one or more embodiments, the network is trained end-to-end using the CTC loss function to directly predict sequences of characters from log spectrograms of audio. With optimized recurrent layers and training together with alignment information, some unwanted bias induced by using purely forward only recurrences may be removed in a deployed model.

Type: Grant

Filed: January 30, 2018

Date of Patent: May 19, 2020

Assignee: Baidu USA LLC

Inventors: Eric Battenberg, Rewon Child, Adam Coates, Christopher Fougner, Yashesh Gaur, Jiaji Huang, Heewoo Jun, Ajay Kannan, Markus Kliegl, Atul Kumar, Hairong Liu, Vinay Rao, Sanjeev Satheesh, David Seetapun, Anuroop Sriram, Zhenyao Zhu
SYSTEMS AND METHODS FOR PRINCIPLED BIAS REDUCTION IN PRODUCTION SPEECH MODELS

Publication number: 20180247643

Abstract: Described herein are systems and methods to identify and address sources of bias in an end-to-end speech model. In one or more embodiments, the end-to-end model may be a recurrent neural network with two 2D-convolutional input layers, followed by multiple bidirectional recurrent layers and one fully connected layer before a softmax layer. In one or more embodiments, the network is trained end-to-end using the CTC loss function to directly predict sequences of characters from log spectrograms of audio. With optimized recurrent layers and training together with alignment information, some unwanted bias induced by using purely forward only recurrences may be removed in a deployed model.

Type: Application

Filed: January 30, 2018

Publication date: August 30, 2018

Applicant: Baidu USA LLC

Inventors: Eric BATTENBERG, Rewon CHILD, Adam COATES, Christopher FOUGNER, Yashesh GAUR, Jiaji HUANG, Heewoo JUN, Ajay KANNAN, Markus KLIEGL, Atul KUMAR, Hairong LIU, Vinay RAO, Sanjeev SATHEESH, David SEETAPUN, Anuroop SRIRAM, Zhenyao ZHU

ACCELERATING CONVOLUTIONS FOR SPARSE INPUTS

Systems and methods for principled bias reduction in production speech models

SYSTEMS AND METHODS FOR PRINCIPLED BIAS REDUCTION IN PRODUCTION SPEECH MODELS