Patents by Inventor Richard Socher

Richard Socher has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Three-dimensional (3D) convolution with 3D batch normalization

Patent number: 10282663

Abstract: The technology disclosed uses a 3D deep convolutional neural network architecture (DCNNA) equipped with so-called subnetwork modules which perform dimensionality reduction operations on 3D radiological volume before the 3D radiological volume is subjected to computationally expensive operations. Also, the subnetworks convolve 3D data at multiple scales by subjecting the 3D data to parallel processing by different 3D convolutional layer paths. Such multi-scale operations are computationally cheaper than the traditional CNNs that perform serial convolutions. In addition, performance of the subnetworks is further improved through 3D batch normalization (BN) that normalizes the 3D input fed to the subnetworks, which in turn increases learning rates of the 3D DCNNA. After several layers of 3D convolution and 3D sub-sampling with 3D across a series of subnetwork modules, a feature map with reduced vertical dimensionality is generated from the 3D radiological volume and fed into one or more fully connected layers.

Type: Grant

Filed: August 15, 2016

Date of Patent: May 7, 2019

Assignee: salesforce.com, inc.

Inventors: Richard Socher, Caiming Xiong, Kai Sheng Tai
HIERARCHICAL AND INTERPRETABLE SKILL ACQUISITION IN MULTI-TASK REINFORCEMENT LEARNING

Publication number: 20190130312

Abstract: The disclosed technology reveals a hierarchical policy network, for use by a software agent, to accomplish an objective that requires execution of multiple tasks. A terminal policy learned by training the agent on a terminal task set, serves as a base task set of the intermediate task set. An intermediate policy learned by training the agent on an intermediate task set serves as a base policy of the top policy. A top policy learned by training the agent on a top task set serves as a base task set of the top task set. The agent is configurable to accomplish the objective by traversal of the hierarchical policy network. A current task in a current task set is executed by executing a previously-learned task selected from a corresponding base task set governed by a corresponding base policy, or performing a primitive action selected from a library of primitive actions.

Type: Application

Filed: January 31, 2018

Publication date: May 2, 2019

Applicant: salesforce.com, inc.

Inventors: Caiming XIONG, Tianmin SHU, Richard SOCHER
Regularization Techniques for End-To-End Speech Recognition

Publication number: 20190130896

Abstract: The disclosed technology teaches regularizing a deep end-to-end speech recognition model to reduce overfitting and improve generalization: synthesizing sample speech variations on original speech samples labelled with text transcriptions, and modifying a particular original speech sample to independently vary tempo and pitch of the original speech sample while retaining the labelled text transcription of the original speech sample, thereby producing multiple sample speech variations having multiple degrees of variation from the original speech sample. The disclosed technology includes training a deep end-to-end speech recognition model, on thousands to millions of original speech samples and the sample speech variations on the original speech samples, that outputs recognized text transcriptions corresponding to speech detected in the original speech samples and the sample speech variations.

Type: Application

Filed: December 21, 2017

Publication date: May 2, 2019

Applicant: salesforce.com, inc.

Inventors: Yingbo ZHOU, Caiming XIONG, Richard SOCHER
GENERATING DUAL SEQUENCE INFERENCES USING A NEURAL NETWORK MODEL

Publication number: 20190130248

Abstract: A computer-implemented method for dual sequence inference using a neural network model includes generating a codependent representation based on a first input representation of a first sequence and a second input representation of a second sequence using an encoder of the neural network model and generating an inference based on the codependent representation using a decoder of the neural network model. The neural network model includes a plurality of model parameters learned according to a machine learning process. The encoder includes a plurality of coattention layers arranged sequentially, each coattention layer being configured to receive a pair of layer input representations and generate one or more summary representations, and an output layer configured to receive the one or more summary representations from a last layer among the plurality of coattention layers and generate the codependent representation.

Type: Application

Filed: January 26, 2018

Publication date: May 2, 2019

Inventors: Victor Zhong, Caiming XIONG, Richard SOCHER
SEQUENCE-TO-SEQUENCE PREDICTION USING A NEURAL NETWORK MODEL

Publication number: 20190130273

Abstract: A method for sequence-to-sequence prediction using a neural network model includes generating an encoded representation based on an input sequence using an encoder of the neural network model and predicting an output sequence based on the encoded representation using a decoder of the neural network model. The neural network model includes a plurality of model parameters learned according to a machine learning process. At least one of the encoder or the decoder includes a branched attention layer. Each branch of the branched attention layer includes an interdependent scaling node configured to scale an intermediate representation of the branch by a learned scaling parameter. The learned scaling parameter depends on one or more other learned scaling parameters of one or more other interdependent scaling nodes of one or more other branches of the branched attention layer.

Type: Application

Filed: January 30, 2018

Publication date: May 2, 2019

Inventors: Nitish Shirish Keskar, Karim Ahmed, Richard SOCHER
INTERPRETABLE COUNTING IN VISUAL QUESTION ANSWERING

Publication number: 20190130206

Abstract: Approaches for interpretable counting for visual question answering include a digital image processor, a language processor, and a counter. The digital image processor identifies objects in an image, maps the identified objects into an embedding space, generates bounding boxes for each of the identified objects, and outputs the embedded objects paired with their bounding boxes. The language processor embeds a question into the embedding space. The scorer determines scores for the identified objects. Each respective score determines how well a corresponding one of the identified objects is responsive to the question. The counter determines a count of the objects in the digital image that are responsive to the question based on the scores. The count and a corresponding bounding box for each object included in the count are output. In some embodiments, the counter determines the count interactively based on interactions between counted and uncounted objects.

Type: Application

Filed: January 29, 2018

Publication date: May 2, 2019

Inventors: Alexander Richard Trott, Caiming XIONG, Richard SOCHER
NATURAL LANGUAGE PROCESSING USING CONTEXT-SPECIFIC WORD VECTORS

Publication number: 20180373682

Abstract: A system is provided for natural language processing. In some embodiments, the system includes an encoder for generating context-specific word vectors for at least one input sequence of words. The encoder is pre-trained using training data for performing a first natural language processing task. A neural network performs a second natural language processing task on the at least one input sequence of words using the context-specific word vectors. The first natural language process task is different from the second natural language processing task and the neural network is separately trained from the encoder.

Type: Application

Filed: May 17, 2018

Publication date: December 27, 2018

Applicant: salesforce.come, inc,

Inventors: Bryan McCann, Caiming Xiong, Richard Socher
NATURAL LANGUAGE PROCESSING USING A NEURAL NETWORK

Publication number: 20180349359

Abstract: A system includes a neural network for performing a first natural language processing task. The neural network includes a first rectifier linear unit capable of executing an activation function on a first input related to a first word sequence, and a second rectifier linear unit capable of executing an activation function on a second input related to a second word sequence. A first encoder is capable of receiving the result from the first rectifier linear unit and generating a first task specific representation relating to the first word sequence, and a second encoder is capable of receiving the result from the second rectifier linear unit and generating a second task specific representation relating to the second word sequence. A biattention mechanism is capable of computing, based on the first and second task specific representations, an interdependent representation related to the first and second word sequences.

Type: Application

Filed: June 5, 2018

Publication date: December 6, 2018

Applicant: salesforce.com,inc.

Inventors: Bryan McCann, Caiming Xiong, Richard Socher
DOMAIN SPECIFIC LANGUAGE FOR GENERATION OF RECURRENT NEURAL NETWORK ARCHITECTURES

Publication number: 20180336453

Abstract: A system automatically generates recurrent neural network (RNN) architectures for performing specific tasks, for example, machine translation. The system represents RNN architectures using a domain specific language (DSL). The system generates candidate RNN architectures. The system predicts performances of the generated candidate RNN architectures, for example, using a neural network. The system filters the candidate RNN architectures based on their predicted performance. The system generates code for selected a candidate architectures. The generated code represents an RNN that is configured to perform the specific task. The system executes the generated code, for example, to evaluate an RNN or to use the RNN in an application.

Type: Application

Filed: April 13, 2018

Publication date: November 22, 2018

Inventors: Stephen Joseph Merity, Richard Socher, James Bradbury, Caiming Xiong
NEURAL NETWORK BASED TRANSLATION OF NATURAL LANGUAGE QUERIES TO DATABASE QUERIES

Publication number: 20180336198

Abstract: A computing system uses neural networks to translate natural language queries to database queries. The computing system uses a plurality of machine learning based models, each machine learning model for generating a portion of the database query. The machine learning models use an input representation generated based on terms of the input natural language query, a set of columns of the database schema, and the vocabulary of a database query language, for example, structured query language SQL. The plurality of machine learning based models may include an aggregation classifier model for determining an aggregation operator in the database query, a result column predictor model for determining the result columns of the database query, and a condition clause predictor model for determining the condition clause of the database query. The condition clause predictor is based on reinforcement learning.

Type: Application

Filed: January 31, 2018

Publication date: November 22, 2018

Inventors: Victor Zhong, Caiming Xiong, Richard Socher
Probability-Based Guider

Publication number: 20180268287

Abstract: The technology disclosed proposes using a combination of computationally cheap, less-accurate bag of words (BoW) model and computationally expensive, more-accurate long short-term memory (LSTM) model to perform natural processing tasks such as sentiment analysis. The use of cheap, less-accurate BoW model is referred to herein as “skimming”. The use of expensive, more-accurate LSTM model is referred to herein as “reading”. The technology disclosed presents a probability-based guider (PBG). PBG combines the use of BoW model and the LSTM model. PBG uses a probability thresholding strategy to determine, based on the results of the BoW model, whether to invoke the LSTM model for reliably classifying a sentence as positive or negative. The technology disclosed also presents a deep neural network-based decision network (DDN) that is trained to learn the relationship between the BoW model and the LSTM model and to invoke only one of the two models.

Type: Application

Filed: December 22, 2017

Publication date: September 20, 2018

Applicant: salesforce.com, inc.

Inventors: Alexander Rosenberg Johansen, Bryan McCann, James Bradbury, Richard Socher
Deep Neural Network-Based Decision Network

Publication number: 20180268298

Abstract: The technology disclosed proposes using a combination of computationally cheap, less-accurate bag of words (BoW) model and computationally expensive, more-accurate long short-term memory (LSTM) model to perform natural processing tasks such as sentiment analysis. The use of cheap, less-accurate BoW model is referred to herein as “skimming”. The use of expensive, more-accurate LSTM model is referred to herein as “reading”. The technology disclosed presents a probability-based guider (PBG). PBG combines the use of BoW model and the LSTM model. PBG uses a probability thresholding strategy to determine, based on the results of the BoW model, whether to invoke the LSTM model for reliably classifying a sentence as positive or negative. The technology disclosed also presents a deep neural network-based decision network (DDN) that is trained to learn the relationship between the BoW model and the LSTM model and to invoke only one of the two models.

Type: Application

Filed: December 22, 2017

Publication date: September 20, 2018

Applicant: salesforce.com, inc.

Inventors: Alexander Rosenberg Johansen, Bryan McCann, James Bradbury, Richard Socher
SENTINEL LONG SHORT-TERM MEMORY (Sn-LSTM)

Publication number: 20180144248

Abstract: The technology disclosed presents a novel spatial attention model that uses current hidden state information of a decoder long short-term memory (LSTM) to guide attention and to extract spatial image features for use in image captioning. The technology disclosed also presents a novel adaptive attention model for image captioning that mixes visual information from a convolutional neural network (CNN) and linguistic information from an LSTM. At each timestep, the adaptive attention model automatically decides how heavily to rely on the image, as opposed to the linguistic model, to emit the next caption word. The technology disclosed further adds a new auxiliary sentinel gate to an LSTM architecture and produces a sentinel LSTM (Sn-LSTM). The sentinel gate produces a visual sentinel at each timestep, which is an additional representation, derived from the LSTM's memory, of long and short term visual and linguistic information.

Type: Application

Filed: November 18, 2017

Publication date: May 24, 2018

Applicant: salesforce.com, inc.

Inventors: Jiasen LU, Caiming XIONG, Richard SOCHER
ADAPTIVE ATTENTION MODEL FOR IMAGE CAPTIONING

Publication number: 20180144208

Abstract: The technology disclosed presents a novel spatial attention model that uses current hidden state information of a decoder long short-term memory (LSTM) to guide attention and to extract spatial image features for use in image captioning. The technology disclosed also presents a novel adaptive attention model for image captioning that mixes visual information from a convolutional neural network (CNN) and linguistic information from an LSTM. At each timestep, the adaptive attention model automatically decides how heavily to rely on the image, as opposed to the linguistic model, to emit the next caption word. The technology disclosed further adds a new auxiliary sentinel gate to an LSTM architecture and produces a sentinel LSTM (Sn-LSTM). The sentinel gate produces a visual sentinel at each timestep, which is an additional representation, derived from the LSTM's memory, of long and short term visual and linguistic information.

Type: Application

Filed: November 17, 2017

Publication date: May 24, 2018

Applicant: salesforce.com, inc.

Inventors: Jiasen LU, Caiming XIONG, Richard SOCHER
Spatial Attention Model for Image Captioning

Publication number: 20180143966

Abstract: The technology disclosed presents a novel spatial attention model that uses current hidden state information of a decoder long short-term memory (LSTM) to guide attention and to extract spatial image features for use in image captioning. The technology disclosed also presents a novel adaptive attention model for image captioning that mixes visual information from a convolutional neural network (CNN) and linguistic information from an LSTM. At each timestep, the adaptive attention model automatically decides how heavily to rely on the image, as opposed to the linguistic model, to emit the next caption word. The technology disclosed further adds a new auxiliary sentinel gate to an LSTM architecture and produces a sentinel LSTM (Sn-LSTM). The sentinel gate produces a visual sentinel at each timestep, which is an additional representation, derived from the LSTM's memory, of long and short term visual and linguistic information.

Type: Application

Filed: November 17, 2017

Publication date: May 24, 2018

Applicant: salesforce.com, inc.

Inventors: Jiasen LU, Caiming XIONG, Richard SOCHER
QUASI-RECURRENT NEURAL NETWORK BASED ENCODER-DECODER MODEL

Publication number: 20180129931

Abstract: The technology disclosed provides a quasi-recurrent neural network (QRNN) encoder-decoder model that alternates convolutional layers, which apply in parallel across timesteps, and minimalist recurrent pooling layers that apply in parallel across feature dimensions.

Type: Application

Filed: January 31, 2017

Publication date: May 10, 2018

Applicant: salesforce.com, inc.

Inventors: James BRADBURY, Stephen Joseph MERITY, Caiming XIONG, Richard SOCHER
QUASI-RECURRENT NEURAL NETWORK

Publication number: 20180129937

Abstract: The technology disclosed provides a quasi-recurrent neural network (QRNN) that alternates convolutional layers, which apply in parallel across timesteps, and minimalist recurrent pooling layers that apply in parallel across feature dimensions.

Type: Application

Filed: January 31, 2017

Publication date: May 10, 2018

Applicant: salesforce.com, inc.

Inventors: James BRADBURY, Stephen Joseph MERITY, Caiming XIONG, Richard SOCHER
DYNAMIC COATTENTION NETWORK FOR QUESTION ANSWERING

Publication number: 20180129938

Abstract: The technology disclosed relates to an end-to-end neural network for question answering, referred to herein as “dynamic coattention network (DCN)”. Roughly described, the DCN includes an encoder neural network and a coattentive encoder that capture the interactions between a question and a document in a so-called “coattention encoding”. The DCN also includes a decoder neural network and highway maxout networks that process the coattention encoding to estimate start and end positions of a phrase in the document that responds to the question.

Type: Application

Filed: January 31, 2017

Publication date: May 10, 2018

Applicant: salesforce.com, inc.

Inventors: Caiming XIONG, Victor ZHONG, Richard SOCHER
Joint Many-Task Neural Network Model for Multiple Natural Language Processing (NLP) Tasks

Publication number: 20180121787

Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.

Type: Application

Filed: January 31, 2017

Publication date: May 3, 2018

Applicant: salesforce.com, inc.

Inventors: Kazuma HASHIMOTO, Caiming XIONG, Richard SOCHER
Deep Neural Network Model for Processing Data Through Mutliple Linguistic Task Hiearchies

Publication number: 20180121788

Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.

Type: Application

Filed: January 31, 2017

Publication date: May 3, 2018

Applicant: salesforce.com, inc.

Inventors: Kazuma HASHIMOTO, Caiming XIONG, Richard SOCHER

prev … 3 4 5 6 7 8 next