Patents by Inventor Caiming Xiong

Caiming Xiong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

QUASI-RECURRENT NEURAL NETWORK BASED ENCODER-DECODER MODEL

Publication number: 20210103816

Abstract: The technology disclosed provides a quasi-recurrent neural network (QRNN) encoder-decoder model that alternates convolutional layers, which apply in parallel across timesteps, and minimalist recurrent pooling layers that apply in parallel across feature dimensions.

Type: Application

Filed: December 15, 2020

Publication date: April 8, 2021

Inventors: James BRADBURY, Stephen Joseph MERITY, Caiming XIONG, Richard SOCHER
Structured text translation

Patent number: 10963652

Abstract: Approaches for the translation of structured text include an embedding module for encoding and embedding source text in a first language, an encoder for encoding output of the embedding module, a decoder for iteratively decoding output of the encoder based on generated tokens in translated text from previous iterations, a beam module for constraining output of the decoder with respect to possible embedded tags to include in the translated text for a current iteration using a beam search, and a layer for selecting a token to be included in the translated text for the current iteration. The translated text is in a second language different from the first language. In some embodiments, the approach further includes scoring and pointer modules for selecting the token based on the output of the beam module or copied from the source text or reference text from a training pair best matching the source text.

Type: Grant

Filed: January 31, 2019

Date of Patent: March 30, 2021

Assignee: salesforce.com, inc.

Inventors: Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Marshall, Caiming Xiong, Richard Socher
Dynamic coattention network for question answering

Patent number: 10963782

Abstract: The technology disclosed relates to an end-to-end neural network for question answering, referred to herein as “dynamic coattention network (DCN)”. Roughly described, the DCN includes an encoder neural network and a coattentive encoder that capture the interactions between a question and a document in a so-called “coattention encoding”. The DCN also includes a decoder neural network and highway maxout networks that process the coattention encoding to estimate start and end positions of a phrase in the document that responds to the question.

Type: Grant

Filed: January 31, 2017

Date of Patent: March 30, 2021

Assignee: salesforce.com, inc.

Inventors: Caiming Xiong, Victor Zhong, Richard Socher
Near-Zero-Cost Differentially Private Deep Learning with Teacher Ensembles

Publication number: 20210089882

Abstract: Systems and methods are provided for near-zero-cost (NZC) query framework or approach for differentially private deep learning. To protect the privacy of training data during learning, the near-zero-cost query framework transfers knowledge from an ensemble of teacher models trained on partitions of the data to a student model. Privacy guarantees may be understood intuitively and expressed rigorously in terms of differential privacy. Other features are also provided.

Type: Application

Filed: October 21, 2019

Publication date: March 25, 2021

Inventors: Lichao SUN, Jia LI, Caiming XIONG, Yingbo ZHOU
Dense video captioning

Patent number: 10958925

Abstract: Systems and methods for dense captioning of a video include a multi-layer encoder stack configured to receive information extracted from a plurality of video frames, a proposal decoder coupled to the encoder stack and configured to receive one or more outputs from the encoder stack, a masking unit configured to mask the one or more outputs from the encoder stack according to one or more outputs from the proposal decoder, and a decoder stack coupled to the masking unit and configured to receive the masked one or more outputs from the encoder stack. Generating the dense captioning based on one or more outputs of the decoder stack. In some embodiments, the one or more outputs from the proposal decoder include a differentiable mask. In some embodiments, during training, error in the dense captioning is back propagated to the decoder stack, the encoder stack, and the proposal decoder.

Type: Grant

Filed: November 18, 2019

Date of Patent: March 23, 2021

Assignee: salesforce.com, inc.

Inventors: Yingbo Zhou, Luowei Zhou, Caiming Xiong, Richard Socher
NATURAL LANGUAGE PROCESSING USING CONTEXT-SPECIFIC WORD VECTORS

Publication number: 20210073459

Abstract: A system is provided for natural language processing. In some embodiments, the system includes an encoder for generating context-specific word vectors for at least one input sequence of words. The encoder is pre-trained using training data for performing a first natural language processing task. A neural network performs a second natural language processing task on the at least one input sequence of words using the context-specific word vectors. The first natural language process task is different from the second natural language processing task and the neural network is separately trained from the encoder.

Type: Application

Filed: September 21, 2020

Publication date: March 11, 2021

Inventors: Bryan McCann, Caiming Xiong, Richard Socher
Dialogue state tracking using a global-local encoder

Patent number: 10929607

Abstract: A method for maintaining a dialogue state associated with a dialogue between a user and a digital system includes receiving, by a dialogue state tracker associated with the digital system, a representation of a user communication, updating, by the dialogue state tracker, the dialogue state and providing a system response based on the updated dialogue state. The dialogue state is updated by evaluating, based on the representation of the user communication, a plurality of member scores corresponding to a plurality of ontology members of an ontology set, and selecting, based on the plurality of member scores, zero or more of the plurality of ontology members to add to or remove from the dialogue state.

Type: Grant

Filed: May 14, 2018

Date of Patent: February 23, 2021

Assignee: salesforce.com, inc.

Inventors: Victor Zhong, Caiming Xiong
JOINT MANY-TASK NEURAL NETWORK MODEL FOR MULTIPLE NATURAL LANGUAGE PROCESSING (NLP) TASKS

Publication number: 20210042604

Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.

Type: Application

Filed: October 26, 2020

Publication date: February 11, 2021

Inventors: Kazuma Hashimoto, Caiming Xiong, Richard SOCHER
Abstraction of text summarization

Patent number: 10909157

Abstract: A system is disclosed for providing an abstractive summary of a source textual document. The system includes an encoder, a decoder, and a fusion layer. The encoder is capable of generating an encoding for the source textual document. The decoder is separated into a contextual model and a language model. The contextual model is capable of extracting words from the source textual document using the encoding. The language model is capable of generating vectors paraphrasing the source textual document based on pre-training with a training dataset. The fusion layer is capable of generating the abstractive summary of the source textual document from the extracted words and the generated vectors for paraphrasing. In some embodiments, the system utilizes a novelty metric to encourage the generation of novel phrases for inclusion in the abstractive summary.

Type: Grant

Filed: July 31, 2018

Date of Patent: February 2, 2021

Assignee: salesforce.com, inc.

Inventors: Romain Paulus, Wojciech Kryscinski, Caiming Xiong
Two-stage online detection of action start in untrimmed videos

Patent number: 10902289

Abstract: Embodiments described herein provide a two-stage online detection of action start system including a classification module and a localization module. The classification module generates a set of action scores corresponding to a first video frame from the video, based on the first video frame and video frames before the first video frames in the video. Each action score indicating a respective probability that the first video frame contains a respective action class. The localization module is coupled to the classification module for receiving the set of action scores from the classification module and generating an action-agnostic start probability that the first video frame contains an action start.

Type: Grant

Filed: April 25, 2019

Date of Patent: January 26, 2021

Assignee: salesforce.com, inc.

Inventors: Mingfei Gao, Richard Socher, Caiming Xiong
Multitask Learning As Question Answering

Publication number: 20200380213

Abstract: Approaches for multitask learning as question answering include an input layer for encoding a context and a question, a self-attention based transformer including an encoder and a decoder, a first bi-directional long-term short-term memory (biLSTM) for further encoding an output of the encoder, a long-term short-term memory (LSTM) for generating a context-adjusted hidden state from the output of the decoder and a hidden state, an attention network for generating first attention weights based on an output of the first biLSTM and an output of the LSTM, a vocabulary layer for generating a distribution over a vocabulary, a context layer for generating a distribution over the context, and a switch for generating a weighting between the distributions over the vocabulary and the context, generating a composite distribution based on the weighting, and selecting a word of an answer using the composite distribution.

Type: Application

Filed: August 18, 2020

Publication date: December 3, 2020

Inventors: Bryan McCann, Nitish Shirish Keskar, Caiming Xiong, Richard Socher
SYSTEMS AND METHODS FOR VERIFICATION OF DISCRIMINATIVE MODELS

Publication number: 20200372339

Abstract: Verification of discriminative models includes receiving an input; receiving a prediction from a discriminative model for the input; encoding, using an encoder, a latent variable based on the input; decoding, using a decoder, a reconstructed input based on the prediction and the latent variable; and determining, using an anomaly detection module, whether the prediction is reliable based on the input, the reconstructed input, and the latent variable. The encoder and the decoder are jointly trained to maximize an evidence lower bound of the encoder and the decoder. In some embodiments, the encoder and the decoder are further trained using a disentanglement constraint between the prediction and the latent variable. In some embodiments, the encoder and the decoder are further trained without using inputs that are out of a distribution of inputs used to train the discriminative model or that are adversarial to the discriminative model.

Type: Application

Filed: October 3, 2019

Publication date: November 26, 2020

Inventors: Tong Che, Caiming Xiong
Systems and Methods for Reading Comprehension for a Question Answering Task

Publication number: 20200372341

Abstract: Embodiments described herein provide a pipelined natural language question answering system that improves a BERT-based system. Specifically, the natural language question answering system uses a pipeline of neural networks each trained to perform a particular task. The context selection network identifies premium context from context for the question. The question type network identifies the natural language question as a yes, no, or span question and a yes or no answer to the natural language question when the question is a yes or no question. The span extraction model determines an answer span to the natural language question when the question is a span question.

Type: Application

Filed: November 26, 2019

Publication date: November 26, 2020

Inventors: Akari ASAI, Kazuma HASHIMOTO, Richard SOCHER, Caiming XIONG
Weakly Supervised Natural Language Localization Networks

Publication number: 20200372116

Abstract: Systems and methods are provided for weakly supervised natural language localization (WSNLL), for example, as implemented in a neural network or model. The WSNLL network is trained with long, untrimmed videos, i.e., videos that have not been temporally segmented or annotated. The WSNLL network or model defines or generates a video-sentence pair, which corresponds to a pairing of an untrimmed video with an input text sentence. According to some embodiments, the WSNLL network or model is implemented with a two-branch architecture, where one branch performs segment sentence alignment and the other one conducts segment selection.

Type: Application

Filed: August 5, 2019

Publication date: November 26, 2020

Inventors: Mingfei GAO, Richard SOCHER, Caiming Xiong
Robustness Evaluation via Natural Typos

Publication number: 20200372319

Abstract: A method for evaluating robustness of one or more target neural network models using natural typos. The method includes receiving one or more natural typo generation rules associated with a first task associated with a first input document type, receiving a first target neural network model, and receiving a first document and corresponding its ground truth labels. The method further includes generating one or more natural typos for the first document based on the one or more natural typo generation rules, and providing, to the first target neural network model, a test document generated based on the first document and the one or more natural typos as an input document to generate a first output. A robustness evaluation result of the first target neural network model is generated based on a comparison between the output and the ground truth labels.

Type: Application

Filed: September 3, 2019

Publication date: November 26, 2020

Inventors: Lichao SUN, Kazuma HASHIMOTO, Jia LI, Richard SOCHER, Caiming XIONG
Spatial attention model for image captioning

Patent number: 10846478

Abstract: The technology disclosed presents a novel spatial attention model that uses current hidden state information of a decoder long short-term memory (LSTM) to guide attention and to extract spatial image features for use in image captioning. The technology disclosed also presents a novel adaptive attention model for image captioning that mixes visual information from a convolutional neural network (CNN) and linguistic information from an LSTM. At each timestep, the adaptive attention model automatically decides how heavily to rely on the image, as opposed to the linguistic model, to emit the next caption word. The technology disclosed further adds a new auxiliary sentinel gate to an LSTM architecture and produces a sentinel LSTM (Sn-LSTM). The sentinel gate produces a visual sentinel at each timestep, which is an additional representation, derived from the LSTM's memory, of long and short term visual and linguistic information.

Type: Grant

Filed: October 23, 2019

Date of Patent: November 24, 2020

Assignee: salesforce.com, inc.

Inventors: Jiasen Lu, Caiming Xiong, Richard Socher
SYSTEMS AND METHODS FOR UNSUPERVISED AUTOREGRESSIVE TEXT COMPRESSION

Publication number: 20200364299

Abstract: Embodiments described herein provide a provide a fully unsupervised model for text compression. Specifically, the unsupervised model is configured to identify an optimal deletion path for each input sequence of texts (e.g., a sentence) and words from the input sequence are gradually deleted along the deletion path. To identify the optimal deletion path, the unsupervised model may adopt a pretrained bidirectional language model (BERT) to score each candidate deletion based on the average perplexity of the resulting sentence and performs a simple greedy look-ahead tree search to select the best deletion for each step.

Type: Application

Filed: August 23, 2019

Publication date: November 19, 2020

Inventors: Tong Niu, Caiming Xiong, Richard Socher
Joint many-task neural network model for multiple natural language processing (NLP) tasks

Patent number: 10839284

Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.

Type: Grant

Filed: January 31, 2017

Date of Patent: November 17, 2020

Assignee: salesforce.com, inc.

Inventors: Kazuma Hashimoto, Caiming Xiong, Richard Socher
Natural language processing using context specific word vectors

Patent number: 10817650

Abstract: A system is provided for natural language processing. In some embodiments, the system includes an encoder for generating context-specific word vectors for at least one input sequence of words. The encoder is pre-trained using training data for performing a first natural language processing task. A neural network performs a second natural language processing task on the at least one input sequence of words using the context-specific word vectors. The first natural language process task is different from the second natural language processing task and the neural network is separately trained from the encoder. In some embodiments, the first natural processing task can be machine translation, and the second natural processing task can be one of sentiment analysis, question classification, entailment classification, and question answering.

Type: Grant

Filed: May 17, 2018

Date of Patent: October 27, 2020

Assignee: salesforce.com, inc.

Inventors: Bryan McCann, Caiming Xiong, Richard Socher
SYSTEMS AND METHODS FOR UNIFYING QUESTION ANSWERING AND TEXT CLASSIFICATION VIA SPAN EXTRACTION

Publication number: 20200334334

Abstract: Systems and methods for unifying question answering and text classification via span extraction include a preprocessor for preparing a source text and an auxiliary text based on a task type of a natural language processing task, an encoder for receiving the source text and the auxiliary text from the preprocessor and generating an encoded representation of a combination of the source text and the auxiliary text, and a span-extractive decoder for receiving the encoded representation and identifying a span of text within the source text that is a result of the NLP task. The task type is one of entailment, classification, or regression. In some embodiments, the source text includes one or more of text received as input when the task type is entailment, a list of classifications when the task type is entailment or classification, or a list of similarity options when the task type is regression.

Type: Application

Filed: July 22, 2019

Publication date: October 22, 2020

Inventors: Nitish Shirish Keskar, Bryan McCann, Richard Socher, Caiming Xiong

prev … 5 6 7 8 9 10 11 12 13 next