Patents by Inventor Caiming Xiong

Caiming Xiong has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

SYSTEMS AND METHODS FOR STRUCTURED TEXT TRANSLATION WITH TAG ALIGNMENT

Publication number: 20210397799

Abstract: Approaches for the translation of structured text include an embedding module for encoding and embedding source text in a first language, an encoder for encoding output of the embedding module, a decoder for iteratively decoding output of the encoder based on generated tokens in translated text from previous iterations, a beam module for constraining output of the decoder with respect to possible embedded tags to include in the translated text for a current iteration using a beam search, and a layer for selecting a token to be included in the translated text for the current iteration. The translated text is in a second language different from the first language. In some embodiments, the approach further includes scoring and pointer modules for selecting the token based on the output of the beam module or copied from the source text or reference text from a training pair best matching the source text.

Type: Application

Filed: August 31, 2021

Publication date: December 23, 2021

Inventors: Kazuma Hashimoto, Raffaella Buschiazzo, James Bradbury, Teresa Anna Marshall, Caiming Xiong, Richard Socher
SYSTEMS AND METHODS FOR LEARNING FOR DOMAIN ADAPTATION

Publication number: 20210389736

Abstract: A method for training parameters of a first domain adaptation model. The method includes evaluating a cycle consistency objective using a first task specific model associated with a first domain and a second task specific model associated with a second domain, and evaluating one or more first discriminator models to generate a first discriminator objective using the second task specific model. The one or more first discriminator models include a plurality of discriminators corresponding to a plurality of bands that corresponds domain variable ranges of the first and second domains respectively. The method further includes updating, based on the cycle consistency objective and the first discriminator objective, one or more parameters of the first domain adaptation model for adapting representations from the first domain to the second domain.

Type: Application

Filed: August 30, 2021

Publication date: December 16, 2021

Inventors: Ehsan Hosseini-Asl, Caiming Xiong, Yingbo Zhou, Richard Socher
SYSTEMS AND METHODS FOR SAFE POLICY IMPROVEMENT FOR TASK ORIENTED DIALOGUES

Publication number: 20210383212

Abstract: Embodiments described herein provide safe policy improvement (SPI) in a batch reinforcement learning framework for a task-oriented dialogue. Specifically, a batch reinforcement learning framework for dialogue policy learning is provided, which improves the performance of the dialogue and learns to shape a reward that reasons the invention behind human response rather than just imitating the human demonstration.

Type: Application

Filed: November 25, 2020

Publication date: December 9, 2021

Inventors: Govardana Sachithanandam Ramachandran, Kazuma Hashimoto, Caiming Xiong, Richard Socher
SYSTEMS AND METHODS FOR COMPOSED VARIATIONAL NATURAL LANGUAGE GENERATION

Publication number: 20210374358

Abstract: Embodiments described herein provide a composed variational natural language generation (CLANG) model that is configured to generate training samples for few-shot intents. Specifically, the CLANG model may build connections between existing training samples of many-shot intents and new training samples of few-shot intents by modeling an intent as a combination of a domain and an action. In this way, the CLANG model transfers knowledge from existing many-shot intents to few-shot intents in natural language generation by learning how to compose utterances with many-shot intents and transferring such knowledge to few-shot intents.

Type: Application

Filed: September 2, 2020

Publication date: December 2, 2021

Inventors: Congying Xia, Caiming Xiong
SYSTEMS AND METHODS FOR DOMAIN ADAPTATION IN DIALOG ACT TAGGING

Publication number: 20210375269

Abstract: Embodiments described herein utilize pre-trained masked language models as the backbone for dialogue act tagging and provide cross-domain generalization of the resulting dialogue acting taggers. For example, a pre-trained MASK token of BERT model may be used as a controllable mechanism for augmenting text input, e.g., generating tags for an input of unlabeled dialogue history. The pre-trained MASK model can be trained with semi-supervised learning, e.g., using multiple objectives from supervised tagging loss, masked tagging loss, masked language model loss, and/or a disagreement loss.

Type: Application

Filed: August 21, 2020

Publication date: December 2, 2021

Inventors: Semih Yavuz, Kazuma Hashimoto, Wenhao Liu, Nitish Shirish Keskar, Richard Socher, Caiming Xiong
SYSTEMS AND METHODS FOR COMPOSED VARIATIONAL NATURAL LANGUAGE GENERATION

Publication number: 20210374603

Abstract: Embodiments described herein provide a composed variational natural language generation (CLANG) model that is configured to generate training samples for few-shot intents. Specifically, the CLANG model may build connections between existing training samples of many-shot intents and new training samples of few-shot intents by modeling an intent as a combination of a domain and an action. In this way, the CLANG model transfers knowledge from existing many-shot intents to few-shot intents in natural language generation by learning how to compose utterances with many-shot intents and transferring such knowledge to few-shot intents.

Type: Application

Filed: September 2, 2020

Publication date: December 2, 2021

Inventors: Congying Xia, Caiming Xiong
Systems and Methods for Out-of-Distribution Detection

Publication number: 20210374524

Abstract: Some embodiments of the current disclosure disclose methods and systems for detecting out-of-distribution (ODD) data. For example, a method for detecting ODD data includes obtaining, at a neural network composed of a plurality of layers, a set of training data generated according to a distribution. Further, the method comprises generating, via a processor, a feature map by combining mapping functions corresponding to the plurality of layers into a vector of mapping function elements and mapping, by the feature map, the set of training data to a set of feature space training data in a feature space. Further, the method comprises identifying, via the processor, a hyper-ellipsoid in the feature space enclosing the feature space training data based on the generated feature map. In addition, the method comprises determining, via the processor, the first test data sample is OOD data when a mapped first test data sample in the feature space is outside the hyper-ellipsoid.

Type: Application

Filed: November 13, 2020

Publication date: December 2, 2021

Inventors: Yihao Feng, Caiming Xiong
BRIDGING TEXTUAL AND TABULAR DATA FOR CROSS DOMAIN TEXT-TO-QUERY LANGUAGE SEMANTIC PARSING WITH A PRE-TRAINED TRANSFORMER LANGUAGE ENCODER AND ANCHOR TEXT

Publication number: 20210374133

Abstract: A text-to-database neural network architecture is provided. The architecture receives a natural language question and a database schema and generates a serialized question-schema representation that includes a question and at least one table and at least one field from the database schema. The serialized question-schema representation is appended with at least one value that matches a word in the natural language question and at least one field in a database picklist. An encoder in the architecture generates question and schema encodings from the appended question-schema representation. Schema encodings are associated with metadata that indicates a data type of the fields and whether fields are associated with primary or foreign keys. A decoder in the architecture generates an executable query from the question encodings and schema encodings.

Type: Application

Filed: October 6, 2020

Publication date: December 2, 2021

Inventors: Xi Lin, Caiming Xiong
Diversity and Explainability Parameters for Recommendation Accuracy in Machine Learning Recommendation Systems

Publication number: 20210374132

Abstract: Embodiments are directed to a machine learning recommendation system. The system receives a user query for generating a recommendation for one or more items with an explanation associated with recommending the one or more items. The system obtains first features of at least one user and second features of a set of items. The system provides the first features and the second features to a first machine learning network for determining a predicted score for an item. The system provides a portion of the first features and a portion of the second features to second machine learning networks for determining explainability scores for an item and generating corresponding explanation narratives. The system provides the recommendation for one or more items and corresponding explanation narratives based on ranking predicted scores and explainability scores for the items.

Type: Application

Filed: November 10, 2020

Publication date: December 2, 2021

Inventors: Wenzhuo Yang, Jia Li, Chenxi Li, Latrice Barnett, Markus Anderle, Simo Arajarvi, Harshavardhan Utharavalli, Caiming Xiong, Richard Socher, Chu Hong Hoi
EFFICIENT DETERMINATION OF USER INTENT FOR NATURAL LANGUAGE EXPRESSIONS BASED ON MACHINE LEARNING

Publication number: 20210374353

Abstract: An online system allows user interactions using natural language expressions. The online system uses a machine learning based model to infer an intent represented by a user expression. The machine learning based model takes as input a user expression and an example expression to compute a score indicating whether the user expression matches the example expression. Based on the scores, the intent inference module determines a most applicable intent for the expression. The online system determines a confidence threshold such that user expressions indicating a high confidence are assigned the most applicable intent and user expressions indicating a low confidence are assigned an out-of-scope intent. The online system encodes the example expressions using the machine learning based model. The online system may compare an encoded user expression with encoded example expressions to identify a subset of example expressions used to determine the most applicable intent.

Type: Application

Filed: August 28, 2020

Publication date: December 2, 2021

Inventors: Jianguo Zhang, Kazuma Hashimoto, Chien-Sheng Wu, Wenhao Liu, Richard Socher, Caiming Xiong
PREDICTION-CORRECTION APPROACH TO ZERO SHOT LEARNING

Publication number: 20210365740

Abstract: Approaches to zero-shot learning include partitioning training data into first and second sets according to classes assigned to the training data, training a prediction module based on the first set to predict a cluster center based on a class label, training a correction module based on the second set and each of the class labels in the first set to generate a correction to a cluster center predicted by the prediction module, presenting a new class label for a new class to the prediction module to predict a new cluster center, presenting the new class label, the predicted new cluster center, and each of the class labels in the first set to the correction module to generate a correction for the predicted new cluster center, augmenting a classifier based on the corrected cluster center for the new class, and classifying input data into the new class using the classifier.

Type: Application

Filed: August 9, 2021

Publication date: November 25, 2021

Inventors: Lily HU, Caiming XIONG, Richard SOCHER
SYSTEMS AND METHODS FOR PARTIALLY SUPERVISED ONLINE ACTION DETECTION IN UNTRIMMED VIDEOS

Publication number: 20210357687

Abstract: Embodiments described herein provide systems and methods for a partially supervised training model for online action detection. Specifically, the online action detection framework may include two modules that are trained jointly—a Temporal Proposal Generator (TPG) and an Online Action Recognizer (OAR). In the training phase, OAR performs both online per-frame action recognition and start point detection. At the same time, TPG generates class-wise temporal action proposals serving as noisy supervisions for OAR. TPG is then optimized with the video-level annotations. In this way, the online action detection framework can be trained with video-category labels only without pre-annotated segment-level boundary labels.

Type: Application

Filed: July 16, 2020

Publication date: November 18, 2021

Inventors: Mingfei Gao, Yingbo Zhou, Ran Xu, Caiming Xiong
Generating dual sequence inferences using a neural network model

Patent number: 11170287

Abstract: A computer-implemented method for dual sequence inference using a neural network model includes generating a codependent representation based on a first input representation of a first sequence and a second input representation of a second sequence using an encoder of the neural network model and generating an inference based on the codependent representation using a decoder of the neural network model. The neural network model includes a plurality of model parameters learned according to a machine learning process. The encoder includes a plurality of coattention layers arranged sequentially, each coattention layer being configured to receive a pair of layer input representations and generate one or more summary representations, and an output layer configured to receive the one or more summary representations from a last layer among the plurality of coattention layers and generate the codependent representation.

Type: Grant

Filed: January 26, 2018

Date of Patent: November 9, 2021

Assignee: salesforce.com, inc.

Inventors: Victor Zhong, Caiming Xiong, Richard Socher
Phone-Based Sub-Word Units for End-to-End Speech Recognition

Publication number: 20210319796

Abstract: System and methods for identifying a text word from a spoken utterance are provided. An ensemble BPE system that includes a phone BPE system and a character BPE system receives a spoken utterance. Both BPE systems include a multi-level language model (LM) and an acoustic model. The phone BPE system identifies first words from the spoken utterance and determine a first score for each first word. The first words are converted into character sequences. The character BPE model converts the character sequences into second words and determines a second score for each second word. For each word from the first words that matches a word in the second words the first and second scores are combined. The text word is the word with a highest score.

Type: Application

Filed: June 17, 2020

Publication date: October 14, 2021

Inventors: Weiran Wang, Yingbo Zhou, Caiming Xiong
SELF-AWARE VISUAL-TEXTUAL CO-GROUNDED NAVIGATION AGENT

Publication number: 20210286369

Abstract: An agent for navigating a mobile automated system is disclosed herein. The navigation agent receives a navigation instruction and visual information for one or more observed images. The navigation agent is provided or equipped with self-awareness, which provides or supports the following abilities: identifying which direction to go or proceed by determining the part of the instruction that corresponds to the observed images (visual grounding), and identifying which part of the instruction has been completed or ongoing and which part is potentially needed for the next action selection (textual grounding). In some embodiments, the navigation agent applies regularization to ensures that the grounded instruction can correctly be used to estimate the progress made towards the navigation goal (progress monitoring).

Type: Application

Filed: May 27, 2021

Publication date: September 16, 2021

Inventors: Chih-Yao Ma, Caiming Xiong
TRAINING A JOINT MANY-TASK NEURAL NETWORK MODEL USING SUCCESSIVE REGULARIZATION

Publication number: 20210279551

Abstract: The technology disclosed provides a so-called “joint many-task neural network model” to solve a variety of increasingly complex natural language processing (NLP) tasks using growing depth of layers in a single end-to-end model. The model is successively trained by considering linguistic hierarchies, directly connecting word representations to all model layers, explicitly using predictions in lower tasks, and applying a so-called “successive regularization” technique to prevent catastrophic forgetting. Three examples of lower level model layers are part-of-speech (POS) tagging layer, chunking layer, and dependency parsing layer. Two examples of higher level model layers are semantic relatedness layer and textual entailment layer. The model achieves the state-of-the-art results on chunking, dependency parsing, semantic relatedness and textual entailment.

Type: Application

Filed: May 26, 2021

Publication date: September 9, 2021

Inventors: Kazuma Hashimoto, Caiming Xiong, Richard Socher
Dynamic memory network

Patent number: 11113598

Abstract: A novel unified neural network framework, the dynamic memory network, is disclosed. This unified framework reduces every task in natural language processing to a question answering problem over an input sequence. Inputs and questions are used to create and connect deep memory sequences. Answers are then generated based on dynamically retrieved memories.

Type: Grant

Filed: July 27, 2016

Date of Patent: September 7, 2021

Assignee: salesforce.com, inc.

Inventors: Richard Socher, Ankit Kumar, Ozan Irsoy, Mohit Iyyer, Caiming Xiong, Stephen Merity, Romain Paulus
Systems and methods for learning for domain adaptation

Patent number: 11106182

Abstract: A method for training parameters of a first domain adaptation model includes evaluating a cycle consistency objective using a first task specific model associated with a first domain and a second task specific model associated with a second domain. The evaluating the cycle consistency objective is based on one or more first training representations adapted from the first domain to the second domain by a first domain adaptation model and from the second domain to the first domain by a second domain adaptation model, and one or more second training representations adapted from the second domain to the first domain by the second domain adaptation model and from the first domain to the second domain by the first domain adaptation model. The method further includes evaluating a learning objective based on the cycle consistency objective, and updating parameters of the first domain adaptation model based on learning objective.

Type: Grant

Filed: August 3, 2018

Date of Patent: August 31, 2021

Assignee: salesforce.com, inc.

Inventors: Ehsan Hosseini-Asl, Caiming Xiong, Yingbo Zhou, Richard Socher
SYSTEM AND METHOD FOR GRAPH-BASED RESOURCE ALLOCATION USING NEURAL NETWORKS

Publication number: 20210256370

Abstract: A method for using a neural network to generate an improved graph model includes receiving, by the neural network, a graph model. The graph model is based on data relating to an environment for allocating resources to a first group and a second group. The method further includes receiving, by the neural network, a budget for editing the graph model based on a cost of corresponding modification to the environment, and determining, by the neural network, a fairness representation based on a fairness requirement between the first and second groups. It is determined by the neural network, a utility function for the graph model based on first and second group utilities representing resource allocation to the first and second groups respectively. Reinforcement learning is performed on the neural network to generate the improved graph model using the utility function and the fairness representation.

Type: Application

Filed: November 17, 2020

Publication date: August 19, 2021

Inventors: Govardana Sachithanandam Ramachandran, Ivan BRUGERE, Lav Varshney, Caiming Xiong
Agent persona grounded chit-chat generation framework

Patent number: 11087092

Abstract: Approaches for determining a response for an agent in an undirected dialogue are provided. The approaches include a dialogue generating framework comprising an encoder neural network, a decoder neural network, and a language model neural network. The dialogue generating framework generates a sketch sentence response with at least one slot. The sketch sentence response is generated word by word and takes into account the undirected dialogue and agent traits of the agent making the response. The dialogue generating framework generates sentence responses by filling the slot with words from the agent traits. The dialogue generating framework ranks the sentence responses according to perplexity by passing the sentence responses through a language model and selects a final response which is a sentence response that has a lowest perplexity.

Type: Grant

Filed: April 30, 2019

Date of Patent: August 10, 2021

Assignee: salesforce.com, inc.

Inventors: Stephan Zheng, Wojciech Kryscinski, Michael Shum, Richard Socher, Caiming Xiong

prev … 3 4 5 6 7 8 9 10 11 … next