Patents by Inventor Semih Yavuz

Semih Yavuz has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Systems and Methods for Constrained Text Generation Using Large Language Models

Publication number: 20250111198

Abstract: In view of the need to improve text generation technology, embodiments described herein provide a neural network model that generates a text output with constraints to achieve desired output behavior, such as reduced toxicity or hallucinations, and inclusion of certain keywords.

Type: Application

Filed: January 25, 2024

Publication date: April 3, 2025

Inventors: Lifu TU, Semih YAVUZ, Yingbo ZHOU
SYSTEMS AND METHODS FOR TEXT GENERATION WITH VOCABULARY DETOXIFICATION

Publication number: 20250111155

Abstract: Embodiments described herein provide a method for mitigating toxic content in text generation by a neural network based framework. The method includes the following operations. A text input of a sequence of tokens is received via a communication interface. A first output probability for a next token generating is generated by a first neural network model that is trained to generate tokens belonging to a prioritized category of vocabulary, in response to the text input. A second output probability of the next token is generated by a second neural network model that is trained to generate tokens belonging to an indiscriminate vocabulary, in response to the text input. The next token for a text output based on a combined output probability computed based on a correction item reflective of the first output probability and the second output probability is generated in response to the text input.

Type: Application

Filed: January 18, 2024

Publication date: April 3, 2025

Inventors: Tong Niu, Yingbo Zhou, Silvio Savarese, Semih Yavuz, Caiming Xiong
SYSTEMS AND METHODS FOR QUESTION ANSWERING WITH DIVERSE KNOWLEDGE SOURCES

Publication number: 20250103592

Abstract: Embodiments described herein provide systems and methods for retrieval augmented generation. A neural network based language model may be provided a question as a user input. Based on the user input, semantically diverse queries may be generated for retrieval from diverse data sources. For example, a structured data source (e.g., database or knowledge base) and unstructured data (e.g., text articles) may be used to retrieve information relevant to the user input. The retrieve information may be ranked so that the most relevant information is used by the language model in generating an answer to the question in the user input. A non-retrieval based answer generated by the language model may be utilized in some embodiments in generating the final answer.

Type: Application

Filed: January 26, 2024

Publication date: March 27, 2025

Inventors: Tong Niu, Shafiq Rayhan Joty, Yingbo Zhou, Semih Yavuz, Wenting Zhao, Ye Liu
SYSTEMS AND METHODS FOR SELECTING NEURAL NETWORK MODELS FOR BUILDING A CUSTOM ARTIFICIAL INTELLIGENCE STACK

Publication number: 20250005276

Abstract: Embodiments described herein provide a system for selecting a neural network based natural language processing (NLP) model for building a custom artificial intelligence (AI) stack for a user. The system includes a communication interface that established connections to one or more external servers hosting one or more neural network based NLP models, a memory; and a processor executing operations including: selecting a source document based on a custom NLP application; generating, by a first language model, a summary of the source document; generating, by a second language model, one or more questions based on at least one of the summary or the source document; transmitting, via the communication interface, the one or more questions to the one or more neural network based NLP models; receiving, via the communication interface, one or more answers generated by the one or more neural network based NLP models.

Type: Application

Filed: October 31, 2023

Publication date: January 2, 2025

Inventors: Meghana Bhat, Semih Yavuz, Rui Meng, Yingbo Zhou
SYSTEMS AND METHODS FOR RETRIEVAL BASED QUESTION ANSWERING USING NEURA NETWORK MODELS

Publication number: 20240428044

Abstract: Embodiments described herein provide a framework that integrates a retriever model and the LLM to feed retrieved passages to an LLM to generate an answer conditioned on the retrieved passages in response to a query. For example, in one embodiment, a single-round approach is implemented, which involves directly transmitting the retrieved passages to the LLM. For another example, a multi-round methodology is implemented, which involves initially presenting the retrieved passages to the Language Model, collecting its responses, and then adjusting our interaction with the Language Model based on this acquired feedback.

Type: Application

Filed: October 30, 2023

Publication date: December 26, 2024

Inventors: Ye Liu, Semih Yavuz, Meghana Moorthy Bhat, Rui Meng, Shafiq Joty, Caiming Xiong, Yingbo Zhou
GENERATING AUTOMATED ASSISTANT RESPONSES AND/OR ACTIONS DIRECTLY FROM DIALOG HISTORY AND RESOURCES

Publication number: 20240347061

Abstract: Training and/or utilizing a single neural network model to generate, at each of a plurality of assistant turns of a dialog session between a user and an automated assistant, a corresponding automated assistant natural language response and/or a corresponding automated assistant action. For example, at a given assistant turn of a dialog session, both a corresponding natural language response and a corresponding action can be generated jointly and based directly on output generated using the single neural network model. The corresponding response and/or corresponding action can be generated based on processing, using the neural network model, dialog history and a plurality of discrete resources. For example, the neural network model can be used to generate a response and/or action on a token-by-token basis.

Type: Application

Filed: June 24, 2024

Publication date: October 17, 2024

Inventors: Arvind Neelakantan, Daniel Duckworth, Ben Goodrich, Vishaal Prasad, Chinnadhurai Sankar, Semih Yavuz
Systems and methods for semantic parsing with primitive level enumeration

Patent number: 12105744

Abstract: Embodiments described herein provide a semantic parsing framework which may be referred to as Uni-Parser. The Uni-Parser framework may be applied to question answering on both knowledge bases and databases. The three main stages of the Uni-Parser framework are enumeration, ranking, and generation. At the enumeration stage, primitives are enumerated based on matching the question to the data structure. After enumerating primitives, the Uni-Parser framework may rank the primitives used a trained ranker model. The top ranked primitives may then be used as inputs to a generator which is a learned sequence to sequence model which produces a logical form.

Type: Grant

Filed: November 29, 2022

Date of Patent: October 1, 2024

Assignee: Salesforce, Inc.

Inventors: Ye Liu, Semih Yavuz, Yingbo Zhou, Rui Meng
SYSTEMS AND METHODS FOR SEMANTIC PARSING WITH EXECUTION FOR ANSWERING QUESTIONS OF VARYING COMPLEXITY FROM UNSTRUCTURED TEXT

Publication number: 20240249113

Abstract: Embodiments described herein provide systems and methods for question answering using a hybrid question parser and executor model. The hybrid question parser and executor model includes a hybrid parser model and a hybrid executor model. The hybrid parser model includes a first neural network model, and generates a representation of an input question. The representation includes primitives and operations representing relationships among the primitives. The hybrid executor model generates an answer to the input question by executing the representation based on an input text document. The hybrid executor model includes an execution neural network model for executing the primitives of the representation, and an execution programming model for executing the operations of the representation.

Type: Application

Filed: June 14, 2023

Publication date: July 25, 2024

Inventors: Ye LIU, Semih YAVUZ, Rui MENG, Yingbo ZHOU
Generating automated assistant responses and/or actions directly from dialog history and resources

Patent number: 12020706

Abstract: Training and/or utilizing a single neural network model to generate, at each of a plurality of assistant turns of a dialog session between a user and an automated assistant, a corresponding automated assistant natural language response and/or a corresponding automated assistant action. For example, at a given assistant turn of a dialog session, both a corresponding natural language response and a corresponding action can be generated jointly and based directly on output generated using the single neural network model. The corresponding response and/or corresponding action can be generated based on processing, using the neural network model, dialog history and a plurality of discrete resources. For example, the neural network model can be used to generate a response and/or action on a token-by-token basis.

Type: Grant

Filed: August 30, 2022

Date of Patent: June 25, 2024

Assignee: GOOGLE LLC

Inventors: Arvind Neelakantan, Daniel Duckworth, Ben Goodrich, Vishaal Prasad, Chinnadhurai Sankar, Semih Yavuz
SYSTEMS AND METHODS FOR UNSUPERVISED TRAINING IN TEXT RETRIEVAL TASKS

Publication number: 20240202530

Abstract: Embodiments described herein provide systems and methods for training a text retrieval model. A system may generate queries associated with provided documents. The queries may be generated in one or more different manners. Examples of query generation may include extracting relevant spans of text from the documents, prompting a language model for a topic, title, abstractive summary, and/or extractive summary based on the documents. Metadata such as title or other HTML tags may be used as queries. Using the one or more queries, the text retrieval model may be trained using contrastive learning, using the generated query, and positive and negative sample documents. A fine-tuning training phase may be performed using domain-specific data which may also be done with generated query pairs, or may be done in a supervised fashion with provided queries. The text retrieval model may be used to locate documents given an input query.

Type: Application

Filed: April 19, 2023

Publication date: June 20, 2024

Inventors: Rui Meng, Yingbo Zhou, Ye Liu, Semih Yavuz, Ning Yu
SYSTEMS AND METHODS FOR SEMANTIC PARSING WITH PRIMITIVE LEVEL ENUMERATION

Publication number: 20240176805

Abstract: Embodiments described herein provide a semantic parsing framework which may be referred to as Uni-Parser. The Uni-Parser framework may be applied to question answering on both knowledge bases and databases. The three main stages of the Uni-Parser framework are enumeration, ranking, and generation. At the enumeration stage, primitives are enumerated based on matching the question to the data structure. After enumerating primitives, the Uni-Parser framework may rank the primitives used a trained ranker model. The top ranked primitives may then be used as inputs to a generator which is a learned sequence to sequence model which produces a logical form.

Type: Application

Filed: November 29, 2022

Publication date: May 30, 2024

Inventors: Ye LIU, Semih YAVUZ, Yingbo ZHOU, Rui MENG
SYSTEMS AND METHODS FOR SHARED LATENT SPACE PROMPT TUNING

Publication number: 20230419027

Abstract: Embodiments described herein provide a prompt-based transfer learning method that employs shared latent space prompt tuning). Specifically, a shared latent space is assumed, among all source and target tasks, where each vector in the space captures a basis skill to do a particular task. Given an instance (from either a source task or a target task), it is first encoded into an instance representation vector and then queries the latent space, which yields a skill vector for this instance. This vector modulates a frozen model, via soft prompts which are a simple prompt transformation (the prompt generator in FIG. 3) of the basis skill vector, to generate an answer for the instance. The latent space and prompt transformation are learned end-to-end in upstream pre-training on source tasks.

Type: Application

Filed: November 30, 2022

Publication date: December 28, 2023

Inventors: Bo Pang, Semih Yavuz, Caiming Xiong, Yingbo Zhou
Systems and methods for unsupervised paraphrase generation

Patent number: 11829721

Abstract: Embodiments described herein provide dynamic blocking, a decoding algorithm which enables large-scale pretrained language models to generate high-quality paraphrases in an un-supervised setting. Specifically, in order to obtain an alternative surface form, when the language model emits a token that is present in the source sequence, the language model is prevented from generating the next token that is the same as the subsequent source token in the source sequence at the next time step. In this way, the language model is forced to generate a paraphrased sequence of the input source sequence, but with mostly different wording.

Type: Grant

Filed: January 28, 2021

Date of Patent: November 28, 2023

Assignee: salesforce.com, inc.

Inventors: Tong Niu, Semih Yavuz, Yingbo Zhou, Nitish Shirish Keskar, Huan Wang, Caiming Xiong
Systems and methods for abstractive document summarization with entity coverage control

Patent number: 11741142

Abstract: Embodiments described herein provide document summarization systems and methods that utilize fine-tuning of pre-trained abstractive summarization models to produce summaries that more faithfully track the content of the documents. Such abstractive summarization models may be pre-trained using a corpus consisting of pairs of articles and associated summaries. For each article-summary pair, a pseudo label or control code is generated and represents a faithfulness of the summary with respect to the article. The pre-trained model is then fine-tuned based on the article-summary pairs and the corresponding control codes. The resulting fine-tuned models then provide improved faithfulness in document summarization tasks.

Type: Grant

Filed: January 31, 2022

Date of Patent: August 29, 2023

Assignee: salesforce.com, inc.

Inventors: Haopeng Zheng, Semih Yavuz, Wojciech Kryscinski, Kazuma Hashimoto, Yingbo Zhou
Structured graph-to-text generation with two step fine-tuning

Patent number: 11727210

Abstract: Embodiments described herein provide systems and methods for data-to-text generation. The embodiments receive input data that includes a resource description framework (RDF) triples in an RDF graph. A data-to-text generation system generates position aware embeddings, including position embeddings, triple role embeddings, and tree-level embeddings. Using the position aware embeddings and the RDF graph, the data-to-text generation system generates a textual description for the RDF graph.

Type: Grant

Filed: January 29, 2021

Date of Patent: August 15, 2023

Assignee: Salesforce.com, Inc.

Inventors: Qingyun Wang, Nazneen Rajani, Semih Yavuz, Xi Lin
SUBCOMPONENT MODEL TRAINING

Publication number: 20230229957

Abstract: Methods, apparatuses, and computer-program products are disclosed. The method may include inputting one or more subcomponent training datasets into the plurality of subcomponent models of the machine learning model, the machine learning model may be configured to perform a final task, and the plurality of subcomponent models may be configured to perform sequential subtasks that result in the final task. The method may include computing one or more weights for data points of the one or more subcomponent training datasets and the one or more weights may be based on a contribution of the data points to an end-to-end error loss measurement associated with performing the final task of the machine learning model. The method may include training the plurality of subcomponent models based on the one or more weights for the data points of the one or more subcomponent training datasets.

Type: Application

Filed: January 14, 2022

Publication date: July 20, 2023

Inventors: Shuyang Li, Yingbo Zhou, Semih Yavuz, Govardana Sachithanandam Ramachandran
SYSTEMS AND METHODS FOR ABSTRACTIVE DOCUMENT SUMMARIZATION WITH ENTITY COVERAGE CONTROL

Publication number: 20230054068

Abstract: Embodiments described herein provide document summarization systems and methods that utilize fine-tuning of pre-trained abstractive summarization models to produce summaries that more faithfully track the content of the documents. Such abstractive summarization models may be pre-trained using a corpus consisting of pairs of articles and associated summaries. For each article-summary pair, a pseudo label or control code is generated and represents a faithfulness of the summary with respect to the article. The pre-trained model is then fine-tuned based on the article-summary pairs and the corresponding control codes. The resulting fine-tuned models then provide improved faithfulness in document summarization tasks.

Type: Application

Filed: January 31, 2022

Publication date: February 23, 2023

Inventors: Haopeng Zheng, Semih Yavuz, Wojciech Kryscinski, Kazuma Hashimoto, Yingbo Zhou
SYSTEMS AND METHODS FOR KNOWLEDGE BASE QUESTION ANSWERING USING GENERATION AUGMENTED RANKING

Publication number: 20230059870

Abstract: Embodiments described herein provide a question answering approach that answers a question by generating an executable logical form. First, a ranking model is used to select a set of good logical forms from a pool of logical forms obtained by searching over a knowledge graph. The selected logical forms are good in the sense that they are close to (or exactly match, in some cases) the intents in the question and final desired logical form. Next, a generation model is adopted conditioned on the question as well as the selected logical forms to generate the target logical form and execute it to obtain the final answer. For example, at inference stage, when a question is received, a matching logical form is identified from the question, based on which the final answer can be generated based on the node that is associated with the matching logical form in the knowledge base.

Type: Application

Filed: December 29, 2021

Publication date: February 23, 2023

Inventors: Xi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou
SYSTEMS AND METHODS FOR KNOWLEDGE BASE QUESTION ANSWERING USING GENERATION AUGMENTED RANKING

Publication number: 20230055188

Abstract: Embodiments described herein provide a question answering approach that answers a question by generating an executable logical form. First, a ranking model is used to select a set of good logical forms from a pool of logical forms obtained by searching over a knowledge graph. The selected logical forms are good in the sense that they are close to (or exactly match, in some cases) the intents in the question and final desired logical form. Next, a generation model is adopted conditioned on the question as well as the selected logical forms to generate the target logical form and execute it to obtain the final answer. For example, at inference stage, when a question is received, a matching logical form is identified from the question, based on which the final answer can be generated based on the node that is associated with the matching logical form in the knowledge base.

Type: Application

Filed: December 29, 2021

Publication date: February 23, 2023

Inventors: Xi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou
GENERATING AUTOMATED ASSISTANT RESPONSES AND/OR ACTIONS DIRECTLY FROM DIALOG HISTORY AND RESOURCES

Publication number: 20220415324

Abstract: Training and/or utilizing a single neural network model to generate, at each of a plurality of assistant turns of a dialog session between a user and an automated assistant, a corresponding automated assistant natural language response and/or a corresponding automated assistant action. For example, at a given assistant turn of a dialog session, both a corresponding natural language response and a corresponding action can be generated jointly and based directly on output generated using the single neural network model. The corresponding response and/or corresponding action can be generated based on processing, using the neural network model, dialog history and a plurality of discrete resources. For example, the neural network model can be used to generate a response and/or action on a token-by-token basis.

Type: Application

Filed: August 30, 2022

Publication date: December 29, 2022

Inventors: Arvind Neelakantan, Daniel Duckworth, Ben Goodrich, Vishaal Prasad, Chinnadhurai Sankar, Semih Yavuz

1 2 next