Patents by Inventor Junyi CHAI

Junyi CHAI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

Generating diverse electronic summary documents for a landing page

Patent number: 11914663

Abstract: Technologies related to generating diverse electronic summary documents for a webpage are described herein. A sequence of tokens is extracted from the webpage, and the sequence of tokens is provided to several computer-implemented models. The computer-implemented models output respective sets of candidate assets based upon the sequence of tokens, where the candidate assets are potentially included in an electronic summary document for the webpage. Subsequently, a user query is received, and at least one candidate asset from the candidate assets are selected for inclusion in the electronic summary document based upon the query. Thus, different electronic summary documents can be generated for the webpage when different queries are received.

Type: Grant

Filed: December 29, 2021

Date of Patent: February 27, 2024

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Junyi Chai, Konstantin Andreyevich Golobokov, Bingyu Chi, Fang Gu, Ye Dong, Jie Cao, Yi Liu
TRAINING A COMPUTER-IMPLEMENTED CONDITIONAL LANGUAGE MODEL FOR IMPROVED PERFORMANCE

Publication number: 20230409826

Abstract: Technologies related to computer-implemented conditional language models (CLMs) are described. A first CLM is trained to generate output texts based upon input texts and conditions. Output texts generated by the first CLM are included in a training set, and a second CLM is trained based upon the training set. The second CLM is then configured to receive input text and a condition and generate an output text based upon the input text and the condition.

Type: Application

Filed: June 16, 2022

Publication date: December 21, 2023

Inventors: Junyi CHAI, Konstantin GOLOBOKOV, Ye DONG, Reid Allen PRYZANT, Yi LIU
GENERATING DIVERSE ELECTRONIC SUMMARY DOCUMENTS FOR A LANDING PAGE

Publication number: 20230205832

Abstract: Technologies related to generating diverse electronic summary documents for a webpage are described herein. A sequence of tokens is extracted from the webpage, and the sequence of tokens is provided to several computer-implemented models. The computer-implemented models output respective sets of candidate assets based upon the sequence of tokens, where the candidate assets are potentially included in an electronic summary document for the webpage. Subsequently, a user query is received, and at least one candidate asset from the candidate assets are selected for inclusion in the electronic summary document based upon the query. Thus, different electronic summary documents can be generated for the webpage when different queries are received.

Type: Application

Filed: December 29, 2021

Publication date: June 29, 2023

Inventors: Junyi CHAI, Konstantin Andreyevich GOLOBOKOV, Bingyu CHI, Fang GU, Ye DONG, Jie CAO, Yi LIU
Annotations for enterprise knowledge graphs using multiple toolkits

Patent number: 11544323

Abstract: Mining of a set of enterprise source documents within an enterprise intranet is performed, by a plurality of knowledge mining toolkits, to determine a plurality of entity names. A plurality of entity records are generated within a knowledge graph for mined entity names from the entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity names. Pattern recognition is applied to an active document using an enterprise named entity recognition (ENER) system to identify potential entity names within the document that match a respective one of a plurality of entity records in the knowledge graph. One or more matching entity names are annotated within the document with information from the knowledge graph for the respective ones of the plurality of entity records. The annotated information is displayed with the active document.

Type: Grant

Filed: July 20, 2020

Date of Patent: January 3, 2023

Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC

Inventors: Dmitriy Meyerzon, Omar Zia Khan, Hui Li, John M. Winn, John Guiver, Ivan Korostelev, Matteo Venanzi, Alexander Armin Spengler, Pavel Myshkov, Elena Pochernina, Martin Kukla, Yordan Kirilov Zaykov, Junyi Chai, Noura Farra, Sravya Narala
ANNOTATIONS FOR ENTERPRISE KNOWLEDGE GRAPHS USING MULTIPLE TOOLKITS

Publication number: 20220019622

Abstract: Mining of a set of enterprise source documents within an enterprise intranet is performed, by a plurality of knowledge mining toolkits, to determine a plurality of entity names. A plurality of entity records are generated within a knowledge graph for mined entity names from the entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity names. Pattern recognition is applied to an active document using an enterprise named entity recognition (ENER) system to identify potential entity names within the document that match a respective one of a plurality of entity records in the knowledge graph. One or more matching entity names are annotated within the document with information from the knowledge graph for the respective ones of the plurality of entity records. The annotated information is displayed with the active document.

Type: Application

Filed: July 20, 2020

Publication date: January 20, 2022

Inventors: Dmitriy MEYERZON, Omar Zia KHAN, Hui LI, John M. WINN, John GUIVER, Ivan KOROSTELEV, Matteo VENANZI, Alexander Armin SPENGLER, Pavel MYSHKOV, Elena POCHERNINA, Martin KUKLA, Yordan Kirilov ZAYKOV, Junyi CHAI, Noura FARRA, Sravya NARALA
ENTERPRISE KNOWLEDGE GRAPHS USING ENTERPRISE NAMED ENTITY RECOGNITION

Publication number: 20220019740

Abstract: Examples described herein generally relate to a computer system including a knowledge graph storing a plurality of entities. A mining of a set of enterprise source documents within an enterprise intranet is performed, by an enterprise named entity recognition (ENER) model, to determine a plurality of entity names. An entity record is generated within a knowledge graph for a mined entity name from the linked entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity name. The entity record includes attributes aggregated from the ones of the set of enterprise source documents associated with the mined entity name.

Type: Application

Filed: July 20, 2020

Publication date: January 20, 2022

Inventors: Dmitriy MEYERZON, Hui LI, Junyi CHAI, Noura FARRA
Querying knowledge graph with natural language input

Patent number: 10970278

Abstract: A server computing device, including memory storing a knowledge graph. The server computing device may further include a processor configured to receive a natural language input and generate a tokenized utterance based on the natural language input. Based on the tokenized utterance, the processor may select a predefined intention indicating a target ontology entity type of the natural language input. The processor may identify at least one input ontology entity token included in the tokenized utterance and may identify at least one relation between the predefined intention and the input ontology entity token. Based on the predefined intention, the at least one input ontology entity token, and the relation, the processor may generate a structured query. Based on the structured query and the knowledge graph, the processor may output an output ontology entity token having the target ontology entity type.

Type: Grant

Filed: March 29, 2019

Date of Patent: April 6, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Rui Yan, Yonggang Deng, Junyi Chai, Maochen Guan, Yujie He, Bing Li
Training utterance generation

Patent number: 10916237

Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities connected by a plurality of edges. The server computing device may further include a processor configured to generate a glossary file based on the knowledge graph. The glossary file may include a plurality of ontology entities included in the knowledge graph. The processor may receive a plurality of utterance templates. Each utterance template may include an utterance and a predefined intention. For each utterance template, the processor may generate one or more utterance template copies in which one or more ontology entities included in the utterance are replaced with one or more utterance template fields. The processor may generate a plurality of training utterances at least in part by filling the one or more utterance template fields of the one or more utterance template copies with respective ontology entities included in the glossary file.

Type: Grant

Filed: March 29, 2019

Date of Patent: February 9, 2021

Assignee: Microsoft Technology Licensing, LLC

Inventors: Rui Yan, Yonggang Deng, Junyi Chai, Maochen Guan, Yujie He, Bing Li
Ontology entity type detection from tokenized utterance

Patent number: 10867132

Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities. The server computing device may further include a processor configured to receive a tokenized utterance including a plurality of words and one or more metadata tokens. The processor may extract a respective word embedding vector from each word included in the tokenized utterance. Based on a glossary file, the processor may determine a respective ontology entity type of each word included in the tokenized utterance. The processor may extract a character embedding vector from each character included in the tokenized utterance. Based on the plurality of word embedding vectors, the plurality of respective ontology entity types of the words, and the plurality of character embedding vectors, the processor may determine a predefined intention of the tokenized utterance using at least one recurrent neural network. The predefined intention may indicate a target ontology entity type.

Type: Grant

Filed: March 29, 2019

Date of Patent: December 15, 2020

Assignee: Microsoft Technology Licensing, LLC

Inventors: Rui Yan, Junyi Chai, Maochen Guan, Yujie He, Bing Li, Yonggang Deng
TRAINING UTTERANCE GENERATION

Publication number: 20200312300

Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities connected by a plurality of edges. The server computing device may further include a processor configured to generate a glossary file based on the knowledge graph. The glossary file may include a plurality of ontology entities included in the knowledge graph. The processor may receive a plurality of utterance templates. Each utterance template may include an utterance and a predefined intention. For each utterance template, the processor may generate one or more utterance template copies in which one or more ontology entities included in the utterance are replaced with one or more utterance template fields. The processor may generate a plurality of training utterances at least in part by filling the one or more utterance template fields of the one or more utterance template copies with respective ontology entities included in the glossary file.

Type: Application

Filed: March 29, 2019

Publication date: October 1, 2020

Applicant: Microsoft Technology Licensing, LLC

Inventors: Rui YAN, Yonggang DENG, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI
QUERYING KNOWLEDGE GRAPH WITH NATURAL LANGUAGE INPUT

Publication number: 20200311070

Abstract: A server computing device, including memory storing a knowledge graph. The server computing device may further include a processor configured to receive a natural language input and generate a tokenized utterance based on the natural language input. Based on the tokenized utterance, the processor may select a predefined intention indicating a target ontology entity type of the natural language input. The processor may identify at least one input ontology entity token included in the tokenized utterance and may identify at least one relation between the predefined intention and the input ontology entity token. Based on the predefined intention, the at least one input ontology entity token, and the relation, the processor may generate a structured query. Based on the structured query and the knowledge graph, the processor may output an output ontology entity token having the target ontology entity type.

Type: Application

Filed: March 29, 2019

Publication date: October 1, 2020

Applicant: Microsoft Technology Licensing, LLC

Inventors: Rui YAN, Yonggang DENG, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI
ONTOLOGY ENTITY TYPE DETECTION FROM TOKENIZED UTTERANCE

Publication number: 20200311199

Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities. The server computing device may further include a processor configured to receive a tokenized utterance including a plurality of words and one or more metadata tokens. The processor may extract a respective word embedding vector from each word included in the tokenized utterance. Based on a glossary file, the processor may determine a respective ontology entity type of each word included in the tokenized utterance. The processor may extract a character embedding vector from each character included in the tokenized utterance. Based on the plurality of word embedding vectors, the plurality of respective ontology entity types of the words, and the plurality of character embedding vectors, the processor may determine a predefined intention of the tokenized utterance using at least one recurrent neural network. The predefined intention may indicate a target ontology entity type.

Type: Application

Filed: March 29, 2019

Publication date: October 1, 2020

Applicant: Microsoft Technology Licensing, LLC

Inventors: Rui YAN, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI, Yonggang DENG