Patents by Inventor Junyi CHAI

Junyi CHAI has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Patent number: 11914663
    Abstract: Technologies related to generating diverse electronic summary documents for a webpage are described herein. A sequence of tokens is extracted from the webpage, and the sequence of tokens is provided to several computer-implemented models. The computer-implemented models output respective sets of candidate assets based upon the sequence of tokens, where the candidate assets are potentially included in an electronic summary document for the webpage. Subsequently, a user query is received, and at least one candidate asset from the candidate assets are selected for inclusion in the electronic summary document based upon the query. Thus, different electronic summary documents can be generated for the webpage when different queries are received.
    Type: Grant
    Filed: December 29, 2021
    Date of Patent: February 27, 2024
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Junyi Chai, Konstantin Andreyevich Golobokov, Bingyu Chi, Fang Gu, Ye Dong, Jie Cao, Yi Liu
  • Publication number: 20230409826
    Abstract: Technologies related to computer-implemented conditional language models (CLMs) are described. A first CLM is trained to generate output texts based upon input texts and conditions. Output texts generated by the first CLM are included in a training set, and a second CLM is trained based upon the training set. The second CLM is then configured to receive input text and a condition and generate an output text based upon the input text and the condition.
    Type: Application
    Filed: June 16, 2022
    Publication date: December 21, 2023
    Inventors: Junyi CHAI, Konstantin GOLOBOKOV, Ye DONG, Reid Allen PRYZANT, Yi LIU
  • Publication number: 20230205832
    Abstract: Technologies related to generating diverse electronic summary documents for a webpage are described herein. A sequence of tokens is extracted from the webpage, and the sequence of tokens is provided to several computer-implemented models. The computer-implemented models output respective sets of candidate assets based upon the sequence of tokens, where the candidate assets are potentially included in an electronic summary document for the webpage. Subsequently, a user query is received, and at least one candidate asset from the candidate assets are selected for inclusion in the electronic summary document based upon the query. Thus, different electronic summary documents can be generated for the webpage when different queries are received.
    Type: Application
    Filed: December 29, 2021
    Publication date: June 29, 2023
    Inventors: Junyi CHAI, Konstantin Andreyevich GOLOBOKOV, Bingyu CHI, Fang GU, Ye DONG, Jie CAO, Yi LIU
  • Patent number: 11544323
    Abstract: Mining of a set of enterprise source documents within an enterprise intranet is performed, by a plurality of knowledge mining toolkits, to determine a plurality of entity names. A plurality of entity records are generated within a knowledge graph for mined entity names from the entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity names. Pattern recognition is applied to an active document using an enterprise named entity recognition (ENER) system to identify potential entity names within the document that match a respective one of a plurality of entity records in the knowledge graph. One or more matching entity names are annotated within the document with information from the knowledge graph for the respective ones of the plurality of entity records. The annotated information is displayed with the active document.
    Type: Grant
    Filed: July 20, 2020
    Date of Patent: January 3, 2023
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Dmitriy Meyerzon, Omar Zia Khan, Hui Li, John M. Winn, John Guiver, Ivan Korostelev, Matteo Venanzi, Alexander Armin Spengler, Pavel Myshkov, Elena Pochernina, Martin Kukla, Yordan Kirilov Zaykov, Junyi Chai, Noura Farra, Sravya Narala
  • Publication number: 20220019622
    Abstract: Mining of a set of enterprise source documents within an enterprise intranet is performed, by a plurality of knowledge mining toolkits, to determine a plurality of entity names. A plurality of entity records are generated within a knowledge graph for mined entity names from the entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity names. Pattern recognition is applied to an active document using an enterprise named entity recognition (ENER) system to identify potential entity names within the document that match a respective one of a plurality of entity records in the knowledge graph. One or more matching entity names are annotated within the document with information from the knowledge graph for the respective ones of the plurality of entity records. The annotated information is displayed with the active document.
    Type: Application
    Filed: July 20, 2020
    Publication date: January 20, 2022
    Inventors: Dmitriy MEYERZON, Omar Zia KHAN, Hui LI, John M. WINN, John GUIVER, Ivan KOROSTELEV, Matteo VENANZI, Alexander Armin SPENGLER, Pavel MYSHKOV, Elena POCHERNINA, Martin KUKLA, Yordan Kirilov ZAYKOV, Junyi CHAI, Noura FARRA, Sravya NARALA
  • Publication number: 20220019740
    Abstract: Examples described herein generally relate to a computer system including a knowledge graph storing a plurality of entities. A mining of a set of enterprise source documents within an enterprise intranet is performed, by an enterprise named entity recognition (ENER) model, to determine a plurality of entity names. An entity record is generated within a knowledge graph for a mined entity name from the linked entity names based on an entity schema and ones of the set of enterprise source documents associated with the mined entity name. The entity record includes attributes aggregated from the ones of the set of enterprise source documents associated with the mined entity name.
    Type: Application
    Filed: July 20, 2020
    Publication date: January 20, 2022
    Inventors: Dmitriy MEYERZON, Hui LI, Junyi CHAI, Noura FARRA
  • Patent number: 10970278
    Abstract: A server computing device, including memory storing a knowledge graph. The server computing device may further include a processor configured to receive a natural language input and generate a tokenized utterance based on the natural language input. Based on the tokenized utterance, the processor may select a predefined intention indicating a target ontology entity type of the natural language input. The processor may identify at least one input ontology entity token included in the tokenized utterance and may identify at least one relation between the predefined intention and the input ontology entity token. Based on the predefined intention, the at least one input ontology entity token, and the relation, the processor may generate a structured query. Based on the structured query and the knowledge graph, the processor may output an output ontology entity token having the target ontology entity type.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: April 6, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Rui Yan, Yonggang Deng, Junyi Chai, Maochen Guan, Yujie He, Bing Li
  • Patent number: 10916237
    Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities connected by a plurality of edges. The server computing device may further include a processor configured to generate a glossary file based on the knowledge graph. The glossary file may include a plurality of ontology entities included in the knowledge graph. The processor may receive a plurality of utterance templates. Each utterance template may include an utterance and a predefined intention. For each utterance template, the processor may generate one or more utterance template copies in which one or more ontology entities included in the utterance are replaced with one or more utterance template fields. The processor may generate a plurality of training utterances at least in part by filling the one or more utterance template fields of the one or more utterance template copies with respective ontology entities included in the glossary file.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: February 9, 2021
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Rui Yan, Yonggang Deng, Junyi Chai, Maochen Guan, Yujie He, Bing Li
  • Patent number: 10867132
    Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities. The server computing device may further include a processor configured to receive a tokenized utterance including a plurality of words and one or more metadata tokens. The processor may extract a respective word embedding vector from each word included in the tokenized utterance. Based on a glossary file, the processor may determine a respective ontology entity type of each word included in the tokenized utterance. The processor may extract a character embedding vector from each character included in the tokenized utterance. Based on the plurality of word embedding vectors, the plurality of respective ontology entity types of the words, and the plurality of character embedding vectors, the processor may determine a predefined intention of the tokenized utterance using at least one recurrent neural network. The predefined intention may indicate a target ontology entity type.
    Type: Grant
    Filed: March 29, 2019
    Date of Patent: December 15, 2020
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Rui Yan, Junyi Chai, Maochen Guan, Yujie He, Bing Li, Yonggang Deng
  • Publication number: 20200312300
    Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities connected by a plurality of edges. The server computing device may further include a processor configured to generate a glossary file based on the knowledge graph. The glossary file may include a plurality of ontology entities included in the knowledge graph. The processor may receive a plurality of utterance templates. Each utterance template may include an utterance and a predefined intention. For each utterance template, the processor may generate one or more utterance template copies in which one or more ontology entities included in the utterance are replaced with one or more utterance template fields. The processor may generate a plurality of training utterances at least in part by filling the one or more utterance template fields of the one or more utterance template copies with respective ontology entities included in the glossary file.
    Type: Application
    Filed: March 29, 2019
    Publication date: October 1, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Rui YAN, Yonggang DENG, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI
  • Publication number: 20200311070
    Abstract: A server computing device, including memory storing a knowledge graph. The server computing device may further include a processor configured to receive a natural language input and generate a tokenized utterance based on the natural language input. Based on the tokenized utterance, the processor may select a predefined intention indicating a target ontology entity type of the natural language input. The processor may identify at least one input ontology entity token included in the tokenized utterance and may identify at least one relation between the predefined intention and the input ontology entity token. Based on the predefined intention, the at least one input ontology entity token, and the relation, the processor may generate a structured query. Based on the structured query and the knowledge graph, the processor may output an output ontology entity token having the target ontology entity type.
    Type: Application
    Filed: March 29, 2019
    Publication date: October 1, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Rui YAN, Yonggang DENG, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI
  • Publication number: 20200311199
    Abstract: A server computing device, including memory storing a knowledge graph including a plurality of ontology entities. The server computing device may further include a processor configured to receive a tokenized utterance including a plurality of words and one or more metadata tokens. The processor may extract a respective word embedding vector from each word included in the tokenized utterance. Based on a glossary file, the processor may determine a respective ontology entity type of each word included in the tokenized utterance. The processor may extract a character embedding vector from each character included in the tokenized utterance. Based on the plurality of word embedding vectors, the plurality of respective ontology entity types of the words, and the plurality of character embedding vectors, the processor may determine a predefined intention of the tokenized utterance using at least one recurrent neural network. The predefined intention may indicate a target ontology entity type.
    Type: Application
    Filed: March 29, 2019
    Publication date: October 1, 2020
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Rui YAN, Junyi CHAI, Maochen GUAN, Yujie HE, Bing LI, Yonggang DENG