Patents by Inventor Yijian Xiang

Yijian Xiang has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250139154
    Abstract: A molecule representation is extracted from a document and associated with the document in a metadata database. For example, an image of a molecular structure may be extracted from a document and stored in the metadata database in a text-based representation such as SMILES. The metadata database may be searched to identify documents that mention a particular molecule. Continuing the example, the metadata database may be searched with a SMILES representation to identify the document and other documents that refer to the same molecule. The metadata database may index documents based on different types of molecule representations, including text-based, image-based, graph-based, name, abbreviation, etc. This allows search over multiple representations of a molecule, improving accuracy and thoroughness. These improvements reduce the time and computational resources needed to search for documents that refer to a particular molecule.
    Type: Application
    Filed: October 31, 2023
    Publication date: May 1, 2025
    Inventors: Yijian XIANG, Rohith Venkata PESALA, Nilgoon ZAREI, Pramod Kumar SHARMA, Liang DU, Robin ABRAHAM, J Brandon SMOCK
  • Publication number: 20250077844
    Abstract: The present disclosure relates to efficiently receiving and processing input tasks in a way that is scalable and which reduces both the quantity of tokens processed by a foundation model (e.g., an LLM) as well as the number of API calls that are made in processing the input tasks. A system batches a set of inputs to provide as a single batch of input(s) into an LLM. The system generates one or more permutations of the batched input(s) to determine outputs based on variable orders in which the input data is provided within the respective permutations of the batched inputs. The system further may eliminate one or more of the data inputs within the respective batches to facilitate smaller batched inputs without sacrificing accuracy in a set of outputs generated by the LLM responsive to the batch permutations.
    Type: Application
    Filed: December 8, 2023
    Publication date: March 6, 2025
    Inventors: Jianzhe LIN, Maurice DIESENDRUCK, Manqing MAO, Yijian XIANG, Julia T. CHEN, Paishun TING, Mingyang XU, Liang DU, Robin ABRAHAM
  • Patent number: 12218890
    Abstract: The present disclosure relates to methods and systems for sharing with a plurality of users a chat session that uses large language models to provide responses for input messages received for the chat session. The methods and systems provide access to the chat session to the users and update the chat session in response to any changes made to the chat session by any of the users. The methods and systems allow the users to resume the chat session at a future time using the chat session history.
    Type: Grant
    Filed: October 19, 2023
    Date of Patent: February 4, 2025
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Robin Abraham, Liang Du, Manqing Mao, Paishun Ting, Julia Chen, Jianzhe Lin, Yijian Xiang, Mingyang Xu, Wenhan Wang, Fahimeh Raja
  • Publication number: 20240430216
    Abstract: The present disclosure relates to methods and systems for sharing with a plurality of users a chat session that uses large language models to provide responses for input messages received for the chat session. The methods and systems provide access to the chat session to the users and update the chat session in response to any changes made to the chat session by any of the users. The methods and systems allow the users to resume the chat session at a future time using the chat session history.
    Type: Application
    Filed: October 19, 2023
    Publication date: December 26, 2024
    Inventors: Robin ABRAHAM, Liang DU, Manqing MAO, Paishun TING, Julia CHEN, Jianzhe LIN, Yijian XIANG, Mingyang XU, Wenhan WANG, Fahimeh RAJA
  • Publication number: 20240428005
    Abstract: The present disclosure relates to methods and systems for automatically generating documents for a specific topic using large language models. The methods and systems receive an input query that identifies a topic for the document. The methods and systems automatically generate, using the large language models, a framework for the document with sections and subsections for the document. The methods and systems write the document, using the large language models, and provide references for the data sources used to obtain the data that the large language model used to write the document.
    Type: Application
    Filed: June 20, 2023
    Publication date: December 26, 2024
    Inventors: Robin ABRAHAM, Mingyang XU, Julia CHEN, Yijian XIANG, Manqing MAO, Jianzhe LIN, Paishun TING, Liang DU
  • Patent number: 11868358
    Abstract: A data processing system implements obtaining query parameters for a query for content items in a datastore, the query parameters including attributes of content items for which a search is to be conducted; obtaining a first set of content items from a content datastore based on the query parameters; analyzing the first set of content items using a first machine learning model trained to generate relevant content information that identifies a plurality of relevant content items included in the first set of content items; and analyzing the plurality of relevant content items using a second machine learning model configured to output novel content information, the novel content information including a plurality of content items predicted to be relevant and novel, the novel content information ranking the plurality of content items predicted to be relevant and novel based on a novelty score associated with each respective content item.
    Type: Grant
    Filed: June 15, 2022
    Date of Patent: January 9, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Leo Moreno Betthauser, Jing Tian, Yijian Xiang, Pramod Kumar Sharma
  • Publication number: 20230409581
    Abstract: A data processing system implements obtaining query parameters for a query for content items in a datastore, the query parameters including attributes of content items for which a search is to be conducted; obtaining a first set of content items from a content datastore based on the query parameters; analyzing the first set of content items using a first machine learning model trained to generate relevant content information that identifies a plurality of relevant content items included in the first set of content items; and analyzing the plurality of relevant content items using a second machine learning model configured to output novel content information, the novel content information including a plurality of content items predicted to be relevant and novel, the novel content information ranking the plurality of content items predicted to be relevant and novel based on a novelty score associated with each respective content item.
    Type: Application
    Filed: June 15, 2022
    Publication date: December 21, 2023
    Applicant: Microsoft Technology Licensing, LLC
    Inventors: Leo Moreno BETTHAUSER, Jing TIAN, Yijian XIANG, Pramod Kumar SHARMA
  • Publication number: 20230088925
    Abstract: A computer implemented method includes receiving an image that includes a type of object, segmenting the object into multiple segments via a trained segmentation machine learning model, and inputting the segments into multiple different attribute extraction models to extract different types of attributes from each of the multiple segments.
    Type: Application
    Filed: September 21, 2021
    Publication date: March 23, 2023
    Inventors: Pramod Kumar Sharma, Yijian Xiang, Yiran Li, Paul Pangilinan Del Villar, Liang Du, Robin Abraham, Nilgoon Zarei, Mandar Dilip Dixit