Patents by Inventor Robin Abraham

Robin Abraham has filed for patents to protect the following inventions. This listing includes patent applications that are pending as well as patents that have already been granted by the United States Patent and Trademark Office (USPTO).

  • Publication number: 20250139154
    Abstract: A molecule representation is extracted from a document and associated with the document in a metadata database. For example, an image of a molecular structure may be extracted from a document and stored in the metadata database in a text-based representation such as SMILES. The metadata database may be searched to identify documents that mention a particular molecule. Continuing the example, the metadata database may be searched with a SMILES representation to identify the document and other documents that refer to the same molecule. The metadata database may index documents based on different types of molecule representations, including text-based, image-based, graph-based, name, abbreviation, etc. This allows search over multiple representations of a molecule, improving accuracy and thoroughness. These improvements reduce the time and computational resources needed to search for documents that refer to a particular molecule.
    Type: Application
    Filed: October 31, 2023
    Publication date: May 1, 2025
    Inventors: Yijian XIANG, Rohith Venkata PESALA, Nilgoon ZAREI, Pramod Kumar SHARMA, Liang DU, Robin ABRAHAM, J Brandon SMOCK
  • Patent number: 12260662
    Abstract: A computer implemented method includes rendering a document page as an image; detecting tables, columns, and other associated table objects within the image via one or more table recognition models that model objects in the image as overlapping bounding boxes; transforming the set of objects into a structured representation of the table; extracting data from the objects into the structured representation; and exporting the table into the desired output format.
    Type: Grant
    Filed: June 21, 2021
    Date of Patent: March 25, 2025
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: J Brandon Smock, Pramod Kumar Sharma, Natalia Larios Delgado, Rohith Venkata Pesala, Robin Abraham
  • Publication number: 20250087311
    Abstract: This disclosure describes a machine learning system that includes a contrastive learning based two-tower model for retrieval of relevant chemical reaction procedures given a query chemical reaction. The two-tower model uses attention-based transformers and neural networks to convert tokenized representations of chemical reactions and chemical reaction procedures to embeddings in a shared embedding space. Each tower can include a transformer network, a pooling layer, a normalization layer, and a neural network. The model is trained with labeled data pairs that include a chemical reaction and the text of a chemical reaction procedure for that chemical reaction. New queries can locate chemical reaction procedures for performing a given chemical reaction as well as procedures for similar chemical reactions. The architecture and training of the model make it possible to perform semantic matching based on chemical structures. The model is highly accurate providing an average recall at K=5 of 95.9%.
    Type: Application
    Filed: November 26, 2024
    Publication date: March 13, 2025
    Inventors: Sudipto MUKHERJEE, Liang DU, Ke JIANG, Robin ABRAHAM
  • Publication number: 20250077844
    Abstract: The present disclosure relates to efficiently receiving and processing input tasks in a way that is scalable and which reduces both the quantity of tokens processed by a foundation model (e.g., an LLM) as well as the number of API calls that are made in processing the input tasks. A system batches a set of inputs to provide as a single batch of input(s) into an LLM. The system generates one or more permutations of the batched input(s) to determine outputs based on variable orders in which the input data is provided within the respective permutations of the batched inputs. The system further may eliminate one or more of the data inputs within the respective batches to facilitate smaller batched inputs without sacrificing accuracy in a set of outputs generated by the LLM responsive to the batch permutations.
    Type: Application
    Filed: December 8, 2023
    Publication date: March 6, 2025
    Inventors: Jianzhe LIN, Maurice DIESENDRUCK, Manqing MAO, Yijian XIANG, Julia T. CHEN, Paishun TING, Mingyang XU, Liang DU, Robin ABRAHAM
  • Publication number: 20250068849
    Abstract: The present disclosure relates to systems, methods, and computer-readable media for utilizing a concept graphing system to determine and provide relationships between concepts within document collections or corpora. For example, the concept graphing system can generate and utilize machine-learning models, such as a sparse graph recovery machine-learning model, to identify less-obvious correlations between concepts, including positive and negative concept connections, as well as provide these connections within a visual concept graph. Additionally, the concept graphing system can provide a visual concept graph that determines and displays concept correlations based on the input of a single concept, multiple concepts, or no concepts.
    Type: Application
    Filed: November 8, 2024
    Publication date: February 27, 2025
    Inventors: Harsh SHRIVASTAVA, Maurice DIESENDRUCK, Robin ABRAHAM
  • Patent number: 12218890
    Abstract: The present disclosure relates to methods and systems for sharing with a plurality of users a chat session that uses large language models to provide responses for input messages received for the chat session. The methods and systems provide access to the chat session to the users and update the chat session in response to any changes made to the chat session by any of the users. The methods and systems allow the users to resume the chat session at a future time using the chat session history.
    Type: Grant
    Filed: October 19, 2023
    Date of Patent: February 4, 2025
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Robin Abraham, Liang Du, Manqing Mao, Paishun Ting, Julia Chen, Jianzhe Lin, Yijian Xiang, Mingyang Xu, Wenhan Wang, Fahimeh Raja
  • Patent number: 12205202
    Abstract: The present disclosure relates to systems, methods, and computer-readable media for utilizing an interactive graphing system to achieve improved dataset exploration utilizing an intelligent workflow and an interactive user interface. More specifically, the interactive graphing system facilitates generating updated network graphs that include inferred user influences based on implicit user action. Indeed, the interactive graphing system can automatically generate and present a user with an updated network graph that includes added, removed, or subsetted elements and relationships that are otherwise hidden from a user. Additionally, the interactive graphing system facilitates network graph exploration and processing of customized combined network graphs that join otherwise separate network graphs.
    Type: Grant
    Filed: June 6, 2022
    Date of Patent: January 21, 2025
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Harsh Shrivastava, Maurice Diesendruck, Robin Abraham
  • Publication number: 20250021596
    Abstract: A system for implementing object state distributions obtains a first object state distribution associated with a first image set. The first object state distribution includes a first plurality of object state vectors generated based upon (i) a first set of object type classifications associated with the first image set and (ii) a first set of image information based on the first image set. The system obtains a second object state distribution associated with a second image set. The second object state distribution comprises a second plurality of object state vectors generated based upon (i) a second set of object type classifications associated with the second image set and (ii) a second set of image information based on the second image set. The system determines a distance measure between the first object state distribution and the second object state distribution and assigns a label based upon the distance measure.
    Type: Application
    Filed: October 1, 2024
    Publication date: January 16, 2025
    Inventors: Maurice DIESENDRUCK, Robin ABRAHAM
  • Patent number: 12191004
    Abstract: This disclosure describes a machine learning system that includes a contrastive learning based two-tower model for retrieval of relevant chemical reaction procedures given a query chemical reaction. The two-tower model uses attention-based transformers and neural networks to convert tokenized representations of chemical reactions and chemical reaction procedures to embeddings in a shared embedding space. Each tower can include a transformer network, a pooling layer, a normalization layer, and a neural network. The model is trained with labeled data pairs that include a chemical reaction and the text of a chemical reaction procedure for that chemical reaction. New queries can locate chemical reaction procedures for performing a given chemical reaction as well as procedures for similar chemical reactions. The architecture and training of the model make it possible to perform semantic matching based on chemical structures. The model is highly accurate providing an average recall at K=5 of 95.9%.
    Type: Grant
    Filed: June 27, 2022
    Date of Patent: January 7, 2025
    Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
    Inventors: Sudipto Mukherjee, Liang Du, Ke Jiang, Robin Abraham
  • Publication number: 20250006303
    Abstract: An ensemble machine learning model is used to visualize complex data relationships. Complex data is provided to multiple generator models that each independently create a graph representing relationships in the data. At least one of the generator models is a machine learning model that can be trained with a generator model loss function. The graphs produced by the separate generator models are combined by an ensemble model to create a consensus graph. The ensemble model may be implemented as an edge-selector neural network. The models are trained jointly with a ensemble model loss function that includes the loss functions of the generator models added as regularization terms. A visualization of the ensemble graph is created to aid a user in understanding the complex data relationships.
    Type: Application
    Filed: June 28, 2023
    Publication date: January 2, 2025
    Inventors: Harsh SHRIVASTAVA, Robin ABRAHAM
  • Publication number: 20240428008
    Abstract: The present disclosure relates to methods and systems for using large language models to support research activities. The methods and systems include a copilot engine that creates input prompts to provide to the large language model to use in generating responses to input messages. The copilot engine infers an intent of the input messages and sends the intent with the input message in the input prompt to the large language model. The large language model generates different types of responses for different intents.
    Type: Application
    Filed: June 22, 2023
    Publication date: December 26, 2024
    Inventors: Robin ABRAHAM, Liang DU, Fahimeh RAJA, Wenhan WANG, Dustin James STEWART, Lipsa PATNAIK, Stuart Richard LONG, Timothy EARNHEART, Sam Daniel GAMMON, Sacha AROZARENA VALLADARE, Jedediah Miller SINGER, Henrique DANTAS
  • Publication number: 20240428005
    Abstract: The present disclosure relates to methods and systems for automatically generating documents for a specific topic using large language models. The methods and systems receive an input query that identifies a topic for the document. The methods and systems automatically generate, using the large language models, a framework for the document with sections and subsections for the document. The methods and systems write the document, using the large language models, and provide references for the data sources used to obtain the data that the large language model used to write the document.
    Type: Application
    Filed: June 20, 2023
    Publication date: December 26, 2024
    Inventors: Robin ABRAHAM, Mingyang XU, Julia CHEN, Yijian XIANG, Manqing MAO, Jianzhe LIN, Paishun TING, Liang DU
  • Publication number: 20240430216
    Abstract: The present disclosure relates to methods and systems for sharing with a plurality of users a chat session that uses large language models to provide responses for input messages received for the chat session. The methods and systems provide access to the chat session to the users and update the chat session in response to any changes made to the chat session by any of the users. The methods and systems allow the users to resume the chat session at a future time using the chat session history.
    Type: Application
    Filed: October 19, 2023
    Publication date: December 26, 2024
    Inventors: Robin ABRAHAM, Liang DU, Manqing MAO, Paishun TING, Julia CHEN, Jianzhe LIN, Yijian XIANG, Mingyang XU, Wenhan WANG, Fahimeh RAJA
  • Patent number: 12169680
    Abstract: The present disclosure relates to methods and systems for converting Portable Document Format (PDF) documents to LaTeX files. The methods and systems use machine learning models to identify and extract PDF portions of a PDF document. The methods and systems create a LaTeX file for the PDF document using the PDF portions extracted by the machine learning models. The methods and systems provide an output with the LaTeX file for the PDF document. The LaTeX file is used to perform different actions on the PDF document.
    Type: Grant
    Filed: June 6, 2022
    Date of Patent: December 17, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Harsh Shrivastava, Sarah Panda, Liang Du, Robin Abraham
  • Patent number: 12159722
    Abstract: A relevance system ranks a set of medical studies based on a relevance of each medical study in the set of medical studies to a patient profile. The relevance system includes a relevance model. The relevance model determines a relevance of each medical study to the patient profile based on a semantic relationship score, a concept relationship score, and a term-occurrence score. The semantic relationship score is a measure of a similarity in semantic meaning of a medical study and a patient profile. The concept relationship score is a measure of the closeness of medical concepts in a medical study to medical concepts in a patient profile. The term-occurrence score is a measure of occurrences of terms in a medical study that also appear in a patient profile and the statistical significances of the terms.
    Type: Grant
    Filed: November 23, 2020
    Date of Patent: December 3, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Nut Limsopatham, Liang Du, Robin Abraham
  • Patent number: 12158910
    Abstract: A system for implementing object state distributions obtains a first object state distribution associated with a first image set. The first object state distribution includes a first plurality of object state vectors generated based upon (i) a first set of object type classifications associated with the first image set and (ii) a first set of image information based on the first image set. The system obtains a second object state distribution associated with a second image set. The second object state distribution comprises a second plurality of object state vectors generated based upon (i) a second set of object type classifications associated with the second image set and (ii) a second set of image information based on the second image set. The system determines a distance measure between the first object state distribution and the second object state distribution and assigns a label based upon the distance measure.
    Type: Grant
    Filed: December 17, 2021
    Date of Patent: December 3, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Maurice Diesendruck, Robin Abraham
  • Patent number: 12159110
    Abstract: The present disclosure relates to systems, methods, and computer-readable media for utilizing a concept graphing system to determine and provide relationships between concepts within document collections or corpora. For example, the concept graphing system can generate and utilize machine-learning models, such as a sparse graph recovery machine-learning model, to identify less-obvious correlations between concepts, including positive and negative concept connections, as well as provide these connections within a visual concept graph. Additionally, the concept graphing system can provide a visual concept graph that determines and displays concept correlations based on the input of a single concept, multiple concepts, or no concepts.
    Type: Grant
    Filed: June 6, 2022
    Date of Patent: December 3, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Harsh Shrivastava, Maurice Diesendruck, Robin Abraham
  • Publication number: 20240331235
    Abstract: A machine learning model is used to generate molecular images by text-to-image diffusion techniques based on natural language text inputs. The machine learning model is trained on combinations of molecule images and corresponding text such that representatives of both are embedded in latent space. Users provide natural language text describing molecular characteristics and the machine learning model generates an image of a molecule with those characteristics. Existing molecular images or those generated by the system can be further edited and refined with additional natural language text instructions. The system also uses machine vision techniques to understand the molecule represented by a molecular image and translate that image into other representations of the molecule.
    Type: Application
    Filed: March 31, 2023
    Publication date: October 3, 2024
    Inventors: J Brandon SMOCK, Robin ABRAHAM, Maurice DIESENDRUCK, Rohith Venkata PESALA
  • Patent number: 12086551
    Abstract: A computer implemented method determines differences between documents. The method includes parsing a first document and a second document into respective distinct instances of content. The distinct instances of content are classified into different categories. Category specific matching algorithms are applied to each of the respective instances of content to determine a similarity score for each of the respective instances of content. Semantic differences between the first document and the second document are analyzed as a function of the similarity scores. A characterization of the semantic differences is generated.
    Type: Grant
    Filed: June 23, 2021
    Date of Patent: September 10, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Robin Abraham, J Brandon Smock, Owen Stephenson Whiting, Henry Hun-Li Reid Pan
  • Patent number: 12072935
    Abstract: Machine learning to predict a layout type that each of a plurality of portions of a document appears in. This is done even though the computer-readable representation of the document does not contain information at the granularity of the prediction to be made that identifies which layout type that each of the plurality of document portions belongs in. For each of a plurality of the portions, the machine-learning system predicts the layout type that the respective portion appears in, and indexes the document using the predictions so as to result in a computer-readable index. The index represents a predicted layout type associated with each of the plurality of portions of the document. Thus, the index can be used to search based on position of a searched term within the document.
    Type: Grant
    Filed: September 8, 2021
    Date of Patent: August 27, 2024
    Assignee: Microsoft Technology Licensing, LLC
    Inventors: Yao Li, Liang Du, Robin Abraham