Patents Assigned to Elsevier, Inc.
-
Publication number: 20260099532Abstract: A method for classifying a document into a hierarchical taxonomy associated with a corpus of documents, the document being associated with document information, the hierarchical taxonomy comprising a plurality of levels with each level comprising one or more nodes, each node comprising a label; the method may include inputting the taxonomy and the document information into a large language model, inputting a prompt into the large language model to cause the large language model to output one or more nodes of the taxonomy for classifying the document based on the document information, and classifying the document into each of the nodes output by the large language model.Type: ApplicationFiled: October 3, 2025Publication date: April 9, 2026Applicant: Elsevier, Inc.Inventors: Seyedamin Tabatabaei, Georgios Tsatsaronis, Michael Parsons, Georgia Hellard Timm, Sarah Fancher, Gregory J. Gordon
-
Patent number: 11704922Abstract: A method of extracting information from a flowchart image comprising a plurality of closed-shaped data nodes having text enclosed within, connecting lines connecting the plurality of closed-shaped data nodes and free text adjacent to the connecting lines includes receiving the flowchart image, detecting the closed-shaped data nodes, localizing the text enclosed within the closed-shaped data nodes, and masking the localized text.to generate an annotated image. Lines in the annotated image are the detected to reconstruct them as closed-shaped data nodes and connecting lines. A tree frame with the plurality of closed-shaped data nodes and the connecting lines is extracted. The free text is then localized. Chunks of the free text oriented and positioned proximally together are assembled into text blocks using an orientation-based two-dimensional clustering.Type: GrantFiled: August 10, 2021Date of Patent: July 18, 2023Assignee: ELSEVIER, INC.Inventors: Atul Kakrana, Kaushik Raha
-
Patent number: 11687734Abstract: A method for performing a search for a result set of documents comprises receiving, at a computing device, an electronic document, identifying a numerical value in the document, extracting the numerical value and a portion of text surrounding the numerical value from the document to obtain extracted text, creating a vector representation of the extracted text, generating a series of questions associated with the extracted text, generating answers to the series of questions based on the vector representation of the extracted text, determining a context associated with the numerical value based on the answers to the plurality of questions, and storing the numerical value and the context associated with the numerical value in a database.Type: GrantFiled: July 2, 2020Date of Patent: June 27, 2023Assignee: ELSEVIER, INC.Inventors: Corey A. Harper, Jessica Rose Cox, Antony Jason Scerri, Ronald E. Daniel, Jr.
-
Patent number: 11550835Abstract: A method of automatically generating content summaries for topics includes receiving a taxonomy for a concept and a text corpus. The method further includes generating an annotated dataset having term annotations corresponding to the concept from the text corpus based on the taxonomy, parsing the annotated dataset into a custom generated document object having a structured layout, determining features for the term annotations, and extracting snippets from the custom generated document object, where each of the snippets corresponds to a section of the custom generated document object. The method further includes scoring the snippets based on the features such that each of the snippets corresponds to a score, filtering one or more snippets from the snippets when one or more snippet filtering conditions is met, ranking the snippets into an ordered list for the concept based on the score, and providing, to a user computing device, the ordered list.Type: GrantFiled: June 15, 2018Date of Patent: January 10, 2023Assignee: ELSEVIER, INC.Inventors: Marius Doornenbal, Srinivasa Satya Sameer Kumar Chivukula, Judson Dunham, Rick Misra, Michelle Gregory
-
Patent number: 11537788Abstract: Methods, systems, and non-transitory media for training a chemical entity recognition system to extract chemical compounds from a patent document and determine a relevance of the chemical compounds to the patent document are disclosed. A method includes obtaining patent documents from patent databases, normalizing each patent document into a unified format, and generating a chemical patent corpus. The chemical patent corpus includes chemical entities, each having relevancy annotations that indicate a relevance to the patent document from which the chemical entity is extracted.Type: GrantFiled: March 6, 2019Date of Patent: December 27, 2022Assignees: Elsevier, Inc.Inventors: Saber A. Akhondi, Hinnerk Rey, Markus Schwoerer, Heike Nau, Gabriele Ilchmann, Matthias Irmer, Claudia Bobach
-
Patent number: 11366814Abstract: A method comprises receiving at a computing device, a search query, performing, by the computing device, a semantic analysis of the search query to identify one or more semantic concepts contained within the query, selecting, by the computing device, one or more corpora, or portions thereof, based on the identified semantic concepts, and performing, by the computing device, a search of the one or more corpora based on the search query.Type: GrantFiled: March 20, 2020Date of Patent: June 21, 2022Assignee: ELSEVIER, Inc.Inventor: Keith Gutfreund
-
Patent number: 11226999Abstract: Systems, methods, and readable memory for providing recommendations. A method includes receiving data corresponding to one or more user interactions with a user interface, where the one or more user interactions indicate a research topic, searching one or more databases for references relating to the research topic, extracting names from the references, the names corresponding to potential collaborators, placing the names into a ranked list, where the names are arranged in the ranked list according to a predicted relevance to a user, and providing the ranked list via the user interface to the user.Type: GrantFiled: October 4, 2018Date of Patent: January 18, 2022Assignee: ELSEVIER, INC.Inventors: Antonio Gulli, Maya Hristakeva, Kris Jack
-
Patent number: 11151372Abstract: A method of extracting information from a flowchart image comprising a plurality of closed-shaped data nodes having text enclosed within, connecting lines connecting the plurality of closed-shaped data nodes and free text adjacent to the connecting lines includes receiving the flowchart image, detecting the closed-shaped data nodes, localizing the text enclosed within the closed-shaped data nodes, and masking the localized text.to generate an annotated image. Lines in the annotated image are the detected to reconstruct them as closed-shaped data nodes and connecting lines. A tree frame with the plurality of closed-shaped data nodes and the connecting lines is extracted. The free text is then localized. Chunks of the free text oriented and positioned proximally together are assembled into text blocks using an orientation-based two-dimensional clustering.Type: GrantFiled: October 9, 2019Date of Patent: October 19, 2021Assignee: ELSEVIER, INC.Inventors: Atul Kakrana, Kaushik Raha
-
Publication number: 20210303604Abstract: Systems and methods for indexing geological features are disclosed. In one embodiment, a method for indexing geological features includes accessing a database storing a plurality of map objects that originate from documents. Each map object includes a map defined by a geographical boundary and a text caption. The method includes, for each map object, determining a plurality of geohashes within the geographical boundary, and includes, for each map object, comparing terms of the text caption with a list of geological keywords. For each map object, the method includes identifying one or more geological noun phrases within the text caption that match one or more geological noun phrases of the list. The method includes determining, for each geological noun phrase, one or more geohashes associated with the geological noun phrase and, for each geohash, determining a frequency that the geohash is associated with the geological noun phrase.Type: ApplicationFiled: June 10, 2021Publication date: September 30, 2021Applicant: Elsevier, Inc.Inventors: Corey A. Harper, Chi Yeung Cheung, Sandra Merten, Antony Jason Scerri
-
Patent number: 11061940Abstract: Systems and methods for indexing geological features are disclosed. In one embodiment, a method for indexing geological features includes accessing a database storing a plurality of map objects that originate from documents. Each map object includes a map defined by a geographical boundary and a text caption. The method includes, for each map object, determining a plurality of geohashes within the geographical boundary, and includes, for each map object, comparing terms of the text caption with a list of geological keywords. For each map object, the method includes identifying one or more geological noun phrases within the text caption that match one or more geological noun phrases of the list. The method includes determining, for each geological noun phrase, one or more geohashes associated with the geological noun phrase and, for each geohash, determining a frequency that the geohash is associated with the geological noun phrase.Type: GrantFiled: October 9, 2019Date of Patent: July 13, 2021Assignee: ELSEVIER, INC.Inventors: Corey A. Harper, Chi Yeung Cheung, Sandra Merten, Antony Jason Scerri
-
Patent number: 11061873Abstract: Systems and methods for normalizing and searching electronic data, such as chemical material property data, are disclosed. In one embodiment, a method includes receiving electronic data from a source. The electronic data is formatted in a source format. The method further includes converting the source data into a normalized format, and storing normalized electronic data in levels of a nested model. The method further includes receiving a search or browse query directed toward normalized properties in a first level of the nested model or a second level of the nested model, in any non-hierarchical order. The method also includes searching the nested model and causing for display on an electronic display one or more entities satisfying the query and maintaining the integrity of all parameters of the query across all selected properties queried in any non-hierarchical order.Type: GrantFiled: June 15, 2017Date of Patent: July 13, 2021Assignee: ELSEVIER, INC.Inventors: Venkatesh Natarajan, Yusufee Nathani, Avin Sijariya, Chi Yeung Cheung
-
Publication number: 20210110150Abstract: A method of extracting information from a flowchart image comprising a plurality of closed-shaped data nodes having text enclosed within, connecting lines connecting the plurality of closed-shaped data nodes and free text adjacent to the connecting lines includes receiving the flowchart image, detecting the closed-shaped data nodes, localizing the text enclosed within the closed-shaped data nodes, and masking the localized text.to generate an annotated image. Lines in the annotated image are the detected to reconstruct them as closed-shaped data nodes and connecting lines. A tree frame with the plurality of closed-shaped data nodes and the connecting lines is extracted. The free text is then localized. Chunks of the free text oriented and positioned proximally together are assembled into text blocks using an orientation-based two-dimensional clustering.Type: ApplicationFiled: October 9, 2019Publication date: April 15, 2021Applicant: Elsevier, Inc.Inventors: Atul KAKRANA, Kaushik RAHA
-
Publication number: 20210109607Abstract: A method of generating a user affect prediction includes receiving a label for a user-reported affect corresponding to interactions with the user interface, receiving events corresponding to the interactions with the user interface, identifying one or more patterns of the events as one or more gestures and extracting one or more features of the gestures. The method uses a machine learning model to generate a user affect prediction based on the training features. The user affect prediction represents a predicted user affect corresponding to the interactions with the user interface. The machine learning model may be trained by modifying one or more parameters of the machine learning model using a difference between the label and the generated user affect prediction.Type: ApplicationFiled: October 14, 2020Publication date: April 15, 2021Applicant: Elsevier, Inc.Inventors: Steven Stalzer, Paul D. Crockett, Gabriel Gabra Zaccak
-
Publication number: 20210004586Abstract: Methods, systems, and non-transitory media for training a chemical entity recognition system to extract chemical compounds from a patent document and determine a relevance of the chemical compounds to the patent document are disclosed. A method includes obtaining patent documents from patent databases, normalizing each patent document into a unified format, and generating a chemical patent corpus. The chemical patent corpus includes chemical entities, each having relevancy annotations that indicate a relevance to the patent document from which the chemical entity is extracted.Type: ApplicationFiled: March 6, 2019Publication date: January 7, 2021Applicant: Elsevier, Inc.Inventors: Saber A. AKHONDI, Hinnerk REY, Markus SCHWOERER, Heike NAU, Gabriele ILCHMANN, Matthias IRMER, Claudia BOBACH
-
Publication number: 20200394197Abstract: A method comprises receiving at a computing device, a search query, performing, by the computing device, a semantic analysis of the search query to identify one or more semantic concepts contained within the query, selecting, by the computing device, one or more corpora, or portions thereof, based on the identified semantic concepts, and performing, by the computing device, a search of the one or more corpora based on the search query.Type: ApplicationFiled: March 20, 2020Publication date: December 17, 2020Applicant: Elsevier, Inc.Inventor: Keith Gutfreund
-
Patent number: 10826781Abstract: A method for extracting structure from networks includes receiving an edge list, where the edge list defines a network including nodes and edges connecting the nodes to each other, where the edges define a strength of a relationship between connected nodes and filtering nodes from the edge list based on a predetermined filter parameter, thereby forming a filtered network. The method further includes identifying distinct connected components within the filtered network, analyzing each of the distinct connected components of the filtered network for the presence of additional structures within the distinct connected components, where the additional structures are decomposed into additional distinct connected components.Type: GrantFiled: July 31, 2018Date of Patent: November 3, 2020Assignee: ELSEVIER, INC.Inventors: Matt Hobby, Barry Norton, Jacek Szejda, Peter Wooldridge
-
Patent number: 10817582Abstract: A system, method, and electronic device for providing concomitant augmentation via learning interstitials for publications includes activating a scan mode, where the scan mode causes a camera to capture image data; determining the presence of a publication captured in the image data; and analyzing the image data to determine the presence of an augmented reality (AR) identifier. In response to identifying the presence of the AR identifier within the publication captured in the image data, the image data and an AR link that corresponds to the AR identifier is displayed as an AR overlay to the image data of the publication. In response to failing to identify the AR identifier within the publication, a user is prompted to input a page number of the publication; and the AR link that corresponds to the page number of the publication input by the user is displayed in a list view.Type: GrantFiled: July 19, 2019Date of Patent: October 27, 2020Assignee: Elsevier, Inc.Inventors: Hans-Frederick Brown, Christian Michael Fazio, Ethan Paul Furstoss, Gboinyee Kevin Tarr, Susanne Marcy Cohen, Daniel Dewitt Barber
-
Patent number: 10740560Abstract: Systems and methods of extracting funding information from text are disclosed herein. The method includes receiving a text document, extracting paragraphs from the text document using a natural language processing model or a machine learning model, and classifying, using a machine learning classifier, the paragraphs as having funding information or not having funding information. The method further includes labeling, using a first annotator, potential entities within the paragraphs classified as having funding information, and labeling, using a second annotator, potential entities within the paragraphs classified as having funding information, where the first annotator implements a first named-entity recognition model and the second annotator implements a second named-entity recognition model that is different from the first named-entity recognition model.Type: GrantFiled: June 27, 2018Date of Patent: August 11, 2020Assignee: Elsevier, Inc.Inventors: Michelle Gregory, Subhradeep Kayal, Georgios Tsatsaronis, Zubair Afzal
-
Publication number: 20200042545Abstract: Systems and methods for indexing geological features are disclosed. In one embodiment, a method for indexing geological features includes accessing a database storing a plurality of map objects that originate from documents. Each map object includes a map defined by a geographical boundary and a text caption. The method includes, for each map object, determining a plurality of geohashes within the geographical boundary, and includes, for each map object, comparing terms of the text caption with a list of geological keywords. For each map object, the method includes identifying one or more geological noun phrases within the text caption that match one or more geological noun phrases of the list. The method includes determining, for each geological noun phrase, one or more geohashes associated with the geological noun phrase and, for each geohash, determining a frequency that the geohash is associated with the geological noun phrase.Type: ApplicationFiled: October 9, 2019Publication date: February 6, 2020Applicant: Elsevier, Inc.Inventors: Corey A. Harper, Chi Yeung Cheung, Sandra Merten, Antony Jason Scerri
-
Publication number: 20200026737Abstract: A system, method, and electronic device for providing concomitant augmentation via learning interstitials for publications includes activating a scan mode, where the scan mode causes a camera to capture image data; determining the presence of a publication captured in the image data; and analyzing the image data to determine the presence of an augmented reality (AR) identifier. In response to identifying the presence of the AR identifier within the publication captured in the image data, the image data and an AR link that corresponds to the AR identifier is displayed as an AR overlay to the image data of the publication. In response to failing to identify the AR identifier within the publication, a user is prompted to input a page number of the publication; and the AR link that corresponds to the page number of the publication input by the user is displayed in a list view.Type: ApplicationFiled: July 19, 2019Publication date: January 23, 2020Applicant: Elsevier, Inc.Inventors: Hans-Frederick Brown, Christian Michael Fazio, Ethan Paul Furstoss, Gboinyee Kevin Tarr, Susanne Marcy Cohen, Daniel Dewitt Barber