Abstract: Preferred embodiments of the present invention comprise methods and software for processing text documents and extracting chemical data therein. Preferred method embodiments comprise: (a) identifying and tagging one or more chemical compounds within a text document; (b) identifying and tagging physical properties related to one or more of those compounds; (c) translating one or more of those compounds into a chemical structure; (d) identifying and tagging one or more chemical reaction descriptions within the text document; and (e) extracting at least some of the tagged information and storing it in a database.
Type:
Grant
Filed:
April 30, 2004
Date of Patent:
April 26, 2011
Assignee:
MDL Information Systems, GmbH
Inventors:
Alexander Johnston Lawson, Stefan Roller, Helmut Grotz, Janusz L. Wisniewski, Libuse Goebels