Abstract: Provided are systems and methods for building a domain-specific facts network. A system includes an optical character recognition (OCR) system configured to perform OCR on an image of a domain-specific document. The system also includes an OCR results analysis system configured to analyze the results of OCR of the domain-specific document. The system also includes a fact extraction system configured to extract data from the domain-specific document based on the analysis of the results of the OCR. The system also includes a web fact extraction system configured to extract data from the Internet; wherein the data is related to the data in the domain-specific document. The system also includes a validation system configured to validate data extracted from the domain-specific document and the Internet. The validated data is stored in a domain-specific facts network.
Type:
Application
Filed:
November 17, 2014
Publication date:
March 5, 2015
Applicant:
GLENBROOK NETWORKS
Inventors:
Julia Komissarchik, Edward Komissarchik
Abstract: A system and methods for automatically assigning of classification codes to a business based on information about the business collected from the Internet are provided in which data extracted from trawling the Internet is compared to a node structure based on a taxonomy of a selected business classification code system.
Type:
Grant
Filed:
March 14, 2013
Date of Patent:
February 24, 2015
Assignee:
Glenbrook Networks
Inventors:
Julia Komissarchik, Edward Komissarchik
Abstract: A system and methods for automatically assigning of classification codes to a business based on information about the business collected from the Internet are provided in which data extracted from trawling the Internet is compared to a node structure based on a taxonomy of a selected business classification code system.
Type:
Application
Filed:
March 14, 2013
Publication date:
September 18, 2014
Applicant:
Glenbrook Networks
Inventors:
Julia KOMISSARCHIK, Edward Komissarchik
Abstract: Provided are system and methods for automatically generating a temporal social network. A method includes extracting a plurality of emails from an email server and extracting pre-facts from the plurality of emails. The method further includes navigating the Internet and extracting pre-facts from the Internet that are related to the pre-facts extracted from the plurality of emails and facts already stored in a temporal social network database. The method further includes determining pre-facts that can be declared facts and storing the facts in the temporal social network database.
Type:
Application
Filed:
March 13, 2014
Publication date:
July 10, 2014
Applicant:
Glenbrook Networks
Inventors:
Julia Komissarchik, Edward Komissarchik, Charles W. Stryker
Abstract: Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Type:
Grant
Filed:
March 13, 2013
Date of Patent:
March 25, 2014
Assignee:
Glenbrook Networks
Inventors:
Julia Komissarchik, Edward Komissarchik
Abstract: Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Type:
Grant
Filed:
March 13, 2013
Date of Patent:
December 31, 2013
Assignee:
Glenbrook Networks
Inventors:
Julia Komissarchik, Edward Komissarchik
Abstract: Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Type:
Grant
Filed:
July 9, 2010
Date of Patent:
August 14, 2012
Assignee:
Glenbrooks Networks
Inventors:
Edward Komissarchik, Julia Komissarchik
Abstract: Provided are methods and systems that extract facts of unstructured documents and build an oracle for various domains. The present invention addresses the problem of efficient finding and extraction of facts about a particular subject domain from semi-structured and unstructured documents, makes inferences of new facts from the extracted facts and the ways of verification of the facts, thus becoming a source of knowledge about the domain to be effectively queried. The methods and systems can also extract temporal information from unstructured and semi-structured documents, and can find and extract dynamically generated documents from Deep or Dynamic Web.
Type:
Grant
Filed:
June 13, 2005
Date of Patent:
November 18, 2008
Assignee:
Glenbrook Networks
Inventors:
Julia Komissarchik, Edward Komissarchik