Abstract: Aspects of the subject technology include an event processing and prospect identifying platform. It accepts as input a set of storylines (a sequence of entities and their relationships) and processes them as follows: (1) uses different algorithms (LDA, SVM, information gain, rule sets) to identify themes from storylines; (2) identifies top locations and times in storylines and combines with themes to generate events that are meaningful in a specific scenario for categorizing storylines; and (3) extracts top prospects as people and organizations from data elements contained in storylines. The output comprises sets of events in different categories and storylines under them along with top prospects identified. Aspects use in-memory distributed processing that scales to high data volumes and categorizes generated storylines in near real-time.
Abstract: A method and apparatus for recognizing and extracting data from a form depicted within an image of a document are described. The method may include receiving the image of the document, the image depicting the form and data contained one the form. The method may also include transforming the image of the document to a set of one or more key, value pairs by processing the image of the document with a sequence of two or more trained machine learning based image analysis processes, wherein keys are relevant to forms of the type depicted in the form, and wherein each value is associated with a key. The method may also include generating a data output that comprises the set of key, value pairs for textual data recognized and extracted from the form depicted in the image.
Type:
Grant
Filed:
October 22, 2018
Date of Patent:
December 15, 2020
Assignee:
OMNISCIENCE CORPORATION
Inventors:
Alexander Wesley Contryman, Jacob Ryan van Gogh, Manu Shukla
Abstract: Aspects of the present disclosure relate to a distributed storytelling framework. A server receives an adjacency list comprising a set of nodes linked together by edges. The server converts the adjacency list to a set of generated storylines, each storyline being represented as a key-value pair. A key represents a first node and a value represents a second node linked to the first node by an edge. The server combines first and second storylines, of the set of generated storylines, to generate an additional storyline in response to a value from a first storyline matching a key from a second storyline. The additional storyline includes a single key and multiple values, and is added to the set of generated storylines. The server repeats combining storylines, of the set of generated storylines, to generate additional storylines. The server provides an output corresponding to at least one of the generated storylines.
Abstract: Drones have become ubiquitous in performing risky and labor intensive areal tasks cheaply and safely. To allow them to be autonomous, their flight plan needs to be pre-built for them. Existing works do not precalculate flight paths but instead focus on navigation through camera based image processing techniques, genetic or geometric algorithms to guide the drone during flight. That makes flight navigation complex and risky. We present automated flight plan builder DIFPL which pre-builds flight plans for drones to survey a large area. The flight plans are built for subregions and fed into drones which allow them to navigate autonomously. DIFPL employs distributed paradigm on Hadoop MapReduce framework. Distribution is achieved by processing sections or subregions in parallel. Experiments performed with network and elevation datasets validate the efficiency of DIFPL in building optimal flight plans.