Abstract: Systems and methods for aggregating, distilling, and analyzing large scale data sets are disclosed. In some embodiments a task object is generated based on inputs received via a user interface defining actions to perform on one or more data sets. In some embodiments, the task object is forwarded to a pipeline processor configured to invoke a distributed computing cluster to process the data sets according to the defined actions.
Abstract: Systems and methods for aggregating, distilling, and analyzing large scale data sets are disclosed. In some embodiments a task object is generated based on inputs received via a user interface defining actions to perform on one or more data sets. In some embodiments, the task object is forwarded to a pipeline processor configured to invoke a distributed computing cluster to process the data sets according to the defined actions.