Patents Assigned to Data World
  • Publication number: 20190066052
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/or disseminate insights to collaborative computing devices and user accounts.
    Type: Application
    Filed: May 22, 2018
    Publication date: February 28, 2019
    Applicant: data.world, Inc.
    Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
  • Publication number: 20190065567
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of extended computerized query language syntax to analyze, for example, multiple tabular data arrangements in data-driven collaborative projects. For example, a method may include generating data to present a query editor in a data project interface, receiving data representing a first query command to select one or more subsets of data, identifying in the data representing a second query command a subset of datasets from which to extract the data, and applying a query based on a first query command and a second query command.
    Type: Application
    Filed: July 16, 2018
    Publication date: February 28, 2019
    Applicant: data. world, Inc.
    Inventors: David Lee Griffith, Shad William Reynolds, Bryon Kristen Jacob
  • Publication number: 20190065569
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/form queries over a composite data dictionary coupled to collaborative computing devices and user accounts. For example, a method may include forming a first data dictionary, linking a dataset associated with the first data dictionary to another dataset, which may be associated with a second data dictionary, and forming a dynamic composite data dictionary.
    Type: Application
    Filed: May 22, 2018
    Publication date: February 28, 2019
    Applicant: Data.World, Inc.
    Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
  • Publication number: 20190050459
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of localized link identifiers to perform implicitly federated queries using, in some examples, extended computerized query language syntax to analyze multiple tabular data arrangements in data-driven collaborative projects.
    Type: Application
    Filed: July 16, 2018
    Publication date: February 14, 2019
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Shad William Reynolds, Bryon Kristen Jacob
  • Publication number: 20190050445
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby logic is configured to remediate anomalies in a data set originating in a first format prior to enrichment and conversion into a second format that facilitates forming collaborative dataset and, for example, interrelations among a system of networked collaborative datasets, whereby, at least in some implementations, data interrelations between different formats may be disposed in one or more data layers (e.g., layered data files and/or data arrangements). In some examples, a method may include analyzing data to detect a non-compliant data attribute, detecting a condition based on the non-compliant data attribute, invoking an action to modify a subset of data, and generating a graph data arrangement linkable to other graph data arrangements to form a collaborative dataset.
    Type: Application
    Filed: March 20, 2018
    Publication date: February 14, 2019
    Applicant: data world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Publication number: 20190042606
    Abstract: Various embodiments relate generally to data science and data analysis and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to transmute associations between data arrangements of different formats or different data models to facilitate data operations, such as queries, configured to enhance, for example, an ingested dataset via transmuted associations as, for example, interrelations among a system of networked collaborative datasets.
    Type: Application
    Filed: April 2, 2018
    Publication date: February 7, 2019
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Publication number: 20190034491
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby logic is configured to remediate anomalies in a data set originating in a first format prior to enrichment and conversion into a second format that facilitates forming collaborative dataset and, for example, interrelations among a system of networked collaborative datasets, whereby, at least in some implementations, data interrelations between different formats may be disposed in one or more data layers (e.g., layered data files and/or data arrangements). In some examples, a method may converting a dataset from a data format at a format converter to form an atomized dataset in a graph data arrangement, the atomized dataset being a collaborative dataset including atomized descriptor data and atomized source data.
    Type: Application
    Filed: March 20, 2018
    Publication date: January 31, 2019
    Applicant: data world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Publication number: 20180314705
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby data ingestion is performed to form data representing layered data files and data arrangements to facilitate, for example, interrelations among a system of networked collaborative datasets. In some examples, a method may include forming a first layer data file and a second layer data file, assigning addressable identifiers to uniquely identify units of data and data units to facilitate the linking of data, and implementing selectively one or more of a unit of data and a data unit as a function of a context of a data access request for a collaborative dataset.
    Type: Application
    Filed: March 20, 2018
    Publication date: November 1, 2018
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Patent number: 10102258
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query into a collaborative dataset consolidation system, identifying datasets relevant to the query, generating one or more queries to access disparate data repositories, and retrieving data representing query results. In some cases, one or more queries are applied (e.g.
    Type: Grant
    Filed: June 19, 2016
    Date of Patent: October 16, 2018
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Arthur Albert Keen, Alexander John Zelenak, Jon Loyens, Brett A. Hurt, Shad William Reynolds, Joseph Boutros
  • Publication number: 20180262864
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more layered data files, interrelations among a system of networked collaborative datasets. In some examples, a method may include transforming of a set of data to an atomized format to form an atomized dataset that includes a derived dataset attribute. The method may also include presenting data representing an annotation at the user interface based on the derived dataset attribute. In some examples, the annotation may be associated with a layer file.
    Type: Application
    Filed: March 9, 2017
    Publication date: September 13, 2018
    Applicant: Data World, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
  • Publication number: 20180210936
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems, to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby user interfaces may be implemented as computerized tools for presenting summarization of dataset attributes to facilitate discovery, formation, and analysis of interrelated collaborative datasets. In some examples, a method may include receiving data resulting from insight calculations. Insight calculations may be based on a derived dataset attribute. Also, the method may include presenting a data arrangement overview summarizing the data attributes as an aggregation of data attributes in a portion of the user interface.
    Type: Application
    Filed: March 9, 2017
    Publication date: July 26, 2018
    Applicant: Data.World, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Jon Loyens, Bryon Kristen Jacob
  • Publication number: 20170371881
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more user interface applications, interrelations among a system of networked collaborative datasets In some examples, a method may include causing transformation of a set of data to an atomized format to form an atomized dataset, monitoring creation of a dataset, and presenting data representing a status of a portion of the creation of the dataset. The status may depict an atomized dataset linked to at least one other dataset.
    Type: Application
    Filed: March 9, 2017
    Publication date: December 28, 2017
    Applicant: Data.World, Inc.
    Inventors: Shad William Reynolds, Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Triet Minh Le, Joseph Boutros
  • Publication number: 20170364703
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.
    Type: Application
    Filed: June 19, 2016
    Publication date: December 21, 2017
    Applicant: data.world
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Publication number: 20170364568
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems, to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more interfaces, such as user interfaces, may be implemented as computerized tools for presenting summarization of dataset attributes to facilitate discovery, formation, and analysis of interrelated collaborative datasets. In some examples, a method may include presenting data representing summary characteristic data in a user interface. This may include user interface elements each specifying a value of a dataset attribute for a collaborative dataset. Also, the method may include presenting aggregated data attributes for a subset of the collaborative dataset associated with the linked atomized datasets.
    Type: Application
    Filed: March 9, 2017
    Publication date: December 21, 2017
    Applicant: Data.World, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Jon Loyens, Bryon Kristen Jacob
  • Publication number: 20170364570
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.
    Type: Application
    Filed: June 19, 2016
    Publication date: December 21, 2017
    Applicant: Data world
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Publication number: 20170364539
    Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.
    Type: Application
    Filed: February 22, 2017
    Publication date: December 21, 2017
    Applicant: Data.World, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
  • Publication number: 20170364553
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.
    Type: Application
    Filed: June 19, 2016
    Publication date: December 21, 2017
    Applicant: Data.world
    Inventors: Bryon Capacity Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Publication number: 20170364564
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include data ingestion controller configured to format datasets to form a first and a second atomized dataset, the second atomized dataset including the first atomized dataset and one or more other atomized datasets. The system may include a dataset query engine configured to identify a portion of a dataset relevant to a query, and to retrieve query results from at least one of different data repositories.
    Type: Application
    Filed: June 19, 2016
    Publication date: December 21, 2017
    Applicant: data world
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Arthur Albert Keen, Alexander John Zelenak, Jon Loyens, Brett A. Hurt, Shad William Reynolds, Joseph Boutros
  • Publication number: 20170364538
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include an atomized workflow loader configured to receive an atomized dataset to load into a data store, and to determine resource requirements data to describe at least one resource requirement. The atomized workflow loader may be further configured to select a data store type based on a resource requirement, and perform a load operation of the atomized dataset as a function of the data store type.
    Type: Application
    Filed: June 19, 2016
    Publication date: December 21, 2017
    Applicant: Data World
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Publication number: 20170364694
    Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema, and optimizing the rewriting by identifying a database engine to execute the query and including other data converted into another triple associated with an attribute of the query.
    Type: Application
    Filed: February 22, 2017
    Publication date: December 21, 2017
    Applicant: Data.World, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen