Patents Assigned to Data World
  • Publication number: 20230376496
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include an atomized workflow loader configured to receive an atomized dataset to load into a data store, and to determine resource requirements data to describe at least one resource requirement. The atomized workflow loader may be further configured to select a data store type based on a resource requirement, and perform a load operation of the atomized dataset as a function of the data store type.
    Type: Application
    Filed: May 17, 2022
    Publication date: November 23, 2023
    Applicant: data.world.Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Patent number: 11816118
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query into a collaborative dataset consolidation system, identifying datasets relevant to the query, generating one or more queries to access disparate data repositories, and retrieving data representing query results. In some cases, one or more queries are applied (e.g.
    Type: Grant
    Filed: August 22, 2022
    Date of Patent: November 14, 2023
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Brett A. Hurt, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen, Joseph Boutros, Alexander John Zelenak
  • Publication number: 20230359615
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools to deploy predictive data models based on in-situ auxiliary query commands implemented in a query, and configured to facilitate development and management of data projects by providing an interactive, project-centric workspace interface coupled to collaborative computing devices and user accounts.
    Type: Application
    Filed: May 9, 2022
    Publication date: November 9, 2023
    Applicant: data.world, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
  • Patent number: 11775547
    Abstract: Various techniques are disclosed for computerized tools to discover, form, and analyze dataset interrelations among a system of networked collaborative datasets including a repository configured to receive and store a dataset, and a dataset consolidation system configured to receive data to form a first input to initiate creation of a dataset based on a set of data, to activate a programmatic interface, to transform the set of data from a first format to an atomized format to form an atomized dataset, to monitor the creation of the dataset, to present data representing a status of a portion of the creation of the dataset, to calculate automatically dataset attributes of the linked dataset, to generate a plurality of sub-queries, and to retrieve data representing query results from the at least one of the different data repositories.
    Type: Grant
    Filed: December 27, 2021
    Date of Patent: October 3, 2023
    Assignee: data.world, Inc.
    Inventors: Shad William Reynolds, Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Triet Minh Le, Joseph Boutros
  • Patent number: 11755602
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and data-driven control systems and algorithms based on graph-based data arrangements, among other things, and, more specifically, to a computing platform configured to receive or analyze datasets in parallel by implementing, for example, parallel computing processor systems to correlate subsets of parallelized data from disparately-formatted data sources to identify entity data and to aggregate graph data portions. In some examples, a method may include classifying data parallelized data to identify a class of observation data, constructing one or more content graphs in a graph data format, correlating parallelized data to other subsets of parallelized data associated with a class of observation data; and aggregating observation data to represent an individual entity.
    Type: Grant
    Filed: August 30, 2021
    Date of Patent: September 12, 2023
    Assignee: data.world, Inc.
    Inventors: Shawn Andrew Pardue Smith, Bryon Kristen Jacob
  • Patent number: 11734564
    Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.
    Type: Grant
    Filed: May 16, 2022
    Date of Patent: August 22, 2023
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
  • Patent number: 11726992
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.
    Type: Grant
    Filed: April 25, 2022
    Date of Patent: August 15, 2023
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Patent number: 11675808
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.
    Type: Grant
    Filed: January 31, 2022
    Date of Patent: June 13, 2023
    Assignee: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
  • Patent number: 11669540
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to identify and match equivalent subsets of data between an ingested dataset, such as in a tabular data arrangement, and one or more graph-based data arrangements, according to at least some examples. For example, a method may include identifying a tabular data arrangement including a subset of data as a column, computing a compressed data representation for a column of data, correlating a compressed data representation to a reference compressed data representations, detecting a link between a column of data associated with a correlated compressed data representation to a dataset stored in a graph data arrangement, and forming an expanded tabular data arrangement.
    Type: Grant
    Filed: August 27, 2020
    Date of Patent: June 6, 2023
    Assignee: data.world, Inc.
    Inventor: David Lee Griffith
  • Publication number: 20230169124
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement computerized tools to identify data classifications and similar subsets of graph-based data arrangements with which to join, according to at least some examples. For example, a method may include determining a classification type for a subset of data based on a graph data arrangement, generating presentation data as a first user input to detect selection of the first user input. The method may include predicting a classification type for data. The method may also include generating other presentation data configured to join datasets, such as a column of tabular-formatted data with one or more portions of a graph data arrangement.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Applicant: data.world, Inc.
    Inventors: Sharon Brener, Shad William Reynolds, David Lee Griffith
  • Publication number: 20230169123
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement computerized tools to facilitate expedited queries based on query results generated by disparate computing and database architectures, according to at least some examples. For example, a method may include generating multiple results of queries, storing the multiple results of queries in a memory, and linking each of the multiple results of queries to a portion of data stored in a graph. The method can include receiving data representing a query and accessing data representing quiescent data values to form a query result.
    Type: Application
    Filed: November 30, 2021
    Publication date: June 1, 2023
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Shad William Reynolds
  • Patent number: 11657089
    Abstract: A system and method for creating and editing graph schema data structures in a collaborative, real time graphical editing environment is disclosed. The system and method further operates to integrate external sources of data as components of the graph schema. The system and method outputs definition files for the graph schema data structure and mapping files that enable the population of a graph database.
    Type: Grant
    Filed: August 31, 2019
    Date of Patent: May 23, 2023
    Assignee: data.world, Inc.
    Inventors: Juan Federico Sequeda, Wayne Phillip Heideman, Daniel Paul Miranker
  • Patent number: 11657043
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/or disseminate insights to collaborative computing devices and user accounts.
    Type: Grant
    Filed: May 28, 2021
    Date of Patent: May 23, 2023
    Assignee: data.world, Inc.
    Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
  • Publication number: 20230153312
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query into a collaborative dataset consolidation system, identifying datasets relevant to the query, generating one or more queries to access disparate data repositories, and retrieving data representing query results. In some cases, one or more queries are applied (e.g.
    Type: Application
    Filed: August 22, 2022
    Publication date: May 18, 2023
    Applicant: data.world, Inc.
    Inventors: Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Brett A. Hurt, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen, Joseph Boutros, Alexander John Zelenak
  • Publication number: 20230109821
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby data ingestion is performed to form data representing layered data files and data arrangements to facilitate, for example, interrelations among a system of networked collaborative datasets. In some examples, a method may include forming a first layer data file and a second layer data file, assigning addressable identifiers to uniquely identify units of data and data units to facilitate the linking of data, and implementing selectively one or more of a unit of data and a data unit as a function of a context of a data access request for a collaborative dataset.
    Type: Application
    Filed: September 2, 2022
    Publication date: April 13, 2023
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Publication number: 20230105459
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby data ingestion is performed to form data representing layered data files and data arrangements to facilitate, for example, interrelations among a system of networked collaborative datasets. In some examples, a method may include forming a first layer data file and a second layer data file, assigning addressable identifiers to uniquely identify units of data and data units to facilitate the linking of data, and implementing selectively one or more of a unit of data and a data unit as a function of a context of a data access request for a collaborative dataset.
    Type: Application
    Filed: September 6, 2022
    Publication date: April 6, 2023
    Applicant: data.world, Inc.
    Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
  • Patent number: 11620301
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of extended computerized query language syntax to analyze, for example, multiple tabular data arrangements in data-driven collaborative projects. For example, a method may include generating data to present a query editor in a data project interface, receiving data representing a first query command to select one or more subsets of data, identifying in the data representing a second query command a subset of datasets from which to extract the data, and applying a query based on a first query command and a second query command.
    Type: Grant
    Filed: May 27, 2021
    Date of Patent: April 4, 2023
    Assignee: data.world, Inc.
    Inventors: David Lee Griffith, Shad William Reynolds, Bryon Kristen Jacob
  • Patent number: 11609680
    Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems, to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more interfaces, such as user interfaces, may be implemented as computerized tools for presenting summarization of dataset attributes to facilitate discovery, formation, and analysis of interrelated collaborative datasets. In some examples, a method may include presenting data representing summary characteristic data in a user interface. This may include user interface elements each specifying a value of a dataset attribute for a collaborative dataset. Also, the method may include presenting aggregated data attributes for a subset of the collaborative dataset associated with the linked atomized datasets.
    Type: Grant
    Filed: February 25, 2021
    Date of Patent: March 21, 2023
    Assignee: data.world, Inc.
    Inventors: Shad William Reynolds, David Lee Griffith, Jon Loyens, Bryon Kristen Jacob
  • Patent number: 11573948
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement predict data constraints to validate one or more portions of a dataset, according to at least some examples. For example, a method may include predicting a subset of constraint data to validate a graph-based data arrangement, and analyzing the graph-based data arrangement against a subset of constraint data to determine an action. At least one action may include validating data in a graph-based data arrangement. Also, the method may include integrating graph-based data arrangement into a graph data arrangement responsive to determining data representing a validation.
    Type: Grant
    Filed: January 29, 2021
    Date of Patent: February 7, 2023
    Assignee: data.world, Inc.
    Inventor: David Lee Griffith
  • Publication number: 20230009198
    Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.
    Type: Application
    Filed: April 25, 2022
    Publication date: January 12, 2023
    Applicant: data.world, Inc.
    Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, John Loyens, Brett A. Hurt, Arthur Albert Keen