Patents Assigned to Data.World, Inc.
-
Publication number: 20190347259Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.Type: ApplicationFiled: April 25, 2019Publication date: November 14, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190347268Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to identify and match equivalent subsets of data between an ingested dataset, such as in a tabular data arrangement, and one or more graph-based data arrangements, according to at least some examples. For example, a method may include identifying a tabular data arrangement including a subset of data as a column, computing a compressed data representation for a column of data, correlating a compressed data representation to a reference compressed data representations, detecting a link between a column of data associated with a correlated compressed data representation to a dataset stored in a graph data arrangement, and forming an expanded tabular data arrangement.Type: ApplicationFiled: September 20, 2018Publication date: November 14, 2019Applicant: data.world, Inc.Inventor: David Lee Griffith
-
Publication number: 20190347258Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.Type: ApplicationFiled: April 25, 2019Publication date: November 14, 2019Applicant: data.world, IncInventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Patent number: 10452975Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.Type: GrantFiled: February 22, 2017Date of Patent: October 22, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Patent number: 10452677Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.Type: GrantFiled: June 19, 2016Date of Patent: October 22, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190317961Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement computerized tools to identify data classifications and similar subsets of graph-based data arrangements with which to join, according to at least some examples. For example, a method may include determining a classification type for a subset of data based on a graph data arrangement, generating presentation data as a first user input to detect selection of the first user input. The method may include predicting a classification type for data. The method may also include generating other presentation data configured to join datasets, such as a column of tabular-formatted data with one or more portions of a graph data arrangement.Type: ApplicationFiled: April 25, 2019Publication date: October 17, 2019Applicant: data.world, Inc.Inventors: Sharon Brener, Shad William Reynolds, David Lee Griffith
-
Patent number: 10438013Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema, and optimizing the rewriting by identifying a database engine to execute the query and including other data converted into another triple associated with an attribute of the query.Type: GrantFiled: February 22, 2017Date of Patent: October 8, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Publication number: 20190272279Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.Type: ApplicationFiled: February 8, 2019Publication date: September 5, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190266155Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query into a collaborative dataset consolidation system, identifying datasets relevant to the query, generating one or more queries to access disparate data repositories, and retrieving data representing query results. In some cases, one or more queries are applied (e.g.Type: ApplicationFiled: February 27, 2019Publication date: August 29, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Brett A. Hurt, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen, Joseph Boutros, Alexander John Zelenak
-
Patent number: 10353911Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more user interface applications, interrelations among a system of networked collaborative datasets In some examples, a method may include causing transformation of a set of data to an atomized format to form an atomized dataset, monitoring creation of a dataset, and presenting data representing a status of a portion of the creation of the dataset. The status may depict an atomized dataset linked to at least one other dataset.Type: GrantFiled: March 9, 2017Date of Patent: July 16, 2019Assignee: data.world, Inc.Inventors: Shad William Reynolds, Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Triet Minh Le, Joseph Boutros
-
Patent number: 10346429Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.Type: GrantFiled: June 19, 2016Date of Patent: July 9, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Patent number: 10324925Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query of a consolidated dataset that may include datasets formatted atomized datasets, analyzing the query to classify portions of the query to form classified query portions, partitioning the query into sub-queries as a function of a classification type for each of the classified query portions, and retrieving data representing a query result from distributed data repositories.Type: GrantFiled: June 19, 2016Date of Patent: June 18, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190121807Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/or disseminate insights to collaborative computing devices and user accounts.Type: ApplicationFiled: May 22, 2018Publication date: April 25, 2019Applicant: data.world, Inc.Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
-
Publication number: 20190095472Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to determine degrees of similarity between at least a subset of data associated with an ingested dataset and one or more equivalent or similar subsets of data associated with one or more graph-based data arrangements, the degrees of similarity facilitating preferences or priorities in joining one or more graph-based data arrangements to the ingested dataset, according to at least some examples. For example, a method may include generating similarity matrices to join an ingested dataset (e.g., tabular dataset) to one or more graph-based datasets in accordance with determining a degree of similarity indication of a dataset with which to join.Type: ApplicationFiled: September 20, 2018Publication date: March 28, 2019Applicant: data.world, IncInventor: David Lee Griffith
-
Publication number: 20190079968Abstract: Various embodiments relate generally to data science and data analysis and computer software and systems to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to transmute associations between data arrangements of different formats or different data models to facilitate data operations, such as queries, configured to enhance, for example, an ingested dataset via link-formative queries to form, for example, interrelations among a system of networked collaborative datasets. For example, a method may include analyzing a dataset to detect data values with which to query against in a link-formative query, applying a link-formative query to a dataset, identifying results of the link-formative query, and forming an enhanced dataset to include results a link-formative queries in the dataset.Type: ApplicationFiled: April 2, 2018Publication date: March 14, 2019Applicant: data.world, Inc.Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
-
Publication number: 20190066052Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/or disseminate insights to collaborative computing devices and user accounts.Type: ApplicationFiled: May 22, 2018Publication date: February 28, 2019Applicant: data.world, Inc.Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
-
Publication number: 20190065567Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of extended computerized query language syntax to analyze, for example, multiple tabular data arrangements in data-driven collaborative projects. For example, a method may include generating data to present a query editor in a data project interface, receiving data representing a first query command to select one or more subsets of data, identifying in the data representing a second query command a subset of datasets from which to extract the data, and applying a query based on a first query command and a second query command.Type: ApplicationFiled: July 16, 2018Publication date: February 28, 2019Applicant: data. world, Inc.Inventors: David Lee Griffith, Shad William Reynolds, Bryon Kristen Jacob
-
Publication number: 20190065569Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, network communications to interface among repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate data projects by providing an interactive, project-centric workspace interface that may include, for example, a unified view in which to identify data sources, generate transformative datasets, and/form queries over a composite data dictionary coupled to collaborative computing devices and user accounts. For example, a method may include forming a first data dictionary, linking a dataset associated with the first data dictionary to another dataset, which may be associated with a second data dictionary, and forming a dynamic composite data dictionary.Type: ApplicationFiled: May 22, 2018Publication date: February 28, 2019Applicant: Data.World, Inc.Inventors: Joseph Boutros, Sharon Brener, Alexander John Zelenak, Robert Thomas Grochowicz, Mark Joseph DiMarco, Bryon Kristen Jacob, David Lee Griffith, Shad William Reynolds
-
Publication number: 20190050445Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby logic is configured to remediate anomalies in a data set originating in a first format prior to enrichment and conversion into a second format that facilitates forming collaborative dataset and, for example, interrelations among a system of networked collaborative datasets, whereby, at least in some implementations, data interrelations between different formats may be disposed in one or more data layers (e.g., layered data files and/or data arrangements). In some examples, a method may include analyzing data to detect a non-compliant data attribute, detecting a condition based on the non-compliant data attribute, invoking an action to modify a subset of data, and generating a graph data arrangement linkable to other graph data arrangements to form a collaborative dataset.Type: ApplicationFiled: March 20, 2018Publication date: February 14, 2019Applicant: data world, Inc.Inventors: David Lee Griffith, Bryon Kristen Jacob, Shad William Reynolds
-
Publication number: 20190050459Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and more specifically, to a computing and data storage platform configured to provide one or more computerized tools that facilitate development and management of data projects, including implementation of localized link identifiers to perform implicitly federated queries using, in some examples, extended computerized query language syntax to analyze multiple tabular data arrangements in data-driven collaborative projects.Type: ApplicationFiled: July 16, 2018Publication date: February 14, 2019Applicant: data.world, Inc.Inventors: David Lee Griffith, Shad William Reynolds, Bryon Kristen Jacob