Patents Assigned to Data.World, Inc.
-
Publication number: 20200252767Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more layered data files, interrelations among a system of networked collaborative datasets. In some examples, a method may include transforming of a set of data to an atomized format to form an atomized dataset that includes a derived dataset attribute. The method may also include presenting data representing an annotation at the user interface based on the derived dataset attribute. In some examples, the annotation may be associated with a layer file.Type: ApplicationFiled: December 31, 2019Publication date: August 6, 2020Applicant: data.world, Inc.Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
-
Publication number: 20200252766Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more layered data files, interrelations among a system of networked collaborative datasets. In some examples, a method may include transforming of a set of data to an atomized format to form an atomized dataset that includes a derived dataset attribute. The method may also include presenting data representing an annotation at the user interface based on the derived dataset attribute. In some examples, the annotation may be associated with a layer file.Type: ApplicationFiled: December 31, 2019Publication date: August 6, 2020Applicant: data.world, Inc.Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
-
Publication number: 20200218723Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema, and optimizing the rewriting by identifying a database engine to execute the query and including other data converted into another triple associated with an attribute of the query.Type: ApplicationFiled: June 28, 2019Publication date: July 9, 2020Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Patent number: 10699027Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include an atomized workflow loader configured to receive an atomized dataset to load into a data store, and to determine resource requirements data to describe at least one resource requirement. The atomized workflow loader may be further configured to select a data store type based on a resource requirement, and perform a load operation of the atomized dataset as a function of the data store type.Type: GrantFiled: June 19, 2016Date of Patent: June 30, 2020Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Patent number: 10691710Abstract: Various embodiments relate generally to data science and data analysis, and computer software and systems, to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more interfaces, such as user interfaces, may be implemented as computerized tools for presenting summarization of dataset attributes to facilitate discovery, formation, and analysis of interrelated collaborative datasets. In some examples, a method may include presenting data representing summary characteristic data in a user interface. This may include user interface elements each specifying a value of a dataset attribute for a collaborative dataset. Also, the method may include presenting aggregated data attributes for a subset of the collaborative dataset associated with the linked atomized datasets.Type: GrantFiled: March 9, 2017Date of Patent: June 23, 2020Assignee: data.world, Inc.Inventors: Shad William Reynolds, David Lee Griffith, Jon Loyens, Bryon Kristen Jacob
-
Patent number: 10645548Abstract: A method may relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets. One or more computerized tools may be configured to discover, form, and analyze via one or more layered data files, interrelations among a system of networked collaborative datasets. A method may include transforming of a set of data to an atomized format to form an atomized dataset that includes a derived dataset attribute. The method may also include presenting data representing an annotation at the user interface based on the derived dataset attribute. An annotation may be associated with a layer file.Type: GrantFiled: March 9, 2017Date of Patent: May 5, 2020Assignee: data.world, Inc.Inventors: Shad William Reynolds, David Lee Griffith, Bryon Kristen Jacob
-
Publication number: 20200117665Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.Type: ApplicationFiled: June 28, 2019Publication date: April 16, 2020Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Publication number: 20200117688Abstract: A system and method for creating and editing graph schema data structures in a collaborative, real time graphical editing environment is disclosed. The system and method further operates to integrate external sources of data as components of the graph schema. The system and method outputs definition files for the graph schema data structure and mapping files.that enable the population of a graph database.Type: ApplicationFiled: June 7, 2019Publication date: April 16, 2020Applicant: data.world, Inc.Inventors: Juan Federico Sequeda, Wayne Phillip Heideman, Daniel Paul Miranker
-
Publication number: 20200097504Abstract: A system and method for creating and editing graph schema data structures in a collaborative, real time graphical editing environment is disclosed. The system and method further operates to integrate external sources of data as components of the graph schema. The system and method outputs definition files for the graph schema data structure and mapping files that enable the population of a graph database.Type: ApplicationFiled: August 31, 2019Publication date: March 26, 2020Applicant: data.world, Inc.Inventors: Juan Federico Sequeda, Wayne Phillip Heideman, Daniel Paul Miranker
-
Publication number: 20200074298Abstract: Various techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving at a dataset access platform a query formatted according to a first data schema, generating a copy of the query, saving the query and the copy to a datastore, parsing the copy of the query in the first schema using an inference engine, determining whether the query comprises data associated with an access control condition associated with accessing the dataset, the access control condition being configured to indicate whether the query is permitted to access the dataset, and rewriting, using a proxy server, the copy of the query in a second schema by converting the copy of the query into a triple associated with the query and another triple associated with the access control condition.Type: ApplicationFiled: June 28, 2019Publication date: March 5, 2020Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Publication number: 20200073865Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving data representing a query into a collaborative dataset consolidation system, identifying datasets relevant to the query, generating one or more queries to access disparate data repositories, and retrieving data representing query results. In some cases, one or more queries are applied (e.g.Type: ApplicationFiled: August 31, 2018Publication date: March 5, 2020Applicant: Data.world, Inc.Inventors: Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Brett A. Hurt, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen, Joseph Boutros, Alexander John Zelenak
-
Publication number: 20200034371Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby one or more computerized tools may be configured to discover, form, and analyze, for example, via one or more user interface applications, interrelations among a system of networked collaborative datasets In some examples, a method may include causing transformation of a set of data to an atomized format to form an atomized dataset, monitoring creation of a dataset, and presenting data representing a status of a portion of the creation of the dataset. The status may depict an atomized dataset linked to at least one other dataset.Type: ApplicationFiled: June 28, 2019Publication date: January 30, 2020Applicant: data.world, Inc.Inventors: Shad William Reynolds, Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Triet Minh Le, Joseph Boutros
-
Patent number: 10515085Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a system may include data ingestion controller configured to format datasets to form a first and a second atomized dataset, the second atomized dataset including the first atomized dataset and one or more other atomized datasets. The system may include a dataset query engine configured to identify a portion of a dataset relevant to a query, and to retrieve query results from at least one of different data repositories.Type: GrantFiled: June 19, 2016Date of Patent: December 24, 2019Assignee: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Arthur Albert Keen, Alexander John Zelenak, Jon Loyens, Brett A. Hurt, Shad William Reynolds, Joseph Boutros
-
Publication number: 20190370266Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset having a data format into a dataset ingestion controller configured to form a collaborative dataset, interpreting data of the dataset against data classifications at an inference engine to derive at least an inferred attribute, associating the data with annotative data identifying the inferred attribute, and converting the dataset at a format converter to form an atomized dataset.Type: ApplicationFiled: March 4, 2019Publication date: December 5, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190370262Abstract: Various techniques are disclosed for computerized tools to discover, form, and analyze dataset interrelations among a system of networked collaborative datasets including a repository configured to receive and store a dataset, and a dataset consolidation system configured to receive data to form a first input to initiate creation of a dataset based on a set of data, to activate a programmatic interface, to transform the set of data from a first format to an atomized format to form an atomized dataset, to monitor the creation of the dataset, to present data representing a status of a portion of the creation of the dataset, to calculate automatically dataset attributes of the linked dataset, to generate a plurality of sub-queries, and to retrieve data representing query results from the at least one of the different data repositories.Type: ApplicationFiled: May 31, 2019Publication date: December 5, 2019Applicant: data.world, Inc.Inventors: Shad William Reynolds, Bryon Kristen Jacob, Jon Loyens, David Lee Griffith, Triet Minh Le, Joseph Boutros
-
Publication number: 20190370230Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.Type: ApplicationFiled: March 4, 2019Publication date: December 5, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen
-
Publication number: 20190370481Abstract: Techniques are described for platform management of integrated access of public and privately-accessible datasets utilizing federated query generation and query schema rewriting optimization, including receiving a query at a dataset access platform, generating a copy of the query, parsing the query to determine a format associated with the dataset and to identify whether an access control condition is required, rewriting, using a proxy server, the copy of the query using data formatted in a triples-based format into an optimized query having the access control condition in the triples-based format, configuring the optimized query to be transmitted to a location at which the dataset is stored, the optimized query being configured to pass the access control condition to gain authorization to retrieve the dataset, converting the dataset to the triples-based format, and rendering the dataset on an interface.Type: ApplicationFiled: May 31, 2019Publication date: December 5, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Shad William Reynolds, Arthur Albert Keen
-
Publication number: 20190361891Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement computerized tools to facilitate expedited queries based on query results generated by disparate computing and database architectures, according to at least some examples. For example, a method may include generating multiple results of queries, storing the multiple results of queries in a memory, and linking each of the multiple results of queries to a portion of data stored in a graph. The method can include receiving data representing a query and accessing data representing quiescent data values to form a query result.Type: ApplicationFiled: May 6, 2019Publication date: November 28, 2019Applicant: data.world, Inc.Inventors: David Lee Griffith, Shad William Reynolds
-
Publication number: 20190347347Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to interface among repositories of disparate datasets and computing machine-based entities configured to access datasets, and, more specifically, to a computing and data storage platform to implement predict data constraints to validate one or more portions of a dataset, according to at least some examples. For example, a method may include predicting a subset of constraint data to validate a graph-based data arrangement, and analyzing the graph-based data arrangement against a subset of constraint data to determine an action. At least one action may include validating data in a graph-based data arrangement. Also, the method may include integrating graph-based data arrangement into a graph data arrangement responsive to determining data representing a validation.Type: ApplicationFiled: September 24, 2018Publication date: November 14, 2019Applicant: data.world, Inc.Inventor: David Lee Griffith
-
Publication number: 20190347244Abstract: Various embodiments relate generally to data science and data analysis, computer software and systems, and wired and wireless network communications to provide an interface between repositories of disparate datasets and computing machine-based entities that seek access to the datasets, and, more specifically, to a computing and data storage platform that facilitates consolidation of one or more datasets, whereby a collaborative data layer and associated logic facilitate, for example, efficient access to, and implementation of, collaborative datasets. In some examples, a method may include receiving a dataset and dataset attributes and identifying a first version of the dataset. The method may include identifying data that varies from a first version of the dataset, and generating a second version of the dataset to include a first subset and a second subset of atomized data. The method may include storing subsets of atomized data points as an atomized dataset.Type: ApplicationFiled: February 8, 2019Publication date: November 14, 2019Applicant: data.world, Inc.Inventors: Bryon Kristen Jacob, David Lee Griffith, Triet Minh Le, Jon Loyens, Brett A. Hurt, Arthur Albert Keen