Patents Assigned to Ab Initio Technology LLC
-
Patent number: 11409545Abstract: A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.Type: GrantFiled: September 18, 2020Date of Patent: August 9, 2022Assignee: Ab Initio Technology LLCInventors: Oded Ravid, Trevor Murphy, Larry Paul Rossi, Joel Gould
-
Publication number: 20220245176Abstract: A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.Type: ApplicationFiled: January 31, 2022Publication date: August 4, 2022Applicant: Ab Initio Technology LLCInventor: Amit Weisman
-
Publication number: 20220245154Abstract: Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.Type: ApplicationFiled: January 28, 2022Publication date: August 4, 2022Applicant: Ab Initio Technology LLC.Inventors: Halldor Isak Gylfason, Robert Parks, Dusan Radivojevic, Adam Harris Weiss
-
Publication number: 20220245140Abstract: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.Type: ApplicationFiled: January 28, 2022Publication date: August 4, 2022Applicant: Ab Initio Technology LLCInventors: Halldor Isak Gylfason, Robert Parks
-
Publication number: 20220245125Abstract: A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.Type: ApplicationFiled: January 31, 2022Publication date: August 4, 2022Applicant: Ab Initio Technology LLCInventors: Amit Weisman, Cory Christopher James Fantasia, Matthew Douglas Becker, Ian Robert Schechter, Edward Alan Bach, Robert Parks
-
Patent number: 11403308Abstract: In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.Type: GrantFiled: May 4, 2020Date of Patent: August 2, 2022Assignee: Ab Initio Technology LLCInventors: Ian Schechter, Tim Wakeling, Ann M. Wollrath
-
Patent number: 11372631Abstract: A method for generating an updated graph based on a base graph includes identifying differences between a base graph and each of a source and a target graph. The specification of each graph defines nodes representing data processing components defining operations for processing data, and links representing flow of data. The identifying includes: comparing specifications of base and source graphs to identify a first set of differences between the base and source graph, and comparing specifications of base and target graphs to identify a second set of differences between the base and the target graph. The method includes characterizing a correspondence between the differences in the first set and the differences in the second set, determining a set of combined differences based on the correspondence and the first and second sets of differences, and generating the updated graph based on the base graph and the set of combined differences.Type: GrantFiled: March 30, 2021Date of Patent: June 28, 2022Assignee: Ab Initio Technology LLCInventors: Edward Alan Bach, Richard A. Shapiro
-
Patent number: 11354346Abstract: In general, metadata is stored in a data storage system. Summary data identifying one or more characteristics of each of multiple metadata objects stored in the data storage system is computed, and the summary data characterizing a given metadata object in association with the given metadata object is stored. A visual representation is generated of a diagram including nodes representing respective metadata objects and relationships among the nodes. Generating the visual representation includes superimposing a representation of a characteristic identified by the summary data characterizing a given metadata object in proximity to the node representing the given metadata object.Type: GrantFiled: January 23, 2019Date of Patent: June 7, 2022Assignee: Ab Initio Technology LLCInventors: Erik Bator, Joel Gould, Dusan Radivojevic
-
Patent number: 11347484Abstract: A method includes analyzing, by a processor, a first version of a computer program, the analyzing including identifying a first process included in the first version of the computer program, the first process configured to perform an operation on data having a first format; and by a processor, generating a second version of at least a portion of the computer program, including omitting the first process and including in the second version of the at least portion of the computer program one or more second processes configured to perform a second operation on data of a second format different from the first format, wherein the second operation is based on the first operation.Type: GrantFiled: February 25, 2021Date of Patent: May 31, 2022Assignee: Ab Initio Technology LLCInventors: Marshall A. Isman, John Joyce
-
Patent number: 11341116Abstract: According to some aspects, a data processing system is provided, the data processing system comprising at least one computer readable medium comprising processor-executable instructions that, when executed, cause the at least one processor to receive, through at least one user interface, input indicating a data element and one or more data quality metrics, identify, based on relationship information associated with the data element and/or the one or more data quality metrics, one or more datasets, one or more fields of the one or more datasets, and one or more data quality rules, each of the data quality rules being associated with at least one of the one or more fields, and perform an analysis of data quality of the one or more fields based at least in part on the one or more data quality rules associated with the one or more fields.Type: GrantFiled: November 9, 2018Date of Patent: May 24, 2022Assignee: Ab Initio Technology LLCInventor: Hugh F. Pyle
-
Patent number: 11341155Abstract: Mapping data stored in a data storage system for use by a computer system includes processing specifications of dataflow graphs that include nodes representing computations interconnected by links representing flows of data. At least one of the dataflow graphs receives a flow of data from at least one input dataset and at least one of the dataflow graphs provides a flow of data to at least one output dataset. A mapper identifies one or more sets of datasets. Each dataset in a given set matches one or more criteria for identifying different versions of a single dataset. A user interface is provided to receive a mapping between at least two datasets in a given set. The mapping received over the user interface is stored in association with a dataflow graph that provides data to or receives data from the datasets of the mapping.Type: GrantFiled: June 16, 2020Date of Patent: May 24, 2022Assignee: Ab Initio Technology LLCInventors: Tim Wakeling, Adam Weiss
-
Patent number: 11340910Abstract: A method includes receiving a request to process a set of data using a data processing application. The method includes, based on a feature associated with the set of data, selecting between (i) a first mode in which one or more running processes of the data processing application are used to process the set of data and (2) a second mode in which one or more new processes of the data processing application are started up. The method includes causing the data processing application to be executed according to the selected mode to process the set of data.Type: GrantFiled: April 10, 2019Date of Patent: May 24, 2022Assignee: Ab Initio Technology LLCInventor: Ben Fisher
-
Patent number: 11334536Abstract: A method for applying a rule to data includes receiving data from a data source and processing a particular data unit of an ordered number of data units derived at least in part from the received data. The processing includes determining a set of data units including an ordered subset of data units from the ordered number of data units, each data unit of the ordered subset being related to the particular data unit and occurring before the particular data unit in the ordered number of data units. Prior to applying the rule the state variable is updated to a state that would result from iterating in order through the set of data units and, for each data unit of the ordered subset, applying the rule to the data unit. The rule is then applied to the particular data unit including reading the updated value of the state variable.Type: GrantFiled: January 28, 2019Date of Patent: May 17, 2022Assignee: Ab Initio Technology LLCInventors: Scott Studer, Amit Weisman, David Phillimore
-
Patent number: 11308161Abstract: Among other things, we describe a computer-implemented method, performed by a data processing system, of executing a computer program based on a query that is expressed in accordance with a query language applicable to a relational database, the computer program executed based at least in part on data stored in a tangible, non-transitory computer-readable medium, the executing including receiving a SQL query, where the SQL query includes an identifier associated with a resource that is external to the data processing system, generating a computer program based on the SQL query, and executing the computer program, causing transmitting one or more instructions to the resource, the instructions defining operations other than operations of the SQL query, and receiving data from the resource in response to the instructions.Type: GrantFiled: August 16, 2019Date of Patent: April 19, 2022Assignee: Ab Initio Technology LLCInventors: Ian Schechter, Glenn John Allin
-
Patent number: 11301445Abstract: A graph-based program specification includes: a plurality of components, each corresponding to a processing task and including one or more ports for sending or receiving one or more data elements; and one or more links, each connecting an output port of an upstream component of the plurality of components to an input port of a downstream component of the plurality of components. Prepared code is generated representing subsets of the plurality of components, including: identifying a plurality of subset boundaries between components in different subsets based at least in part on characteristics of linked components; forming the subsets based on the identified subset boundaries; and generating prepared code for each formed subset that when used for execution by a runtime system causes processing tasks corresponding to the components in that formed subset to be performed according to information embedded in the prepared code for that formed subset.Type: GrantFiled: December 3, 2019Date of Patent: April 12, 2022Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, Richard Shapiro, Stephen A. Kukolich
-
Patent number: 11295049Abstract: A method implemented by a data processing system for processing data items of a stream of data items, including: accessing a specification that represents the executable logic, wherein a state of the specification for a particular value of the key specifies one or more portions of the executable logic that are executable in that state; receiving, over an input device or port, data items of a stream of data; for a first one of the data items of the stream, identifying a first state of the specification for a value of the key associated with that first one of the data items; processing, by the data processing system, the first one of the data items according to one or more portions of executable logic that are represented in the specification as being associated with the first state.Type: GrantFiled: February 3, 2020Date of Patent: April 5, 2022Assignee: Ab Initio Technology LLCInventors: Joel Gould, Scott Studer, Craig W. Stanfill
-
Patent number: 11294874Abstract: An approach to structured record retrieval permits transmission and storage of records in a native concise format, without requiring that the records be interpreted and stored in a tabular form. Such storage of the records in a tabular form might double the space required, and more generally, requires substantially more space in applications in which there are many optional elements. In some embodiments, each message is parsed according to a specification of the message structure (e.g., according to a “grammar” for the message), and during parsing field values in predefined positions in the structure are extracted and added to an index structure that associates record identifiers with the (position, value) pairs.Type: GrantFiled: July 25, 2019Date of Patent: April 5, 2022Assignee: Ab Initio Technology LLCInventor: Taro Ikai
-
Publication number: 20220100719Abstract: Techniques for allowing a user to invoke lifecycle operations on a data store configured in accordance with a data management scheme, without knowledge of the data management scheme. Techniques include receiving user input from a user interface; identifying, in the received user input, an indication of a first lifecycle operation corresponding to a first entry in a template; based on information in the first entry, identifying in the received user input a first configuration value associated with the first lifecycle operation; formatting the first configuration value as at least a portion of at least one argument to a first command; and invoking, via the first command with the at least one argument, a first computer program configured to perform the first lifecycle operation on the data store.Type: ApplicationFiled: September 20, 2021Publication date: March 31, 2022Applicant: Ab Initio Technology LLCInventors: Ian Robert Schechter, David E. Wald
-
Patent number: 11288258Abstract: A method implemented by a data processing system including: executing a dataflow graph that includes the plurality of components and the links, with a given component of the plurality including an input port, an audit port and an output port; processing, by the dataflow graph with the components and the links, the one or more data records representing the transaction, wherein the at least one of the components saves a state specifying one or more input records that are processed by the at least one of the components; when an error occurs during processing of one or more input records by the given component, restoring a state of the at least one of the components to the saved state; and based on the restored state, recovering at least some of the audit data for the given component of the dataflow graph.Type: GrantFiled: September 25, 2019Date of Patent: March 29, 2022Assignee: Ab Initio Technology LLCInventors: Trevor Murphy, Paul Veiser
-
Patent number: 11288290Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for building reports. One of the methods includes creating a model based on relational structured data, the structured data including data structures, each data structure having data elements, each data element having fields, each field having a name. The method includes generating a hierarchy of objects in model, the hierarchy organizing objects the with respect to a starting object according to relationship fields on the objects. The method includes generating a user interface including elements for one or more of the objects in the hierarchy, wherein the user interface enables a user to create a report and filter the report using the new name. The method includes receiving a user selection of an element from the elements. The method also includes generating a report.Type: GrantFiled: May 20, 2015Date of Patent: March 29, 2022Assignee: Ab Initio Technology LLCInventors: Dusan Radivojevic, Peter Moss