Patents Assigned to Ab Initio Technology LLC
  • Patent number: 11409545
    Abstract: A method is described for processing keyed data items that are each associated with a value of a key, the keyed data items being from a plurality of distinct data streams, the processing including collecting the keyed data items, determining, based on contents of at least one of the keyed data items, satisfaction of one or more specified conditions for execution of one or more actions and causing execution of at least one of the one or more actions responsive to the determining.
    Type: Grant
    Filed: September 18, 2020
    Date of Patent: August 9, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Oded Ravid, Trevor Murphy, Larry Paul Rossi, Joel Gould
  • Publication number: 20220245176
    Abstract: A data processing system that receives user input specifying datasets on which operations are performed with user interfaces that enable manipulation of hierarchical groups of datasets. A user interface may enable individual datasets or a previously defined group of datasets to be aggregated into another grouping. The groupings may be scoped, including by persona of users, such that, when a user is prompted to specify one or more datasets as a target of an operation by the data processing system, the available choices are limited to datasets that have a scope encompassing that user. The interfaces may prompt a user to select a grouping within the hierarchy that contains datasets on which the operation can be performed. Upon selection of a grouping with multiple datasets as a target of an operation that is performed on datasets singly, the operation may be performed on each dataset in the selected group.
    Type: Application
    Filed: January 31, 2022
    Publication date: August 4, 2022
    Applicant: Ab Initio Technology LLC
    Inventor: Amit Weisman
  • Publication number: 20220245154
    Abstract: Techniques for storing data entities by a data processing system are described herein. The data processing system may store a plurality of data entity instances generated using a plurality of data entities. The plurality of data entity instances may include a first data entity instance generated using a first data entity and a second data entity instance generated using a second data entity. The first data entity instance may include a first attribute that is configured to inherit its value from a second attribute of the second data entity instance. The data processing system may provide the inherited value of the second attribute of the second data entity instance as the value of the first attribute of the first data entity instance.
    Type: Application
    Filed: January 28, 2022
    Publication date: August 4, 2022
    Applicant: Ab Initio Technology LLC.
    Inventors: Halldor Isak Gylfason, Robert Parks, Dusan Radivojevic, Adam Harris Weiss
  • Publication number: 20220245140
    Abstract: Techniques for obtaining information about data entity instances managed by a data processing system using at least one data store. The techniques include obtaining a query comprising a first portion comprising information for identifying instances of a first data entity stored in at least one data store; and a second portion indicating at least one attribute of the first data entity; generating, from the query, a plurality of executable queries including a first set of one or more executable queries and a second set of one or more executable queries, the generating comprising: generating, using the first portion, the first set of executable queries for identifying instances of the first data entity, and generating, using the second portion, the second set of executable queries for obtaining attribute values for instances of the first data entity; and executing the plurality of executable queries to obtain results for the query.
    Type: Application
    Filed: January 28, 2022
    Publication date: August 4, 2022
    Applicant: Ab Initio Technology LLC
    Inventors: Halldor Isak Gylfason, Robert Parks
  • Publication number: 20220245125
    Abstract: A data processing system with a dataset multiplexer that enables applications to be written to specify access to datasets as operations on logical datasets. During execution of an application by the data processing system, operations that access a dataset are implemented by accessing an entry in a dataset catalog for the logical dataset. That entry includes information to access the physical data source storing the logical dataset, including conversion of data from the format of the physical data source to the format of the logical dataset. An entry in the catalog may be created based on registration of a data source with the dataset multiplexer and may be updated automatically based on changes in storage of the dataset. This maintenance of the catalog may be partially or totally automated such that the system automatically adjusts to any changes in storage of the dataset without need for modification of any application.
    Type: Application
    Filed: January 31, 2022
    Publication date: August 4, 2022
    Applicant: Ab Initio Technology LLC
    Inventors: Amit Weisman, Cory Christopher James Fantasia, Matthew Douglas Becker, Ian Robert Schechter, Edward Alan Bach, Robert Parks
  • Patent number: 11403308
    Abstract: In a first aspect, a method includes, at a node of a Hadoop cluster, the node storing a first portion of data in HDFS data storage, executing a first instance of a data processing engine capable of receiving data from a data source external to the Hadoop cluster, receiving a computer-executable program by the data processing engine, executing at least part of the program by the first instance of the data processing engine, receiving, by the data processing engine, a second portion of data from the external data source, storing the second portion of data other than in HDFS storage, and performing, by the data processing engine, a data processing operation identified by the program using at least the first portion of data and the second portion of data.
    Type: Grant
    Filed: May 4, 2020
    Date of Patent: August 2, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Ian Schechter, Tim Wakeling, Ann M. Wollrath
  • Patent number: 11372631
    Abstract: A method for generating an updated graph based on a base graph includes identifying differences between a base graph and each of a source and a target graph. The specification of each graph defines nodes representing data processing components defining operations for processing data, and links representing flow of data. The identifying includes: comparing specifications of base and source graphs to identify a first set of differences between the base and source graph, and comparing specifications of base and target graphs to identify a second set of differences between the base and the target graph. The method includes characterizing a correspondence between the differences in the first set and the differences in the second set, determining a set of combined differences based on the correspondence and the first and second sets of differences, and generating the updated graph based on the base graph and the set of combined differences.
    Type: Grant
    Filed: March 30, 2021
    Date of Patent: June 28, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Edward Alan Bach, Richard A. Shapiro
  • Patent number: 11354346
    Abstract: In general, metadata is stored in a data storage system. Summary data identifying one or more characteristics of each of multiple metadata objects stored in the data storage system is computed, and the summary data characterizing a given metadata object in association with the given metadata object is stored. A visual representation is generated of a diagram including nodes representing respective metadata objects and relationships among the nodes. Generating the visual representation includes superimposing a representation of a characteristic identified by the summary data characterizing a given metadata object in proximity to the node representing the given metadata object.
    Type: Grant
    Filed: January 23, 2019
    Date of Patent: June 7, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Erik Bator, Joel Gould, Dusan Radivojevic
  • Patent number: 11347484
    Abstract: A method includes analyzing, by a processor, a first version of a computer program, the analyzing including identifying a first process included in the first version of the computer program, the first process configured to perform an operation on data having a first format; and by a processor, generating a second version of at least a portion of the computer program, including omitting the first process and including in the second version of the at least portion of the computer program one or more second processes configured to perform a second operation on data of a second format different from the first format, wherein the second operation is based on the first operation.
    Type: Grant
    Filed: February 25, 2021
    Date of Patent: May 31, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Marshall A. Isman, John Joyce
  • Patent number: 11341116
    Abstract: According to some aspects, a data processing system is provided, the data processing system comprising at least one computer readable medium comprising processor-executable instructions that, when executed, cause the at least one processor to receive, through at least one user interface, input indicating a data element and one or more data quality metrics, identify, based on relationship information associated with the data element and/or the one or more data quality metrics, one or more datasets, one or more fields of the one or more datasets, and one or more data quality rules, each of the data quality rules being associated with at least one of the one or more fields, and perform an analysis of data quality of the one or more fields based at least in part on the one or more data quality rules associated with the one or more fields.
    Type: Grant
    Filed: November 9, 2018
    Date of Patent: May 24, 2022
    Assignee: Ab Initio Technology LLC
    Inventor: Hugh F. Pyle
  • Patent number: 11341155
    Abstract: Mapping data stored in a data storage system for use by a computer system includes processing specifications of dataflow graphs that include nodes representing computations interconnected by links representing flows of data. At least one of the dataflow graphs receives a flow of data from at least one input dataset and at least one of the dataflow graphs provides a flow of data to at least one output dataset. A mapper identifies one or more sets of datasets. Each dataset in a given set matches one or more criteria for identifying different versions of a single dataset. A user interface is provided to receive a mapping between at least two datasets in a given set. The mapping received over the user interface is stored in association with a dataflow graph that provides data to or receives data from the datasets of the mapping.
    Type: Grant
    Filed: June 16, 2020
    Date of Patent: May 24, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Tim Wakeling, Adam Weiss
  • Patent number: 11340910
    Abstract: A method includes receiving a request to process a set of data using a data processing application. The method includes, based on a feature associated with the set of data, selecting between (i) a first mode in which one or more running processes of the data processing application are used to process the set of data and (2) a second mode in which one or more new processes of the data processing application are started up. The method includes causing the data processing application to be executed according to the selected mode to process the set of data.
    Type: Grant
    Filed: April 10, 2019
    Date of Patent: May 24, 2022
    Assignee: Ab Initio Technology LLC
    Inventor: Ben Fisher
  • Patent number: 11334536
    Abstract: A method for applying a rule to data includes receiving data from a data source and processing a particular data unit of an ordered number of data units derived at least in part from the received data. The processing includes determining a set of data units including an ordered subset of data units from the ordered number of data units, each data unit of the ordered subset being related to the particular data unit and occurring before the particular data unit in the ordered number of data units. Prior to applying the rule the state variable is updated to a state that would result from iterating in order through the set of data units and, for each data unit of the ordered subset, applying the rule to the data unit. The rule is then applied to the particular data unit including reading the updated value of the state variable.
    Type: Grant
    Filed: January 28, 2019
    Date of Patent: May 17, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Scott Studer, Amit Weisman, David Phillimore
  • Patent number: 11308161
    Abstract: Among other things, we describe a computer-implemented method, performed by a data processing system, of executing a computer program based on a query that is expressed in accordance with a query language applicable to a relational database, the computer program executed based at least in part on data stored in a tangible, non-transitory computer-readable medium, the executing including receiving a SQL query, where the SQL query includes an identifier associated with a resource that is external to the data processing system, generating a computer program based on the SQL query, and executing the computer program, causing transmitting one or more instructions to the resource, the instructions defining operations other than operations of the SQL query, and receiving data from the resource in response to the instructions.
    Type: Grant
    Filed: August 16, 2019
    Date of Patent: April 19, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Ian Schechter, Glenn John Allin
  • Patent number: 11301445
    Abstract: A graph-based program specification includes: a plurality of components, each corresponding to a processing task and including one or more ports for sending or receiving one or more data elements; and one or more links, each connecting an output port of an upstream component of the plurality of components to an input port of a downstream component of the plurality of components. Prepared code is generated representing subsets of the plurality of components, including: identifying a plurality of subset boundaries between components in different subsets based at least in part on characteristics of linked components; forming the subsets based on the identified subset boundaries; and generating prepared code for each formed subset that when used for execution by a runtime system causes processing tasks corresponding to the components in that formed subset to be performed according to information embedded in the prepared code for that formed subset.
    Type: Grant
    Filed: December 3, 2019
    Date of Patent: April 12, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, Richard Shapiro, Stephen A. Kukolich
  • Patent number: 11295049
    Abstract: A method implemented by a data processing system for processing data items of a stream of data items, including: accessing a specification that represents the executable logic, wherein a state of the specification for a particular value of the key specifies one or more portions of the executable logic that are executable in that state; receiving, over an input device or port, data items of a stream of data; for a first one of the data items of the stream, identifying a first state of the specification for a value of the key associated with that first one of the data items; processing, by the data processing system, the first one of the data items according to one or more portions of executable logic that are represented in the specification as being associated with the first state.
    Type: Grant
    Filed: February 3, 2020
    Date of Patent: April 5, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Scott Studer, Craig W. Stanfill
  • Patent number: 11294874
    Abstract: An approach to structured record retrieval permits transmission and storage of records in a native concise format, without requiring that the records be interpreted and stored in a tabular form. Such storage of the records in a tabular form might double the space required, and more generally, requires substantially more space in applications in which there are many optional elements. In some embodiments, each message is parsed according to a specification of the message structure (e.g., according to a “grammar” for the message), and during parsing field values in predefined positions in the structure are extracted and added to an index structure that associates record identifiers with the (position, value) pairs.
    Type: Grant
    Filed: July 25, 2019
    Date of Patent: April 5, 2022
    Assignee: Ab Initio Technology LLC
    Inventor: Taro Ikai
  • Publication number: 20220100719
    Abstract: Techniques for allowing a user to invoke lifecycle operations on a data store configured in accordance with a data management scheme, without knowledge of the data management scheme. Techniques include receiving user input from a user interface; identifying, in the received user input, an indication of a first lifecycle operation corresponding to a first entry in a template; based on information in the first entry, identifying in the received user input a first configuration value associated with the first lifecycle operation; formatting the first configuration value as at least a portion of at least one argument to a first command; and invoking, via the first command with the at least one argument, a first computer program configured to perform the first lifecycle operation on the data store.
    Type: Application
    Filed: September 20, 2021
    Publication date: March 31, 2022
    Applicant: Ab Initio Technology LLC
    Inventors: Ian Robert Schechter, David E. Wald
  • Patent number: 11288258
    Abstract: A method implemented by a data processing system including: executing a dataflow graph that includes the plurality of components and the links, with a given component of the plurality including an input port, an audit port and an output port; processing, by the dataflow graph with the components and the links, the one or more data records representing the transaction, wherein the at least one of the components saves a state specifying one or more input records that are processed by the at least one of the components; when an error occurs during processing of one or more input records by the given component, restoring a state of the at least one of the components to the saved state; and based on the restored state, recovering at least some of the audit data for the given component of the dataflow graph.
    Type: Grant
    Filed: September 25, 2019
    Date of Patent: March 29, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Trevor Murphy, Paul Veiser
  • Patent number: 11288290
    Abstract: Methods, systems, and apparatus, including computer programs encoded on computer storage media, for building reports. One of the methods includes creating a model based on relational structured data, the structured data including data structures, each data structure having data elements, each data element having fields, each field having a name. The method includes generating a hierarchy of objects in model, the hierarchy organizing objects the with respect to a starting object according to relationship fields on the objects. The method includes generating a user interface including elements for one or more of the objects in the hierarchy, wherein the user interface enables a user to create a report and filter the report using the new name. The method includes receiving a user selection of an element from the elements. The method also includes generating a report.
    Type: Grant
    Filed: May 20, 2015
    Date of Patent: March 29, 2022
    Assignee: Ab Initio Technology LLC
    Inventors: Dusan Radivojevic, Peter Moss