Patents Assigned to Ab Initio Technology LLC
  • Publication number: 20150169428
    Abstract: A method includes receiving data indicative of a number of times each of one or more rules was executed by a data processing application during processing of one or more records; based on the number of times each of the rules was executed by the data processing application, determining a content criterion for each of one or more particular fields; generating content for each of the particular fields based on the content criterion; and populating each of the particular fields with the generated content.
    Type: Application
    Filed: December 17, 2014
    Publication date: June 18, 2015
    Applicant: Ab Initio Technology LLC
    Inventors: Marshall A. Isman, Richard Alan Epstein
  • Publication number: 20150169687
    Abstract: In one aspect, in general, a method of generating a dataflow graph representing a database query includes receiving a query plan from a plan generator, the query plan representing operations for executing a database query on at least one input representing a source of data, producing a dataflow graph from the query plan, wherein the dataflow graph includes at least one node that represents at least one operation represented by the query plan, and includes at least one link that represents at least one dataflow associated with the query plan, and altering one or more components of the dataflow graph based on at least one characteristic of the at least one input representing the source of data.
    Type: Application
    Filed: February 23, 2015
    Publication date: June 18, 2015
    Applicant: Ab Initio Technology LLC
    Inventors: Ian Schechter, Glenn John Allin
  • Patent number: 9037589
    Abstract: Received data records, each including one or more values in one or more fields, are processed to identify one or more data clusters. The processing includes: identifying tokens that each include at least one value or fragment of a value in a field or a combination of fields; generating a network representing the identified tokens, with nodes of the network representing tokens and edges of the network each representing a variant relationship between tokens; and generating a graphical representation of the network with different subsets of nodes distinguished based at least in part on values associated with nodes, where a value associated with a particular node quantifies a count of a number of instances of the token represented by that particular node appearing within the received data records.
    Type: Grant
    Filed: November 15, 2012
    Date of Patent: May 19, 2015
    Assignee: Ab Initio Technology LLC
    Inventor: Arlen Anderson
  • Patent number: 9031895
    Abstract: Processing metadata includes storing, in a data storage system, a specification for each of multiple sources, each specification including information identifying one or more data elements of the corresponding source; and processing, in a data processing system coupled to the data storage system, data elements from the sources, including generating a set of rules for each source based on a corresponding one of the stored specifications, and matching data elements of different sources and determining a quality metric characterizing a given match between a first data element of a first source and a second data element of a second source according to the set of rules generated for the first source and the set of rules generated for the second source.
    Type: Grant
    Filed: January 13, 2011
    Date of Patent: May 12, 2015
    Assignee: Ab Initio Technology LLC
    Inventor: Andrew Schon
  • Patent number: 9021299
    Abstract: Techniques are disclosed that include a computer-implemented method, including storing information related to an initial state of a process upon being initialized, wherein execution of the process includes executing at least one execution phase and upon completion of the executing of the execution phase storing information representative of an end state of the execution phase; aborting execution of the process in response to a predetermined event; and resuming execution of the process from one of the saved initial and end states without needing to shut down the process.
    Type: Grant
    Filed: February 18, 2011
    Date of Patent: April 28, 2015
    Assignee: Ab Initio Technology LLC
    Inventors: Bryan Phil Douros, Joseph Skeffington Wholey, III
  • Publication number: 20150106341
    Abstract: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.
    Type: Application
    Filed: October 20, 2014
    Publication date: April 16, 2015
    Applicant: Ab Initio Technology LLC
    Inventors: Joel Gould, Carl Richard Feynman, Paul Bay
  • Patent number: 9002770
    Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.
    Type: Grant
    Filed: August 31, 2012
    Date of Patent: April 7, 2015
    Assignee: Ab Initio Technology LLC
    Inventor: Joel Gould
  • Patent number: 9003084
    Abstract: Systems and techniques are disclosed that include in one aspect a computer implemented method storing a received stream of data elements in a buffer, applying a boundary condition to the data elements stored in the buffer after receiving each individual data element of the stream of data elements, and producing one or more data elements from the buffer based on the boundary condition as an output stream of data elements sorted according to a predetermined order.
    Type: Grant
    Filed: February 18, 2011
    Date of Patent: April 7, 2015
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, Carl Richard Feynman
  • Patent number: 8996442
    Abstract: Disclosed is a method including receiving a rule having at least one rule case for producing an output value based on one or more input values, generating a transform for receiving data from an input dataset and transforming the data based on the rule including producing a first series of values for at least one output variable in an output dataset, at least one value in the first series of values including a second series of values, and providing an output field corresponding to the at least one output variable in the output dataset for storing the second series of values.
    Type: Grant
    Filed: June 27, 2013
    Date of Patent: March 31, 2015
    Assignee: AB Initio Technology LLC
    Inventors: Joel Gould, Scott Studer
  • Patent number: 8954482
    Abstract: Processing data elements associated with a hierarchy that includes multiple levels includes: forming multiple flows of data elements, with each flow being associated with a level of the hierarchy; and including within at least some of the flows hierarchical structure information associated with a level of the hierarchy. For at least one of the levels, two or more and fewer than all of the flows include hierarchical structure information associated with that level.
    Type: Grant
    Filed: September 19, 2008
    Date of Patent: February 10, 2015
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, Joseph Skeffington Wholey, III
  • Patent number: 8949189
    Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.
    Type: Grant
    Filed: July 15, 2013
    Date of Patent: February 3, 2015
    Assignee: Ab Initio Technology LLC
    Inventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
  • Patent number: 8930337
    Abstract: Among other things, one aspect includes receiving one or more mapped relationships between a given output and one or more inputs represented by input variables, at least one of the mapped relationships including a transformational expression, the transformational expression defining an output of a mapped relationship based on at least one input variable mapped to an element of an input dataset; receiving identification of elements of an output dataset mapped to outputs of respective mapped relationships; generating output data according to the transformational expression based on input data from the input dataset associated with the element of the input dataset mapped to the input variable; determining validation information in response to the generated output data based on validation criteria defining one or more characteristics of valid values associated with one or more of the identified elements of the output dataset; and presenting visual feedback based on the determined validation information.
    Type: Grant
    Filed: July 25, 2013
    Date of Patent: January 6, 2015
    Assignee: Ab Initio Technology LLC
    Inventors: Scott Studer, Amit Weisman
  • Publication number: 20140344508
    Abstract: Processing a plurality of data units to generate result information, includes: performing a data operation for each data unit of a first subset of data units from the plurality of data units, and storing information associated with a result of the data operation in a first set of one or more data structures stored in working memory space of a memory device; after an overflow condition on the working memory space is satisfied, storing information in overflow storage space of a storage device; and repeating an overflow processing procedure multiple times during the processing of the plurality of data units, the overflow processing procedure including: updating a new set of one or more data structures stored in the working memory space using at least some information stored in the overflow storage space.
    Type: Application
    Filed: May 16, 2014
    Publication date: November 20, 2014
    Applicant: Ab Initio Technology LLC
    Inventors: Muhammad Arshad Khan, Stephen G. Rybicki, Joel Gould
  • Patent number: 8875145
    Abstract: Processing data includes: receiving units of work that each include one or more work elements, and processing a first unit of work using a first compiled dataflow graph (160) loaded into a data processing system (100) in response to receiving the first unit of work. The processing includes: analysis to determine a characteristic of the first unit of work; identifying one or more compiled dataflow graphs from graphs stored in a data storage system (107) that include at least some that were compiled for processing a unit of work having the determined characteristic; loading one of the identified compiled dataflow graphs into the data processing system (100) as the first compiled dataflow graph (160); and generating one or more output work elements from at least one work element in the first unit of work.
    Type: Grant
    Filed: June 15, 2011
    Date of Patent: October 28, 2014
    Assignee: Ab Initio Technology LLC
    Inventors: Matthew Darcy Atterbury, H. Mark Bromley, Wayne Mesard, Arkadi Popov, Stephen Schmidt, Craig W. Stanfill, Joseph Skeffington Wholey, III
  • Patent number: 8866817
    Abstract: An approach to automatically specifying, or assisting with the specification of, a parallel computation graph involves determining data processing characteristics of the linking elements that couple data processing elements of the graph. The characteristics of the linking elements are determined according to the characteristics of the upstream and/or downstream data processing elements associated with the linking element, for example, to enable computation by the parallel computation graph that is equivalent to computation of an associated serial graph.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: October 21, 2014
    Assignee: Ab Initio Technology LLC
    Inventor: Craig W. Stanfill
  • Patent number: 8868580
    Abstract: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.
    Type: Grant
    Filed: September 15, 2004
    Date of Patent: October 21, 2014
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Carl Feynman, Paul Bay
  • Publication number: 20140282418
    Abstract: Among other things, a method includes, at a computer system on which one or more computer programs are executing, receiving a specification defining types of state information, receiving an indication that an event associated with at least one of the computer programs has occurred, the event associated with execution of a function of the computer program, collecting state information describing the state of the execution of the computer program when the event occurred, generating an entry corresponding to the event, the entry including elements of the collected state information, the elements of state information formatted according to the specification, and storing the entry. The log can be parsed to generate a visualization of computer program execution.
    Type: Application
    Filed: January 16, 2014
    Publication date: September 18, 2014
    Applicant: Ab Initio Technology LLC
    Inventors: Joseph Stuart Wood, Robert Freundlich
  • Publication number: 20140258651
    Abstract: A system for managing storage of data units includes a data storage system configured to store multiple data blocks, at least some of the data blocks containing multiple data units, and configured to store, for at least some of the data blocks, corresponding historical information about prior removal of one or more data units from that data block, the removal affecting at least some addresses of data units in that data block. The system is configured to perform at least one operation that accesses at least a first data unit stored in a first data block according to address information interpreted based on any stored historical information corresponding to the first data block.
    Type: Application
    Filed: March 6, 2013
    Publication date: September 11, 2014
    Applicant: Ab Initio Technology LLC
    Inventors: Ephraim Meriwether Vishniac, Stephen J. Schmidt
  • Publication number: 20140258652
    Abstract: A system for managing storage of data units includes a data storage system configured to store multiple data blocks, at least some of the data blocks containing multiple data units, with at least a group of the data blocks being stored contiguously, thereby supporting a first read operation that retrieves data units from at least two adjacent data blocks in the group. The system is configured to perform one or more operations with respect to data units, the operations including a delete operation that replaces a first data block containing a data unit to be deleted with a second data block that does not contain the deleted data unit, with the second data block having the same size as the first data block.
    Type: Application
    Filed: March 6, 2013
    Publication date: September 11, 2014
    Applicant: Ab Initio Technology LLC
    Inventors: Ephraim Meriwether Vishniac, Stephen J. Schmidt
  • Patent number: D722434
    Type: Grant
    Filed: September 19, 2013
    Date of Patent: February 17, 2015
    Assignee: Ab Initio Technology LLC
    Inventor: Allen Bowen