Patents Assigned to Ab Initio Technology LLC
  • Patent number: 8489553
    Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.
    Type: Grant
    Filed: July 2, 2012
    Date of Patent: July 16, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
  • Patent number: 8484215
    Abstract: A method for clustering data elements stored in a data storage system includes reading data elements from the data storage system. Clusters of data elements are formed with each data element being a member of at least one cluster. At least one data element is associated with two or more clusters. Membership of the data element belonging to respective ones of the two or more clusters is represented by a measure of ambiguity. Information is stored in the data storage system to represent the formed clusters.
    Type: Grant
    Filed: October 23, 2009
    Date of Patent: July 9, 2013
    Assignee: Ab Initio Technology LLC
    Inventor: Arlen Anderson
  • Patent number: 8484159
    Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.
    Type: Grant
    Filed: December 23, 2010
    Date of Patent: July 9, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin
  • Patent number: 8478706
    Abstract: Disclosed is a method including receiving a rule having at least one rule case for producing an output value based on one or more input values, generating a transform for receiving data from an input dataset and transforming the data based on the rule including producing a first series of values for at least one output variable in an output dataset, at least one value in the first series of values including a second series of values, and providing an output field corresponding to the at least one output variable in the output dataset for storing the second series of values.
    Type: Grant
    Filed: January 29, 2010
    Date of Patent: July 2, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Scott Studer
  • Patent number: 8447901
    Abstract: Systems and techniques include, in some implementations, a computer implemented method storing a portion of data elements present in a first buffer in a second buffer in response to detecting an overflow condition of the first buffer, wherein the data elements in the first buffer are sorted according to a predetermined order, and inserting a proxy data element in the first buffer to represent the portion of data elements stored to the second buffer.
    Type: Grant
    Filed: February 18, 2011
    Date of Patent: May 21, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, Carl Richard Feynman
  • Patent number: 8447724
    Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.
    Type: Grant
    Filed: December 23, 2010
    Date of Patent: May 21, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin
  • Patent number: 8386408
    Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: February 26, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
  • Patent number: 8380651
    Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.
    Type: Grant
    Filed: November 14, 2011
    Date of Patent: February 19, 2013
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
  • Patent number: 8229902
    Abstract: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.
    Type: Grant
    Filed: November 1, 2006
    Date of Patent: July 24, 2012
    Assignee: Ab Initio Technology LLC
    Inventors: Ephraim Meriwether Vishniac, Marshall A. Isman, Paul Bay, H. Mark Bromley, John L. Richardson
  • Patent number: 8214331
    Abstract: Managing data includes: receiving at least one group of individually accessible data units over an input device or port, each data unit identified by a key value, with key values of the received data units being sorted such that the key value identifying a given first data unit that is received before a given second data unit occurs earlier in a sort order than the key value identifying the given second data unit; and processing the data units for storage in a data storage system.
    Type: Grant
    Filed: January 3, 2011
    Date of Patent: July 3, 2012
    Assignee: Ab Initio Technology LLC
    Inventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
  • Patent number: 8205113
    Abstract: Among other aspects disclosed are a method and system for processing a batch of input data in a fault tolerant manner. The method includes reading a batch of input data including a plurality of records from one or more data sources and passing the batch through a dataflow graph. The dataflow graph includes two or more nodes representing components connected by links representing flows of data between the components. At least one but fewer than all of the components includes a checkpoint process for an action performed for each of multiple units of work associated with one or more of the records. The checkpoint process includes opening a checkpoint buffer stored in non-volatile memory at the start of processing for the batch. For each unit of work from the batch, if a result from performing the action for the unit of work was previously saved in the checkpoint buffer, the saved result is used to complete processing of the unit of work without performing the action again.
    Type: Grant
    Filed: July 14, 2009
    Date of Patent: June 19, 2012
    Assignee: Ab Initio Technology LLC
    Inventors: Bryan Phil Douros, Matthew Darcy Atterbury, Tim Wakeling
  • Patent number: 8082544
    Abstract: A number of tasks are defined according to a dependency graph. Multiple parameter contexts are maintained, each associated with a different scope of the tasks. A parameter used in a first of the tasks is bound to a value. This binding includes identifying a first of the contexts according to the dependency graph and retrieving the value for the parameter from the identified context.
    Type: Grant
    Filed: December 4, 2008
    Date of Patent: December 20, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Frank Inchingolo, Craig W. Stanfill
  • Patent number: 8078820
    Abstract: A method, and corresponding system and software, is described for writing data to a plurality of queues, each portion of the data being written to a corresponding one of the queues. The method includes, without requiring concurrent locking of more than one queue, determining if a space is available in each queue for writing a corresponding portion of the data, and if available, reserving the spaces in the queues. The method includes writing each portion of the data to a corresponding one of the queues.
    Type: Grant
    Filed: December 6, 2010
    Date of Patent: December 13, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Spiro Michaylov, Sanjeev Banerji, Craig W. Stanfill
  • Patent number: 8069129
    Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.
    Type: Grant
    Filed: April 10, 2007
    Date of Patent: November 29, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
  • Patent number: 8059125
    Abstract: An approach to automatically specifying, or assisting with the specification of, a parallel computation graph involves determining data processing characteristics of the linking elements that couple data processing elements of the graph. The characteristics of the linking elements are determined according to the characteristics of the upstream and/or downstream data processing elements associated with the linking element, for example, to enable computation by the parallel computation graph that is equivalent to computation of an associated serial graph.
    Type: Grant
    Filed: June 22, 2004
    Date of Patent: November 15, 2011
    Assignee: Ab Initio Technology LLC
    Inventor: Craig W. Stanfill
  • Patent number: 8046385
    Abstract: In general, a method includes determining metric values associated with data quality for one or more child nodes. Metric values are determined for a parent node based on the metric values of at least some of the child nodes, and relationships between one or more parent nodes and one or more child nodes define a hierarchy. The determination of the metric value for the parent node is repeated for multiple instances.
    Type: Grant
    Filed: June 20, 2008
    Date of Patent: October 25, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: David Wald, Tim Wakeling, Muhammad Arshad Khan
  • Patent number: 7949839
    Abstract: A method, and corresponding software and system, is described for paging memory used for one or more sequentially-accessed data structure. The method includes providing a data structure representing an order in which memory pages are to be reused; and maintaining the data structure according to a history of access to a memory page associated with one of the sequentially-accessed data structures. A position of the memory page in the order depends on a transition of sequential access off of the memory page.
    Type: Grant
    Filed: January 15, 2010
    Date of Patent: May 24, 2011
    Assignee: Ab Initio Technology LLC
    Inventor: Craig W. Stanfill
  • Patent number: 7899833
    Abstract: Information related to a first set of one or more data objects of a plurality of data objects in a database is determined. A composition of one or more actions is applied to determine the information related to the first set of one or more data objects. At least one of the actions includes a navigation action that maps a set of data objects to another set of data objects related according to a relationship type determined from a specification of the navigation action. Applying the composition of one or more actions may include recursive application of the navigation action. The actions may include a second action that performs a function based values associated with the data objects returned by the navigation action, where each value represents an attribute of the associated data object.
    Type: Grant
    Filed: November 2, 2004
    Date of Patent: March 1, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Daniell Stevens, Donald Beaudry
  • Patent number: 7885932
    Abstract: Managing data includes: receiving at least one group of individually accessible data units over an input device or port, each data unit identified by a key value, with key values of the received data units being sorted such that the key value identifying a given first data unit that is received before a given second data unit occurs earlier in a sort order than the key value identifying the given second data unit; and processing the data units for storage in a data storage system.
    Type: Grant
    Filed: March 10, 2010
    Date of Patent: February 8, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
  • Patent number: 7877350
    Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.
    Type: Grant
    Filed: June 27, 2005
    Date of Patent: January 25, 2011
    Assignee: Ab Initio Technology LLC
    Inventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin