Patents Assigned to Ab Initio Technology LLC
-
Patent number: 8489553Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.Type: GrantFiled: July 2, 2012Date of Patent: July 16, 2013Assignee: Ab Initio Technology LLCInventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
-
Patent number: 8484215Abstract: A method for clustering data elements stored in a data storage system includes reading data elements from the data storage system. Clusters of data elements are formed with each data element being a member of at least one cluster. At least one data element is associated with two or more clusters. Membership of the data element belonging to respective ones of the two or more clusters is represented by a measure of ambiguity. Information is stored in the data storage system to represent the formed clusters.Type: GrantFiled: October 23, 2009Date of Patent: July 9, 2013Assignee: Ab Initio Technology LLCInventor: Arlen Anderson
-
Patent number: 8484159Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.Type: GrantFiled: December 23, 2010Date of Patent: July 9, 2013Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin
-
Patent number: 8478706Abstract: Disclosed is a method including receiving a rule having at least one rule case for producing an output value based on one or more input values, generating a transform for receiving data from an input dataset and transforming the data based on the rule including producing a first series of values for at least one output variable in an output dataset, at least one value in the first series of values including a second series of values, and providing an output field corresponding to the at least one output variable in the output dataset for storing the second series of values.Type: GrantFiled: January 29, 2010Date of Patent: July 2, 2013Assignee: Ab Initio Technology LLCInventors: Joel Gould, Scott Studer
-
Patent number: 8447901Abstract: Systems and techniques include, in some implementations, a computer implemented method storing a portion of data elements present in a first buffer in a second buffer in response to detecting an overflow condition of the first buffer, wherein the data elements in the first buffer are sorted according to a predetermined order, and inserting a proxy data element in the first buffer to represent the portion of data elements stored to the second buffer.Type: GrantFiled: February 18, 2011Date of Patent: May 21, 2013Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, Carl Richard Feynman
-
Patent number: 8447724Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.Type: GrantFiled: December 23, 2010Date of Patent: May 21, 2013Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin
-
Patent number: 8386408Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.Type: GrantFiled: November 14, 2011Date of Patent: February 26, 2013Assignee: Ab Initio Technology LLCInventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
-
Patent number: 8380651Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.Type: GrantFiled: November 14, 2011Date of Patent: February 19, 2013Assignee: Ab Initio Technology LLCInventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
-
Patent number: 8229902Abstract: A method for managing data includes receiving individually accessible data units, each identified by a key value; storing a plurality of blocks of data, each of at least some of the blocks being generated by combining a plurality of the data units; and providing an index that includes an entry for each of the blocks. One or more of the entries enable location, based on a provided key value, of a block that includes data units corresponding to a range of key values that includes the provided key value.Type: GrantFiled: November 1, 2006Date of Patent: July 24, 2012Assignee: Ab Initio Technology LLCInventors: Ephraim Meriwether Vishniac, Marshall A. Isman, Paul Bay, H. Mark Bromley, John L. Richardson
-
Patent number: 8214331Abstract: Managing data includes: receiving at least one group of individually accessible data units over an input device or port, each data unit identified by a key value, with key values of the received data units being sorted such that the key value identifying a given first data unit that is received before a given second data unit occurs earlier in a sort order than the key value identifying the given second data unit; and processing the data units for storage in a data storage system.Type: GrantFiled: January 3, 2011Date of Patent: July 3, 2012Assignee: Ab Initio Technology LLCInventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
-
Patent number: 8205113Abstract: Among other aspects disclosed are a method and system for processing a batch of input data in a fault tolerant manner. The method includes reading a batch of input data including a plurality of records from one or more data sources and passing the batch through a dataflow graph. The dataflow graph includes two or more nodes representing components connected by links representing flows of data between the components. At least one but fewer than all of the components includes a checkpoint process for an action performed for each of multiple units of work associated with one or more of the records. The checkpoint process includes opening a checkpoint buffer stored in non-volatile memory at the start of processing for the batch. For each unit of work from the batch, if a result from performing the action for the unit of work was previously saved in the checkpoint buffer, the saved result is used to complete processing of the unit of work without performing the action again.Type: GrantFiled: July 14, 2009Date of Patent: June 19, 2012Assignee: Ab Initio Technology LLCInventors: Bryan Phil Douros, Matthew Darcy Atterbury, Tim Wakeling
-
Patent number: 8082544Abstract: A number of tasks are defined according to a dependency graph. Multiple parameter contexts are maintained, each associated with a different scope of the tasks. A parameter used in a first of the tasks is bound to a value. This binding includes identifying a first of the contexts according to the dependency graph and retrieving the value for the parameter from the identified context.Type: GrantFiled: December 4, 2008Date of Patent: December 20, 2011Assignee: Ab Initio Technology LLCInventors: Frank Inchingolo, Craig W. Stanfill
-
Patent number: 8078820Abstract: A method, and corresponding system and software, is described for writing data to a plurality of queues, each portion of the data being written to a corresponding one of the queues. The method includes, without requiring concurrent locking of more than one queue, determining if a space is available in each queue for writing a corresponding portion of the data, and if available, reserving the spaces in the queues. The method includes writing each portion of the data to a corresponding one of the queues.Type: GrantFiled: December 6, 2010Date of Patent: December 13, 2011Assignee: Ab Initio Technology LLCInventors: Spiro Michaylov, Sanjeev Banerji, Craig W. Stanfill
-
Patent number: 8069129Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.Type: GrantFiled: April 10, 2007Date of Patent: November 29, 2011Assignee: Ab Initio Technology LLCInventors: Joel Gould, Joseph Skeffington Wholey, III, Timothy Perkins
-
Patent number: 8059125Abstract: An approach to automatically specifying, or assisting with the specification of, a parallel computation graph involves determining data processing characteristics of the linking elements that couple data processing elements of the graph. The characteristics of the linking elements are determined according to the characteristics of the upstream and/or downstream data processing elements associated with the linking element, for example, to enable computation by the parallel computation graph that is equivalent to computation of an associated serial graph.Type: GrantFiled: June 22, 2004Date of Patent: November 15, 2011Assignee: Ab Initio Technology LLCInventor: Craig W. Stanfill
-
Patent number: 8046385Abstract: In general, a method includes determining metric values associated with data quality for one or more child nodes. Metric values are determined for a parent node based on the metric values of at least some of the child nodes, and relationships between one or more parent nodes and one or more child nodes define a hierarchy. The determination of the metric value for the parent node is repeated for multiple instances.Type: GrantFiled: June 20, 2008Date of Patent: October 25, 2011Assignee: Ab Initio Technology LLCInventors: David Wald, Tim Wakeling, Muhammad Arshad Khan
-
Patent number: 7949839Abstract: A method, and corresponding software and system, is described for paging memory used for one or more sequentially-accessed data structure. The method includes providing a data structure representing an order in which memory pages are to be reused; and maintaining the data structure according to a history of access to a memory page associated with one of the sequentially-accessed data structures. A position of the memory page in the order depends on a transition of sequential access off of the memory page.Type: GrantFiled: January 15, 2010Date of Patent: May 24, 2011Assignee: Ab Initio Technology LLCInventor: Craig W. Stanfill
-
Patent number: 7899833Abstract: Information related to a first set of one or more data objects of a plurality of data objects in a database is determined. A composition of one or more actions is applied to determine the information related to the first set of one or more data objects. At least one of the actions includes a navigation action that maps a set of data objects to another set of data objects related according to a relationship type determined from a specification of the navigation action. Applying the composition of one or more actions may include recursive application of the navigation action. The actions may include a second action that performs a function based values associated with the data objects returned by the navigation action, where each value represents an attribute of the associated data object.Type: GrantFiled: November 2, 2004Date of Patent: March 1, 2011Assignee: Ab Initio Technology LLCInventors: Daniell Stevens, Donald Beaudry
-
Patent number: 7885932Abstract: Managing data includes: receiving at least one group of individually accessible data units over an input device or port, each data unit identified by a key value, with key values of the received data units being sorted such that the key value identifying a given first data unit that is received before a given second data unit occurs earlier in a sort order than the key value identifying the given second data unit; and processing the data units for storage in a data storage system.Type: GrantFiled: March 10, 2010Date of Patent: February 8, 2011Assignee: Ab Initio Technology LLCInventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
-
Patent number: 7877350Abstract: Determining metadata associated with a graph-based computation includes functionally transforming metadata associated with a first portion of a graph to generate transformed metadata associated with a second portion of the graph; determining a third portion of the graph related to the second portion of the graph; and propagating the transformed metadata from the second portion of the graph to the third portion of the graph.Type: GrantFiled: June 27, 2005Date of Patent: January 25, 2011Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, J. Skeffington Wholey, Brond Larson, Glenn John Allin