Patents Assigned to Ab Initio Technology LLC
-
Publication number: 20150169428Abstract: A method includes receiving data indicative of a number of times each of one or more rules was executed by a data processing application during processing of one or more records; based on the number of times each of the rules was executed by the data processing application, determining a content criterion for each of one or more particular fields; generating content for each of the particular fields based on the content criterion; and populating each of the particular fields with the generated content.Type: ApplicationFiled: December 17, 2014Publication date: June 18, 2015Applicant: Ab Initio Technology LLCInventors: Marshall A. Isman, Richard Alan Epstein
-
Publication number: 20150169687Abstract: In one aspect, in general, a method of generating a dataflow graph representing a database query includes receiving a query plan from a plan generator, the query plan representing operations for executing a database query on at least one input representing a source of data, producing a dataflow graph from the query plan, wherein the dataflow graph includes at least one node that represents at least one operation represented by the query plan, and includes at least one link that represents at least one dataflow associated with the query plan, and altering one or more components of the dataflow graph based on at least one characteristic of the at least one input representing the source of data.Type: ApplicationFiled: February 23, 2015Publication date: June 18, 2015Applicant: Ab Initio Technology LLCInventors: Ian Schechter, Glenn John Allin
-
Patent number: 9037589Abstract: Received data records, each including one or more values in one or more fields, are processed to identify one or more data clusters. The processing includes: identifying tokens that each include at least one value or fragment of a value in a field or a combination of fields; generating a network representing the identified tokens, with nodes of the network representing tokens and edges of the network each representing a variant relationship between tokens; and generating a graphical representation of the network with different subsets of nodes distinguished based at least in part on values associated with nodes, where a value associated with a particular node quantifies a count of a number of instances of the token represented by that particular node appearing within the received data records.Type: GrantFiled: November 15, 2012Date of Patent: May 19, 2015Assignee: Ab Initio Technology LLCInventor: Arlen Anderson
-
Patent number: 9031895Abstract: Processing metadata includes storing, in a data storage system, a specification for each of multiple sources, each specification including information identifying one or more data elements of the corresponding source; and processing, in a data processing system coupled to the data storage system, data elements from the sources, including generating a set of rules for each source based on a corresponding one of the stored specifications, and matching data elements of different sources and determining a quality metric characterizing a given match between a first data element of a first source and a second data element of a second source according to the set of rules generated for the first source and the set of rules generated for the second source.Type: GrantFiled: January 13, 2011Date of Patent: May 12, 2015Assignee: Ab Initio Technology LLCInventor: Andrew Schon
-
Patent number: 9021299Abstract: Techniques are disclosed that include a computer-implemented method, including storing information related to an initial state of a process upon being initialized, wherein execution of the process includes executing at least one execution phase and upon completion of the executing of the execution phase storing information representative of an end state of the execution phase; aborting execution of the process in response to a predetermined event; and resuming execution of the process from one of the saved initial and end states without needing to shut down the process.Type: GrantFiled: February 18, 2011Date of Patent: April 28, 2015Assignee: Ab Initio Technology LLCInventors: Bryan Phil Douros, Joseph Skeffington Wholey, III
-
Publication number: 20150106341Abstract: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.Type: ApplicationFiled: October 20, 2014Publication date: April 16, 2015Applicant: Ab Initio Technology LLCInventors: Joel Gould, Carl Richard Feynman, Paul Bay
-
Patent number: 9002770Abstract: A component in a graph-based computation having data processing components connected by linking elements representing data flows is updated by receiving a rule specification, generating a transform for transforming data based on the rule specification, associating the transform with a component in the graph-based computation, and in response to determining that a new rule specification has been received or an existing rule specification has been edited, updating the transform associated with the component in the graph-based computation according to the new or edited rule specification. A computation is tested by receiving a rule specification including a set of rule cases, receiving a set of test cases, each test case containing a value for one or more of the potential inputs, and for each test case, identifying one of the rule cases that will generate an output given the input values of the test case.Type: GrantFiled: August 31, 2012Date of Patent: April 7, 2015Assignee: Ab Initio Technology LLCInventor: Joel Gould
-
Patent number: 9003084Abstract: Systems and techniques are disclosed that include in one aspect a computer implemented method storing a received stream of data elements in a buffer, applying a boundary condition to the data elements stored in the buffer after receiving each individual data element of the stream of data elements, and producing one or more data elements from the buffer based on the boundary condition as an output stream of data elements sorted according to a predetermined order.Type: GrantFiled: February 18, 2011Date of Patent: April 7, 2015Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, Carl Richard Feynman
-
Patent number: 8996442Abstract: Disclosed is a method including receiving a rule having at least one rule case for producing an output value based on one or more input values, generating a transform for receiving data from an input dataset and transforming the data based on the rule including producing a first series of values for at least one output variable in an output dataset, at least one value in the first series of values including a second series of values, and providing an output field corresponding to the at least one output variable in the output dataset for storing the second series of values.Type: GrantFiled: June 27, 2013Date of Patent: March 31, 2015Assignee: AB Initio Technology LLCInventors: Joel Gould, Scott Studer
-
Patent number: 8954482Abstract: Processing data elements associated with a hierarchy that includes multiple levels includes: forming multiple flows of data elements, with each flow being associated with a level of the hierarchy; and including within at least some of the flows hierarchical structure information associated with a level of the hierarchy. For at least one of the levels, two or more and fewer than all of the flows include hierarchical structure information associated with that level.Type: GrantFiled: September 19, 2008Date of Patent: February 10, 2015Assignee: Ab Initio Technology LLCInventors: Craig W. Stanfill, Joseph Skeffington Wholey, III
-
Patent number: 8949189Abstract: Managing data by: receiving a group of individually accessible data units, each data unit identified by a key value, with key values determined such that the key value identifying a first data unit received before a second data unit occurs earlier in a sort order than the key value identifying the second data unit; and processing the data units for storage in a data storage system. The processing includes: storing blocks of data, the blocks being generated by combining a plurality of the data units; providing an index with entries that enable location, based on a provided key value, of a block that includes a data unit corresponding to the provided key value; and generating one or more screening data structures associated with the blocks for determining, based on a given key value, whether to search the stored blocks for a data unit corresponding to the given key value.Type: GrantFiled: July 15, 2013Date of Patent: February 3, 2015Assignee: Ab Initio Technology LLCInventors: Vrishal Kulkarni, Stephen Schmidt, Craig W. Stanfill, Ephraim Meriwether Vishniac
-
Patent number: 8930337Abstract: Among other things, one aspect includes receiving one or more mapped relationships between a given output and one or more inputs represented by input variables, at least one of the mapped relationships including a transformational expression, the transformational expression defining an output of a mapped relationship based on at least one input variable mapped to an element of an input dataset; receiving identification of elements of an output dataset mapped to outputs of respective mapped relationships; generating output data according to the transformational expression based on input data from the input dataset associated with the element of the input dataset mapped to the input variable; determining validation information in response to the generated output data based on validation criteria defining one or more characteristics of valid values associated with one or more of the identified elements of the output dataset; and presenting visual feedback based on the determined validation information.Type: GrantFiled: July 25, 2013Date of Patent: January 6, 2015Assignee: Ab Initio Technology LLCInventors: Scott Studer, Amit Weisman
-
Publication number: 20140344508Abstract: Processing a plurality of data units to generate result information, includes: performing a data operation for each data unit of a first subset of data units from the plurality of data units, and storing information associated with a result of the data operation in a first set of one or more data structures stored in working memory space of a memory device; after an overflow condition on the working memory space is satisfied, storing information in overflow storage space of a storage device; and repeating an overflow processing procedure multiple times during the processing of the plurality of data units, the overflow processing procedure including: updating a new set of one or more data structures stored in the working memory space using at least some information stored in the overflow storage space.Type: ApplicationFiled: May 16, 2014Publication date: November 20, 2014Applicant: Ab Initio Technology LLCInventors: Muhammad Arshad Khan, Stephen G. Rybicki, Joel Gould
-
Patent number: 8875145Abstract: Processing data includes: receiving units of work that each include one or more work elements, and processing a first unit of work using a first compiled dataflow graph (160) loaded into a data processing system (100) in response to receiving the first unit of work. The processing includes: analysis to determine a characteristic of the first unit of work; identifying one or more compiled dataflow graphs from graphs stored in a data storage system (107) that include at least some that were compiled for processing a unit of work having the determined characteristic; loading one of the identified compiled dataflow graphs into the data processing system (100) as the first compiled dataflow graph (160); and generating one or more output work elements from at least one work element in the first unit of work.Type: GrantFiled: June 15, 2011Date of Patent: October 28, 2014Assignee: Ab Initio Technology LLCInventors: Matthew Darcy Atterbury, H. Mark Bromley, Wayne Mesard, Arkadi Popov, Stephen Schmidt, Craig W. Stanfill, Joseph Skeffington Wholey, III
-
Patent number: 8866817Abstract: An approach to automatically specifying, or assisting with the specification of, a parallel computation graph involves determining data processing characteristics of the linking elements that couple data processing elements of the graph. The characteristics of the linking elements are determined according to the characteristics of the upstream and/or downstream data processing elements associated with the linking element, for example, to enable computation by the parallel computation graph that is equivalent to computation of an associated serial graph.Type: GrantFiled: November 14, 2011Date of Patent: October 21, 2014Assignee: Ab Initio Technology LLCInventor: Craig W. Stanfill
-
Patent number: 8868580Abstract: Processing data includes profiling data from a data source, including reading the data from the data source, computing summary data characterizing the data while reading the data, and storing profile information that is based on the summary data. The data is then processed from the data source. This processing includes accessing the stored profile information and processing the data according to the accessed profile information.Type: GrantFiled: September 15, 2004Date of Patent: October 21, 2014Assignee: Ab Initio Technology LLCInventors: Joel Gould, Carl Feynman, Paul Bay
-
Publication number: 20140282418Abstract: Among other things, a method includes, at a computer system on which one or more computer programs are executing, receiving a specification defining types of state information, receiving an indication that an event associated with at least one of the computer programs has occurred, the event associated with execution of a function of the computer program, collecting state information describing the state of the execution of the computer program when the event occurred, generating an entry corresponding to the event, the entry including elements of the collected state information, the elements of state information formatted according to the specification, and storing the entry. The log can be parsed to generate a visualization of computer program execution.Type: ApplicationFiled: January 16, 2014Publication date: September 18, 2014Applicant: Ab Initio Technology LLCInventors: Joseph Stuart Wood, Robert Freundlich
-
Publication number: 20140258651Abstract: A system for managing storage of data units includes a data storage system configured to store multiple data blocks, at least some of the data blocks containing multiple data units, and configured to store, for at least some of the data blocks, corresponding historical information about prior removal of one or more data units from that data block, the removal affecting at least some addresses of data units in that data block. The system is configured to perform at least one operation that accesses at least a first data unit stored in a first data block according to address information interpreted based on any stored historical information corresponding to the first data block.Type: ApplicationFiled: March 6, 2013Publication date: September 11, 2014Applicant: Ab Initio Technology LLCInventors: Ephraim Meriwether Vishniac, Stephen J. Schmidt
-
Publication number: 20140258652Abstract: A system for managing storage of data units includes a data storage system configured to store multiple data blocks, at least some of the data blocks containing multiple data units, with at least a group of the data blocks being stored contiguously, thereby supporting a first read operation that retrieves data units from at least two adjacent data blocks in the group. The system is configured to perform one or more operations with respect to data units, the operations including a delete operation that replaces a first data block containing a data unit to be deleted with a second data block that does not contain the deleted data unit, with the second data block having the same size as the first data block.Type: ApplicationFiled: March 6, 2013Publication date: September 11, 2014Applicant: Ab Initio Technology LLCInventors: Ephraim Meriwether Vishniac, Stephen J. Schmidt
-
Patent number: D722434Type: GrantFiled: September 19, 2013Date of Patent: February 17, 2015Assignee: Ab Initio Technology LLCInventor: Allen Bowen