Abstract: One embodiment of the present invention sets forth a mechanism for linking data fields across different components in a data pipeline. For a particular output data field in an upstream data component, a corresponding input data field in the downstream data component is identified based on an analysis of data types, string matching and previously created links.