Data lineage is the process of understanding, recording, and visualizing data as it flows from data sources to consumption.