TagSniff: Debugging Model for Distributed Data Processing
TagSniff introduces two simple primitives—tag and sniff—for debugging distributed data processing. Tag marks tuples with metadata; sniff identifies tuples needing attention. Together they enable data breakpoints, lineage tracking, and anomaly detection across Spark and Flink jobs.
