Synthetic Data Generator Leaderboard

Comprehensive benchmarking of privacy preservation and data fidelity across leading synthetic data generators

Mind the Gap — Why Trust Our Metrics?

🎯

Source-Level Privacy

We define a novel real-world privacy metric that captures harms people actually care about. Rather than asking if a flow or packet was included in training, our formulation asks whether an individual's traffic contributed to training a model, whether that inclusion can reveal sensitive information such as their location, behavior, or organizational affiliation etc.

⚔️

Attack-Grounded Metric

TraceBleed is not a naïve membership inference attack (MIA) — it's a novel attack-grounded metric tailored to network traces that exploits behavioral fingerprints across flows, rather than statistical artifacts.

🌐

Cross-Vantage Evaluation

We do not simply train attacker and generator on the same dataset. Our leaderboard contains evaluations across datasets collected from diverse vantage points to ensure that the vulnerability is true privacy leakage not distribution-specific anomalies.

✅

Actionable — Good News

We close the loop with a principled, generator-agnostic mitigation, showing the leakage we measure is both real and fixable.

Synthetic Data Generator Rankings

Mind the Gap — Why Trust Our Metrics?