[Paper] IntraShuffler: A Privacy Preserving Framework for Heterogeneous DP Federated Learning

Published: (June 1, 2026 at 01:54 PM EDT)
4 min read
Source: arXiv

Source: arXiv - 2606.02563v1

Overview

The paper introduces IntraShuffler, a middleware that protects client privacy in heterogeneous‑differential‑privacy federated learning (HDP‑FL). By intelligently shuffling model updates within privacy‑compatible groups, it thwarts a new “privacy inference attack” that can otherwise expose client‑level data characteristics—even when the server already knows each client’s privacy budget.

Key Contributions

  • Privacy inference attack model: Demonstrates how an honest‑but‑curious server can combine gradient denoising with surrogate modeling to recover client data distributions and link updates across training rounds.
  • IntraShuffler framework: Proposes a privacy‑aware shuffling mechanism that (1) buckets clients with compatible ε‑budgets and (2) performs parameter‑level shuffling inside each bucket, preserving ε‑aware aggregation while breaking gradient linkability.
  • Empirical validation: Shows >60 % reduction in gradient recoverability and a drop in surrogate inference accuracy from 0.78 → 0.33 across four benchmark datasets, with negligible loss in model utility for common FL aggregation rules (FedAvg, FedProx, etc.).
  • Compatibility with HDP‑FL: Unlike classic shuffle‑model defenses, IntraShuffler works with per‑client ε‑weighting, making it practical for real‑world federated deployments where privacy budgets differ.

Methodology

  1. Threat model: The server knows each client’s declared εᵢ and can observe the aggregated model after each round. It may apply gradient denoising to reduce DP noise and train a surrogate model that maps noisy updates back to data distribution attributes.
  2. Attack evaluation:
    • Surrogate inference accuracy measures how well the server predicts a client’s data distribution (e.g., class proportions).
    • Linkage success quantifies the probability of correctly matching updates from the same client across rounds.
  3. IntraShuffler design:
    • Bucket formation: Clients are grouped so that the variance of εᵢ within a bucket stays below a configurable threshold, ensuring the server can still apply ε‑aware weighting.
    • Parameter‑level shuffling: Within each bucket, the server randomly permutes the order of model parameters (or sub‑vectors) before aggregation, breaking the persistent structure that the attack relies on.
    • Aggregation: After shuffling, the server performs the usual ε‑aware weighted average, then un‑shuffles the global model for distribution back to clients.
  4. Experimental setup: Four heterogeneous datasets (e.g., FEMNIST, Shakespeare, CIFAR‑10 with non‑IID splits) were used. The authors compared three baselines (no defense, classic shuffle‑model, and DP‑only) against IntraShuffler under multiple aggregation rules.

Results & Findings

MetricNo DefenseClassic ShuffleIntraShuffler
Surrogate inference accuracy0.780.550.33
Linkage success (↑)0.710.420.18
Test accuracy (model utility)84.2 %83.7 %84.0 %
Gradient recoverability reduction38 %>60 %
  • Privacy gain: IntraShuffler cuts the attacker’s ability to infer client‑level distributional info by more than half compared to the classic shuffle model.
  • Utility preservation: Model performance remains within 0.2 % of the unprotected baseline, confirming that the shuffling does not materially degrade learning.
  • Scalability: Overhead is modest—bucket formation adds <5 ms per round on a 100‑client simulation, and parameter shuffling incurs negligible compute cost.

Practical Implications

  • Enterprise FL platforms: Companies that must honor different privacy contracts per client (e.g., hospitals with varying HIPAA constraints) can adopt IntraShuffler to keep per‑client ε‑weighting while mitigating inference leakage.
  • Edge‑AI deployments: Mobile or IoT federations often run on limited hardware; IntraShuffler’s lightweight shuffling fits within existing communication pipelines without extra bandwidth.
  • Regulatory compliance: By reducing the risk of linking updates to specific data sources, the framework helps satisfy stricter privacy regulations (GDPR Art. 25 “data protection by design”).
  • Open‑source integration: The middleware can be plugged into popular FL frameworks (TensorFlow Federated, PySyft) as a pre‑aggregation hook, making adoption straightforward for developers.

Limitations & Future Work

  • Bucket granularity trade‑off: Tight ε‑compatibility thresholds improve privacy but may force many small buckets, reducing the shuffling’s effectiveness. Adaptive bucket sizing strategies are needed.
  • Assumed honest‑but‑curious server: The attack model does not consider a malicious server that can tamper with the shuffling process; integrity verification mechanisms would be a logical extension.
  • Evaluation scope: Experiments focus on image/text classification tasks; extending to larger‑scale models (e.g., Transformers) and other modalities (speech, graph data) remains open.
  • Theoretical guarantees: While empirical results are strong, formal privacy bounds that combine HDP and intra‑bucket shuffling are not yet derived. Future work could aim to prove a unified privacy accounting framework.

Authors

  • Farhin Farhad Riya
  • Olivera Kotevska
  • Jinyuan Stella Sun

Paper Information

  • arXiv ID: 2606.02563v1
  • Categories: cs.LG, cs.CR, cs.DC
  • Published: June 1, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »