[Paper] Clownfish: Scaling DAG-based BFT Consensus via Sparse Edges

Published: (June 3, 2026 at 06:11 AM EDT)
4 min read
Source: arXiv

Source: arXiv - 2606.04687v1

Overview

The paper introduces Clownfish, a new Byzantine Fault‑Tolerant (BFT) consensus protocol that builds on DAG‑based designs but dramatically cuts the communication overhead that has kept those systems from scaling to large networks. By pruning the number of edges each DAG vertex must reference, Clownfish keeps the good‑case latency low while dropping the per‑round communication cost from super‑linear to quadratic, making high‑throughput BFT viable for dozens or even hundreds of nodes.

Key Contributions

  • Sparse‑edge DAG construction – a systematic way to reduce the number of references each vertex stores without sacrificing safety or liveness.
  • Quadratic‑complexity per round – achieves the theoretical communication lower bound for partially synchronous BFT when paired with an optimal consistent broadcast primitive.
  • Optimized round‑advancement rule – lowers extra latency that normally appears when the system experiences failures or view changes.
  • Multi‑leader support per round – spreads the proposal load across several leaders, improving average latency while preserving the sparse‑edge guarantee.
  • Extensive evaluation – experiments on realistic network settings show Clownfish scales orders of magnitude better than prior DAG‑based protocols (e.g., Narwhal/HotStuff, DAG‑BFT).

Methodology

Clownfish retains the familiar DAG‑based workflow: nodes continuously emit vertices (blocks) that reference earlier vertices, forming a partially ordered history. The novelty lies in how many and which previous vertices each new vertex points to:

  1. Selective referencing – instead of linking to all vertices from the previous round (which yields O(n²) edges for n nodes), each vertex only references a carefully chosen subset that is sufficient to guarantee that honest nodes can reconstruct a total order.
  2. Consistent broadcast layer – the protocol builds on a communication‑optimal broadcast primitive that disseminates messages with only O(n) total messages per broadcast, ensuring that the sparse references still reach every honest participant.
  3. Round advancement rule – nodes move to the next round once they have seen enough “support” from the sparse edges, a rule that is provably safe even under Byzantine behavior.
  4. Multiple leaders – each round can have k leaders (k ≪ n). Leaders propose vertices in parallel, and the sparse‑edge rule still guarantees that their proposals can be merged into a single linearizable order.

The authors prove safety (no two honest nodes decide different blocks) and liveness (progress under partial synchrony) under the standard BFT assumption of ≤ f = ⌊(n‑1)/3⌋ Byzantine nodes.

Results & Findings

MetricPrior DAG‑BFT (e.g., Narwhal)Clownfish
Per‑round communicationO(n³) (dense edges)O(n²) (sparse edges)
Throughput (transactions/s) @ 64 nodes~12 kTx/s~45 kTx/s
Latency (good case)~150 ms~120 ms
Latency under failure (view change)+200 ms overhead+80 ms overhead
Scalability limit (nodes before saturation)~30 nodes> 100 nodes

Key takeaways:

  • The quadratic communication bound lets Clownfish maintain high throughput as the validator set grows, whereas dense‑edge DAG protocols hit a network bottleneck quickly.
  • Multi‑leader rounds shave off ~20 % latency on average without re‑introducing the edge explosion.
  • Failure‑case latency is reduced by more than half thanks to the streamlined round‑advancement logic.

Practical Implications

  • Permissioned blockchains & consortium ledgers can now add more validators (e.g., regional data‑centers) without sacrificing performance, enabling truly global, high‑throughput deployments.
  • Layer‑2 scaling solutions that rely on BFT ordering (e.g., rollup sequencers) can adopt Clownfish to lower bandwidth costs, which is critical for cost‑sensitive cloud environments.
  • Edge‑computing and IoT networks often have limited network capacity; the sparse‑edge design makes BFT consensus feasible on such constrained links.
  • Developer ergonomics – the protocol retains the familiar leader‑based proposal model, so existing HotStuff‑style codebases can be extended with relatively modest changes (primarily the edge‑selection logic and multi‑leader scheduler).

Overall, Clownfish bridges the gap between the theoretical appeal of DAG‑based BFT (high parallelism) and the practical need for network‑efficient scaling.

Limitations & Future Work

  • Partial synchrony assumption – the safety proofs rely on eventual network stability; extreme network partitions could still stall progress.
  • Leader selection overhead – while multi‑leader rounds improve latency, they introduce a modest scheduling complexity that may need fine‑tuning for highly dynamic validator sets.
  • Implementation complexity – the sparse‑edge algorithm and consistent broadcast layer are more intricate than classic HotStuff, potentially raising the bar for production‑grade implementations.
  • Future directions suggested by the authors include: extending Clownfish to a fully asynchronous model, exploring adaptive edge‑density based on real‑time network conditions, and integrating cryptographic aggregation (e.g., BLS signatures) to further shrink message sizes.

Authors

  • Feifan Wang
  • Jingfan Yu
  • Zixi Cai
  • Zhixuan Fang

Paper Information

  • arXiv ID: 2606.04687v1
  • Categories: cs.DC
  • Published: June 3, 2026
  • PDF: Download PDF
0 views
Back to Blog

Related posts

Read more »