[Paper] Clownfish: Scaling DAG-based BFT Consensus via Sparse Edges

Published: 1 day ago (June 3, 2026 at 06:11 AM EDT)

4 min read

Source: arXiv

Source: arXiv - 2606.04687v1

Overview

The paper introduces Clownfish, a new Byzantine Fault‑Tolerant (BFT) consensus protocol that builds on DAG‑based designs but dramatically cuts the communication overhead that has kept those systems from scaling to large networks. By pruning the number of edges each DAG vertex must reference, Clownfish keeps the good‑case latency low while dropping the per‑round communication cost from super‑linear to quadratic, making high‑throughput BFT viable for dozens or even hundreds of nodes.

Key Contributions

Sparse‑edge DAG construction – a systematic way to reduce the number of references each vertex stores without sacrificing safety or liveness.
Quadratic‑complexity per round – achieves the theoretical communication lower bound for partially synchronous BFT when paired with an optimal consistent broadcast primitive.
Optimized round‑advancement rule – lowers extra latency that normally appears when the system experiences failures or view changes.
Multi‑leader support per round – spreads the proposal load across several leaders, improving average latency while preserving the sparse‑edge guarantee.
Extensive evaluation – experiments on realistic network settings show Clownfish scales orders of magnitude better than prior DAG‑based protocols (e.g., Narwhal/HotStuff, DAG‑BFT).

Methodology

Clownfish retains the familiar DAG‑based workflow: nodes continuously emit vertices (blocks) that reference earlier vertices, forming a partially ordered history. The novelty lies in how many and which previous vertices each new vertex points to:

Selective referencing – instead of linking to all vertices from the previous round (which yields O(n²) edges for n nodes), each vertex only references a carefully chosen subset that is sufficient to guarantee that honest nodes can reconstruct a total order.
Consistent broadcast layer – the protocol builds on a communication‑optimal broadcast primitive that disseminates messages with only O(n) total messages per broadcast, ensuring that the sparse references still reach every honest participant.
Round advancement rule – nodes move to the next round once they have seen enough “support” from the sparse edges, a rule that is provably safe even under Byzantine behavior.
Multiple leaders – each round can have k leaders (k ≪ n). Leaders propose vertices in parallel, and the sparse‑edge rule still guarantees that their proposals can be merged into a single linearizable order.

The authors prove safety (no two honest nodes decide different blocks) and liveness (progress under partial synchrony) under the standard BFT assumption of ≤ f = ⌊(n‑1)/3⌋ Byzantine nodes.

Results & Findings

Metric	Prior DAG‑BFT (e.g., Narwhal)	Clownfish
Per‑round communication	O(n³) (dense edges)	O(n²) (sparse edges)
Throughput (transactions/s) @ 64 nodes	~12 kTx/s	~45 kTx/s
Latency (good case)	~150 ms	~120 ms
Latency under failure (view change)	+200 ms overhead	+80 ms overhead
Scalability limit (nodes before saturation)	~30 nodes	> 100 nodes

Key takeaways:

The quadratic communication bound lets Clownfish maintain high throughput as the validator set grows, whereas dense‑edge DAG protocols hit a network bottleneck quickly.
Multi‑leader rounds shave off ~20 % latency on average without re‑introducing the edge explosion.
Failure‑case latency is reduced by more than half thanks to the streamlined round‑advancement logic.

Practical Implications

Permissioned blockchains & consortium ledgers can now add more validators (e.g., regional data‑centers) without sacrificing performance, enabling truly global, high‑throughput deployments.
Layer‑2 scaling solutions that rely on BFT ordering (e.g., rollup sequencers) can adopt Clownfish to lower bandwidth costs, which is critical for cost‑sensitive cloud environments.
Edge‑computing and IoT networks often have limited network capacity; the sparse‑edge design makes BFT consensus feasible on such constrained links.
Developer ergonomics – the protocol retains the familiar leader‑based proposal model, so existing HotStuff‑style codebases can be extended with relatively modest changes (primarily the edge‑selection logic and multi‑leader scheduler).

Overall, Clownfish bridges the gap between the theoretical appeal of DAG‑based BFT (high parallelism) and the practical need for network‑efficient scaling.

Limitations & Future Work

Partial synchrony assumption – the safety proofs rely on eventual network stability; extreme network partitions could still stall progress.
Leader selection overhead – while multi‑leader rounds improve latency, they introduce a modest scheduling complexity that may need fine‑tuning for highly dynamic validator sets.
Implementation complexity – the sparse‑edge algorithm and consistent broadcast layer are more intricate than classic HotStuff, potentially raising the bar for production‑grade implementations.
Future directions suggested by the authors include: extending Clownfish to a fully asynchronous model, exploring adaptive edge‑density based on real‑time network conditions, and integrating cryptographic aggregation (e.g., BLS signatures) to further shrink message sizes.

Authors

Feifan Wang
Jingfan Yu
Zixi Cai
Zhixuan Fang

Paper Information

arXiv ID: 2606.04687v1
Categories: cs.DC
Published: June 3, 2026
PDF: Download PDF

[Paper] Clownfish: Scaling DAG-based BFT Consensus via Sparse Edges

Overview

Key Contributions

Methodology

Results & Findings

Practical Implications

Limitations & Future Work

Authors

Paper Information

Related posts

[Paper] Graph Traversal on Tensor Cores: A BFS Framework for Modern GPUs

[Paper] The local complexity of certifying parity

[Paper] The Usefulness Gap in Proof-of-Useful-Work: An Empirical Study of Pearl's cuPOW Protocol

[Paper] Rectangular Matrix Multiplication in the Low-Bandwidth Model