Multi-Agent LLM System for Automated Vulnerability Discovery and Reproduction

Published: 2 weeks ago (May 27, 2026 at 01:42 PM EDT)

2 min read

Source: Hacker News

Abstract

Software vulnerabilities pose critical security threats, with nearly 50,000 CVEs reported in 2025. While Large Language Models (LLMs) show promise for automated vulnerability detection, three key challenges remain.

False positives & reproducibility – LLM‑generated vulnerability reports suffer from high false‑positive rates and lack reproducible verification.
Granularity of localization – Existing LLM‑based approaches use suboptimal granularities: function‑level analysis overlooks bugs when context becomes extensive, while line‑level analysis lacks sufficient context.
Complex reasoning – Difficulty reasoning about vulnerabilities with complex cross‑function dependencies and triggering conditions.

We present FuzzingBrain V2, a multi‑agent system that addresses these gaps through four key contributions:

Fully automated vulnerability analysis built on Google’s OSS‑Fuzz, ensuring all reported vulnerabilities are fuzzer‑reproducible.
Suspicious Point, a novel control‑flow‑based abstraction for precise vulnerability localization at the optimal granularity.
Logic‑driven hierarchical function analysis with dual‑layer fuzzing, enhancing function coverage under resource constraints.
MCP‑based static and dynamic analysis tools with context engineering, improving reasoning about complex vulnerabilities.

On the AIxCC 2025 Final Competition C/C++ dataset, FuzzingBrain V2 achieved a 90 % detection rate (36 of 40 vulnerabilities). In real‑world deployment, it discovered 29 zero‑day vulnerabilities across 12 open‑source projects, all confirmed and fixed by maintainers, with 2 assigned CVE IDs.

Multi-Agent LLM System for Automated Vulnerability Discovery and Reproduction

Abstract

Related posts

Codex just found a 'workaround' of not having sudo on my PC

Show HN: Streambed – Stream Postgres to Iceberg on S3, Supports Postgres Wire

Deflock hits 100k ALPRs Mapped in USA

The Speed of Prototyping in the Age of AI