A reproducible study of LLM penetration-testing agents measuring how model backbone, agent scaffold, backend substrate, and scoring integrity affect exploit success.
-
Updated
Jun 25, 2026 - PHP
A reproducible study of LLM penetration-testing agents measuring how model backbone, agent scaffold, backend substrate, and scoring integrity affect exploit success.
Fully autonomous AI hacker to find actual exploits in your web apps. Shannon has achieved a 96.15% success rate on the hint-free, source-aware XBOW Benchmark.
The Ninja Tables plugin for WordPress (versions < 4.1.9) contains a critical vulnerability in the AJAX action ninja_table_force_download.
Add a description, image, and links to the xbow topic page so that developers can more easily learn about it.
To associate your repository with the xbow topic, visit your repo's landing page and select "manage topics."