Question 1

What is pwnkit?

Accepted Answer

pwnkit is an open-source agentic framework for autonomous security research. It uses 4 AI agents (Discover, Attack, Verify, Report) to find and prove vulnerabilities in LLM endpoints, MCP servers, npm packages, source code, and web applications.

Question 2

How does pwnkit eliminate false positives?

Accepted Answer

pwnkit's Verify agent independently re-exploits every finding. If it can't reproduce the vulnerability, the finding is killed as a false positive. Only confirmed vulnerabilities with working proof-of-concept code make it into the final report.

Question 3

How much does pwnkit cost?

Accepted Answer

pwnkit is free and open source (MIT license). The only cost is the AI API usage — quick scans cost ~$0.05, default scans ~$0.15, and deep scans ~$1.00. Free tier models are also supported.

Question 4

What can pwnkit scan?

Accepted Answer

pwnkit scans 5 attack surfaces: LLM endpoints (ChatGPT, Claude, custom chatbots), MCP servers (tool schemas, auth, poisoning), npm packages (supply chain, malicious code), source code (local repos or GitHub URLs), and web applications (SQLi, XSS, SSRF, auth bypass).

Feature	pwnkit	promptfoo (acquired by OpenAI)	garak	nuclei	Semgrep
Autonomous multi-agent	Agentic pipeline	—	—	—	—
Verification (no false positives)	Re-exploits	—	—	—	—
LLM endpoint scanning	✓	✓	✓	—	—
MCP server security	✓	—	—	—	—
npm package audit	✓	—	—	—	Rules
Source code review	AI-powered	—	—	—	Rules
Web/API scanning	✓	—	—	✓	—
AI attack coverage	30+ agentic	Partial	Partial	—	—
Zero config	npx	YAML	Python	Templates	Config
Independent	✓	Acquired	✓	✓	VC-backed
Open source	MIT	OpenAI-owned	OSS	MIT	LGPL

AI writes the code.
pwnkit hacks it.

General-purpose autonomous pentesting.

LLM Endpoints

MCP Servers

npm Packages

Source Code

Web Apps

Just give it a target.

One command, zero config

Zero false positives

$0.05 per CI scan

LLM agnostic

How it compares

Drops into your CI/CD

pwnkit reviews its own source code

Built from real security research

Stop guessing.
Start proving.

AI writes the code.pwnkit hacks it.