E2B RFC/OWASP Auditor

Chat-driven API auditor that uses E2B sandboxes and MCP servers to audit HTTP APIs against RFCs and OWASP Top 10 - without baking any of this complexity into the application itself.

Overview

The Problem

Building an API security auditor typically requires:

Hardcoding security rules that quickly become outdated
Managing complex infrastructure for safe code execution
Building custom integrations for knowledge bases and graph databases
Handling PII/secret sanitization to prevent data leaks

How E2B + MCP Eliminates This Complexity

Traditional Approach	Our Approach with E2B + MCP
Hardcode OWASP rules → maintain forever	MCP Perplexity fetches latest OWASP/RFC specs on demand
Build custom sandboxing → months of work	E2B Sandbox provides secure execution in one API call
Write Memgraph driver code → handle connections, queries	MCP Memgraph exposes Cypher via standard MCP protocol
Build egress filtering → complex middleware	E2B's MCP Gateway centralizes all external calls

Result: Our entire codebase focuses on the audit logic, not infrastructure.

Verifiable Results

Every audit finding includes a "View Source" link that takes you directly to the vulnerable code in GitHub:

/user-leaky → View Source → e2bClient.ts#L171-L186

Click any finding to see the actual code being audited, proving the analysis is grounded in real endpoints.

Architecture Overview

┌─────────────────────────────────────────────────────────────┐
│  Browser                                                     │
│  └── Chat UI + Graph Visualization                          │
└─────────────┬───────────────────────────────────────────────┘
              │
              ▼
┌─────────────────────────────────────────────────────────────┐
│  Next.js API Route (/api/chat)                              │
│  └── Orchestrates audit or answers questions                │
└─────────────┬───────────────────────────────────────────────┘
              │
              ▼
┌─────────────────────────────────────────────────────────────┐
│  E2B Sandbox                                                │
│  ┌─────────────────────────────────────────────────────┐    │
│  │  Sample API (Express)                               │    │
│  │  └── 5 endpoints with intentional vulnerabilities   │    │
│  ├─────────────────────────────────────────────────────┤    │
│  │  HTTP Probe → Sanitizer → Analysis                  │    │
│  ├─────────────────────────────────────────────────────┤    │
│  │  E2B MCP Gateway (built-in)                         │    │
│  │  ├── Perplexity MCP → Live RFC/OWASP lookup        │    │
│  │  └── Memgraph MCP → Knowledge graph storage         │    │
│  └─────────────────────────────────────────────────────┘    │
└─────────────┬───────────────────────────────────────────────┘
              │
              ▼
┌─────────────────────────────────────────────────────────────┐
│  Groq LLM (compound-beta)                                   │
│  └── Analyzes sanitized exchanges + specs → findings        │
└─────────────────────────────────────────────────────────────┘

Why This Architecture Matters

E2B Sandbox - The API runs in complete isolation. No risk of malicious code affecting the host. The sandbox includes a built-in MCP gateway, so we don't manage any MCP server infrastructure.
MCP Perplexity - Instead of hardcoding "OWASP A01 is Broken Access Control", we ask Perplexity at runtime. The knowledge is always current.
MCP Memgraph - The specs we discover get stored in a graph. Over time, this builds a knowledge base of RFC→OWASP relationships without us writing graph driver code.
Groq - Fast inference for analyzing the HTTP exchanges against the discovered specs.

Key Features

Real-time Graph Population

As you chat about RFC or OWASP topics, the knowledge graph grows:

Ask "What is RFC 7231?"
The response mentions RFC 7231, 7230, 7232
These specs are automatically added to Memgraph
The graph visualization updates in real-time

Live Chat on Graph Page

The /graph page includes a chat sidebar. Ask questions while watching the graph populate - this demonstrates the real-time nature of the system.

No Hardcoded Data

Check these files to verify:

packages/auditor-core/src/config/dynamicOwaspFetcher.ts - OWASP categories fetched from Perplexity MCP
packages/auditor-core/src/graphContext.ts - RFC/OWASP extraction from live responses
EMPTY_CATEGORIES and EMPTY_FALLBACK_CATEGORIES - defaults are empty arrays, not hardcoded lists

Source Code Verification

Each audit finding links to the exact lines in the codebase:

Endpoint	Issue	Source
`/user-leaky`	PII Exposure	e2bClient.ts#L171-L186
`/debug-error`	Stack Trace Leak	e2bClient.ts#L188-L209
`/items-injection`	SQL Injection	e2bClient.ts#L211-L231
`/cors-wildcard`	CORS Misconfiguration	e2bClient.ts#L233-L255

Running the Demo

Prerequisites

GitHub Codespaces (recommended) or Docker + Node.js 20+
API keys for: E2B_API_KEY, GROQ_API_KEY, PERPLEXITY_API_KEY

Option 1: GitHub Codespaces (Easiest)

Add secrets to GitHub Settings → Codespaces → Secrets:
- E2B_API_KEY
- GROQ_API_KEY
- PERPLEXITY_API_KEY
Click Code → Codespaces → Create codespace on main
Wait for the container to build (~2-3 minutes). Docker Compose starts only Memgraph + Memgraph MCP inside the Codespace; the web app runs directly in the devcontainer (not as a Docker service).
The web app auto-starts on port 3000 via the devcontainer postStartCommand (pnpm dev from apps/web), so the browser preview just works. There is no pnpm build step in Codespaces—the app stays in dev mode for fast refresh.

Option 2: Local dev (Docker only for Memgraph)

git clone https://github.com/airnub-labs/rfc-refactor
cd rfc-refactor
cp .env.example apps/web/.env.local
# Edit apps/web/.env.local with your API keys (Next.js reads this automatically from that folder)

# Start graph deps only
docker compose -f docker/docker-compose.yml up memgraph memgraph-mcp

# In another terminal, start the app directly (best DX)
pnpm install
pnpm --filter web dev

Open http://localhost:3000. Docker Compose does not read .env.local, so there's no need to copy it to .env; the app reads .env.local when you run pnpm dev.

Note: Next.js only loads environment variables from apps/web/.env.local (or the shell environment). A .env.local in the repo root will not be picked up by the web app.

Demo Walkthrough

1. Run an Audit (2 minutes)

Open the app
Click "Run Audit"
Watch the audit run:
- Sandbox creation
- Endpoint probing
- Spec discovery via MCP
- Compliance analysis
View the results with "View Source" links

2. Test the Graph Population (1 minute)

Click "View Graph" in the header
In the chat sidebar, ask: "What is RFC 7231?"
Watch the graph add RFC 7231 node
Ask: "How does OWASP A03 relate to injection?"
Watch OWASP node appear

3. Verify Source Links (30 seconds)

In the audit results, click "View Source" on any finding
Verify the GitHub code matches the issue description
This proves findings are real, not hallucinated

Technical Highlights

Complexity Avoided

What We Didn't Build	Why
Custom sandboxing solution	E2B provides this
Perplexity API integration	MCP server handles it
Memgraph driver/connection pooling	MCP server handles it
MCP server hosting	E2B's built-in gateway
OWASP/RFC knowledge base	Fetched live via MCP

Security Features

PII Sanitization: @redactpii/node + regex patterns strip sensitive data before egress
Cypher Injection Prevention: All graph queries use proper escaping and ID validation
Header Filtering: Sensitive headers (Authorization, Cookie, etc.) removed from probes

Code Quality

TypeScript throughout
Aspect-oriented design for cross-cutting concerns (sanitization, logging)
Clean separation: auditor-core package is framework-agnostic

Project Structure

apps/web/                    # Next.js app
  src/app/
    page.tsx                 # Chat UI with audit results
    graph/page.tsx           # Graph visualization with chat sidebar
    api/chat/route.ts        # Main API endpoint
    api/graph/route.ts       # Graph data endpoint

packages/auditor-core/       # Core audit logic
  src/
    auditEngine.ts           # Orchestration
    e2bClient.ts             # E2B sandbox + sample API (L158-255)
    graphContext.ts          # Spec discovery + Memgraph upsert
    groqClient.ts            # LLM analysis + source location mapping
    mcpClient.ts             # MCP gateway client
    sanitizeHttp.ts          # HTTP exchange sanitization
    config/
      dynamicOwaspFetcher.ts # Live OWASP fetching via MCP

What Makes This Different

No hardcoded rules - All security knowledge is fetched live
Zero infrastructure management - E2B + MCP handles everything
Verifiable results - Source links prove findings are real
Growing knowledge base - Graph builds as you use it
Real HTTP analysis - Actual requests/responses, not static code

This demonstrates how E2B sandboxes and MCP servers can eliminate months of infrastructure work, letting developers focus on the actual problem they're solving.

Links

Repository: https://github.com/airnub-labs/rfc-refactor
E2B Documentation: https://e2b.dev/docs
MCP Specification: https://modelcontextprotocol.io

Built for the "Build MCP Agents – with Docker, Groq, and E2B" hackathon.

Name		Name	Last commit message	Last commit date
Latest commit History 119 Commits
.devcontainer		.devcontainer
apps/web		apps/web
docker		docker
packages		packages
.env.example		.env.example
.gitignore		.gitignore
AGENTS.md		AGENTS.md
ARCHITECTURE.md		ARCHITECTURE.md
DECISIONS.md		DECISIONS.md
DEMO_SCRIPT.md		DEMO_SCRIPT.md
PROMPT.md		PROMPT.md
Project Idea_ “CompliBot” – An Agent for Protocol Compliance Validation.pdf		Project Idea_ “CompliBot” – An Agent for Protocol Compliance Validation.pdf
README.md		README.md
followup_coding_agent_prompt.md		followup_coding_agent_prompt.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.json		tsconfig.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

E2B RFC/OWASP Auditor

Overview

The Problem

How E2B + MCP Eliminates This Complexity

Verifiable Results

Architecture Overview

Why This Architecture Matters

Key Features

Real-time Graph Population

Live Chat on Graph Page

No Hardcoded Data

Source Code Verification

Running the Demo

Prerequisites

Option 1: GitHub Codespaces (Easiest)

Option 2: Local dev (Docker only for Memgraph)

Demo Walkthrough

1. Run an Audit (2 minutes)

2. Test the Graph Population (1 minute)

3. Verify Source Links (30 seconds)

Technical Highlights

Complexity Avoided

Security Features

Code Quality

Project Structure

What Makes This Different

Links

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 2

Uh oh!

Languages

airnub-labs/rfc-refactor

Folders and files

Latest commit

History

Repository files navigation

E2B RFC/OWASP Auditor

Overview

The Problem

How E2B + MCP Eliminates This Complexity

Verifiable Results

Architecture Overview

Why This Architecture Matters

Key Features

Real-time Graph Population

Live Chat on Graph Page

No Hardcoded Data

Source Code Verification

Running the Demo

Prerequisites

Option 1: GitHub Codespaces (Easiest)

Option 2: Local dev (Docker only for Memgraph)

Demo Walkthrough

1. Run an Audit (2 minutes)

2. Test the Graph Population (1 minute)

3. Verify Source Links (30 seconds)

Technical Highlights

Complexity Avoided

Security Features

Code Quality

Project Structure

What Makes This Different

Links

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 2

Uh oh!

Languages

Packages