AutoPentester: An LLM Agent-based Framework for Automated Pentesting

Note:

If you are using this work for academic purposes, please cite our paper.
If you find any incorrect / inappropriate / outdated content, please kindly consider opening an issue or a PR.

Updates

AutoPentester now supports claude.
This work has been accepted for the IEEE TrustCom 2025.

Installation

Create a virtual environment. (python3 -m venv myenv, source myenv/bin/activate)
Clone the project and install the requirements.
- git clone <repo_url>
- cd AutoPentester
- Create a virtual environment with Python pip3 3.12.3. Then install the requirements.txt inside it.
- pip3 install -r requirements.txt
- pip3 install -e .
To use OpenAI API
- Ensure that you have link a payment method to your OpenAI account.
- export your API key with export OPENAI_API_KEY="<your key here>"
- optional: override the API endpoint with export OPENAI_BASE_URL="https://your-host/v1"
To use Anthropic Claude models

export your API key with export ANTHROPIC_API_KEY="<your key here>"

To use Google Gemini models
- export your API key with export GEMINI_API_KEY="<your key here>" or export GOOGLE_API_KEY="<your key here>"
- Gemini is selected per agent in pentestgpt/config/agent_config.py
Configure agent models in pentestgpt/config/agent_config.py

Each agent name maps to a provider and model.
Supported providers are openai, anthropic, gemini, azure, gpt4all, titan, and rag.
Update the file to mix OpenAI and Claude per agent based on your preference.

To run the framework, type pentestgpt --login
You will be asked for the key for the configured provider, and then the IP address.
Do you want to continue from previous session? (y/n) -> Press n
Give a pentesting task. You can use a prompt like "I want to test the machine with the IP (targe_IP)"
Only for the first run, it will take 10 minutes to build the vectorbase of the RAG module at the beginning. Please wait until it starts its process.

Demo

Please find the demonstration video in the following link.

Log files

The processed log files are in the processed_log_files directory. The quantitative results were calculated baseed on these log files.

Survey analysis

The analysis of the survey is in the Survey_analysis directory. Run the analysis.py to plot the graphs.

Citations

If you are using this work for academic purposes, please cite our paper.

@inproceedings{ginige2025autopentester,
  title={Autopentester: An llm agent-based framework for automated pentesting},
  author={Ginige, Yasod and Niroshan, Akila and Jain, Sajal and Seneviratne, Suranga},
  booktitle={2025 IEEE 24th International Conference on Trust, Security and Privacy in Computing and Communications (TrustCom)},
  pages={163--174},
  year={2025},
  organization={IEEE}
}

Name		Name	Last commit message	Last commit date
Latest commit History 647 Commits
.devcontainer		.devcontainer
Sublist3r		Sublist3r
Survey_analysis		Survey_analysis
config		config
pentestgpt		pentestgpt
resources		resources
tasks		tasks
tests		tests
.deepsource.toml		.deepsource.toml
.gitignore		.gitignore
LICENSE.md		LICENSE.md
Makefile		Makefile
README.md		README.md
Tools_attach.py		Tools_attach.py
framework.png		framework.png
metasploit_tool_check.py		metasploit_tool_check.py
pyproject.toml		pyproject.toml
rag.py		rag.py
repitition_identifier.py		repitition_identifier.py
requirements.txt		requirements.txt
setup.py		setup.py
test.py		test.py
test_csv_gen.py		test_csv_gen.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AutoPentester: An LLM Agent-based Framework for Automated Pentesting

Updates

Installation

Demo

Log files

Survey analysis

Citations

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AutoPentester: An LLM Agent-based Framework for Automated Pentesting

Updates

Installation

Demo

Log files

Survey analysis

Citations

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages