This guide covers installation, configuration, and running your first experiment.
- Python 3.11 or higher
- OpenRouter API key (for LLM access)
- 8GB+ RAM recommended for batch experiments
git clone https://github.com/stchakwdev/Secret_H_Evals.git
cd Secret_H_Evalspip install -r requirements.txtCreate a .env file in the project root:
OPENROUTER_API_KEY=your_api_key_hereOr export directly:
export OPENROUTER_API_KEY=your_api_key_herepython run_game.py --players 5python run_game.py --batch --games 10 --players 7 --enable-db-loggingIn a separate terminal:
python check_batch_progress.py --watch| Argument | Description | Default |
|---|---|---|
--players |
Number of players (5-10) | 5 |
--batch |
Enable batch mode | False |
--games |
Number of games in batch | 10 |
--model |
LLM model identifier | deepseek/deepseek-v3.2-exp |
--enable-db-logging |
Save to SQLite database | False |
--batch-tag |
Descriptive tag for batch | None |
Supported models via OpenRouter:
# DeepSeek (cost-effective)
python run_game.py --model deepseek/deepseek-v3.2-exp
# Claude
python run_game.py --model anthropic/claude-3.5-sonnet
# GPT-4
python run_game.py --model openai/gpt-4-turbo
# Llama
python run_game.py --model meta-llama/llama-3.1-70b-instructpython -m dashboard.app --port 8050Open http://localhost:8050 in your browser.
python scripts/generate_all_visuals.py --games 5python scripts/export_to_inspect.py --db data/games.db- Running Experiments - Detailed experiment guide
- Analyzing Results - Analysis tools tutorial
- API Reference - Module documentation