Lessons Learned

2026-01-17: Blocking --no-verify in PreToolUse Hooks

Discovery: Regex Pattern Matching vs Commit Message Content

Issue: Initial regex blocked commits where commit MESSAGE contained "--no-verify" as text
Example: git commit -m "block --no-verify" was incorrectly blocked
Root cause: Regex matched anywhere in the command string, including quoted message content

Solution: Strip quoted strings BEFORE running pattern match:

CMD_NO_QUOTES=$(echo "$COMMAND" | sed 's/"[^"]*"//g' | sed "s/'[^']*'//g")

Discovery: HEREDOC Expansion Timing

Issue: When using HEREDOC for commit messages, shell expands before hook sees it
Example: git commit -m "$(cat <<'EOF'\ntext\nEOF)" becomes a single string
Challenge: The expanded text isn't properly quoted in the final command
Workaround: Avoid mentioning blocked patterns literally in commit messages
Better approach: Use alternative phrasing like "hook bypass flags"

Discovery: Two-Stage Hook Processing

Pattern: Guard check should run BEFORE replacement transformation

Flow:

1. Parse JSON input
2. Guard check (block if dangerous) -> EXIT if blocked
3. Replacement transformation (text substitution)
4. Return modified JSON

Benefit: Blocked commands never reach replacement stage, reducing complexity

Best Practice: Testing Hook Changes

When modifying hook scripts, test in this order:

Test blocked patterns are blocked:

echo '{"tool_name":"Bash","tool_input":{"command":"git commit --no-verify"}}' | ~/.claude/hooks/pre_tool_use.sh
# Should return deny decision

Test allowed patterns pass through:

echo '{"tool_name":"Bash","tool_input":{"command":"git commit -m \"test\""}}' | ~/.claude/hooks/pre_tool_use.sh
# Should return original or replacement

Test edge cases (blocked text in message):

echo '{"tool_name":"Bash","tool_input":{"command":"git commit -m \"mention --no-verify\""}}' | ~/.claude/hooks/pre_tool_use.sh
# Should pass through (it's in the message, not a flag)

Pitfall: The Hook Tests Itself

Irony: The hook will block you from committing changes that mention "--no-verify"
Symptom: Commit fails with "BLOCKED: --no-verify bypasses git hooks"
Workaround: Use euphemisms in commit messages ("hook bypass flags" instead of "--no-verify")
Lesson: Be careful with guard patterns that match documentation strings

Debugging Approach That Worked

Initial regex too aggressive (matched message content)
Added sed to strip quoted strings before pattern match
Tested with echo piping JSON through hook script
Verified both blocked and allowed cases work correctly
Used simpler commit message to avoid self-blocking

Recommendation: Separate Guard and Replace Scripts

Current pre_tool_use.sh does two things: guard + replace
Consider splitting into:
- ~/.claude/hooks/git_guard.sh - blocks dangerous commands
- ~/.claude/hooks/text_replace.sh - transforms text via knowledge graph
Would make testing and maintenance easier

2026-01-14: Troubleshooting Silent Hook Failures

Discovery: Fail-Open Design Hides Missing Dependencies

Issue: terraphim hook wasn't working but no errors were shown
Root cause: terraphim-agent binary was not installed at ~/.cargo/bin/terraphim-agent
Hook behavior: Silently exits with success (line 40: [ -z "$AGENT" ] && exit 0)
User experience: Commands execute normally, but text replacement doesn't happen
Lesson: Fail-open design is great for reliability but terrible for debugging

Discovery: Installation Requires Two Steps

Step 1: Install the binary from GitHub releases
Step 2: Build the knowledge graph with terraphim-agent graph --role "Terraphim Engineer"
Common mistake: Installing binary but forgetting to build knowledge graph
Result: Hook runs but replacements don't happen because thesaurus is missing

Pattern:

# Install
gh release download --repo terraphim/terraphim-ai \
  --pattern "terraphim-agent-aarch64-apple-darwin" --dir /tmp
mv /tmp/terraphim-agent-aarch64-apple-darwin ~/.cargo/bin/terraphim-agent
chmod +x ~/.cargo/bin/terraphim-agent

# REQUIRED: Build knowledge graph
cd ~/.config/terraphim
terraphim-agent graph --role "Terraphim Engineer"

Discovery: Hook Testing Requires Exact JSON Format

Issue: Can't just run hook script directly - needs proper JSON input

Solution: Simulate Claude Code's JSON format:

echo '{"tool_name":"Bash","tool_input":{"command":"echo Claude Code"}}' | \
  ~/.claude/hooks/pre_tool_use.sh

Tip: Use 2>/dev/null to suppress stderr warnings in production
Tip: Use jq . to pretty-print JSON output for debugging

Discovery: terraphim-agent Stderr Warnings Are Normal

When running terraphim-agent replace, multiple WARN messages appear:
- embedded_config.json: read failed NotFound
- thesaurus_*.json: read failed NotFound
These are logged to stderr and don't affect functionality
Hook script uses 2>/dev/null to hide them from users
JSON output on stdout is always correct despite warnings

Best Practice: Diagnostic Checklist for Silent Hook Failures

When hooks aren't working, check in this order:

Binary exists:

which terraphim-agent
[ -x "$HOME/.cargo/bin/terraphim-agent" ] && echo "Found" || echo "Missing"

Binary works:
```
~/.cargo/bin/terraphim-agent --version
```
Knowledge graph exists:
```
ls -la ~/.config/terraphim/docs/src/kg/
```

Knowledge graph built:

cd ~/.config/terraphim
~/.cargo/bin/terraphim-agent graph --role "Terraphim Engineer"

Replacement works:

cd ~/.config/terraphim
echo "Claude Code" | ~/.cargo/bin/terraphim-agent replace --role "Terraphim Engineer"

Hook works:

echo '{"tool_name":"Bash","tool_input":{"command":"echo Claude Code"}}' | \
  ~/.claude/hooks/pre_tool_use.sh 2>/dev/null

Pitfall: Version Mismatch Between Binary and Documentation

Previous handover mentioned v1.4.7 from GitHub releases
This session installed v1.3.0 (latest available release)
Always check gh release list --repo terraphim/terraphim-ai for actual versions
Don't assume documentation version numbers are current

Debugging Approach That Worked

User reported "hook not triggered" (but didn't specify error)
Read hook script to understand logic flow
Identified fail-open exit at line 40: [ -z "$AGENT" ] && exit 0
Checked if binary exists: which terraphim-agent (not found)
Installed binary from GitHub releases
Built knowledge graph (required step often forgotten)
Tested each component separately (replace, guard, hook)
Verified end-to-end with full JSON input simulation

Recommendation for Future: Improve Hook Observability

Problem: Silent failures make troubleshooting require deep technical knowledge

Potential solutions:

Add debug mode that logs missing dependencies to a file
Create health check command: terraphim-agent health --check-hooks
Hook could log to ~/.claude/hooks/pre_tool_use.log when agent missing
Claude Code could show hook status in UI (installed vs active vs failing)

2026-01-06: User-Level Hooks Configuration

Discovery: crates.io vs GitHub Releases Version Gap

Issue: cargo install terraphim_agent installs v1.0.0 which lacks hook and guard commands
Solution: Install from GitHub releases (v1.4.7+) which has all features

Pattern:

gh release download --repo terraphim/terraphim-ai \
  --pattern "terraphim-agent-aarch64-apple-darwin" --dir /tmp
mv /tmp/terraphim-agent-aarch64-apple-darwin ~/.cargo/bin/terraphim-agent

Discovery: Claude Code Hook Permission Prompt Timing

Issue: Permission prompt shows ORIGINAL command before PreToolUse hook runs
Reality: The hook transformation happens AFTER approval but BEFORE execution
Example: Prompt shows "Claude Code" but commit will have "Terraphim AI"
User experience: Can be confusing - user sees original, gets transformed result

Discovery: Combined Guard + Replacement Hook Pattern

Best practice: Run guard check FIRST, then replacement

Pattern:

# Step 1: Guard - block if destructive
GUARD_RESULT=$($AGENT guard --json <<< "$COMMAND" 2>/dev/null)
if blocked; then deny; fi

# Step 2: Replacement - only if guard passed
$AGENT hook --hook-type pre-tool-use --json <<< "$INPUT"

Discovery: User-Level vs Project-Level Settings

User-level: ~/.claude/settings.local.json - applies to ALL projects
Project-level: .claude/settings.local.json - applies to specific project
Recommendation: Put hooks at user-level for consistent behavior

Discovery: Skill Permissions in settings.local.json

Issue: Skills may require explicit permission to run without prompts
Solution: Add Skill(plugin-name:skill-name) to permissions allow list

Pattern:

{
  "permissions": {
    "allow": [
      "Skill(terraphim-engineering-skills:disciplined-research)",
      "Bash(terraphim-agent:*)"
    ]
  }
}

Best Practice: Fail-Open Hook Semantics

Always use || exit 0 or || echo '{"decision":"allow"}' in hooks
If terraphim-agent is unavailable, command should pass through
Never block user workflow due to hook infrastructure issues

Debugging Approach: Test Hook Components Separately

Test terraphim-agent guard --json directly with command string
Test terraphim-agent hook --hook-type pre-tool-use --json with full JSON input
Test hook script with echo '{"tool_name":"Bash",...}' | ~/.claude/hooks/pre_tool_use.sh
Only then rely on Claude Code integration

2026-01-03: Terraphim Hooks Setup

Discovery: Knowledge Graph Term Format

Issue: Using underscores in filenames (e.g., bun_install.md) produces underscored output
Solution: Use spaces in filenames (e.g., "bun install.md") for proper output
File naming: The filename (without .md extension) becomes the replacement text, NOT the heading

Discovery: terraphim-agent Requires Working Directory

Issue: terraphim-agent looks for docs/src/kg/ relative to current working directory
Solution: Change to ~/.config/terraphim before running, or create symlinks

Hook pattern:

cd ~/.config/terraphim 2>/dev/null || exit 0
terraphim-agent hook --hook-type pre-tool-use --json <<< "$INPUT" 2>/dev/null

Discovery: Git Hook Path Resolution

Issue: Git hooks receive relative paths that break after cd to another directory

Solution: Convert to absolute path before changing directories:

COMMIT_MSG_FILE="$(cd "$(dirname "$1")" && pwd)/$(basename "$1")"

Bug: terraphim-agent Case and Over-Replacement

Issue #394: terraphim-agent lowercases all replacement output regardless of heading case
Issue #394: Replaces text inside URLs and compound terms aggressively
Workaround: Be specific with synonyms (only exact matches you want replaced)
Status: Bug filed at terraphim/terraphim-ai#394

Best Practice: Installing Released Binaries

# Download specific platform binary from GitHub releases
gh release download v1.3.0 --repo terraphim/terraphim-ai \
  --pattern "terraphim-agent-aarch64-apple-darwin" --dir /tmp
chmod +x /tmp/terraphim-agent-aarch64-apple-darwin
mv /tmp/terraphim-agent-aarch64-apple-darwin ~/.cargo/bin/terraphim-agent

Debugging Approach: Test Components Separately

Test terraphim-agent replace directly from config directory
Test hook script with echo input before integrating
Use --json flag for structured output debugging
Suppress stderr with 2>/dev/null in production hooks

2025-12-30: Claude Code Plugin Marketplace Structure

Discovery: marketplace.json Location

Initial assumption: marketplace.json should be at repository root
Reality: Claude Code expects marketplace.json inside .claude-plugin/ directory
Evidence: Error message explicitly showed path .claude-plugin/marketplace.json

Discovery: Marketplace Directory Naming

When adding a marketplace via claude plugin marketplace add owner/repo:

Claude Code creates directory: ~/.claude/plugins/marketplaces/{owner}-{repo}/
For terraphim/terraphim-skills this becomes terraphim-terraphim-skills
The name field in marketplace.json does NOT determine the directory name
This can cause confusion if marketplace was previously added differently

Pitfall: Documentation vs Reality

The claude-code-guide agent provided information suggesting marketplace.json at root
Always verify with actual error messages and behavior over documentation

Best Practice: Plugin Validation

Always run claude plugin validate . before attempting installation
Validation passing does not guarantee marketplace discovery will work
Marketplace discovery has separate requirements from plugin validation

Debugging Approach That Worked

Check exact error message path
Inspect ~/.claude/plugins/marketplaces/ to see existing installations
Compare expected vs actual directory structures
Trace the naming convention (owner-repo format)

Recommendation for Future

When creating a Claude Code plugin marketplace:

Name your GitHub repo to match desired marketplace name
Or accept that marketplace directory will be {owner}-{repo} format
Consider using local path installation during development:
```
claude plugin marketplace add /path/to/local/plugin
```

2026-01-30: OpenCode Skills vs Agents Architecture

Discovery: Skills and Agents Are Different Concepts in OpenCode

Skills: Reusable prompts/instructions, managed by skills.sh, cross-platform
Agents: Platform-specific orchestration configs with different YAML schema
Key difference: skills.sh only manages skills, NOT agents
Consequence: Reinstalling skills does not fix agent configuration issues

Discovery: OpenCode Agent YAML Format Differs from Claude Code

Claude Code format (in repo agents/):

---
name: disciplined-implementation
description: |
  Phase 3 description...
tools: Read, Write, Edit, Glob, Grep, Bash, TodoWrite, Task
---

OpenCode format (in ~/.config/opencode/agent/):

---
description: Phase 3 description...
mode: primary
temperature: 0.2
tools:
  write: true
  edit: true
  bash: true
permission:
  write: allow
  edit: allow
  bash: allow
---

Tools: String list vs object with booleans
Mode: OpenCode has mode: primary|subagent (subagent = hidden from list)
Model: OpenCode allows model: provider/model-name (can cause errors if invalid)

Discovery: Invalid Model Reference Causes Silent Failure

Issue: model: xai/grok-code-fast in agent config caused OpenCode startup error
Symptom: "Configuration is invalid... expected record, received string tools"
Misleading: Error message mentioned "tools" but root cause was invalid model
Fix: Remove the model: line entirely to use default model

Discovery: OpenCode Path Plural vs Singular Confusion

Old docs: ~/.config/opencode/skill/ (singular) - from oh-my-opencode
New docs: ~/.config/opencode/skills/ (plural) - current OpenCode standard
skills.sh: Uses skills/ (plural) - CORRECT
Lesson: Always verify against current official documentation, not community plugins

Discovery: GitHub Issues Often Have Outdated Resolutions

Issue #54: Reported singular vs plural path discrepancy
Resolution: Closed as "fixed" but OpenCode changed their expectation instead
Lesson: Check if the software changed to match the tool, not vice versa

Best Practice: Debugging OpenCode Agent Issues

Check for startup errors:
```
opencode 2>&1 | grep -i error
```
Verify agent YAML format:
- No name: field (use description: instead)
- tools: must be object, not string
- mode: should be primary to appear in list

Check for invalid model references:

grep "model:" ~/.config/opencode/agent/*.md

Test agent visibility:
- Start OpenCode, press Tab to cycle through agents
- If agent missing, check mode: subagent (hidden)

Pitfall: Fixing Repo Doesn't Fix Local Config

Issue: Fixed skills in repo, but OpenCode agents are in local config
Location of agents: ~/.config/opencode/agent/ (not in repo)
Source: ~/private_agents_settings/opencode/agent/ (separate from skills repo)
Lesson: Know which config files are repo-managed vs locally-managed

Recommendation: Consider Adding OpenCode Agents to Repo

Current state:

Skills: In repo, managed by skills.sh
Agents (Claude Code format): In repo agents/
Agents (OpenCode format): NOT in repo, local only

Potential improvement:

Add opencode-agents/ directory with OpenCode-format agents
Or create conversion script from Claude Code format to OpenCode format
Document the installation process for OpenCode agents separately

2026-02-22: Hook Fixes and Judge Testing

Technical Discoveries

PreToolUse Hook JSON Requirement: Claude Code expects valid JSON output from PreToolUse hooks. When the hook returned exit code 1 with no output, Claude showed "just error" with no details.
set -e and Command Substitution: The hook used set -euo pipefail which caused the script to exit when $AGENT hook returned non-zero. Fix: Add || true to capture output without exiting.
```
AGENT_OUTPUT=$($AGENT hook ... 2>/dev/null) || true
```
terraphim-agent guard Output Format: The guard command outputs text ("BLOCKED: ...") not JSON when blocking. The hook must handle both formats.
Free Model Names: Not all models with "free" in the name exist. Verified free models:
- opencode/gpt-5-nano
- opencode/glm-5-free
- opencode/minimax-m2.5-free
- opencode/trinity-large-preview-free
Pre-push Hook Path Issue: When pre-push-judge.sh is symlinked to .git/hooks/pre-push, SCRIPT_DIR becomes .git/hooks/ but run-judge.sh is in automation/judge/. Fixed by using relative path: ${SCRIPT_DIR}/../../automation/judge/run-judge.sh

Debugging Approaches

Hook Testing: Test hooks directly with echo:

echo '{"tool_name":"Bash","tool_input":{"command":"ls"}}' | ~/.claude/hooks/pre_tool_use.sh

Tmux for Interactive Commands: When stdin is not a TTY, use tmux:

command tmux new-session -d -s test
command tmux send-keys -t test "ssh host" Enter
command tmux capture-pane -t test -p

Bash -x with Redirect: For debugging scripts that process stdin:
```
( bash -x script.sh ) > /tmp/debug.log 2>&1
```

Pitfalls to Avoid

Don't assume kimi-k2.5-free exists - verify model names with opencode models
Don't forget || true when capturing output from commands that may fail
Don't use chmod +x on scripts when hooks block it - use bash script.sh
Don't assume symlinks resolve paths the same as direct execution

Best Practices

Always return JSON from PreToolUse hooks - Claude requires valid JSON
Fail-open design - If terraphim-agent is not available, allow the command
Verify free models - Check opencode models | grep free before configuring
Test with real files - Judge evaluation on actual files reveals issues

FilesExpand file tree

lessons-learned.md

Latest commit

History

lessons-learned.md

File metadata and controls

Lessons Learned

2026-01-17: Blocking --no-verify in PreToolUse Hooks

Discovery: Regex Pattern Matching vs Commit Message Content

Discovery: HEREDOC Expansion Timing

Discovery: Two-Stage Hook Processing

Best Practice: Testing Hook Changes

Pitfall: The Hook Tests Itself

Debugging Approach That Worked

Recommendation: Separate Guard and Replace Scripts

2026-01-14: Troubleshooting Silent Hook Failures

Discovery: Fail-Open Design Hides Missing Dependencies

Discovery: Installation Requires Two Steps

Discovery: Hook Testing Requires Exact JSON Format

Discovery: terraphim-agent Stderr Warnings Are Normal

Best Practice: Diagnostic Checklist for Silent Hook Failures

Pitfall: Version Mismatch Between Binary and Documentation

Debugging Approach That Worked

Recommendation for Future: Improve Hook Observability

2026-01-06: User-Level Hooks Configuration

Discovery: crates.io vs GitHub Releases Version Gap

Discovery: Claude Code Hook Permission Prompt Timing

Discovery: Combined Guard + Replacement Hook Pattern

Discovery: User-Level vs Project-Level Settings

Discovery: Skill Permissions in settings.local.json

Best Practice: Fail-Open Hook Semantics

Debugging Approach: Test Hook Components Separately

2026-01-03: Terraphim Hooks Setup

Discovery: Knowledge Graph Term Format

Discovery: terraphim-agent Requires Working Directory

Discovery: Git Hook Path Resolution

Bug: terraphim-agent Case and Over-Replacement

Best Practice: Installing Released Binaries

Debugging Approach: Test Components Separately

2025-12-30: Claude Code Plugin Marketplace Structure

Discovery: marketplace.json Location

Discovery: Marketplace Directory Naming

Pitfall: Documentation vs Reality

Best Practice: Plugin Validation

Debugging Approach That Worked

Recommendation for Future

2026-01-30: OpenCode Skills vs Agents Architecture

Discovery: Skills and Agents Are Different Concepts in OpenCode

Discovery: OpenCode Agent YAML Format Differs from Claude Code

Discovery: Invalid Model Reference Causes Silent Failure

Discovery: OpenCode Path Plural vs Singular Confusion

Discovery: GitHub Issues Often Have Outdated Resolutions

Best Practice: Debugging OpenCode Agent Issues

Pitfall: Fixing Repo Doesn't Fix Local Config

Recommendation: Consider Adding OpenCode Agents to Repo

2026-02-22: Hook Fixes and Judge Testing

Technical Discoveries

Debugging Approaches

Pitfalls to Avoid

Best Practices