chore(build): parser kind field fix, bump build skill v1.22.0 by anbangr · Pull Request #1445 · garrytan/gstack

anbangr · 2026-05-12T03:03:47Z

Summary

This PR contains housekeeping changes for the build skill:

Parser fix: Reordered +kind+ and +dualImpl+ fields in parser.ts object literal (no functional change)
Version bump: build/SKILL.md.tmpl frontmatter version 1.21.4 → 1.22.0
Regenerated: build/SKILL.md host docs regenerated from template
Tests updated: skill-md.test.ts version assertions synced to 1.22.0
.gitignore: Added +.llm-tmp/+ directory

Pre-Landing Review

No issues found.

Test Coverage

Tests: All existing tests pass (1188 pass, 0 fail).

^{Need help on this PR? Tag @codesmith with what you need.}

Let Codesmith autofix CI failures and bot reviews

# Conflicts: # README.md

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

…g loop

…oices

…tructions

… execution

…cations

…loops

…te dispatch

…s review and ship, add implement reexamine mode

… and sonnet for review/qa

…erative fix, and deployment

… of at the end

…subagent loop

… review instead of /review

- Update Step M3 monitor launch to use set -o pipefail and ${PIPESTATUS[0]} while teeing output to monitor-output.log - Add Step M3.5 that scans monitor output for SKILL_FAULT_DETECTED, dedupes by resolved path (readlink), reads fault_investigator_model from configure.cm, and dispatches either GSTACK_FAULT_INVESTIGATOR_COMMAND or one background agent per non-duplicate fault - Add validation tests for Step M3.5 content in skill-md.test.ts - Fix pre-existing hardcoded model name in cli.ts comment

…_RUN_ID The previous Step M3.5 implementation had a critical silent-failure bug: 1. `sed -n 's/.*file:////p'` is a malformed sed expression (4 slashes = bad flag in substitute command). `_FAULT_FILE` was always empty and the `[ -z "$_FAULT_FILE" ] && continue` guard silently skipped every fault. 2. The expression also assumed a `file://` URI format that the monitor never emits — actual SKILL_FAULT_DETECTED events are JSON lines with a `faults[].sourceFiles[]` array (see build/orchestrator/cli.ts:5739-5741 and build/orchestrator/types.ts:15-23). No investigator would ever spawn. Switch to jq-based JSON parsing that flattens each event into TSV rows (runId<TAB>category<TAB>file) and pass FAULT_CATEGORY + FAULT_RUN_ID env vars to the investigator alongside FAULT_FILE. Dedupe key now includes (runId, category, resolved-path) so unrelated faults across runs aren't collapsed. Log filename is suffixed with category to avoid collisions. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…nv passing

…atch Adds test/skill-e2e-build-fault-investigator.test.ts (periodic tier) covering the fault investigator E2E flow: mock gstack-build outputs SKILL_FAULT_DETECTED JSON, Step M3.5 dispatches GSTACK_FAULT_INVESTIGATOR_COMMAND with fault env vars, mock investigator writes report to $FAULT_PRIMARY, assertions verify report exists with PLAN_SYNTHESIS_INVALID and no source files were edited. Registers build-fault-investigator-e2e in touchfiles.ts — selected when build/SKILL.md, skill-fault-detector.ts, or monitor.ts change. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

1. plan-selection (6 tests): `defaultActiveRunRegistryDir()` hardcoded `~/.gstack/build-state/active-runs` and ignored `GSTACK_BUILD_STATE_DIR`, causing 11 real active-run records to leak into unit tests and inflate candidate counts (turning expected "selected" into "ambiguous"). Fix: honour the env var consistently, the same way `state.ts` already does. 2. integration (3 tests): plan review subprocess called `codex` with `OPENAI_API_KEY` from the inherited `process.env`, triggering a real ~30s API call against the LLM. These tests exercise feature lifecycle, not plan review. Fix: add `--no-plan-review` to each CLI invocation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…estSpec detection Four improvements identified during code review of 3e2b8b2: - Move `extractCoverageTarget` from cli.ts to sub-agents.ts (alongside parseCoveragePercent); re-export via import in cli.ts. Eliminates the circular-import risk when phase-runner.ts calls coverage functions. - Fix decimal truncation in extractCoverageTarget: `(\d+)` only matched integers, silently returning 80 for targets like ≥90.5%. Changed to `([\d.]+)` + parseFloat. - Fix `hasTestSpec` detection in buildGeminiTestSpecPrompt: was `phase.body.includes("#### Test Spec")` (fragile string match, false negative when body text differs). Now `phase.testSpecCheckboxLine !== -1` (parser already computes this — zero extra overhead). - Wire coverage gate in RUN_TESTS handler: after GREEN tests pass and the phase has a test spec (`testSpecCheckboxLine !== -1`), call parseCoveragePercent(result.stdout, testCmd) and compare against extractCoverageTarget(phase.body). Below target → set coverageResult and route to test_fix_running. Unknown framework → log advisory, proceed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Complete the coverage gate: `injectCoverageFlags(testCmd)` appends the appropriate flag for the detected framework before the GREEN test run, so `parseCoveragePercent` reliably finds coverage data in stdout even when projects don't pre-configure coverage in their test script. Framework → flag mapping: jest → --coverage --coverageReporters text vitest → --coverage bun test → --coverage pytest → --cov --cov-report term-missing go test → -cover unknown → unchanged (advisory log, gate skips) Injection is idempotent (no-op if flag already present) and only fires when the phase has a test spec (testSpecCheckboxLine !== -1) — VERIFY_RED and legacy phases use the bare test command unchanged. 11 unit tests added covering each framework, idempotency, and unknowns. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…d, document sweep removal

…p (Bug D1) Two failing tests document the bug: 1. After CRITICAL verdict, state.planReview must be persisted with status "critical_exit_pending" — currently cli.ts does not persist anything before process.exit(3), so planReview stays undefined on disk. 2. On resume with the sentinel set, the plan-review gate must still fire — the current guard (!state.planReview) is false when planReview is truthy, so the gate is skipped after the sentinel is introduced. Two GREEN tests confirm baseline behavior: APPROVE verdict suppresses the gate; undefined planReview (first run) fires the gate. Tests MUST fail until Feature 4 implementation lands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Before this fix, a CRITICAL plan-review verdict caused process.exit(3) without saving any sentinel to state. On resume, !state.planReview was true → review ran again → CRITICAL again → infinite loop. Fix: 1. Save state.planReview = { ...verdict, status: "critical_exit_pending" } before releaseLock + process.exit(3) so the sentinel survives on disk. 2. Widen the plan-review gate guard from !state.planReview to !state.planReview || state.planReview.status === "critical_exit_pending" so the gate re-fires on resume when the sentinel is present. Tests: two new tests in phase-runner.test.ts cover both the sentinel persistence and the widened gate; 90/90 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…g D2) Introduces ExitError (errors.ts) — thrown instead of process.exit(N) inside try/finally blocks so the finally clause runs cleanup before the process terminates. Changes: - errors.ts: new ExitError class (instanceof Error, numeric code field) - cli.ts: import ExitError; replace critical_exit process.exit(3) with throw new ExitError(3); update main().catch to call process.exit(err.code) when err instanceof ExitError - phase-runner.test.ts: 5 new tests (ExitError shape, propagation through finally, default and custom messages); 95/95 passing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ature 6) applyResult() now populates phaseState.coverageResult when: - action is RUN_TESTS - tests are GREEN (status = "tests_green") - extra.phaseBody is provided - parseCoveragePercent() returns a non-null value for the stdout Coverage below target emits an advisory warning but keeps status "tests_green" — not blocking. The target defaults to 80 when no "**Coverage target: ≥N%**" line appears in the phase body. 6 new tests in phase-runner.test.ts; 101/101 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ics + test assertions - Add errors.ts to MODULE_TEST_OWNERS in coverage-matrix.test.ts - Fix analytics logActivity to emit "success" for exit code 13 (FINALIZATION_REQUIRED), which is a success state (pending ship), not a failure - Fix integration test assertions: --skip-ship correctly exits 13, not 0, when features reach origin_verified (pre-existing test/impl mismatch)

…d [Phase 1.1] RED phase TDD: 11 tests fail because the parser does not yet stamp kind: "code" on emitted phases, and existing Phase literal construction sites have no kind field (undefined fails the VALID_KINDS.includes runtime assertion). 11 tests pass immediately: direct Phase construction with explicit kind values, and PhaseKind union membership checks (both already exist in types.ts). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… loop

Add required kind: PhaseKind field to the parser factory init and to every Phase literal construction site in tests/fixtures. This ensures backward-compatible default of kind: "code" for all existing phases while the type system enforces correctness going forward. - parser.ts: stamp kind: "code" on every emitted Phase - state.test.ts, cli.test.ts, phase-runner.test.ts, feature-review.test.ts, cli-guardrails.test.ts, phase-kind.test.ts: add kind: "code" to all helpers and inline literals

…tations - Fix PHASE_HEADING regex to allow optional [kind] bracket between number and colon - Add BODY_KIND_PATTERN for  HTML comment fallback - Add IMPL_LABELS_BY_KIND and REVIEW_LABELS_BY_KIND maps for all 5 PhaseKind values - Parser now stamps kind from heading bracket (primary), body comment (fallback), or defaults to "code" - Inline kind-comment detection ensures kind is set before checkbox processing - Add implCheckboxRe/reviewCheckboxRe for kind-specific checkbox matching - Add 16 new parser tests covering all bracket annotations, HTML fallback, checkbox recognition Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add IMPL_MARKER_BY_KIND and REVIEW_MARKER_BY_KIND lookup tables - Update flipPhaseCheckboxes signature to accept optional kind?: PhaseKind - Derives implMarker/reviewMarker from kind ?? "code" (backward compat) - Update reconcilePhaseCheckboxes to pass phase.kind - Update both cli.ts call sites (lines ~3870, ~4282) to pass kind: phase.kind - Add 9 kind-aware mutator tests covering all 5 kinds + error cases + backward compat Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…EW gates, ship gate

…(p as any) cast - Merge origin/main (13 commits behind, real conflict in parser.test.ts) - Resolve parser.test.ts conflict in favor of main's comprehensive PhaseKind test matrix (writing, experiment, research, manual, malformed bracket warning, HTML comment fallback, checkbox line tests) - Drop branch's (p as any).kind cast in parser.ts; use typed p.kind ?? code - Regenerate SKILL.md docs for all hosts after merge

…-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenerated-ski # Conflicts: # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

anbangr added 30 commits April 22, 2026 19:18

Add architecture-focused planning review skills

190e6c4

docs: add GStack Playbook for workflow guidance and skill reference

6638051

Merge remote-tracking branch 'upstream/main'

2ad9e73

# Conflicts: # README.md

Merge origin/main into main

946e9f5

feat: add /implement autonomous coding skill

d3b148b

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

feat(implement): add model routing discipline for gemini and sonnet

7b6bc1b

feat(implement): add living implementation plan synthesis and checkin…

073eee2

…g loop

feat(implement): add feature branching and auto-deploy

c16834b

feat(implement): add opus and codex consensus for ambiguous review ch…

6ed7d95

…oices

feat(implement): process entire plan and use proper plan naming

7039ec0

feat(implement): add verbose state narration and autonomous continuity

b15bdf2

fix(implement): enforce automatic deploy skill invocation without asking

43300a9

feat(implement): use sub-agent delegation to prevent context compaction

87040dc

feat(implement): add iterative github ci/cd checking to sub-agent ins…

318504f

…tructions

fix(implement): explicit bash tool instruction for ship skill invocation

1bacade

feat(implement): mandate autonomous execution of skills via bash tool

4d6a8a2

feat(implement): run both ship and land-and-deploy sequentially

f3c6208

feat(implement): explicitly mandate sonnet model for autonomous skill…

f517f2c

… execution

feat(implement): explicitly set sonnet model for sub-agent skill invo…

5e9df85

…cations

feat(implement): mandate /review and /qa skills during sub-agent phases

193dfa6

feat(implement): mandate agents to fix issues found during QA/review …

e1e051b

…loops

feat(implement): mandate bash tool for autonomous opus and codex deba…

2462748

…te dispatch

feat: replace AskUserQuestion with autonomous Opus/Codex debate acros…

72fc1f7

…s review and ship, add implement reexamine mode

revert(skills): restore AskUserQuestion to review and ship skills

5a0dd78

feat(implement): sync execution status back to original autoplan file

4b524b1

feat(implement): strictly enforce gemini for phase execution via bash…

bb5b1ee

… and sonnet for review/qa

feat(implement): spawn dedicated sonnet subagent for final review, it…

86d7a05

…erative fix, and deployment

feat(implement): execute continuous deployment loop per phase instead…

9b4f9fc

… of at the end

feat(implement): replace sonnet with codex for review and deployment …

d689127

…subagent loop

fix(implement): restore sonnet subagent but instruct it to use /codex…

2a09300

… review instead of /review

anbangr and others added 30 commits May 11, 2026 11:48

qa(build): improve M3.5 path resolution, exit-code persistence, and e…

e368ba0

…nv passing

fix(build): complete M3.5 fault investigator report contract

0e07df2

docs(build): remove startup sweep from README startup gates

1d79ecd

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(e2e): complete build fault investigator test structure

779d79f

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

qa(e2e): fix HOME isolation and report path in fault investigator test

523d7f8

Merge branch 'feat/gstack-gstack-now-i-want-the-virtual-minsky-202605…

4070c04

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

chore: bump test phase timeout to 900000ms (suite grew past 5min budget)

412ade4

fix(review): remove dead-code noop in buildCodexReviewBody

4b385a4

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore(build): fix parser kind field, bump v1.22.0, regen host SKILL.m…

97e4569

…d, document sweep removal

fix(test): add build/orchestrator/__tests__/ to bun test path for TDD…

e093b14

… loop

feat(cli): Phase 1.4 — buildKindInstructions for kind-specific prompts

0b5388b

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: regenerate SKILL.md files after Phase 1.2-1.5 template updates

f752e7e

feat(templates): Phase 1.5 — non-coding phase templates, CONTENT_REVI…

8542048

…EW gates, ship gate

Merge remote-tracking branch 'github/main' into feat/gstack-delegated…

71627eb

…-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenerated-ski # Conflicts: # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(build): parser kind field fix, bump build skill v1.22.0#1445

chore(build): parser kind field fix, bump build skill v1.22.0#1445
anbangr wants to merge 209 commits into
garrytan:mainfrom
anbangr:feat/gstack-delegated-storm-20260511-122548-fabe4c3f-3-commit-housekeeping-parser-fix-readme-regenerated-ski

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anbangr commented May 12, 2026 • edited by blacksmith-sh Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Pre-Landing Review

Test Coverage

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading