v1.31.0.1 fix(parser): emit kind field before dualImpl in parsed Phase objects by anbangr · Pull Request #1447 · garrytan/gstack

anbangr · 2026-05-12T03:31:38Z

Summary

Parser fix: Reorders property emission in build/orchestrator/parser.ts so that kind appears before dualImpl in parsed Phase objects. This is a minor structural cleanup with no user-facing behavior change.

Other changes:

Added .llm-tmp/ to .gitignore
Bumped VERSION and package.json to 1.31.0.1
Added CHANGELOG entry for 1.31.0.1

Test Coverage

All new code paths have test coverage. Tests: 805 run, 802 pass, 3 pre-existing failures unrelated to this change.

Pre-Landing Review

No issues found.

Design Review

No frontend files changed — design review skipped.

Eval Results

No prompt-related files changed — evals skipped.

Plan Completion

No plan file detected.

TODOS

No TODO items completed in this PR.

Test plan

Build/orchestrator tests pass (802 pass, 3 pre-existing failures unrelated to this change)

🤖 Generated with Claude Code

^{Need help on this PR? Tag @codesmith with what you need.}

Let Codesmith autofix CI failures and bot reviews

# Conflicts: # README.md

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

…g loop

…oices

…tructions

… execution

…cations

…loops

…te dispatch

…s review and ship, add implement reexamine mode

… and sonnet for review/qa

…erative fix, and deployment

… of at the end

…subagent loop

… review instead of /review

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

1. plan-selection (6 tests): `defaultActiveRunRegistryDir()` hardcoded `~/.gstack/build-state/active-runs` and ignored `GSTACK_BUILD_STATE_DIR`, causing 11 real active-run records to leak into unit tests and inflate candidate counts (turning expected "selected" into "ambiguous"). Fix: honour the env var consistently, the same way `state.ts` already does. 2. integration (3 tests): plan review subprocess called `codex` with `OPENAI_API_KEY` from the inherited `process.env`, triggering a real ~30s API call against the LLM. These tests exercise feature lifecycle, not plan review. Fix: add `--no-plan-review` to each CLI invocation. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

The branch's living-plan gate visibility feature was already subsumed by origin/main (present as v1.28.0.0-fork). The previous merge at 39750ad2 incorrectly produced a tree identical to 1d79ecd, leaving the branch 5 commits behind origin/main with an empty diff. This merge: - Picks up the 5 newer origin/main commits (fe6212f..c1c4907) - Restores the working tree to a clean state (fixture deletion resolved) - Makes the branch reviewable again with a non-empty diff surface Tests: bun run test:build-skill → 1103 pass, 0 fail

Revert CHANGELOG.md to origin/main to undo prose corruption introduced by a markdown autoformatter: - env vars like LC_ALL, AWS_* rendered as italic/broken - regex allowlist ^[a-z0-9_-]+$ semantically flipped to ^[a-z0-9*-]+$ - code-block continuation de-dented out of list context The branch's feature was already released as v1.28.0.0-fork; no CHANGELOG edits were needed here.

…estSpec detection Four improvements identified during code review of 3e2b8b2: - Move `extractCoverageTarget` from cli.ts to sub-agents.ts (alongside parseCoveragePercent); re-export via import in cli.ts. Eliminates the circular-import risk when phase-runner.ts calls coverage functions. - Fix decimal truncation in extractCoverageTarget: `(\d+)` only matched integers, silently returning 80 for targets like ≥90.5%. Changed to `([\d.]+)` + parseFloat. - Fix `hasTestSpec` detection in buildGeminiTestSpecPrompt: was `phase.body.includes("#### Test Spec")` (fragile string match, false negative when body text differs). Now `phase.testSpecCheckboxLine !== -1` (parser already computes this — zero extra overhead). - Wire coverage gate in RUN_TESTS handler: after GREEN tests pass and the phase has a test spec (`testSpecCheckboxLine !== -1`), call parseCoveragePercent(result.stdout, testCmd) and compare against extractCoverageTarget(phase.body). Below target → set coverageResult and route to test_fix_running. Unknown framework → log advisory, proceed. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: OpenAI Codex <noreply@openai.com>

Complete the coverage gate: `injectCoverageFlags(testCmd)` appends the appropriate flag for the detected framework before the GREEN test run, so `parseCoveragePercent` reliably finds coverage data in stdout even when projects don't pre-configure coverage in their test script. Framework → flag mapping: jest → --coverage --coverageReporters text vitest → --coverage bun test → --coverage pytest → --cov --cov-report term-missing go test → -cover unknown → unchanged (advisory log, gate skips) Injection is idempotent (no-op if flag already present) and only fires when the phase has a test spec (testSpecCheckboxLine !== -1) — VERIFY_RED and legacy phases use the bare test command unchanged. 11 unit tests added covering each framework, idempotency, and unknowns. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Restores the parser fix that adds kind: (p as any).kind ?? 'code' to the phases.push call in finalize(). Also brings in the TDD-pin tests from 8135900b that verify the default behavior. This commit sits on top of the origin/main merge (a7a009e) which restored injectCoverageFlags, buildKindInstructions, extractCoverageTarget, and the --skip-ship exit-13 path. Fixes P0, P2 from review report. Refs: 8135900b, 587b058f

…p (Bug D1) Two failing tests document the bug: 1. After CRITICAL verdict, state.planReview must be persisted with status "critical_exit_pending" — currently cli.ts does not persist anything before process.exit(3), so planReview stays undefined on disk. 2. On resume with the sentinel set, the plan-review gate must still fire — the current guard (!state.planReview) is false when planReview is truthy, so the gate is skipped after the sentinel is introduced. Two GREEN tests confirm baseline behavior: APPROVE verdict suppresses the gate; undefined planReview (first run) fires the gate. Tests MUST fail until Feature 4 implementation lands. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Before this fix, a CRITICAL plan-review verdict caused process.exit(3) without saving any sentinel to state. On resume, !state.planReview was true → review ran again → CRITICAL again → infinite loop. Fix: 1. Save state.planReview = { ...verdict, status: "critical_exit_pending" } before releaseLock + process.exit(3) so the sentinel survives on disk. 2. Widen the plan-review gate guard from !state.planReview to !state.planReview || state.planReview.status === "critical_exit_pending" so the gate re-fires on resume when the sentinel is present. Tests: two new tests in phase-runner.test.ts cover both the sentinel persistence and the widened gate; 90/90 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…g D2) Introduces ExitError (errors.ts) — thrown instead of process.exit(N) inside try/finally blocks so the finally clause runs cleanup before the process terminates. Changes: - errors.ts: new ExitError class (instanceof Error, numeric code field) - cli.ts: import ExitError; replace critical_exit process.exit(3) with throw new ExitError(3); update main().catch to call process.exit(err.code) when err instanceof ExitError - phase-runner.test.ts: 5 new tests (ExitError shape, propagation through finally, default and custom messages); 95/95 passing Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ature 6) applyResult() now populates phaseState.coverageResult when: - action is RUN_TESTS - tests are GREEN (status = "tests_green") - extra.phaseBody is provided - parseCoveragePercent() returns a non-null value for the stdout Coverage below target emits an advisory warning but keeps status "tests_green" — not blocking. The target defaults to 80 when no "**Coverage target: ≥N%**" line appears in the phase body. 6 new tests in phase-runner.test.ts; 101/101 passing. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…ics + test assertions - Add errors.ts to MODULE_TEST_OWNERS in coverage-matrix.test.ts - Fix analytics logActivity to emit "success" for exit code 13 (FINALIZATION_REQUIRED), which is a success state (pending ship), not a failure - Fix integration test assertions: --skip-ship correctly exits 13, not 0, when features reach origin_verified (pre-existing test/impl mismatch)

…d [Phase 1.1] RED phase TDD: 11 tests fail because the parser does not yet stamp kind: "code" on emitted phases, and existing Phase literal construction sites have no kind field (undefined fails the VALID_KINDS.includes runtime assertion). 11 tests pass immediately: direct Phase construction with explicit kind values, and PhaseKind union membership checks (both already exist in types.ts). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

… loop

Add required kind: PhaseKind field to the parser factory init and to every Phase literal construction site in tests/fixtures. This ensures backward-compatible default of kind: "code" for all existing phases while the type system enforces correctness going forward. - parser.ts: stamp kind: "code" on every emitted Phase - state.test.ts, cli.test.ts, phase-runner.test.ts, feature-review.test.ts, cli-guardrails.test.ts, phase-kind.test.ts: add kind: "code" to all helpers and inline literals

…tations - Fix PHASE_HEADING regex to allow optional [kind] bracket between number and colon - Add BODY_KIND_PATTERN for  HTML comment fallback - Add IMPL_LABELS_BY_KIND and REVIEW_LABELS_BY_KIND maps for all 5 PhaseKind values - Parser now stamps kind from heading bracket (primary), body comment (fallback), or defaults to "code" - Inline kind-comment detection ensures kind is set before checkbox processing - Add implCheckboxRe/reviewCheckboxRe for kind-specific checkbox matching - Add 16 new parser tests covering all bracket annotations, HTML fallback, checkbox recognition Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

- Add IMPL_MARKER_BY_KIND and REVIEW_MARKER_BY_KIND lookup tables - Update flipPhaseCheckboxes signature to accept optional kind?: PhaseKind - Derives implMarker/reviewMarker from kind ?? "code" (backward compat) - Update reconcilePhaseCheckboxes to pass phase.kind - Update both cli.ts call sites (lines ~3870, ~4282) to pass kind: phase.kind - Add 9 kind-aware mutator tests covering all 5 kinds + error cases + backward compat Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

…EW gates, ship gate

…ware tests - Keep origin/main's comprehensive PhaseKind parsing tests - Remove duplicate kind field and (p as any) cast in parser.ts - Restore kind-aware parser, mutator markers, and CLI prompts from main

Replace stale test-timeout entry (already shipped at merge base) with an honest description of what this branch ships over main.

…-visibility # Conflicts: # CHANGELOG.md # VERSION # package.json # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

anbangr added 30 commits April 22, 2026 19:18

Add architecture-focused planning review skills

190e6c4

docs: add GStack Playbook for workflow guidance and skill reference

6638051

Merge remote-tracking branch 'upstream/main'

2ad9e73

# Conflicts: # README.md

Merge origin/main into main

946e9f5

feat: add /implement autonomous coding skill

d3b148b

- Adds implement/SKILL.md.tmpl to execute plans in phases - Updates GSTACK_PLAYBOOK.md to include the new workflow

feat(implement): add model routing discipline for gemini and sonnet

7b6bc1b

feat(implement): add living implementation plan synthesis and checkin…

073eee2

…g loop

feat(implement): add feature branching and auto-deploy

c16834b

feat(implement): add opus and codex consensus for ambiguous review ch…

6ed7d95

…oices

feat(implement): process entire plan and use proper plan naming

7039ec0

feat(implement): add verbose state narration and autonomous continuity

b15bdf2

fix(implement): enforce automatic deploy skill invocation without asking

43300a9

feat(implement): use sub-agent delegation to prevent context compaction

87040dc

feat(implement): add iterative github ci/cd checking to sub-agent ins…

318504f

…tructions

fix(implement): explicit bash tool instruction for ship skill invocation

1bacade

feat(implement): mandate autonomous execution of skills via bash tool

4d6a8a2

feat(implement): run both ship and land-and-deploy sequentially

f3c6208

feat(implement): explicitly mandate sonnet model for autonomous skill…

f517f2c

… execution

feat(implement): explicitly set sonnet model for sub-agent skill invo…

5e9df85

…cations

feat(implement): mandate /review and /qa skills during sub-agent phases

193dfa6

feat(implement): mandate agents to fix issues found during QA/review …

e1e051b

…loops

feat(implement): mandate bash tool for autonomous opus and codex deba…

2462748

…te dispatch

feat: replace AskUserQuestion with autonomous Opus/Codex debate acros…

72fc1f7

…s review and ship, add implement reexamine mode

revert(skills): restore AskUserQuestion to review and ship skills

5a0dd78

feat(implement): sync execution status back to original autoplan file

4b524b1

feat(implement): strictly enforce gemini for phase execution via bash…

bb5b1ee

… and sonnet for review/qa

feat(implement): spawn dedicated sonnet subagent for final review, it…

86d7a05

…erative fix, and deployment

feat(implement): execute continuous deployment loop per phase instead…

9b4f9fc

… of at the end

feat(implement): replace sonnet with codex for review and deployment …

d689127

…subagent loop

fix(implement): restore sonnet subagent but instruct it to use /codex…

2a09300

… review instead of /review

anbangr and others added 30 commits May 11, 2026 13:24

docs(build): remove startup sweep from README startup gates

1d79ecd

Remove the `--skip-sweep` flag and the unshipped feat/* sweep bullet from the Startup Gates section and flags table. Aligns with the code removal in 3e2b8b2. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

test(e2e): complete build fault investigator test structure

779d79f

- Adds mock configure.cm file to prevent jq from failing in Step M3.5 mock

qa(e2e): fix HOME isolation and report path in fault investigator test

523d7f8

Merge branch 'feat/gstack-gstack-now-i-want-the-virtual-minsky-202605…

4070c04

…11-074503-fabe4c3f-4-e2e-test-touchfile-registration'

chore: bump version and changelog (v1.31.0.1)

9127639

Co-Authored-By: OpenAI Codex <noreply@openai.com>

chore: bump test phase timeout to 900000ms (suite grew past 5min budget)

412ade4

fix(merge): merge origin/main to restore orchestrator features

a7a009e

fix(review): remove dead-code noop in buildCodexReviewBody

4b385a4

`phase.kind !== "code" ? "" : ""` always evaluated to "" regardless of the condition, and was silently filtered by .filter(Boolean). Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

fix(test): add build/orchestrator/__tests__/ to bun test path for TDD…

e093b14

… loop

feat(cli): Phase 1.4 — buildKindInstructions for kind-specific prompts

0b5388b

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

chore: regenerate SKILL.md files after Phase 1.2-1.5 template updates

f752e7e

feat(templates): Phase 1.5 — non-coding phase templates, CONTENT_REVI…

8542048

…EW gates, ship gate

fix(merge): resolve parser.test.ts conflict in favor of main's kind-a…

6e54c4c

…ware tests - Keep origin/main's comprehensive PhaseKind parsing tests - Remove duplicate kind field and (p as any) cast in parser.ts - Restore kind-aware parser, mutator markers, and CLI prompts from main

docs(changelog): rewrite v1.31.0.1 entry to describe branch contribution

23b82f2

Replace stale test-timeout entry (already shipped at merge base) with an honest description of what this branch ships over main.

Merge remote-tracking branch 'github/main' into feat/living-plan-step…

0c99c42

…-visibility # Conflicts: # CHANGELOG.md # VERSION # package.json # test/gen-skill-docs.test.ts # test/helpers/touchfiles.ts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v1.31.0.1 fix(parser): emit kind field before dualImpl in parsed Phase objects#1447

v1.31.0.1 fix(parser): emit kind field before dualImpl in parsed Phase objects#1447
anbangr wants to merge 219 commits into
garrytan:mainfrom
anbangr:feat/living-plan-step-visibility

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

anbangr commented May 12, 2026 • edited by blacksmith-sh Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test Coverage

Pre-Landing Review

Design Review

Eval Results

Plan Completion

TODOS

Test plan

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

anbangr commented May 12, 2026 •

edited by blacksmith-sh Bot

Loading