feat: voice input, model picker, and multi-form fill by vharkins1 · Pull Request #507 · fireform-core/FireForm

vharkins1 · 2026-05-25T15:56:47Z

Summary

Three user-facing features for the Fill Form workflow, plus supporting infra. Built on top of current main.

🎙️ Voice input (speech-to-text)

New whisper Docker sidecar — faster-whisper engine, small.en model, bundles ffmpeg, multi-arch (runs natively on Apple Silicon).
POST /forms/transcribe streams mic audio to the local Whisper service and returns text. Audio is never persisted — nothing leaves the machine. Connection failures map to 503.
Frontend recorder (Record / Pause / Stop & Transcribe) seeds the input box; Electron grants only the media permission.

🤖 Model picker

GET /forms/models lists the local Ollama models, always surfacing the configured default even if not yet pulled.
Per-fill model override on FormFill (runtime hint, not persisted to the DB).
Dropdown in the Fill Form UI to choose the extraction model.
Replaces the hardcoded mistral default with qwen2.5:1.5b across llm.py, the Makefile pull-model target (now OLLAMA_MODEL-configurable), and test_model.py.

📋 Multi-form fill

Select one or more templates via keyboard-accessible tiles and fill them in a single submit, with per-form outcome reporting.
Drops the manual "Template ID" / "Template Directory" inputs; adds a "Change PDF" control.

🛠️ Infra

Persists the SQLite DB in a fireform_db volume so created templates survive container rebuilds.
WHISPER_HOST / OLLAMA_MODEL wiring; app now depends on the whisper service starting.

Testing

make test → 19/19 pass, including 5 new tests covering /transcribe (incl. service-down 503), /models (incl. Ollama-down fallback), and the per-fill model override.

Notes

Commits are split by layer (ui / api / core / infra+tests) because the changes interleave across shared files.

🤖 Generated with Claude Code

Reworks the Fill Form screen: - Mic recorder (Record / Pause / Stop & Transcribe) that posts audio to the local /forms/transcribe endpoint and seeds the input box. Electron grants the `media` permission; audio only ever goes to the local service. - Extraction-model dropdown populated from /forms/models. - Select one or more templates via keyboard-accessible tiles and fill them in a single submit, with per-form outcome reporting. - Drops the manual "Template ID" and "Template Directory" inputs; adds a "Change PDF" control and drop-zone swap. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

- POST /forms/transcribe forwards uploaded audio to the local Whisper sidecar (WHISPER_HOST) and returns the transcript; audio is streamed through and never persisted. Maps connection failures to 503. - GET /forms/models lists the Ollama models the local instance reports, always surfacing the configured default even if not yet pulled. - FormFill gains an optional `model` override, excluded before building the FormSubmission row (it is a runtime hint, not a column). Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

- LLM accepts an optional `model`, threaded through Controller and FileManipulator, falling back to OLLAMA_MODEL then qwen2.5:1.5b. - Replaces the hardcoded `mistral` default across llm.py, the Makefile pull-model target (now OLLAMA_MODEL-configurable), and test_model.py. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

- Adds the whisper ASR service (faster-whisper small.en) plus WHISPER_HOST and OLLAMA_MODEL wiring; app now depends on it starting. - Persists the SQLite DB in a fireform_db volume so created templates survive container rebuilds. - Adds API tests for the transcribe (incl. service-down 503) and models endpoints and the per-fill model override; gitignores local CLAUDE.md. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

…ound

vharkins1 and others added 7 commits May 25, 2026 08:54

fix: 🐛 File Not Found Error when submitting a form whose pdf is not f…

5e85793

…ound

fix: 🙈 .gitignore update

474879d

refactor: ♻️ Removing outdated Optional[] to new version convetions

a35722c

marcvergees merged commit 63a8f24 into main May 25, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: voice input, model picker, and multi-form fill#507

feat: voice input, model picker, and multi-form fill#507
marcvergees merged 7 commits into
mainfrom
feature/voice-input-multiform

vharkins1 commented May 25, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

vharkins1 commented May 25, 2026

Summary

🎙️ Voice input (speech-to-text)

🤖 Model picker

📋 Multi-form fill

🛠️ Infra

Testing

Notes

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants