Skip to content

[Non-record] MHALM V2 non-record submission (1.3477 bpb)#2145

Open
aquemy wants to merge 1 commit into
openai:mainfrom
aquemy:2026-04-29_MHALMv2
Open

[Non-record] MHALM V2 non-record submission (1.3477 bpb)#2145
aquemy wants to merge 1 commit into
openai:mainfrom
aquemy:2026-04-29_MHALMv2

Conversation

@aquemy
Copy link
Copy Markdown

@aquemy aquemy commented May 2, 2026

Multi-Head Atlas Language Model V2: geometric LM with Stäckel coordinate encoders, 5 kernel readout heads, 2-pass FFT SSM + attention. 18.3M params, 13.0 MB artifact, 585s on 8×H100.
−0.107 bpb improvement over V1 (1.4574 → 1.3477).

Blog post: https://quemy.info/2026-04-29-mhalm-v3-lessons.html

PR for V1: #476

Multi-Head Atlas Language Model V2: geometric LM with Stäckel coordinate
encoders, 5 kernel readout heads, 2-pass FFT SSM + attention.
18.3M params, 13.0 MB artifact, 585s on 8×H100.
−0.107 bpb improvement over V1 (1.4574 → 1.3477).

Blog post: https://quemy.info/2026-04-29-mhalm-v3-lessons.html
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant