Skip to content

[Draft]Add ROCM kernel skill#343

Draft
01xjw wants to merge 3 commits intohuggingface:mainfrom
01xjw:add-rocm-kernels-skill
Draft

[Draft]Add ROCM kernel skill#343
01xjw wants to merge 3 commits intohuggingface:mainfrom
01xjw:add-rocm-kernels-skill

Conversation

@01xjw
Copy link

@01xjw 01xjw commented Mar 13, 2026

Add ROCm Triton kernels skill for MI355X/R9700

  • RMSNorm, RoPE 3D, GEGLU, AdaLN kernel patterns
  • Benchmark scripts (micro + e2e for LTX-Video)
  • HuggingFace Kernels integration example
  • Reference docs: optimization guides, templates, troubleshooting

01xjw added 3 commits March 13, 2026 06:43
- RMSNorm, RoPE 3D, GEGLU, AdaLN kernel patterns
- Benchmark scripts (micro + e2e for LTX-Video)
- HuggingFace Kernels integration example
- Reference docs: optimization guides, templates, troubleshooting
@01xjw 01xjw changed the title Add ROCM kernel skill [Draft]Add ROCM kernel skill Mar 13, 2026
@01xjw 01xjw marked this pull request as draft March 13, 2026 11:55
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant