Multi-DSL/device refactoring by sandlbn · Pull Request #19 · IntelLabs/Xe-Forge

sandlbn · 2026-04-20T03:43:26Z

Generalize pipeline from Triton/XPU-only to support multiple DSLs (Triton, Gluon, SYCL, CUDA) and devices (XPU, CUDA, CPU)
Add DSL and DeviceType enums, abstract DeviceConfig with XPU/CUDA subclasses
Add DSL-stage compatibility registry (dsl_registry.py)
Abstract device query module (device_query.py) dispatching XPU/CUDA
Parameterize LLM prompts by DSL/device (prompts/device_prompts.py)
Restructure knowledge base: common/, triton/{xpu,cuda}/, gluon/xpu/, sycl/xpu/
Add --device and --dsl CLI flags
Add .gitignore

sandlbn · 2026-04-20T03:45:51Z

This is more like proposal to add other DSLs, we can discuss it in this thread. @danielfleischer @gbenms

danielfleischer

The HW and DSL generalization is fine.

Where did the 18 new examples in KB/triton/XPU/ come from? are they synthetic, are they based on KernelBench?

danielfleischer · 2026-04-30T14:23:52Z

Where did this come from?

extracted from sycl tla kernels, I will update this PR for couple new when you pass existing stuff.

seems to be working:

Speedup: 23.09x - I changed strategy from “pick one supposedly better tile” to a runtime micro-autotuned kernel family while preserving the proven RowMajor/ColumnMajor layout fix and BF16->FP32 math path. This directly addresses the stage issue and avoids locking into a regressing shape. The sweep prioritizes 256x256x32 and 128x128x64, also testing 128x256x32, 256x128x32, and retaining the current 256x128x16 as a fallback/reference. I also hardened reference GEMM stride products to int64. If this still doesn’t be Total Speedup: 23.10x Performance: 3.71 → 85.68 TFLOPS Execution Time: 37.047 ms → 1.604 ms

danielfleischer · 2026-04-30T14:24:46Z

Is this from KernelBench?

it is as example of fully fused kernel

danielfleischer

More comments.

danielfleischer · 2026-05-07T07:57:45Z

    )


+class SyclOptimizationSignature(dspy.Signature):


Maybe it's a good opportunity to have backend-specific agents in dedicated modules. So all the SYCL is there, TRITON, Gluon, etc.

danielfleischer

Good

sandlbn requested review from danielfleischer and gbenms April 20, 2026 03:44

Multi-DSL/device refactoring

a917f31

sandlbn force-pushed the sandlbn/refactor branch from dea6744 to a917f31 Compare April 29, 2026 17:24

sandlbn marked this pull request as ready for review April 29, 2026 17:26

sandlbn added 9 commits April 29, 2026 10:32

Update AI bench

f0eb16f

Update optional-dependencies

25c512a

Reorganize pyproject

fee2828

add resolv strategy

4873f7a

Removed old pytorch backend

9503aac

Add dsl type to calls to match c++ or python

00f402e

Merge branch 'main' into sandlbn/refactor

ec39d79

Exclude examples

3fa0a25

Update knowlenge base

c58707a

danielfleischer reviewed Apr 30, 2026

View reviewed changes

sandlbn added 4 commits April 30, 2026 08:05

Update sycl path

bb7f7f3

Update ai-bench branch

5894c76

Add new kb

180402e

Fix dim extraction

2dabf75

danielfleischer reviewed May 7, 2026

View reviewed changes

Update env example and sycl executor

c107305

danielfleischer approved these changes May 7, 2026

View reviewed changes

sandlbn merged commit 5cb0498 into main May 7, 2026
2 checks passed

sandlbn deleted the sandlbn/refactor branch May 7, 2026 21:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi-DSL/device refactoring#19

Multi-DSL/device refactoring#19
sandlbn merged 15 commits intomainfrom
sandlbn/refactor

sandlbn commented Apr 20, 2026

Uh oh!

sandlbn commented Apr 20, 2026

Uh oh!

danielfleischer left a comment

Uh oh!

danielfleischer Apr 30, 2026

Uh oh!

sandlbn Apr 30, 2026

Uh oh!

sandlbn Apr 30, 2026

Uh oh!

danielfleischer Apr 30, 2026

Uh oh!

sandlbn Apr 30, 2026

Uh oh!

danielfleischer left a comment

Uh oh!

danielfleischer May 7, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielfleischer left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		)


		class SyclOptimizationSignature(dspy.Signature):

Conversation

sandlbn commented Apr 20, 2026

Uh oh!

sandlbn commented Apr 20, 2026

Uh oh!

danielfleischer left a comment

Choose a reason for hiding this comment

Uh oh!

danielfleischer Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

sandlbn Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

sandlbn Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

danielfleischer Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

sandlbn Apr 30, 2026

Choose a reason for hiding this comment

Uh oh!

danielfleischer left a comment

Choose a reason for hiding this comment

Uh oh!

danielfleischer May 7, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

danielfleischer left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants