503 add gpu metric into cluster dashboard by KUASWoodyLIN · Pull Request #504 · otterscale/dashboard

KUASWoodyLIN · 2026-06-10T11:41:50Z

No description provided.

Section A: replace the Pod Density ranking with a GPU Utilization ranking (avg per node). Section B: when the drilled-in node has GPU cards, show four per-card heatmaps (utilisation / memory / power / temperature), gated to GPU nodes so CPU-only nodes stay clean. Cells render on a Canvas layer to keep the mount cheap; reload skips identical-data re-renders and hover caches its plot rect. GPU queries are scoped via a new dcgmNodeSelector helper (exact, escaped match). Adds gpu_power / gpu_temperature and node_chart_gpu_* i18n keys.

…e/ folders

gemini-code-assist

Code Review

This pull request introduces GPU monitoring capabilities to the cluster analytics dashboard, adding translation strings, a GPU heatmap component, and a conditional GPU usage wrapper. It also reorganizes existing analytics components into dedicated subdirectories. Feedback on these changes highlights a critical issue with a hardcoded Prometheus URL in the proxy server that bypasses multi-cluster routing. Additionally, minor performance improvements are suggested in the new GPU heatmap component to replace SvelteSet and SvelteMap with standard JavaScript collections inside derived state blocks to eliminate unnecessary reactive overhead.

Important

The consumer version of Gemini Code Assist on GitHub is being sunset. Starting June 18, 2026, new organization installations will be blocked, and all code review activity will officially cease on July 17, 2026.
For more details on the timeline and next steps, please review the Help Documentation.

woody_lin added 3 commits June 10, 2026 19:27

refactor(cluster): split analytics components into node/ and namespac…

258d052

…e/ folders

chore: remove unused VSCode settings file

896b0d4

KUASWoodyLIN requested review from ZhangEnYao and iasthc as code owners June 10, 2026 11:41

KUASWoodyLIN linked an issue Jun 10, 2026 that may be closed by this pull request

Add GPU metric into cluster dashboard #503

Closed

gemini-code-assist Bot reviewed Jun 10, 2026

View reviewed changes

Comment thread src/lib/server/proxy.ts Outdated

Comment thread src/lib/components/dashboard/cluster/analytics/node/gpu-heatmap.svelte

Comment thread src/lib/components/dashboard/cluster/analytics/node/gpu-heatmap.svelte

fix: remove temporary hardcoded Prometheus proxy target

e521585

ZhangEnYao approved these changes Jun 10, 2026

View reviewed changes

ZhangEnYao merged commit f20dc0f into main Jun 10, 2026
8 of 11 checks passed

ZhangEnYao deleted the 503-add-gpu-metric-into-cluster-dashboard branch June 10, 2026 11:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

503 add gpu metric into cluster dashboard#504

503 add gpu metric into cluster dashboard#504
ZhangEnYao merged 4 commits into
mainfrom
503-add-gpu-metric-into-cluster-dashboard

KUASWoodyLIN commented Jun 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

KUASWoodyLIN commented Jun 10, 2026

Uh oh!

gemini-code-assist Bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants