Try to make backend configuration a little more robust. #86

xenoscopic · 2025-06-17T16:13:48Z

If there's no change to the backend config, then there's no need to fail the request on an active runner. Also, if the active runner is unused, then we can evict it and then update its config.

If there's no change to the backend config, then there's no need to fail the request on an active runner. Also, if the active runner is unused, then we can evict it and then update its config. Signed-off-by: Jacob Howard <[email protected]>

xenoscopic · 2025-06-17T16:14:24Z

pkg/inference/scheduling/loader.go

+	// If the configuration hasn't changed, then just return.
+	if existingConfig, ok := l.runnerConfigs[runnerId]; ok && reflect.DeepEqual(runnerConfig, existingConfig) {
+		l.log.Infof("Configuration for %s runner for model %s unchanged", backendName, model)
+		return nil
+	}


@silvin-lubecki I think this path should handle your case without eviction (we can look at eviction disablement or configuration in a separate PR).

The logic is good, but I think we should avoid using DeepEqual if possible. Looking at the inference.BackendConfiguration type, it's simple enough we can add an Equal method:

import "slices" func (b *BackendConfiguration) Equals(other *BackendConfiguration) bool { return b.ContextSize == other.ContextSize && slices.Equal(b.RawFlags, other.RawFlags) }

Is there a risk to DeepEqual? It feels like there's a bigger risk if someone forgets to update Equals when they add a new field.

Good point about the maintenance risk! That's definitely a valid concern.

However, I think an explicit Equals method would be beneficial here for a couple of reasons:

Explicitness and readability: With only 2 fields (ContextSize and RawFlags), the comparison logic is clear. For such a simple struct, the Equals method is just 2 lines with slices.Equal.

Type safety: reflect.DeepEqual accepts any types and will silently return false for type mismatches, which could mask bugs during refactoring.

For the maintenance concern, we could add a focused test:

func TestBackendConfigurationEquals(t *testing.T) { base := &BackendConfiguration{ContextSize: 100, RawFlags: []string{"--flag"}} // Test each field difference different := *base different.ContextSize = 200 require.False(t, base.Equals(&different)) different = *base different.RawFlags = []string{"--other"} require.False(t, base.Equals(&different)) }

This way, adding a new field without updating Equals would likely be caught by the test.
That said, DeepEqual works perfectly fine for this use case too - what's your preference?

I'll leave it @p1-0tr to decide since he's sort of the CODEOWNER here 😉. I tend to lean towards less code = less maintenance. I don't see a risk with differing types since everything here is statically typed.

~~Not sure if that helps, but it seems the type can be safely compared with ==.~~

Sorry, looked at the wrong thing. Definitely not comparable with ==.

I dislike reflection as a matter of principle. But in this case I don't mind the DeepEquals, because it keeps the code simple and small, and it will likely disappear once we implement proper runner configuration management.

Just to be clear, I won't block on this 😄 I think I join @p1-0tr on this, I raised it as a matter of principle. And if it's temporary, then LGTM ✅

pkg/inference/scheduling/loader.go

silvin-lubecki

LGTM

context: Detect DD based on the OS and allow WSL2 client

xenoscopic requested review from doringeman, p1-0tr and silvin-lubecki June 17, 2025 16:13

xenoscopic commented Jun 17, 2025

View reviewed changes

github-advanced-security bot found potential problems Jun 17, 2025

View reviewed changes

pkg/inference/scheduling/loader.go Dismissed Show dismissed Hide dismissed

doringeman reviewed Jun 18, 2025

View reviewed changes

pkg/inference/scheduling/loader.go Show resolved Hide resolved

p1-0tr approved these changes Jun 18, 2025

View reviewed changes

silvin-lubecki approved these changes Jun 18, 2025

View reviewed changes

doringeman approved these changes Jun 18, 2025

View reviewed changes

xenoscopic merged commit 24a2a4b into main Jun 18, 2025
4 checks passed

xenoscopic deleted the config-refinement branch June 18, 2025 13:48

doringeman pushed a commit to doringeman/model-runner that referenced this pull request Sep 23, 2025

Do not assume a model always has a tag (docker#86)

5cd7f65

doringeman pushed a commit to doringeman/model-runner that referenced this pull request Sep 24, 2025

Do not assume a model always has a tag (docker#86)

c228907

doringeman added a commit to doringeman/model-runner that referenced this pull request Oct 2, 2025

Merge pull request docker#86 from doringeman/desktop-context

3d333f6

context: Detect DD based on the OS and allow WSL2 client

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Try to make backend configuration a little more robust. #86

Try to make backend configuration a little more robust. #86

Uh oh!

xenoscopic commented Jun 17, 2025

Uh oh!

xenoscopic Jun 17, 2025

Uh oh!

silvin-lubecki Jun 17, 2025

Uh oh!

xenoscopic Jun 17, 2025

Uh oh!

silvin-lubecki Jun 17, 2025

Uh oh!

xenoscopic Jun 17, 2025

Uh oh!

fiam Jun 17, 2025 •

edited

Loading

Uh oh!

p1-0tr Jun 18, 2025

Uh oh!

silvin-lubecki Jun 18, 2025

Uh oh!

Uh oh!

Uh oh!

silvin-lubecki left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

Try to make backend configuration a little more robust. #86

Try to make backend configuration a little more robust. #86

Uh oh!

Conversation

xenoscopic commented Jun 17, 2025

Uh oh!

xenoscopic Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

silvin-lubecki Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

xenoscopic Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

silvin-lubecki Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

xenoscopic Jun 17, 2025

Choose a reason for hiding this comment

Uh oh!

fiam Jun 17, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

p1-0tr Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

silvin-lubecki Jun 18, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

silvin-lubecki left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

fiam Jun 17, 2025 •

edited

Loading