Add realtime_trace_jsonl recipe for structured real-time optimization progress streaming by MySweetEden · Pull Request #1177 · scipopt/PySCIPOpt

MySweetEden · 2026-01-24T14:31:47Z

Motivation

PySCIPOpt already has recipe(s) that store optimization progress in memory. However, in-memory traces are not suitable for real-time, external observation (e.g., another process tailing progress, dashboards, log collectors).

This recipe focuses on the missing piece: a stream-friendly, structured output that can be consumed outside the running Python process during solve.

Design Decisions

JSONL format: Designed for streaming writes and partial reads; remains readable even if the run is interrupted or crashes
Real-time external output is the primary value:
- Records progress updates as one JSON object per line
- Flushes on key events so downstream consumers can react immediately
Schema compatibility with setTracefile() (PR Add setTracefile() method for structured optimization progress loggingAdd settracefile api #1158):
- Uses the same JSONL field names (type, time, primalbound, dualbound, gap, nodes, nsol) for consistency across tracing approaches.
In-memory + file output: Keeps model.data["trace"] for convenience/testing, but the recipe is centered on file streaming via path=...
Robust termination signaling for external monitoring:
- Always emits a final run_end record on normal termination, interruption, or exception
- On exceptions, run_end includes structured error metadata (status, exception type, message)
- Flushes run_end to make completion detection reliable

Events Recorded

bestsol_found: when a new best solution is found
dualbound_improved: when the dual bound improves
run_end: when optimization terminates (also emitted on interrupt/exception)

Fields

type, time, primalbound, dualbound, gap, nodes, nsol (aligned with the JSONL trace schema introduced in PR #1158)
(run_end may additionally include: status, exception, message on failure)

…tion naming

… handling; rename optimize_with_trace to optimizeTrace for clarity

…nified event writing method, improving clarity and consistency in event handling.

…ction, enhancing test coverage for both optimizeTrace and optimizeNogilTrace. Update assertions for trace data consistency.

…tracking This update introduces a comprehensive docstring for the _TraceRun class, detailing its purpose, arguments, return values, and usage examples. This enhancement improves code documentation and usability for future developers.

…racking with JSONL output This commit introduces the realtime_trace_jsonl recipe, which allows for real-time tracking of optimization progress and outputs the data in JSONL format. Additionally, the CHANGELOG has been updated to reflect this new feature.

…uments for clarity

Copilot

Pull request overview

Adds a new PySCIPOpt recipe to stream structured optimization progress in real time using JSONL, enabling external tailing/monitoring while the solver runs.

Changes:

Introduces realtime_trace_jsonl recipe with optimizeTrace() / optimizeNogilTrace() to record selected SCIP events into model.data["trace"] and optionally a JSONL file.
Records bestsol_found, dualbound_improved, and a final run_end event with flushing intended for real-time consumption.
Adds tests covering in-memory tracing, file output, and interrupt handling; updates changelog.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 7 comments.

File	Description
src/pyscipopt/recipes/realtime_trace_jsonl.py	Implements the real-time JSONL tracing recipe and event handling.
tests/test_recipe_realtime_trace_jsonl.py	Adds tests for in-memory traces, JSONL file output, and interruption behavior.
CHANGELOG.md	Documents the addition of the new recipe.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-01-28T14:50:19Z

src/pyscipopt/recipes/realtime_trace_jsonl.py

+        self._handler = _TraceEventhdlr()
+        self.model.includeEventhdlr(
+            self._handler, "realtime_trace_jsonl", "Realtime trace jsonl handler"
+        )


includeEventhdlr() registers an event handler plugin permanently (there is no corresponding remove/uninclude API). Calling optimizeTrace()/optimizeNogilTrace() multiple times on the same model will attempt to include another handler with the same name (realtime_trace_jsonl), which can raise a SCIP error and/or leave multiple live handlers capturing closed file handles and old _TraceRun instances. Refactor to include the handler at most once per model (e.g., stash/reuse it in model.data), and make the handler read its current sink (trace list / file handle) from mutable attributes rather than a closure over a per-run object.

Not Addressed

1. includeEventhdlr() multiple invocation issue

The concern about permanent handler registration is valid:

includeEventhdlr() registers handlers permanently with no removal API

dropEvent() only unsubscribes from events, not the handler itself

Scope: Refactoring to a handler-reuse pattern would require architectural changes and will be addressed separately. The current implementation assumes single-run usage.

src/pyscipopt/recipes/realtime_trace_jsonl.py

Copilot · 2026-01-28T14:50:20Z

src/pyscipopt/recipes/realtime_trace_jsonl.py

+                    self._write_event(
+                        "dualbound_improved", fields=snapshot, flush=False
+                    )


For a recipe marketed as “real-time JSONL streaming”, not flushing dualbound_improved events can delay visibility for external consumers tailing the file. Consider flushing here as well (or making flushing policy configurable), especially since dualbound_improved is one of the primary progress signals you record.

Not Addressed

2. dualbound_improved flush policy

dualbound_improved events are intentionally not flushed:

Frequency asymmetry: dualbound_improved fires hundreds to thousands of times during optimization, while bestsol_found fires only a few dozen times at most; flushing on every dual bound update would accumulate significant I/O overhead

OS buffering suffices: Events naturally flush within seconds via OS buffering, providing adequate real-time visibility

Context: Optimizations typically run for minutes to hours, making second-scale buffering delays negligible

Discussion: I'm open to reconsidering the flush policy if there are use cases where immediate flushing of dualbound_improved events is valuable (e.g., sub-minute monitoring). Would making it configurable be useful, or is the current approach acceptable?

src/pyscipopt/recipes/realtime_trace_jsonl.py

MySweetEden · 2026-01-29T16:47:19Z

I’ll address the comments over the weekend and push updates soon.

…ntainability. Introduced a set to track caught events, ensuring proper cleanup during event execution. Updated event initialization and execution methods for consistency.

… cleanup process. Added note regarding flushing behavior for dualbound_improved events.

MySweetEden · 2026-01-31T20:01:00Z

Addressed

1. `dropEvent()` refcount underflow prevention

SCIP does not provide a dedicated "solve finished" event, so we cannot rely on an event-driven shutdown callback (like eventexit()) for cleanup. Therefore, cleanup is performed in __exit__, with guards to avoid invalid dropEvent() calls.

Implementation:

Added self._caught_events (set) to track which events were successfully caught in eventinit()
Modified __exit__ to drop only the events that were actually caught
This prevents incorrect dropEvent() calls if eventinit() partially fails

2. Trace data initialization

Changed from setdefault() to explicit assignment:

self.model.data["trace"] = []

Rationale: Ensures each traced run starts with a fresh list, preventing data from previous runs from mixing and keeping in-memory behavior consistent with file output (which always truncates).

3. Event handler parameter naming

Changed the first parameter from s to hdlr to improve readability while avoiding collision with the outer _TraceRun.self.
Note: hdlr is a conventional abbreviation commonly used in PySCIPOPT/SCIP codebase.

4. Exception handling and flush comments

Added explanatory comments to exception handling and flush behavior for clarity.

Not Addressed

1. `includeEventhdlr()` multiple invocation issue

The concern about permanent handler registration is valid:

includeEventhdlr() registers handlers permanently with no removal API
dropEvent() only unsubscribes from events, not the handler itself

Scope: Refactoring to a handler-reuse pattern would require architectural changes and will be addressed separately. The current implementation assumes single-run usage.

2. `dualbound_improved` flush policy

dualbound_improved events are intentionally not flushed:

Frequency asymmetry: dualbound_improved fires hundreds to thousands of times during optimization, while bestsol_found fires only a few dozen times at most; flushing on every dual bound update would accumulate significant I/O overhead
OS buffering suffices: Events naturally flush within seconds via OS buffering, providing adequate real-time visibility
Context: Optimizations typically run for minutes to hours, making second-scale buffering delays negligible

Discussion: I'm open to reconsidering the flush policy if there are use cases where immediate flushing of dualbound_improved events is valuable (e.g., sub-minute monitoring). Would making it configurable be useful, or is the current approach acceptable?

MySweetEden · 2026-02-04T17:45:20Z

I addressed the actionable review items and all checks are green. A couple of higher-level/trade-off points are intentionally left open for discussion. Could you take another look when you have time?

Joao-Dionisio · 2026-02-05T23:28:07Z

Hey @MySweetEden , yes I will have a look! I will try to lay low for a little bit, for my own sake, but this should get merged, don't worry :)

MySweetEden and others added 13 commits January 20, 2026 03:57

Add trace_run recipe for structured trace output

08308d7

Refine trace_run recipe behavior

408b6da

Add tests for trace_run recipe

868022f

Rename tests for trace_run recipe

9d2dfac

Refactor trace_run recipe to enhance event handling and optimize func…

4ab23b2

…tion naming

Enhance _TraceRun class to include snapshot logging and improve event…

1f1bead

… handling; rename optimize_with_trace to optimizeTrace for clarity

Refactor _TraceRun class to replace snapshot logging methods with a u…

2c9444c

…nified event writing method, improving clarity and consistency in event handling.

Refactor tests for trace_run recipe to use parameterized optimize fun…

211359a

…ction, enhancing test coverage for both optimizeTrace and optimizeNogilTrace. Update assertions for trace data consistency.

Update usage examples in _TraceRun class docstring to use keyword arg…

6bf7355

…uments for clarity

Merge branch 'master' into realtime-trace-jsonl

cc41cad

Merge branch 'master' into realtime-trace-jsonl

f646e51

Joao-Dionisio requested review from Joao-Dionisio and Copilot and removed request for Copilot January 28, 2026 14:41

Copilot started reviewing on behalf of Joao-Dionisio January 28, 2026 14:42 View session

Copilot AI reviewed Jan 28, 2026

View reviewed changes

MySweetEden added 3 commits February 1, 2026 03:37

Refactor event handling in _TraceRun class to improve clarity and mai…

b822dbc

…ntainability. Introduced a set to track caught events, ensuring proper cleanup during event execution. Updated event initialization and execution methods for consistency.

Merge remote-tracking branch 'origin/master' into realtime-trace-jsonl

7fd53cf

Enhance comments in _TraceRun class for clarity on event handling and…

987251f

… cleanup process. Added note regarding flushing behavior for dualbound_improved events.

MySweetEden and others added 4 commits February 2, 2026 19:50

Merge remote-tracking branch 'origin/master' into realtime-trace-jsonl

3478efb

Merge branch 'master' into realtime-trace-jsonl

fc869bc

Merge remote-tracking branch 'origin/master' into realtime-trace-jsonl

c7d36b9

Merge remote-tracking branch 'origin/master' into realtime-trace-jsonl

7e81977

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add realtime_trace_jsonl recipe for structured real-time optimization progress streaming#1177

Add realtime_trace_jsonl recipe for structured real-time optimization progress streaming#1177
MySweetEden wants to merge 20 commits intoscipopt:masterfrom
MySweetEden:realtime-trace-jsonl

MySweetEden commented Jan 24, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

MySweetEden Jan 31, 2026 •

edited

Loading

Uh oh!

Uh oh!

Copilot AI Jan 28, 2026

Uh oh!

MySweetEden Jan 31, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MySweetEden commented Jan 29, 2026

Uh oh!

MySweetEden commented Jan 31, 2026

Uh oh!

MySweetEden commented Feb 4, 2026

Uh oh!

Joao-Dionisio commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

MySweetEden commented Jan 24, 2026

Motivation

Design Decisions

Events Recorded

Fields

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

MySweetEden Jan 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Not Addressed

1. includeEventhdlr() multiple invocation issue

Uh oh!

Uh oh!

Copilot AI Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

MySweetEden Jan 31, 2026

Choose a reason for hiding this comment

Not Addressed

2. dualbound_improved flush policy

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

MySweetEden commented Jan 29, 2026

Uh oh!

MySweetEden commented Jan 31, 2026

Addressed

1. dropEvent() refcount underflow prevention

2. Trace data initialization

3. Event handler parameter naming

4. Exception handling and flush comments

Not Addressed

1. includeEventhdlr() multiple invocation issue

2. dualbound_improved flush policy

Uh oh!

MySweetEden commented Feb 4, 2026

Uh oh!

Joao-Dionisio commented Feb 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MySweetEden Jan 31, 2026 •

edited

Loading

1. `includeEventhdlr()` multiple invocation issue

2. `dualbound_improved` flush policy

1. `dropEvent()` refcount underflow prevention

1. `includeEventhdlr()` multiple invocation issue

2. `dualbound_improved` flush policy