Add checkpoint_name_prefix to RL training Config #192

pourion · 2025-12-19T22:10:16Z

Summary

Adds an optional checkpoint_name_prefix field to the RL training Config that prefixes all saved checkpoint names.

Motivation

When running multiple experiments, checkpoints saved to Tinker are difficult to identify because they're named only by batch number (e.g., 000042). This change allows prefixing with an experiment identifier (e.g., my_exp_dec19_000042), making it easy to match checkpoints to WandB runs.

Changes

Added checkpoint_name_prefix: str | None = None to Config class
Updated save_checkpoint_and_get_sampling_client to accept and use the prefix
Updated all checkpoint save call sites to pass the prefix

Usage

Config(
checkpoint_name_prefix="exp_rl_dec19",
wandb_name="exp_rl_dec19",
...
)

Backward Compatibility

Fully backward compatible - the field defaults to None, preserving existing behavior.

Copilot

Pull request overview

This PR adds an optional checkpoint_name_prefix field to the RL training Config class to improve checkpoint identification when running multiple experiments. The prefix is prepended to the batch number when saving checkpoints, making it easier to match checkpoints with their corresponding experiments.

Added checkpoint_name_prefix field to the Config dataclass with default value of None
Updated checkpoint saving logic to use the prefix when constructing checkpoint names
Updated all call sites to pass the prefix parameter through the call chain

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2025-12-19T22:13:35Z

tinker_cookbook/rl/train.py

    # Get a sampling client using the new weights
    sampling_client, checkpoint_metrics = await save_checkpoint_and_get_sampling_client(
-        training_client, i_batch, log_path, save_every
+        training_client, i_batch, log_path, save_every, checkpoint_name_prefix=checkpoint_name_prefix


Inconsistent parameter passing style. This call uses a keyword argument for checkpoint_name_prefix, while other calls to save_checkpoint_and_get_sampling_client in this PR use positional arguments (see lines 332, 911, 959, 982). For consistency and clarity, consider using the same style throughout. Either use positional arguments or keyword arguments consistently for the new parameter.

Suggested change

training_client, i_batch, log_path, save_every, checkpoint_name_prefix=checkpoint_name_prefix

training_client, i_batch, log_path, save_every, checkpoint_name_prefix

Adds an optional `checkpoint_name_prefix` field to the Config class that prefixes all checkpoint names. This makes it easier to identify checkpoints on the Tinker platform by experiment/run name. Example usage: Config( checkpoint_name_prefix="my_experiment_dec19", ... ) Results in checkpoints named like: - my_experiment_dec19_000020 - my_experiment_dec19_000040 - my_experiment_dec19_final Instead of: - 000020 - 000040 - final

Copilot AI review requested due to automatic review settings December 19, 2025 22:10

Copilot started reviewing on behalf of pourion December 19, 2025 22:10 View session

Copilot AI reviewed Dec 19, 2025

View reviewed changes

pourion force-pushed the feature/checkpoint-name-prefix branch from ad82e0b to 14e71dc Compare December 19, 2025 22:15

pourion force-pushed the feature/checkpoint-name-prefix branch from 14e71dc to 778ffd4 Compare December 19, 2025 22:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add checkpoint_name_prefix to RL training Config #192

Add checkpoint_name_prefix to RL training Config #192

Uh oh!

pourion commented Dec 19, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

	training_client, i_batch, log_path, save_every, checkpoint_name_prefix=checkpoint_name_prefix
	training_client, i_batch, log_path, save_every, checkpoint_name_prefix

Add checkpoint_name_prefix to RL training Config #192

Are you sure you want to change the base?

Add checkpoint_name_prefix to RL training Config #192

Uh oh!

Conversation

pourion commented Dec 19, 2025

Summary

Motivation

Changes

Usage

Backward Compatibility

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Dec 19, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant