Skip to content

fix: adapt PushJobAgent to new time management mechanism#8526

Merged
fstagni merged 1 commit intoDIRACGrid:integrationfrom
aldbr:fix-pja-time-left
May 5, 2026
Merged

fix: adapt PushJobAgent to new time management mechanism#8526
fstagni merged 1 commit intoDIRACGrid:integrationfrom
aldbr:fix-pja-time-left

Conversation

@aldbr
Copy link
Copy Markdown
Contributor

@aldbr aldbr commented May 4, 2026

#8416 streamlined the time management mechanism within allocations.
Problem: the PJA was relying on a bug as the Watchdog was not killing payloads reaching the end of their allocation.
When we push jobs, we don't set any CPUTimeLeft so it is seen as 0 now, and the jobs are immediately killed.

BEGINRELEASENOTES
*WorkloadManagement
FIX: adapt PushJobAgent to new time management mechanism
ENDRELEASENOTES

@aldbr aldbr requested review from atsareg and fstagni as code owners May 4, 2026 12:29
@aldbr aldbr force-pushed the fix-pja-time-left branch from 0940da1 to 1a6e5a3 Compare May 4, 2026 13:05
@aldbr
Copy link
Copy Markdown
Contributor Author

aldbr commented May 4, 2026

Tested in LHCbDIRAC, jobs stopped failing immediately

@fstagni fstagni merged commit 1738e7c into DIRACGrid:integration May 5, 2026
23 checks passed
@DIRACGridBot DIRACGridBot added the sweep:ignore Prevent sweeping from being ran for this PR label May 5, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

sweep:ignore Prevent sweeping from being ran for this PR

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants