Skip to content

AP-25628: add checkpoint/restore (CRaC support in executor)#88

Open
bernd-wiswedel wants to merge 1 commit intomasterfrom
todo/AP-25628-c-ra-c-po-c-for-faster-executor-statup
Open

AP-25628: add checkpoint/restore (CRaC support in executor)#88
bernd-wiswedel wants to merge 1 commit intomasterfrom
todo/AP-25628-c-ra-c-po-c-for-faster-executor-statup

Conversation

@bernd-wiswedel
Copy link
Copy Markdown
Member

AP-25628 (PoC: "CRaC" for faster executor startup (suspend VM after start))

@bernd-wiswedel bernd-wiswedel requested a review from a team as a code owner March 18, 2026 10:16
@bernd-wiswedel bernd-wiswedel requested review from Copilot and knime-ghub-bot and removed request for a team March 18, 2026 10:16
Copy link
Copy Markdown
Contributor

Copilot AI left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a CRaC (Coordinated Restore at Checkpoint) hook to the KNIME Python gateway tracking layer so Python processes are terminated before the JVM is checkpointed, aiming to improve executor startup/restore behavior.

Changes:

  • Register a PhasedInit callback in PythonGatewayTracker to run cleanup before checkpointing.
  • Reuse existing gateway cleanup logic (clear()) to forcefully terminate tracked Python gateways/processes.

Comment thread org.knime.python3/src/main/java/org/knime/python3/PythonGatewayTracker.java Outdated
Comment thread org.knime.python3/src/main/java/org/knime/python3/PythonGatewayTracker.java Outdated
Comment thread org.knime.python3/src/main/java/org/knime/python3/PythonGatewayTracker.java Outdated
@bernd-wiswedel bernd-wiswedel force-pushed the todo/AP-25628-c-ra-c-po-c-for-faster-executor-statup branch 3 times, most recently from dd8bd9d to ed41788 Compare April 4, 2026 12:29
@bernd-wiswedel bernd-wiswedel force-pushed the todo/AP-25628-c-ra-c-po-c-for-faster-executor-statup branch from ed41788 to 4c40520 Compare April 12, 2026 13:19
AP-25628 (PoC: "CRaC" for faster executor startup (suspend VM after start))
@bernd-wiswedel bernd-wiswedel force-pushed the todo/AP-25628-c-ra-c-po-c-for-faster-executor-statup branch from 4c40520 to 0c39b45 Compare April 16, 2026 11:24
@sonarqubecloud
Copy link
Copy Markdown

Quality Gate Failed Quality Gate failed

Failed conditions
61.5% Coverage on New Code (required ≥ 85%)

See analysis details on SonarQube Cloud

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants