Skip to content

add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (MPI_Init)#25043

Merged
Micket merged 2 commits intoeasybuilders:developfrom
Thyre:20260113105823_new_pr_hwloc2121
Jan 13, 2026
Merged

add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (MPI_Init)#25043
Micket merged 2 commits intoeasybuilders:developfrom
Thyre:20260113105823_new_pr_hwloc2121

Conversation

@Thyre
Copy link
Copy Markdown
Collaborator

@Thyre Thyre commented Jan 13, 2026

(created using eb --new-pr)

See also open-mpi/hwloc#756.
Marked as draft to first test on our Zen 2 systems.

@Thyre Thyre marked this pull request as draft January 13, 2026 09:58
@github-actions github-actions Bot added 2025b issues & PRs related to 2025b common toolchains change labels Jan 13, 2026
@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 13, 2026

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 3 secs) (1 easyconfigs in total)
jrlogin10.jureca - Linux Rocky Linux 9.6, x86_64, AMD EPYC 7742 64-Core Processor (zen2), 2 x NVIDIA Quadro RTX 8000, 580.95.05, Python 3.9.21
See https://gist.github.com/Thyre/8fdfdb5f99d55fafddb23dd56902561a for a full test report.

@Thyre Thyre marked this pull request as ready for review January 13, 2026 10:04
@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 13, 2026

@boegelbot please test @ jsc-zen3
EB_ARGS="--installpath /tmp/$USER/ecpr-25043"

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 13, 2026

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 33 secs) (1 easyconfigs in total)
ZAM054 - Linux Zorin OS 18, x86_64, 12th Gen Intel(R) Core(TM) i7-1260P (skylake), 1 x NVIDIA NVIDIA GeForce MX550, 580.95.05, Python 3.12.3
See https://gist.github.com/Thyre/7989028ea48c082bac031c02a40d9efb for a full test report.

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 13, 2026

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (total: 58 secs) (1 easyconfigs in total)
jwlogin21.juwels - Linux Rocky Linux 9.6, x86_64, AMD EPYC 7402 24-Core Processor (zen2), Python 3.9.21
See https://gist.github.com/Thyre/7d6b210fd8546d872cdcad470bf565a5 for a full test report.

@boegelbot
Copy link
Copy Markdown
Collaborator

@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=25043 EB_ARGS="--installpath /tmp/$USER/ecpr-25043" EB_CONTAINER= EB_REPO=easybuild-easyconfigs EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_25043 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 9377

Test results coming soon (I hope)...

Details

- notification for comment with ID 3743461308 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 13, 2026

Test report by @Thyre
SUCCESS
Build succeeded for 1 out of 1 (total: 19 secs) (1 easyconfigs in total)
jpbl-s01-01 - Linux RHEL 9.6, AArch64, ARM UNKNOWN (neoverse_v2), 1 x NVIDIA NVIDIA GH200 480GB, 570.133.20, Python 3.9.21
See https://gist.github.com/Thyre/8f8187dafcd745a731e983e01849c4ed for a full test report.

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 22 secs) (1 easyconfigs in total)
jsczen3c1.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.7, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.23
See https://gist.github.com/boegelbot/9ce89bb2411ad87a0d7a592425348208 for a full test report.

@Thyre Thyre added bug fix and removed change labels Jan 13, 2026
Copy link
Copy Markdown
Contributor

@Micket Micket left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@Micket
Copy link
Copy Markdown
Contributor

Micket commented Jan 13, 2026

Test report by @Micket
SUCCESS
Build succeeded for 1 out of 1 (total: 1 min 3 secs) (1 easyconfigs in total)
vera-icelake-build - Linux Rocky Linux 9.6, x86_64, Intel(R) Xeon(R) Silver 4316 CPU @ 2.30GHz, Python 3.9.21
See https://gist.github.com/Micket/5cd103638b2d809266602ae7ce39df1a for a full test report.

@Micket Micket merged commit 9c1f56e into easybuilders:develop Jan 13, 2026
8 checks passed
@Micket Micket added this to the next release (5.2.1?) milestone Jan 13, 2026
@boegel
Copy link
Copy Markdown
Member

boegel commented Jan 14, 2026

@Thyre Mention during EB conf call: does same problem exist for hwloc in previous easyconfig generations as well?

If so, please open an issue to follow-up, or a PR (we can't keep track of TODOs in merged PRs)

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 14, 2026

@Thyre Mention during EB conf call: does same problem exist for hwloc in previous easyconfig generations as well?

If so, please open an issue to follow-up, or a PR (we can't keep track of TODOs in merged PRs)

2025a is also affected. Will open a follow up PR at latest tomorrow.

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Jan 14, 2026

@boegel done in #25063

@boegel boegel changed the title add patches to hwloc v2.12.1 to better handle ACPI CPCC add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (time required for MPI_Init) Jan 15, 2026
@boegel boegel changed the title add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (time required for MPI_Init) add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (time required for MPI_Init) Jan 15, 2026
@boegel boegel changed the title add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (time required for MPI_Init) add patches to hwloc v2.12.1 to better handle ACPI CPCC, fixes performance regression in MPI startup time (MPI_Init) Feb 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

2025b issues & PRs related to 2025b common toolchains bug fix performance

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants