Skip to content

fix ELPA easyblock to not pass sm80 configure flag for newer NVIDIA GPUs#3886

Merged
smoors merged 2 commits intoeasybuilders:developfrom
Thyre:20250813113459_new_pr_elpa
Aug 18, 2025
Merged

fix ELPA easyblock to not pass sm80 configure flag for newer NVIDIA GPUs#3886
smoors merged 2 commits intoeasybuilders:developfrom
Thyre:20250813113459_new_pr_elpa

Conversation

@Thyre
Copy link
Copy Markdown
Collaborator

@Thyre Thyre commented Aug 13, 2025

(created using eb --new-pr)

Fixes build failure seen on GH200 due to sm_90 being used:

configure: error: You specified --enable-nvidia-sm80-gpu, but not --with-NVIDIA-GPU-compute-capability=sm_80 (or higher)Aborting...

One could also pass sm_80, but our CUDA check in Framework really won't like finding that architecture as well, if we only specify --cuda-compute-capabilities=sm_90 as the EasyBuild option.

Comment thread easybuild/easyblocks/e/elpa.py Outdated
@Thyre Thyre added the bug fix label Aug 13, 2025
@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Aug 13, 2025

@boegelbot please test @ jsc-zen3-a100
EB_ARGS="ELPA-2024.05.001-foss-2024a-CUDA-12.6.0.eb --installpath=/tmp/$USER/ebpr-3886"

@boegelbot
Copy link
Copy Markdown

@Thyre: Request for testing this PR well received on jsczen3l1.int.jsc-zen3.fz-juelich.de

PR test command 'if [[ develop != 'develop' ]]; then EB_BRANCH=develop ./easybuild_develop.sh 2> /dev/null 1>&2; EB_PREFIX=/home/boegelbot/easybuild/develop source init_env_easybuild_develop.sh; fi; EB_PR=3886 EB_ARGS="ELPA-2024.05.001-foss-2024a-CUDA-12.6.0.eb --installpath=/tmp/$USER/ebpr-3886" EB_REPO=easybuild-easyblocks EB_BRANCH=develop /opt/software/slurm/bin/sbatch --job-name test_PR_3886 --ntasks=8 ~/boegelbot/eb_from_pr_upload_jsc-zen3.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 7573

Test results coming soon (I hope)...

Details

- notification for comment with ID 3183224252 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@boegelbot
Copy link
Copy Markdown

Test report by @boegelbot

Overview of tested easyconfigs (in order)

  • SUCCESS ELPA-2024.05.001-foss-2024a-CUDA-12.6.0.eb

Build succeeded for 1 out of 1 (1 easyconfigs in total)
jsczen3c2.int.jsc-zen3.fz-juelich.de - Linux Rocky Linux 9.6, x86_64, AMD EPYC-Milan Processor (zen3), Python 3.9.21
See https://gist.github.com/boegelbot/5279be72693e2ec9959a392ffe4cfe72 for a full test report.

@Thyre
Copy link
Copy Markdown
Collaborator Author

Thyre commented Aug 13, 2025

Success with sm_90

easybuilders/easybuild-easyconfigs#23626 (comment)

This really looks like a bug in their configure script. They mention sm_80 or greater, but fail for an arch actually being greater than 8.0

@smoors smoors added this to the next release (5.1.2) milestone Aug 18, 2025
Copy link
Copy Markdown
Contributor

@smoors smoors left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@smoors
Copy link
Copy Markdown
Contributor

smoors commented Aug 18, 2025

Going in, thanks @Thyre!

@smoors smoors merged commit 5e54586 into easybuilders:develop Aug 18, 2025
17 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants