Skip to content

add patch to fix failing file system cache test for jax 0.2.19 on recent Linux kernels#14067

Merged
SebastianAchilles merged 2 commits intoeasybuilders:developfrom
Flamefire:20210924181305_new_pr_jax0219
Sep 25, 2021
Merged

add patch to fix failing file system cache test for jax 0.2.19 on recent Linux kernels#14067
SebastianAchilles merged 2 commits intoeasybuilders:developfrom
Flamefire:20210924181305_new_pr_jax0219

Conversation

@Flamefire
Copy link
Copy Markdown
Contributor

(created using eb --new-pr)

@SebastianAchilles
Copy link
Copy Markdown
Member

@boegelbot please test @ generoso
CORE_CNT="16"

@boegelbot
Copy link
Copy Markdown
Collaborator

@SebastianAchilles: Request for testing this PR well received on login1

PR test command 'EB_PR=14067 EB_ARGS= /opt/software/slurm/bin/sbatch --job-name test_PR_14067 --ntasks="16" ~/boegelbot/eb_from_pr_upload_generoso.sh' executed!

  • exit code: 0
  • output:
Submitted batch job 6989

Test results coming soon (I hope)...

Details

- notification for comment with ID 926765902 processed

Message to humans: this is just bookkeeping information for me,
it is of no use to you (unless you think I have a bug, which I don't).

@Flamefire
Copy link
Copy Markdown
Contributor Author

Test report by @Flamefire
FAILED
Build succeeded for 0 out of 1 (1 easyconfigs in total)
taurusi8007 - Linux centos linux 7.9.2009, x86_64, AMD EPYC 7352 24-Core Processor (zen2), Python 2.7.5
See https://gist.github.com/8c7ee8c3afef6a1c8bc5cfcdfed55406 for a full test report.

@Flamefire
Copy link
Copy Markdown
Contributor Author

Test report by @Flamefire
FAILED
Build succeeded for 0 out of 2 (2 easyconfigs in total)
taurusml12 - Linux RHEL 7.6, POWER, 8335-GTX (power9le), Python 2.7.5
See https://gist.github.com/e78cd9d48537b0645152ea979d214301 for a full test report.

@boegelbot
Copy link
Copy Markdown
Collaborator

Test report by @boegelbot
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
cnx1 - Linux rocky linux 8.4, x86_64, Intel(R) Xeon(R) CPU E5-2690 v3 @ 2.60GHz (haswell), Python 3.6.8
See https://gist.github.com/12cbc4576a674b170f65f956e4319751 for a full test report.

@Flamefire
Copy link
Copy Markdown
Contributor Author

@SebastianAchilles Our Epyc cluster seemingly cannot handle 10 parallel tests while I've seen it succeed with 5. Hence reduced that. The failures on PPC are kinda expected. Got to work on that later. Ready to merge from my side

@SebastianAchilles
Copy link
Copy Markdown
Member

Test report by @SebastianAchilles
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
jsfc061 - Linux centos linux 7.9.2009, x86_64, AMD EPYC 7742 64-Core Processor, Python 3.6.8
See https://gist.github.com/8fd99350b30f49208321519bdbddd67d for a full test report.

@boegel boegel changed the title Fix jax test failures on recent kernels add patch to fix failing file system cache test for jax on recent Linux kernels Sep 25, 2021
@SebastianAchilles
Copy link
Copy Markdown
Member

@SebastianAchilles Our Epyc cluster seemingly cannot handle 10 parallel tests while I've seen it succeed with 5. Hence reduced that. The failures on PPC are kinda expected. Got to work on that later. Ready to merge from my side

Thanks for the update 👍

Copy link
Copy Markdown
Member

@SebastianAchilles SebastianAchilles left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@SebastianAchilles
Copy link
Copy Markdown
Member

Going in, thanks @Flamefire!

@SebastianAchilles SebastianAchilles merged commit a207eb4 into easybuilders:develop Sep 25, 2021
@boegel boegel changed the title add patch to fix failing file system cache test for jax on recent Linux kernels add patch to fix failing file system cache test for jax 0.2.19 on recent Linux kernels Sep 25, 2021
@Flamefire
Copy link
Copy Markdown
Contributor Author

Test report by @Flamefire
SUCCESS
Build succeeded for 2 out of 2 (2 easyconfigs in total)
taurusa7 - Linux centos linux 7.7.1908, x86_64, Intel(R) Xeon(R) CPU E5-2603 v4 @ 1.70GHz (broadwell), Python 2.7.5
See https://gist.github.com/b1c47f63c73812ddfa12b556e2c19030 for a full test report.

@Flamefire Flamefire deleted the 20210924181305_new_pr_jax0219 branch September 27, 2021 08:03
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants