Skip to content

Comments

Add OSU with CUDA#1401

Open
casparvl wants to merge 1 commit intoEESSI:mainfrom
casparvl:osu_cuda
Open

Add OSU with CUDA#1401
casparvl wants to merge 1 commit intoEESSI:mainfrom
casparvl:osu_cuda

Conversation

@casparvl
Copy link
Collaborator

No description provided.

@casparvl
Copy link
Collaborator Author

bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/intel/icelake,accel=nvidia/cc80
bot: build repo:eessi.io-2025.06-software instance:eessi-bot-surf for:arch=x86_64/amd/zen4,accel=nvidia/cc90

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Feb 19, 2026

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: intel-icelake and accelerator nvidia/cc80
Building for: x86_64/intel/icelake and accelerator nvidia/cc80
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2026.02/pr_1401/19727277

date job status comment
Feb 19 22:27:34 UTC 2026 submitted job id 19727277 will be eligible to start in about 20 seconds
Feb 19 22:27:45 UTC 2026 received job awaits launch by Slurm scheduler
Feb 19 22:28:03 UTC 2026 running job 19727277 is running
Feb 19 22:34:50 UTC 2026 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-19727277.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-intel-icelake-accel-nvidia-cc80-17715404340.tar.zstsize: 0 MiB (831648 bytes)
entries: 52
modules under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/modules/all
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/software
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80/reprod
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0/20260219_223130UTC
other under 2025.06/software/linux/x86_64/intel/icelake/accel/nvidia/cc80
no other files in tarball
Feb 19 22:34:50 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-19727277.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@eessi-bot-surf
Copy link

eessi-bot-surf bot commented Feb 19, 2026

New job on instance eessi-bot-surf for repository eessi.io-2025.06-software
Building on: amd-zen4 and accelerator nvidia/cc90
Building for: x86_64/amd/zen4 and accelerator nvidia/cc90
Job dir: /projects/eessibot/eessi-bot-surf/jobs/2026.02/pr_1401/19727411

date job status comment
Feb 19 22:27:40 UTC 2026 submitted job id 19727411 will be eligible to start in about 20 seconds
Feb 19 22:27:49 UTC 2026 received job awaits launch by Slurm scheduler
Feb 19 22:28:18 UTC 2026 running job 19727411 is running
Feb 19 22:34:26 UTC 2026 finished
😢 FAILURE (click triangle for details)
Details
✅ job output file slurm-19727411.out
✅ no message matching FATAL:
❌ found message matching ERROR:
❌ found message matching FAILED:
❌ found message matching required modules missing:
❌ no message matching No missing installations
✅ found message matching .tar.* created!
Artefacts
eessi-2025.06-software-linux-x86_64-amd-zen4-accel-nvidia-cc90-17715404080.tar.zstsize: 0 MiB (832048 bytes)
entries: 52
modules under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/modules/all
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0.lua
software under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/software
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0
reprod directories under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90/reprod
UCX-CUDA/1.16.0-GCCcore-13.3.0-CUDA-12.6.0/20260219_223108UTC
other under 2025.06/software/linux/x86_64/amd/zen4/accel/nvidia/cc90
no other files in tarball
Feb 19 22:34:26 UTC 2026 test result
😁 SUCCESS (click triangle for details)
ReFrame Summary
[ PASSED ] Ran 0/0 test case(s) from 0 check(s) (0 failure(s), 0 skipped, 0 aborted)
Details
✅ job output file slurm-19727411.out
❌ found message matching ERROR:
✅ no message matching [\s*FAILED\s*].*Ran .* test case

@bedroge
Copy link
Collaborator

bedroge commented Feb 20, 2026

ESC[31mERROR: /cvmfs/software.eessi.io/versions/2025.06/compat/linux/x86_64/lib/nvidia is a symlink pointing to /cvmfs/software.eessi.io/defaults/nvidia, which is a symlink pointing to /dev/null

Looks like the variant symlinks needs to be configured for the Surf bot?

@casparvl
Copy link
Collaborator Author

ah, yeah, same issue you had on the jsc bot...

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants