This workshop is designed to familiarize you with the capabilities of the LUMI supercomputer for artificial intelligence applications.
This physical two-day workshop takes place in Amsterdam, The Netherlands. However, for those unable to attend in person, it is possible to join the lectures online.
While the interactive hands-on exercises and personalized support for implementing your own workflows will be exclusive to in-person attendees, remote participants will still benefit from the comprehensive lectures streamed live from the workshop.
Start: May 27 2025 09:00
End: May 28 2025 16:30
Learning outcomes
Attending the workshop, you will acquire an understanding of the LUMI-G architecture tailored for AI training, including an introduction to SLURM, ROCm, the Lustre/LUMI-O file systems, and the Slingshot 11 interconnect. Specifically, you will:
- Learn to utilize existing AI containers on LUMI and build your own using the container build tool, cotainr
- Learn to distribute AI workloads across multiple GPUs within a single LUMI-G node
- Explore strategies for scaling AI workloads across numerous GPUs distributed over several LUMI-G nodes
- Gain insight into advanced topics for optimizing AI training processes on the LUMI supercomputer