EuroCC@Greece announced the 9th Course of HPC Training Series with the subject “Running LLMs on HPC: Transformers, Inference & Deployment”, that took place online on January 17th, 2025.
Presentation languages: Greek and English
Audience:
- Data scientists and machine learning engineers.
- NLP researchers and practitioners.
- HPC system administrators and engineers.
- Developers exploring Hugging Face Transformers and RAG.
- Academic researchers working on language modeling projects.
- Professionals interested in training or deploying LLMs on HPC.
- Organizations planning to adopt HPC for AI workloads.
Description: This course focused on Large Language Models running on High-Performance Computing systems. Participants gained a foundational understanding of the Hugging Face Transformers library, embeddings’ models, and of Retrieval-Augmented Generation. They discovered how to effectively set up an inference server on HPC systems as well as a deployment process and limitations. Training of the Greek LLM Meltemi was also be presented. This seminar included hands-on sessions where users were able to run the provided code.
The Course’s presentation material can be found here.
You may find the Course’s available recordings in the dedicated playlist here.