EuroCC@Greece HPC Training Series – Course 9 “Running LLMs on HPC: Transformers, Inference & Deployment”, on January 17th, 2025

EuroCC@Greece announced the 9th Course of HPC Training Series with the subject “Running LLMs on HPC: Transformers, Inference & Deployment”, that took place online on January 17th, 2025.

Presentation languages: Greek and English

Audience:

  • Data scientists and machine learning engineers.
  • NLP researchers and practitioners.
  • HPC system administrators and engineers.
  • Developers exploring Hugging Face Transformers and RAG.
  • Academic researchers working on language modeling projects.
  • Professionals interested in training or deploying LLMs on HPC.
  • Organizations planning to adopt HPC for AI workloads.

Description: This course focused on Large Language Models running on High-Performance Computing systems. Participants gained a foundational understanding of the Hugging Face Transformers library, embeddings’ models, and of Retrieval-Augmented Generation. They discovered how to effectively set up an inference server on HPC systems as well as a deployment process and limitations. Training of the Greek LLM Meltemi was also be presented. This seminar included hands-on sessions where users were able to run the provided code.

The Course’s presentation material can be found here.

You may find the Course’s available recordings in the dedicated playlist here.