LLM Fine-Tuning TutorialsΒΆ

These tutorials provide an introductory guide to using AgileRL for fine-tuning LLMs.

GRPO - Fine-Tuning

GRPO - Fine-Tuning with HPO

SFT & DPO - Fine-Tuning

Multiturn - LLMPPO, LLMREINFORCE, GRPO