hugging-face-model-trainer
// This skill should be used when users want to train or fine-tune language models using TRL (Transformer Reinforcement Learning) on Hugging Face Jobs infrastructure. Covers SFT, DPO, GRPO and reward modeling training methods, plus GGUF conversion for local deployment. Includes guidance on the TRL Jobs
$ git log --oneline --stat
stars:8,109
forks:1.5k
updated:March 4, 2026
SKILL.md
Този skill няма публичен SKILL.md файл.
Разгледайте в GitHub