LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework
-
Updated
Jan 28, 2026 - Python
LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework
Codebase of GRPO: Implementations and Resources of GRPO and Its Variants
Add a description, image, and links to the dapo topic page so that developers can more easily learn about it.
To associate your repository with the dapo topic, visit your repo's landing page and select "manage topics."