dapo

Star

Here are 7 public repositories matching this topic...

opendilab / LightRFT

Star

LightRFT: Light, Efficient, Omni-modal & Reward-model Driven Reinforcement Fine-Tuning Framework

reinforcement-learning multi-modal vlm rft llm reward-model llm-training grpo dapo

Updated Jan 28, 2026
Python

WangJingyao07 / Awesome-GRPO

Star

Codebase of GRPO: Implementations and Resources of GRPO and Its Variants

reinforcement-learning transformers papers reasoning llm grpo dapo

Updated Dec 6, 2025
Python

saikiranrallabandi / inframind

Star

InfraMind: Fine-tuning toolkit for training SLMs on Infrastructure-as-Code using GRPO/DAPO. Achieves 97.3% accuracy on IaC generation.

rl grpo dapo

Updated Dec 15, 2025
Python

AchoWu / GCPO

Star

Group Contrastive Policy Optimazation. Read the paper on arXiv: 👉 https://arxiv.org/abs/2510.07790

rl llm rlhf qwen grpo dapo

Updated Oct 12, 2025
Python

teilomillet / materl

Star

modular reinforcement-learning mojo rl vapo grpo dapo materl

Updated Jul 12, 2025
Python

mapi-developer / dapo

Star

Simple, zero-dependency tabular data manipulation and analysis for Python.

python data dapo

Updated Dec 1, 2025
Python

VocabVictor / verl-plus

Star

增加verl ascend适配；做一些小的改进

ppo dpo grpo dapo

Updated Nov 29, 2025
Python

Improve this page

Add a description, image, and links to the dapo topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the dapo topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dapo

Here are 7 public repositories matching this topic...

opendilab / LightRFT

WangJingyao07 / Awesome-GRPO

saikiranrallabandi / inframind

AchoWu / GCPO

teilomillet / materl

mapi-developer / dapo

VocabVictor / verl-plus

Improve this page

Add this topic to your repo