On Policy Distillation checkpoints Loading

Does anybody would like to help me solve this problem.

When I finsh the Teacher Model pretraining bash examples/algorithms/dapo/run_dapo_aime_qwen3_4b_aime.sh and obtain ckpts/dapo_qwen3_4b_base/global_step_90. It contains data.pt, trainer_state.pt and policy folder, where policy folder includes huggingface folder (extra_state_world_size_8_rank_{0-8}.pt, model_world_size_8_rank_{0-7}, optim_world_size_8_rank_{0-8}.pt and fsdp_config.json)

when I set TEACHER_MODEL as "ckpts/dapo_qwen3_4b_base/global_step_90/" and execute "bash examples/on_policy_distillation/run_on_policy_distill_math_qwen3_4b.sh", I got the following error:

raise OSError(
OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /personal/jiayu2026/ckpts/dapo_qwen3_4b_base/global_step_150/.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

On Policy Distillation checkpoints Loading #895

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

On Policy Distillation checkpoints Loading #895

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions