Skip to content

On Policy Distillation checkpoints Loading #895

@White1973

Description

@White1973

Does anybody would like to help me solve this problem.

When I finsh the Teacher Model pretraining bash examples/algorithms/dapo/run_dapo_aime_qwen3_4b_aime.sh and obtain ckpts/dapo_qwen3_4b_base/global_step_90. It contains data.pt, trainer_state.pt and policy folder, where policy folder includes huggingface folder (extra_state_world_size_8_rank_{0-8}.pt, model_world_size_8_rank_{0-7}, optim_world_size_8_rank_{0-8}.pt and fsdp_config.json)

when I set TEACHER_MODEL as "ckpts/dapo_qwen3_4b_base/global_step_90/" and execute "bash examples/on_policy_distillation/run_on_policy_distill_math_qwen3_4b.sh", I got the following error:

raise OSError(
OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /personal/jiayu2026/ckpts/dapo_qwen3_4b_base/global_step_150/.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions