-
Notifications
You must be signed in to change notification settings - Fork 233
Description
Does anybody would like to help me solve this problem.
When I finsh the Teacher Model pretraining bash examples/algorithms/dapo/run_dapo_aime_qwen3_4b_aime.sh and obtain ckpts/dapo_qwen3_4b_base/global_step_90. It contains data.pt, trainer_state.pt and policy folder, where policy folder includes huggingface folder (extra_state_world_size_8_rank_{0-8}.pt, model_world_size_8_rank_{0-7}, optim_world_size_8_rank_{0-8}.pt and fsdp_config.json)
when I set TEACHER_MODEL as "ckpts/dapo_qwen3_4b_base/global_step_90/" and execute "bash examples/on_policy_distillation/run_on_policy_distill_math_qwen3_4b.sh", I got the following error:
raise OSError(
OSError: Error no file named pytorch_model.bin, model.safetensors, tf_model.h5, model.ckpt.index or flax_model.msgpack found in directory /personal/jiayu2026/ckpts/dapo_qwen3_4b_base/global_step_150/.