fix(tj): finetune code #399

tAnGjIa520 · 2025-08-12T07:35:43Z

atari_unizero_multitask_segment_ddp_config_debug_naive.py 单任务从0微调
atari_unizero_multitask_segment_ddp_config_finetune_SpaceInvaders_full.py 全量微调
atari_unizero_multitask_segment_ddp_config_finetune_SpaceInvaders_head_back_encoder_lora.py 微调head+encoder(lora)+backbone(lora)
atari_unizero_multitask_segment_ddp_config_finetune_SpaceInvaders_head_back_lora.py 微调head+backbone(lora)
atari_unizero_multitask_segment_ddp_config_finetune_SpaceInvaders_head.py 微调(head)

puyuan1996 · 2025-08-13T09:19:25Z

zoo/atari/config/atari_unizero_multitask_segment_ddp_config_debug_naive.py

+    # finetune_components = ['transformer'] # load-enc-trans_finetune-trans-head
+    finetune_components = [] # load-enc-trans_finetune-encoder-head
+
+    for seed in [3]:


scalezero加载ckpt全量调整和scalezero从零训的版本都指定为seed0

puyuan1996 · 2025-08-13T09:20:20Z

zoo/atari/config/atari_unizero_multitask_segment_ddp_config_debug_naive.py

+    n_episode = 8
+    evaluator_env_num = 3
+    # num_simulations = 50
+    num_simulations = 25


是需要改成collect_num_simulations为25， eval_num_simulations为50。全部改成25，eval的性能是会下降的

puyuan1996 · 2025-08-13T09:21:42Z

zoo/atari/config/atari_unizero_multitask_segment_ddp_config_finetune_SpaceInvaders_full.py

+    num_segments = collector_env_num
+    n_episode = 8
+    evaluator_env_num = 3
+    num_simulations = 25


这里也是需要按上面的修改

finetune v1

4b2268f

puyuan1996 requested changes Aug 13, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

fix(tj): finetune code #399

fix(tj): finetune code #399

Uh oh!

tAnGjIa520 commented Aug 12, 2025 •

edited

Loading

Uh oh!

puyuan1996 Aug 13, 2025

Uh oh!

tAnGjIa520 Aug 13, 2025

Uh oh!

puyuan1996 Aug 13, 2025 •

edited

Loading

Uh oh!

tAnGjIa520 Aug 13, 2025

Uh oh!

puyuan1996 Aug 13, 2025

Uh oh!

tAnGjIa520 Aug 13, 2025

Uh oh!

Uh oh!

fix(tj): finetune code #399

Are you sure you want to change the base?

fix(tj): finetune code #399

Uh oh!

Conversation

tAnGjIa520 commented Aug 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

puyuan1996 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

tAnGjIa520 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

puyuan1996 Aug 13, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

tAnGjIa520 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

puyuan1996 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

tAnGjIa520 Aug 13, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

tAnGjIa520 commented Aug 12, 2025 •

edited

Loading

puyuan1996 Aug 13, 2025 •

edited

Loading