Thanks for your great work. I find that when I adjust your code to train wan-animate, the result is bad than without training rcm.
https://github.com/user-attachments/assets/a7ad2ef8-4a74-4dd0-b5b8-f1906391666d
directly inference 4 step without training
https://github.com/user-attachments/assets/7f105535-e6b1-4d43-a30d-fbef16dc4a4d
train 15 step
https://github.com/user-attachments/assets/fb18c653-8852-4d25-b7d4-cd893589a65d
train 1500step
I set warmup to 100 and train in 8*A800 gpu, context_parallel_size=8 and state_t=8.
can you introduce me some advice? thanks in advance
Thanks for your great work. I find that when I adjust your code to train wan-animate, the result is bad than without training rcm.
https://github.com/user-attachments/assets/a7ad2ef8-4a74-4dd0-b5b8-f1906391666d
directly inference 4 step without training
https://github.com/user-attachments/assets/7f105535-e6b1-4d43-a30d-fbef16dc4a4d
train 15 step
https://github.com/user-attachments/assets/fb18c653-8852-4d25-b7d4-cd893589a65d
train 1500step
I set warmup to 100 and train in 8*A800 gpu, context_parallel_size=8 and state_t=8.
can you introduce me some advice? thanks in advance