-
|
So what's that and why isn't it listed in improvements over original? The settings look so complicated, there's no way to use it without a guide. |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment 4 replies
-
|
Hey @Seedmanc , that is true. I'm just not good about updating documentation, the fork is mainly used by me and a smaller group that I tend to communicate with and keep up to speed on the details. It's based on https://arxiv.org/abs/2312.02696. Here are some reasonable defaults, with a quick explainer: edm2_loss_weighting = "True" edm2_loss_weighting_optimizer = "LoraEasyCustomOptimizer.fmarscrop.FMARSCropV2ExMachina" edm2_loss_weighting_optimizer_lr = "2e-2" edm2_loss_weighting_optimizer_args = "{'update_strategy':'cautious', 'gamma':0.0, 'betas':(0.99,0.9999,0.999), 'adaptive_clip':0}" edm2_loss_weighting_lr_scheduler = "True" edm2_loss_weighting_lr_scheduler_warmup_percent = "0.1" edm2_loss_weighting_lr_scheduler_constant_percent = "0.9" edm2_loss_weighting_max_grad_norm = "0" edm2_loss_weighting_generate_graph_output_dir = "D:/ai/training/loss_weighting" edm2_loss_weighting_generate_graph_every_x_steps = "10" edm2_loss_weighting_generate_graph = "True" edm2_loss_weighting_num_channels = "448" edm2_loss_weighting_generate_graph_y_limit = "5" edm2_loss_weighting_initial_weights = "" |
Beta Was this translation helpful? Give feedback.
@Seedmanc validation loss and EDM2 loss weighting are different things.
Validation loss simply calculates loss for reporting and analysis purposes, based on a percentage of images that are withheld from training and only used for validation. The intent being to see how well it can generalize past images it is explicitly trained on, which can help determine if it is learning effectively or overfitting.
EDM2 loss weighting basically brings up the loss for timesteps where it is lower, so that it can learn more effectively.