You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In the training script, it seems that the next token embedding was not used to predict the next next token during the multi-step prediction calculation of loss.