You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I’d like to ask if the current code is capable of running the GPQA evaluation. I saw it reported in the paper. Are there any points I should pay attention to?
Thank you for your outstanding work.
I’d like to ask if the current code is capable of running the GPQA evaluation. I saw it reported in the paper. Are there any points I should pay attention to?
Thanks