You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I pulled the test data linked in the README, and I am noticing within each category there is basically never an even 25% split between A, B, C, and D..
The most imbalanced category is high school statistics, for which 47% of the answers are D.
I have two Qs: Is my analysis correct? I was using the test data downloadable from the main repo. Furthermore, if my analysis is correct wouldn't random baseline not be a fair comparison, since majority vote would do much better?