AI & ML interests
None yet
Organizations
ericsun153/mmlu_cleaned_dataset
Viewer
• Updated • 300 • 36
ericsun153/arc_question_shift_test
Viewer
• Updated • 1.17k • 39
ericsun153/mathqa_question_shift_test
Viewer
• Updated • 2.99k • 58
ericsun153/mmlu_question_shift_test
Viewer
• Updated • 14k • 99
ericsun153/mmlu_shuffle_question_test
Viewer
• Updated • 14k • 124
ericsun153/mathqa_shuffle_question_test
Viewer
• Updated • 2.99k • 232
ericsun153/arc_shuffle_question_test
Viewer
• Updated • 1.17k • 65
ericsun153/arc_question_token_drop_test
Viewer
• Updated • 1.17k • 10
ericsun153/mathqa_question_token_drop_test
Viewer
• Updated • 2.99k • 9
ericsun153/mmlu_question_token_drop_test
Viewer
• Updated • 14k • 13
ericsun153/mathqa_question_change_test
Viewer
• Updated • 2.99k • 12
• 1
ericsun153/mmlu_question_change_test
Viewer
• Updated • 14k • 80
ericsun153/arc_challenge_question_change_test
Viewer
• Updated • 1.17k • 8
ericsun153/mathqa_confusing_options_contamination_test
Viewer
• Updated • 2.98k • 4
ericsun153/mathqa_test_dataset
Viewer
• Updated • 5.97k • 112
• 1
ericsun153/arc_confusing_options_contamination_test
Viewer
• Updated • 1.17k • 4
ericsun153/mmlu_confusing_options_contamination_test
Viewer
• Updated • 14k • 139
ericsun153/llama3_mmlu_shuffle_choices_with_drop
Viewer
• Updated • 14k • 5
ericsun153/llama3_mmlu_shuffle_question_with_drop
Viewer
• Updated • 14k • 6
ericsun153/llama3_mmlu_shuffle_all_with_drop
Viewer
• Updated • 14k • 3
ericsun153/llama3_mmlu_shuffle_all
Viewer
• Updated • 14k • 37
ericsun153/llama3_mmlu_shuffle_choices
Viewer
• Updated • 14k • 13
ericsun153/llama3_mmlu_shuffle_question
Viewer
• Updated • 14k • 75
ericsun153/mmlu_shuffle_drop_10percent_all
Viewer
• Updated • 14k • 7
ericsun153/mmlu_shuffle_drop_10percent_choices
Viewer
• Updated • 14k • 6
ericsun153/mmlu_shuffle_drop_10percent_questions
Viewer
• Updated • 14k • 6
ericsun153/mmlu_drop_10percent_questions
Viewer
• Updated • 14k • 61
ericsun153/mmlu_shuffled_choices_tinyllama
Viewer
• Updated • 14k • 26
ericsun153/mmlu_shuffled_all
Viewer
• Updated • 14k • 40
ericsun153/mmlu_shuffled_choices
Viewer
• Updated • 14k • 51