benchmark-evaluation allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 427k • 334 Rowan/hellaswag Viewer • Updated Jul 10, 2025 • 60k • 303k • 172 ybisk/piqa Updated Jan 18, 2024 • 57.9k • 104 EleutherAI/lambada_openai Viewer • Updated Jul 10, 2025 • 30.9k • 91.6k • 49
benchmark-evaluation allenai/ai2_arc Viewer • Updated Dec 21, 2023 • 7.79k • 427k • 334 Rowan/hellaswag Viewer • Updated Jul 10, 2025 • 60k • 303k • 172 ybisk/piqa Updated Jan 18, 2024 • 57.9k • 104 EleutherAI/lambada_openai Viewer • Updated Jul 10, 2025 • 30.9k • 91.6k • 49
jonathanjordan21/SmolLM2-135M-Instruct-sentiment-finetuned Text Classification • 0.1B • Updated Nov 11, 2024 • 3
jonathanjordan21/donut_fine_tuning_food_composition_id Document Question Answering • 0.2B • Updated Nov 5, 2024 • 12 • 1
jonathanjordan21/paraphrase-multilingual-MiniLM-L12-v2-helpfulness Sentence Similarity • 0.1B • Updated Nov 4, 2024 • 2