🇪🇪 Estonian LLM Evaluation
A collection of resources for evaluation of LLM capabilities in the Estonian language.
Viewer • Updated • 541 • 14Note Manually translated and culturally adapted IFEval.
tartuNLP/winogrande_et
Viewer • Updated • 8.36k • 120 • 1Note Manually translated and culturally adapted WinoGrande test set.
-
Estonian WinoGrande Dataset: Comparative Analysis of LLM Performance on Human and Machine Translation
Paper • 2511.17290 • Published • 1
TalTechNLP/EstQA
Viewer • Updated • 1.38k • 29 • 2Note Manually annotated extractive question answering.
tartuNLP/EstNER
Viewer • Updated • 46.1k • 333Note Manually annotated named entity recognition.
tartuNLP/EstCOPA
Viewer • Updated • 2k • 61 • 2Note Machine-translated and post-edited choice of plausible alternatives.
TalTechNLP/grammar_et
Viewer • Updated • 8.94k • 17 • 1Note Grammatical error correction. Text with errors → corrected text.
TalTechNLP/ERRnews
Updated • 41 • 1Note ERR news story transcripts generated by an ASR pipeline paired with the human written summary.
kardosdrur/estonian-valence
Viewer • Updated • 4.09k • 177 • 1Note Newspaper article sentiment.
TalTechNLP/trivia_et
Viewer • Updated • 800 • 10 • 1Note Trivia dataset extracted from the "Eesti Mäng" board game.
TalTechNLP/exam_et
Viewer • Updated • 1.61k • 19 • 1Note Exam questions on several subjects in multiple choice format.
TalTechNLP/inflection_et
Viewer • Updated • 1.4k • 25 • 1Note Noun phrase inflection.
tartuNLP/smugri-mt-bench
Viewer • Updated • 500 • 26 • 2Note Extension of the MT-Bench dataset including Estonian, Livonian, Komi, and Võro.
TalTechNLP/word_meanings_et
Viewer • Updated • 54.6k • 27 • 1Note Matching words with their dictionary definitions.
-
mrlbenchmarks/global-piqa-nonparallel
Viewer • Updated • 13.5k • 10.9k • 35 -
tartuNLP/Estonian_Subjectivity
Viewer • Updated • 1k • 1.11k
alexandrainst/multi-wiki-qa
Viewer • Updated • 1.22M • 2.97k • 23Note Multilingual extractive question answering with Wikipedia articles as context, includes Estonian. Questions are generated with Gemini-1.5-pro.
TalTechNLP/MMLU_et
Viewer • Updated • 14k • 102 • 2Note Machine-translated MMLU.
-
TalTechNLP/M_MMLU_et
Viewer • Updated • 15k • 27
TalTechNLP/MMLU_Pro_et
Viewer • Updated • 12.1k • 16Note Machine-translated MMLU-Pro.
LumiOpen/arc_challenge_mt
Viewer • Updated • 54.4k • 807 • 2Note Multilingual machine-translated version of the `arc_challenge` dataset, includes Estonian.
LumiOpen/opengpt-x_gsm8kx
Viewer • Updated • 28.7k • 157 • 2Note Multilingual machine-translated version of the `gsm8k` dataset, includes Estonian.
LumiOpen/opengpt-x_truthfulqax
Viewer • Updated • 32.7k • 177 • 1Note Multilingual machine-translated version of the `truthfulqa` dataset, includes Estonian.
LumiOpen/opengpt-x_hellaswagx
Viewer • Updated • 203k • 394Note Multilingual machine-translated version of the `hellaswag` dataset, includes Estonian.
LumiOpen/opengpt-x_mmlux
Viewer • Updated • 314k • 3k • 1Note Multilingual machine-translated version of the `mmlu` dataset, includes Estonian.
openlanguagedata/flores_plus
Viewer • Updated • 893k • 11.2k • 133Note Includes Estonian as one of the languages. English sources are translated to 200 languages. Any translation direction is possible potentially.
cambridgeltl/xcopa
Viewer • Updated • 12.6k • 9.61k • 22Note Machine-translated multilingual COPA. Includes validation and test for Estonian.
google/wmt24pp
Viewer • Updated • 54.9k • 8.51k • 89Note Includes English to Estonian translation direction.
tartuNLP/flores-smugri-pairs
Viewer • Updated • 16.5k • 316Note Extension of the original Flores dataset adding additional translation directions, such as Estonian to Võro, Estonian to Komi-Zyrian, and some others.
mteb/sib200
Viewer • Updated • 387k • 12.3k • 4Note Includes Estonian as one of the languages, classification labels remain in English.
facebook/belebele
Viewer • Updated • 110k • 39.7k • 127Note Multiple choice reading comprehension. Includes Estonian as one of the languages.
jumelet/multiblimp
Viewer • Updated • 121k • 7.7k • 17Note Linguistic acceptability dataset derived from Universal Dependencies with heuristics. Includes Estonian as one of the languages.
-
CohereLabs/include-base-44
Viewer • Updated • 23k • 13.2k • 49