| The Belebele Benchmark: a Parallel Reading Comprehension Dataset in 122 Language Variants | Aug 31, 2023 | BelebeleCross-Lingual Transfer | CodeCode Available | 2 | 5 |
| MEXA: Multilingual Evaluation of English-Centric LLMs via Cross-Lingual Alignment | Oct 8, 2024 | ARCBelebele | CodeCode Available | 1 | 5 |
| OpenBA: An Open-sourced 15B Bilingual Asymmetric seq2seq Model Pre-trained from Scratch | Sep 19, 2023 | BelebeleMMLU | CodeCode Available | 1 | 5 |
| Can you map it to English? The Role of Cross-Lingual Alignment in Multilingual Performance of LLMs | Apr 13, 2025 | BelebeleMachine Translation | CodeCode Available | 0 | 5 |
| NaijaRC: A Multi-choice Reading Comprehension Dataset for Nigerian Languages | Aug 18, 2023 | BelebeleCross-Lingual Transfer | CodeCode Available | 0 | 5 |
| From Multiple-Choice to Extractive QA: A Case Study for English and Arabic | Apr 26, 2024 | BelebeleExtractive Question-Answering | CodeCode Available | 0 | 5 |
| Marco-LLM: Bridging Languages via Massive Multilingual Training for Cross-Lingual Enhancement | Dec 5, 2024 | BelebeleMachine Translation | —Unverified | 0 | 0 |
| DNA 1.0 Technical Report | Jan 18, 2025 | BelebeleGSM8K | —Unverified | 0 | 0 |
| Elastic Weight Consolidation for Full-Parameter Continual Pre-Training of Gemma2 | May 9, 2025 | ARCBelebele | —Unverified | 0 | 0 |
| 2M-BELEBELE: Highly Multilingual Speech and American Sign Language Comprehension Dataset | Dec 11, 2024 | BelebeleReading Comprehension | —Unverified | 0 | 0 |
| Multi-lingual Functional Evaluation for Large Language Models | Jun 25, 2025 | BelebeleInstruction Following | —Unverified | 0 | 0 |