| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| What can 5.17 billion regression fits tell us about artificial models of the human visual system? | Oct 12, 2021 | Benchmarking | —Unverified | 0 |
| What cleaves? Is proteasomal cleavage prediction reaching a ceiling? | Oct 24, 2022 | BenchmarkingDenoising | —Unverified | 0 |
| What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs | May 15, 2025 | AllBenchmarking | —Unverified | 0 |
| What Emotions Make One or Five Stars? Understanding Ratings of Online Product Reviews by Sentiment Analysis and XAI | Feb 29, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus | Sep 17, 2020 | BenchmarkingTerm Extraction | —Unverified | 0 |
| Alexpaca: Learning Factual Clarification Question Generation Without Examples | Oct 17, 2023 | BenchmarkingChatbot | —Unverified | 0 |
| What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts | Aug 1, 2021 | BenchmarkingBinary Classification | —Unverified | 0 |
| Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms | Mar 8, 2023 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 |
| What Will it Take to Fix Benchmarking in Natural Language Understanding? | Apr 5, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |