| MEETING DELEGATE: Benchmarking LLMs on Attending Meetings on Our Behalf | Feb 5, 2025 | BenchmarkingScheduling | —Unverified | 0 | 0 |
| Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset | Feb 26, 2024 | BenchmarkingCross-Lingual Transfer | —Unverified | 0 | 0 |
| MegaCOIN: Enhancing Medium-Grained Color Perception for Vision-Language Models | Dec 5, 2024 | BenchmarkingDomain Generalization | —Unverified | 0 | 0 |
| Benchmarking Large Language Model Capabilities for Conditional Generation | Jun 29, 2023 | BenchmarkingFew-Shot Learning | —Unverified | 0 | 0 |
| Benchmarking Language Models for Cyberbullying Identification and Classification from Social-media Texts | Jun 1, 2022 | BenchmarkingBinary Classification | —Unverified | 0 | 0 |
| MEGAVERSE: Benchmarking Large Language Models Across Languages, Modalities, Models and Tasks | Nov 13, 2023 | Benchmarking | —Unverified | 0 | 0 |
| MELABenchv1: Benchmarking Large Language Models against Smaller Fine-Tuned Models for Low-Resource Maltese NLP | Jun 4, 2025 | BenchmarkingLanguage Modelling | —Unverified | 0 | 0 |
| Benchmarking Lane-changing Decision-making for Deep Reinforcement Learning | Sep 22, 2021 | Autonomous DrivingBenchmarking | —Unverified | 0 | 0 |
| MeltpoolNet: Melt pool Characteristic Prediction in Metal Additive Manufacturing Using Machine Learning | Jan 26, 2022 | ArticlesBenchmarking | —Unverified | 0 | 0 |
| Benchmarking Knowledge-Enhanced Commonsense Question Answering via Knowledge-to-Text Transformation | Jan 4, 2021 | BenchmarkingQuestion Answering | —Unverified | 0 | 0 |
| MERGE -- A Bimodal Audio-Lyrics Dataset for Static Music Emotion Recognition | Jul 8, 2024 | BenchmarkingDeep Learning | —Unverified | 0 | 0 |
| Towards Explainable Network Intrusion Detection using Large Language Models | Aug 8, 2024 | BenchmarkingIntrusion Detection | —Unverified | 0 | 0 |
| Benchmarking KAZE and MCM for Multiclass Classification | May 20, 2015 | BenchmarkingClassification | —Unverified | 0 | 0 |
| What cleaves? Is proteasomal cleavage prediction reaching a ceiling? | Oct 24, 2022 | BenchmarkingDenoising | —Unverified | 0 | 0 |
| Benchmarking Joint Lexical and Syntactic Analysis on Multiword-Rich Data | Apr 1, 2017 | BenchmarkingDependency Parsing | —Unverified | 0 | 0 |
| Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues | Aug 10, 2022 | BenchmarkingDeepFake Detection | —Unverified | 0 | 0 |
| Metaethical Perspectives on 'Benchmarking' AI Ethics | Apr 11, 2022 | BenchmarkingEthics | —Unverified | 0 | 0 |
| Towards Fair Machine Learning Software: Understanding and Addressing Model Bias Through Counterfactual Thinking | Feb 16, 2023 | Benchmarkingcounterfactual | —Unverified | 0 | 0 |
| Benchmarking Japanese Speech Recognition on ASR-LLM Setups with Multi-Pass Augmented Generative Error Correction | Aug 29, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| A deep convolutional neural network model for rapid prediction of fluvial flood inundation | Jun 20, 2020 | BenchmarkingComputational Efficiency | —Unverified | 0 | 0 |
| Meta learning to classify intent and slot labels with noisy few shot examples | Nov 30, 2020 | Benchmarkingintent-classification | —Unverified | 0 | 0 |
| Benchmarking Invertible Architectures on Inverse Problems | Jan 26, 2021 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking inverse statistical approaches for protein structure and design with exactly solvable models | Nov 15, 2016 | Benchmarking | —Unverified | 0 | 0 |
| Metastatic Cancer Outcome Prediction with Injective Multiple Instance Pooling | Mar 9, 2022 | BenchmarkingManagement | —Unverified | 0 | 0 |
| Benchmarking in Optimization: Best Practice and Open Issues | Jul 7, 2020 | Benchmarking | —Unverified | 0 | 0 |