| Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges | Jul 1, 2020 | BenchmarkingMetaheuristic Optimization | —Unverified | 0 | 0 |
| GuideBench: Benchmarking Domain-Oriented Guideline Following for LLM Agents | May 16, 2025 | BenchmarkingInstruction Following | —Unverified | 0 | 0 |
| Towards Personalized Federated Learning | Mar 1, 2021 | BenchmarkingFederated Learning | —Unverified | 0 | 0 |
| MolMiner: Towards Controllable, 3D-Aware, Fragment-Based Molecular Design | Nov 10, 2024 | 3D geometryBenchmarking | —Unverified | 0 | 0 |
| Towards Private Learning on Decentralized Graphs with Local Differential Privacy | Jan 23, 2022 | BenchmarkingGraph Learning | —Unverified | 0 | 0 |
| MOLTR: Multiple Object Localisation, Tracking, and Reconstruction from Monocular RGB Videos | Dec 9, 2020 | BenchmarkingObject | —Unverified | 0 | 0 |
| Benchmarking for Bayesian Reinforcement Learning | Sep 14, 2015 | Benchmarkingreinforcement-learning | —Unverified | 0 | 0 |
| Towards Productionizing Subjective Search Systems | Mar 31, 2020 | BenchmarkingLanguage Modelling | —Unverified | 0 | 0 |
| Momentum Contrastive Pre-training for Question Answering | Dec 12, 2022 | BenchmarkingContrastive Learning | —Unverified | 0 | 0 |
| Benchmarking Floworks against OpenAI & Anthropic: A Novel Framework for Enhanced LLM Function Calling | Oct 23, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types | Jul 17, 2023 | Benchmarking | —Unverified | 0 | 0 |
| MorisienMT: A Dataset for Mauritian Creole Machine Translation | Jun 6, 2022 | BenchmarkingMachine Translation | —Unverified | 0 | 0 |
| Morphing Attack Detection -- Database, Evaluation Platform and Benchmarking | Jun 11, 2020 | BenchmarkingFace Recognition | —Unverified | 0 | 0 |
| MORSE: Semantic-ally Drive-n MORpheme SEgment-er | Feb 7, 2017 | Benchmarking | —Unverified | 0 | 0 |
| MotionBench: Benchmarking and Improving Fine-grained Video Motion Understanding for Vision Language Models | Jan 6, 2025 | BenchmarkingFeature Compression | —Unverified | 0 | 0 |
| Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level | Nov 15, 2024 | Benchmarkingcounterfactual | —Unverified | 0 | 0 |
| A Dataset for Benchmarking Image-Based Localization | Jul 1, 2017 | BenchmarkingImage-Based Localization | —Unverified | 0 | 0 |
| Movie Description | May 12, 2016 | Benchmarking | —Unverified | 0 | 0 |
| MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning | Jun 4, 2023 | BenchmarkingContrastive Learning | —Unverified | 0 | 0 |
| Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking | Dec 2, 2022 | BenchmarkingInformation Retrieval | —Unverified | 0 | 0 |
| MozzaVID: Mozzarella Volumetric Image Dataset | Dec 6, 2024 | BenchmarkingComputed Tomography (CT) | —Unverified | 0 | 0 |
| MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning | Dec 26, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 | 0 |
| MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures | Feb 1, 2024 | AnatomyBenchmarking | —Unverified | 0 | 0 |
| MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization | Nov 16, 2021 | Benchmarkingdialogue summary | —Unverified | 0 | 0 |
| Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room | Apr 22, 2025 | BenchmarkingFairness | —Unverified | 0 | 0 |