| MSC-Bench: Benchmarking and Analyzing Multi-Sensor Corruption for Driving Perception | Jan 2, 2025 | 3D Object DetectionAutonomous Driving | —Unverified | 0 | 0 |
| Benchmarking five global optimization approaches for nano-optical shape optimization and parameter reconstruction | Sep 18, 2018 | Bayesian OptimizationBenchmarking | —Unverified | 0 | 0 |
| MS MARCO: Benchmarking Ranking Models in the Large-Data Regime | May 9, 2021 | Benchmarking | —Unverified | 0 | 0 |
| MSQA: Benchmarking LLMs on Graduate-Level Materials Science Reasoning and Knowledge | May 29, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Towards Robust and Generalizable Gerchberg Saxton based Physics Inspired Neural Networks for Computer Generated Holography: A Sensitivity Analysis Framework | Apr 30, 2025 | BenchmarkingLearning Theory | —Unverified | 0 | 0 |
| Benchmarking federated strategies in Peer-to-Peer Federated learning for biomedical data | Feb 15, 2024 | BenchmarkingFederated Learning | —Unverified | 0 | 0 |
| MTG: A Benchmarking Suite for Multilingual Text Generation | Oct 16, 2021 | BenchmarkingQuestion Generation | —Unverified | 0 | 0 |
| Benchmarking Federated Machine Unlearning methods for Tabular Data | Apr 1, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 | 0 |
| MTLens: Machine Translation Output Debugging | Jun 1, 2022 | BenchmarkingMachine Translation | —Unverified | 0 | 0 |
| MTOP: A Comprehensive Multilingual Task-Oriented Semantic Parsing Benchmark | Aug 21, 2020 | BenchmarkingSemantic Parsing | —Unverified | 0 | 0 |
| Towards Robust Evaluation: A Comprehensive Taxonomy of Datasets and Metrics for Open Domain Question Answering in the Era of Large Language Models | Jun 19, 2024 | BenchmarkingOpen-Domain Question Answering | —Unverified | 0 | 0 |
| Benchmarking FedAvg and FedCurv for Image Classification Tasks | Mar 31, 2023 | BenchmarkingClassification | —Unverified | 0 | 0 |
| Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models | May 16, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Muffin or Chihuahua? Challenging Multimodal Large Language Models with Multipanel VQA | Jan 29, 2024 | BenchmarkingImage Comprehension | —Unverified | 0 | 0 |
| Mukayese: Turkish NLP Strikes Back | Nov 16, 2021 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Benchmarking features from different radiomics toolkits / toolboxes using Image Biomarkers Standardization Initiative | Jun 23, 2020 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking Feature Extractors for Reinforcement Learning-Based Semiconductor Defect Localization | Nov 18, 2023 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Benchmarking Expressive Japanese Character Text-to-Speech with VITS and Style-BERT-VITS2 | May 22, 2025 | BenchmarkingDialogue Generation | —Unverified | 0 | 0 |
| Multicalibration for Confidence Scoring in LLMs | Apr 6, 2024 | BenchmarkingQuestion Answering | —Unverified | 0 | 0 |
| Multi-Camera Action Dataset for Cross-Camera Action Recognition Benchmarking | Jul 21, 2016 | Action RecognitionBenchmarking | —Unverified | 0 | 0 |
| Multi-channel deep convolutional neural networks for multi-classifying thyroid disease | Mar 6, 2022 | BenchmarkingBinary Classification | —Unverified | 0 | 0 |
| Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area | Jul 14, 2023 | BenchmarkingTime Series | —Unverified | 0 | 0 |
| Multiclass Optimal Classification Trees with SVM-splits | Nov 16, 2021 | BenchmarkingClassification | —Unverified | 0 | 0 |
| Benchmarking Evolutionary Community Detection Algorithms in Dynamic Networks | Dec 21, 2023 | BenchmarkingCommunity Detection | —Unverified | 0 | 0 |
| Multi-Dimensional Insights: Benchmarking Real-World Personalization in Large Multimodal Models | Dec 17, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Benchmarking Model Predictive Control Algorithms in Building Optimization Testing Framework (BOPTEST) | Jan 31, 2023 | BenchmarkingModel Predictive Control | —Unverified | 0 | 0 |
| Multifactorial Cellular Genetic Algorithm (MFCGA): Algorithmic Design, Performance Comparison and Genetic Transferability Analysis | Mar 24, 2020 | BenchmarkingTransfer Learning | —Unverified | 0 | 0 |
| Multi-Fidelity Methods for Optimization: A Survey | Feb 15, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 | 0 |
| Benchmarking Evolutionary Algorithms For Single Objective Real-valued Constrained Optimization - A Critical Review | Jun 12, 2018 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 | 0 |
| Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition | Nov 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans | Jun 25, 2025 | Action DetectionBenchmarking | —Unverified | 0 | 0 |
| Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 | May 12, 2025 | BenchmarkingEthics | —Unverified | 0 | 0 |
| Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet | Oct 26, 2024 | BenchmarkingCantilever Beam | —Unverified | 0 | 0 |
| Benchmarking Estimators for Natural Experiments: A Novel Dataset and a Doubly Robust Algorithm | Sep 6, 2024 | Benchmarkingregression | —Unverified | 0 | 0 |
| Multi-label classification for biomedical literature: an overview of the BioCreative VII LitCovid Track for COVID-19 literature topic annotations | Apr 20, 2022 | ArticlesBenchmarking | —Unverified | 0 | 0 |
| Benchmarking energy consumption and latency for neuromorphic computing in condensed matter and particle physics | Sep 21, 2022 | Anomaly DetectionBenchmarking | —Unverified | 0 | 0 |
| Multilingual European Language Models: Benchmarking Approaches and Challenges | Feb 18, 2025 | BenchmarkingQuestion Answering | —Unverified | 0 | 0 |
| Multilingual Large Language Models Are Not (Yet) Code-Switchers | May 23, 2023 | BenchmarkingLanguage Identification | —Unverified | 0 | 0 |
| Multilingual Protest News Detection - Shared Task 1, CASE 2021 | Aug 1, 2021 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| Benchmarking Energy-Conserving Neural Networks for Learning Dynamics from Data | Dec 3, 2020 | BenchmarkingInductive Bias | —Unverified | 0 | 0 |
| Benchmarking Energy and Latency in TinyML: A Novel Method for Resource-Constrained AI | May 21, 2025 | Benchmarking | —Unverified | 0 | 0 |
| MultiMed: Massively Multimodal and Multitask Medical Understanding | Aug 22, 2024 | BenchmarkingMedical Question Answering | —Unverified | 0 | 0 |
| Multimodal ArXiv: A Dataset for Improving Scientific Comprehension of Large Vision-Language Models | Mar 1, 2024 | BenchmarkingMathematical Reasoning | —Unverified | 0 | 0 |
| A Data-Driven Method to Identify IBRs with Dominant Participation in Sub-Synchronous Oscillations | May 20, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Towards Sentiment Analysis of Tobacco Products’ Usage in Social Media | Sep 1, 2021 | BenchmarkingSentiment Analysis | —Unverified | 0 | 0 |
| Multimodal Deep Learning for Scientific Imaging Interpretation | Sep 21, 2023 | ArticlesBenchmarking | —Unverified | 0 | 0 |
| Multimodal Deep Reinforcement Learning for Portfolio Optimization | Dec 23, 2024 | ArticlesBenchmarking | —Unverified | 0 | 0 |
| Multi-Modal Explainable Medical AI Assistant for Trustworthy Human-AI Collaboration | May 11, 2025 | BenchmarkingDescriptive | —Unverified | 0 | 0 |
| Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms | Jul 3, 2024 | BenchmarkingCPU | —Unverified | 0 | 0 |
| Benchmarking End-to-end Learning of MIMO Physical-Layer Communication | May 19, 2020 | Benchmarking | —Unverified | 0 | 0 |