| Reinforcement Learning Based Handwritten Digit Recognition with Two-State Q-Learning | Jun 28, 2020 | BenchmarkingHandwritten Digit Recognition | —Unverified | 0 | 0 |
| A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression | Jun 25, 2025 | BenchmarkingManagement | —Unverified | 0 | 0 |
| Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research | Jul 22, 2024 | Benchmarking | —Unverified | 0 | 0 |
| Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse | Feb 20, 2025 | BenchmarkingGraph Attention | —Unverified | 0 | 0 |
| Reinforcing Competitive Multi-Agents for Playing So Long Sucker | Nov 17, 2024 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering | Mar 23, 2025 | BenchmarkingChart Question Answering | —Unverified | 0 | 0 |
| Relative Afferent Pupillary Defect Screening through Transfer Learning | Aug 6, 2019 | BenchmarkingObject Recognition | —Unverified | 0 | 0 |
| A Survey of Parameters Associated with the Quality of Benchmarks in NLP | Oct 14, 2022 | Benchmarking | —Unverified | 0 | 0 |
| Reliable validation of Reinforcement Learning Benchmarks | Mar 2, 2022 | BenchmarkingData Compression | —Unverified | 0 | 0 |
| Why every GBDT speed benchmark is wrong | Oct 24, 2018 | Benchmarking | —Unverified | 0 | 0 |
| REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models | Jun 9, 2025 | BenchmarkingDecision Making | —Unverified | 0 | 0 |
| A Survey of Model Compression and Acceleration for Deep Neural Networks | Oct 23, 2017 | BenchmarkingKnowledge Distillation | —Unverified | 0 | 0 |
| A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing | Oct 10, 2022 | BenchmarkingData Augmentation | —Unverified | 0 | 0 |
| Removal of Ocular Artifacts in EEG Using Deep Learning | Sep 24, 2022 | BenchmarkingDeep Learning | —Unverified | 0 | 0 |
| A Comparative Analysis of Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) as Dimensionality Reduction Techniques | Jun 20, 2025 | BenchmarkingDimensionality Reduction | —Unverified | 0 | 0 |
| Removing Multiple Hybrid Adverse Weather in Video via a Unified Model | Mar 8, 2025 | BenchmarkingVideo Restoration | —Unverified | 0 | 0 |
| A survey of benchmarking frameworks for reinforcement learning | Nov 27, 2020 | Benchmarkingreinforcement-learning | —Unverified | 0 | 0 |
| Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training | Oct 28, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning | May 17, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| A Collection of Challenging Optimization Problems in Science, Engineering and Economics | Apr 9, 2015 | Benchmarking | —Unverified | 0 | 0 |
| A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews | Jun 13, 2023 | BenchmarkingKeyword Extraction | —Unverified | 0 | 0 |
| Why is the winner the best? | Mar 30, 2023 | BenchmarkingMulti-Task Learning | —Unverified | 0 | 0 |
| A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives | Mar 23, 2025 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 | 0 |
| Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering | Apr 19, 2025 | BenchmarkingDataset Generation | —Unverified | 0 | 0 |
| Reproducible evaluation of classification methods in Alzheimer's disease: framework and application to MRI and PET data | Aug 20, 2018 | BenchmarkingClassification | —Unverified | 0 | 0 |
| Repurposing Foundation Model for Generalizable Medical Time Series Classification | Oct 3, 2024 | BenchmarkingDiagnostic | —Unverified | 0 | 0 |
| Reradiation and Scattering from a Reconfigurable Intelligent Surface: A General Macroscopic Model | Jul 27, 2021 | Benchmarking | —Unverified | 0 | 0 |
| UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI | Dec 30, 2024 | BenchmarkingReinforcement Learning (RL) | —Unverified | 0 | 0 |
| ResBench: Benchmarking LLM-Generated FPGA Designs with Resource Awareness | Mar 11, 2025 | BenchmarkingCode Generation | —Unverified | 0 | 0 |
| ResearchArena: Benchmarking LLMs' Ability to Collect and Organize Information as Research Agents | Jun 13, 2024 | BenchmarkingSurvey | —Unverified | 0 | 0 |
| ResearchBench: Benchmarking LLMs in Scientific Discovery via Inspiration-Based Task Decomposition | Mar 27, 2025 | Benchmarkingscientific discovery | —Unverified | 0 | 0 |
| ResearchCodeAgent: An LLM Multi-Agent System for Automated Codification of Research Methodologies | Apr 28, 2025 | BenchmarkingData Augmentation | —Unverified | 0 | 0 |
| ResearchCodeBench: Benchmarking LLMs on Implementing Novel Machine Learning Research Code | Jun 2, 2025 | BenchmarkingCode Generation | —Unverified | 0 | 0 |
| Reservoir Computing with a Single Oscillating Gas Bubble: Emphasizing the Chaotic Regime | Mar 25, 2025 | BenchmarkingLearning Theory | —Unverified | 0 | 0 |
| Resistive Neural Hardware Accelerators | Sep 8, 2021 | Benchmarking | —Unverified | 0 | 0 |
| Resource-efficient Medical Image Analysis with Self-adapting Forward-Forward Networks | Jun 20, 2024 | BenchmarkingMedical Image Analysis | —Unverified | 0 | 0 |
| UnsafeBench: Benchmarking Image Safety Classifiers on Real-World and AI-Generated Images | May 6, 2024 | Benchmarking | —Unverified | 0 | 0 |
| RESPONSE: Benchmarking the Ability of Language Models to Undertake Commonsense Reasoning in Crisis Situation | Mar 14, 2025 | Benchmarking | —Unverified | 0 | 0 |
| Restoring Images Captured in Arbitrary Hybrid Adverse Weather Conditions in One Go | May 17, 2023 | BenchmarkingImage Restoration | —Unverified | 0 | 0 |
| A Strong Sustainability Paradigm Based Analytical Hierarchy Process (SSP-AHP) Method to Evaluate Sustainable Healthcare Systems | May 13, 2023 | Benchmarking | —Unverified | 0 | 0 |
| AstroMLab 2: AstroLLaMA-2-70B Model and Benchmarking Specialised LLMs for Astronomy | Sep 29, 2024 | AstronomyBenchmarking | —Unverified | 0 | 0 |
| AstroMLab 1: Who Wins Astronomy Jeopardy!? | Jul 15, 2024 | AstronomyBenchmarking | —Unverified | 0 | 0 |
| TaskEval: Assessing Difficulty of Code Generation Tasks for Large Language Models | Jul 30, 2024 | BenchmarkingCode Completion | —Unverified | 0 | 0 |
| AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets | Jan 3, 2024 | AstronomyBenchmarking | —Unverified | 0 | 0 |
| A Statistical Framework to Investigate the Optimality of Signal-Reconstruction Methods | Mar 18, 2022 | Benchmarking | —Unverified | 0 | 0 |
| Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks | Feb 18, 2022 | BenchmarkingDeep Learning | —Unverified | 0 | 0 |
| Unsupervised Deep Epipolar Flow for Stationary or Dynamic Scenes | Apr 8, 2019 | BenchmarkingDeep Learning | —Unverified | 0 | 0 |
| Unsupervised Feature Learning for Environmental Sound Classification Using Weighted Cycle-Consistent Generative Adversarial Network | Apr 8, 2019 | BenchmarkingClassification | —Unverified | 0 | 0 |
| A Statistical Analysis for Per-Instance Evaluation of Stochastic Optimizers: How Many Repeats Are Enough? | Mar 20, 2025 | Benchmarking | —Unverified | 0 | 0 |
| A Standardized Benchmark Set of Clustering Problem Instances for Comparing Black-Box Optimizers | May 14, 2025 | BenchmarkingClustering | —Unverified | 0 | 0 |