SOTAVerified

Benchmarking

Papers

Showing 44014425 of 5548 papers

TitleStatusHype
Reinforcement Learning Based Handwritten Digit Recognition with Two-State Q-Learning0
A Survey of Predictive Maintenance Methods: An Analysis of Prognostics via Classification and Regression0
Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research0
Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse0
Reinforcing Competitive Multi-Agents for Playing So Long Sucker0
Unmasking Deceptive Visuals: Benchmarking Multimodal Large Language Models on Misleading Chart Question Answering0
Relative Afferent Pupillary Defect Screening through Transfer Learning0
A Survey of Parameters Associated with the Quality of Benchmarks in NLP0
Reliable validation of Reinforcement Learning Benchmarks0
Why every GBDT speed benchmark is wrong0
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models0
A Survey of Model Compression and Acceleration for Deep Neural Networks0
A Survey of Methods for Addressing Class Imbalance in Deep-Learning Based Natural Language Processing0
Removal of Ocular Artifacts in EEG Using Deep Learning0
A Comparative Analysis of Principal Component Analysis (PCA) and Singular Value Decomposition (SVD) as Dimensionality Reduction Techniques0
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model0
A survey of benchmarking frameworks for reinforcement learning0
Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training0
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning0
A Collection of Challenging Optimization Problems in Science, Engineering and Economics0
A Cloud-based Machine Learning Pipeline for the Efficient Extraction of Insights from Customer Reviews0
Why is the winner the best?0
A Study on Neuro-Symbolic Artificial Intelligence: Healthcare Perspectives0
Unreal Robotics Lab: A High-Fidelity Robotics Simulator with Advanced Physics and Rendering0
Reproducible evaluation of classification methods in Alzheimer's disease: framework and application to MRI and PET data0
Show:102550
← PrevPage 177 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified