SOTAVerified

Benchmarking

Papers

Showing 38013825 of 5548 papers

TitleStatusHype
Real-time Webcam Heart-Rate and Variability Estimation with Clean Ground Truth for Evaluation0
One-Shot Real-to-Sim via End-to-End Differentiable Simulation and Rendering0
Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms0
Real-World fNIRS-Based Brain-Computer Interfaces: Benchmarking Deep Learning and Classical Models in Interactive Gaming0
Rearrangement: A Challenge for Embodied AI0
Reasoning as a Resource: Optimizing Fast and Slow Thinking in Code Generation Models0
Re-assessing ImageNet: How aligned is its single-label assumption with its multi-label nature?0
RECipe: Does a Multi-Modal Recipe Knowledge Graph Fit a Multi-Purpose Recommendation System?0
Recommendations for Baselines and Benchmarking Approximate Gaussian Processes0
Reconstructing antibody repertoires from error-prone immunosequencing datasets0
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research0
Refer to Anything with Vision-Language Prompts0
Regularization of ML models for Earth systems by using longer model timesteps0
Reinforcement Learning Based Handwritten Digit Recognition with Two-State Q-Learning0
Reinforcement Learning with Graph Attention for Routing and Wavelength Assignment with Lightpath Reuse0
Reinforcing Competitive Multi-Agents for Playing So Long Sucker0
Relative Afferent Pupillary Defect Screening through Transfer Learning0
Reliable validation of Reinforcement Learning Benchmarks0
REMoH: A Reflective Evolution of Multi-objective Heuristics approach via Large Language Models0
Removal of Ocular Artifacts in EEG Using Deep Learning0
Removing Multiple Hybrid Adverse Weather in Video via a Unified Model0
Rephrasing natural text data with different languages and quality levels for Large Language Model pre-training0
REPLAB: A Reproducible Low-Cost Arm Benchmark Platform for Robotic Learning0
Reproducible evaluation of classification methods in Alzheimer's disease: framework and application to MRI and PET data0
Repurposing Foundation Model for Generalizable Medical Time Series Classification0
Show:102550
← PrevPage 153 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified