SOTAVerified

Benchmarking

Papers

Showing 36763700 of 5548 papers

TitleStatusHype
pix2pockets: Shot Suggestions in 8-Ball Pool from a Single Image in the Wild0
PKLot-A robust dataset for parking lot classification0
PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI0
Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment0
Point Cloud Compression and Objective Quality Assessment: A Survey0
Point Cloud Objective Quality: Benchmarking Features and Quality Evaluation0
Polarization and Index Modulations: a Theoretical and Practical Perspective0
Policy Entropy for Out-of-Distribution Classification0
Polyp-E: Benchmarking the Robustness of Deep Segmentation Models via Polyp Editing0
Portfolio Benchmarking under Drawdown Constraint and Stochastic Sharpe Ratio0
PoseBench: Benchmarking the Robustness of Pose Estimation Models under Corruptions0
Pose Estimation for Non-Cooperative Spacecraft Rendezvous Using Convolutional Neural Networks0
Position: AI Competitions Provide the Gold Standard for Empirical Rigor in GenAI Evaluation0
Position: Benchmarking is Limited in Reinforcement Learning Research0
Position: Graph Learning Will Lose Relevance Due To Poor Benchmarks0
Position: There are no Champions in Long-Term Time Series Forecasting0
Post-FEC BER Benchmarking for Bit-Interleaved Coded Modulation with Probabilistic Shaping0
Post-hoc labeling of arbitrary EEG recordings for data-efficient evaluation of neural decoding methods0
Deep Neural Operator Driven Real Time Inference for Nuclear Systems to Enable Digital Twin Solutions0
PowerGraph: A power grid benchmark dataset for graph neural networks0
Power Line Communication vs. Talkative Power Conversion: A Benchmarking Study0
Practical Design and Benchmarking of Generative AI Applications for Surgical Billing and Coding0
Practical, Fast and Robust Point Cloud Registration for 3D Scene Stitching and Object Localization0
Precise Model Benchmarking with Only a Few Observations0
Predicting credit default probabilities using machine learning techniques in the face of unequal class distributions0
Show:102550
← PrevPage 148 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified