SOTAVerified

Benchmarking

Papers

Showing 751760 of 5548 papers

TitleStatusHype
Towards Sim-to-Real Industrial Parts Classification with Synthetic DatasetCode1
Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation ModelCode1
AgentQuest: A Modular Benchmark Framework to Measure Progress and Improve LLM AgentsCode1
PARIS3D: Reasoning-based 3D Part Segmentation Using Large Multimodal ModelCode1
Outlier-Efficient Hopfield Layers for Large Transformer-Based ModelsCode1
Benchmarking Large Language Models for Persian: A Preliminary Study Focusing on ChatGPTCode1
PREGO: online mistake detection in PRocedural EGOcentric videosCode1
Atom-Level Optical Chemical Structure Recognition with Limited SupervisionCode1
Benchmarking Counterfactual Image GenerationCode1
Benchmarking the Robustness of Temporal Action Detection Models Against Temporal CorruptionsCode1
Show:102550
← PrevPage 76 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified