SOTAVerified

Benchmarking

Papers

Showing 23112320 of 5548 papers

TitleStatusHype
A Metadata-Driven Approach to Understand Graph Neural Networks0
FixCLR: Negative-Class Contrastive Learning for Semi-Supervised Domain Generalization0
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning0
BAGELS: Benchmarking the Automated Generation and Extraction of Limitations from Scholarly Text0
ACT-Bench: Towards Action Controllable World Models for Autonomous Driving0
Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency0
Benchmarks as Microscopes: A Call for Model Metrology0
FineText: Text Classification via Attention-based Language Model Fine-tuning0
Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge0
FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets0
Show:102550
← PrevPage 232 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified