SOTAVerified

Benchmarking

Papers

Showing 9911000 of 5548 papers

TitleStatusHype
Prompt Tuned Embedding Classification for Multi-Label Industry Sector AllocationCode1
Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic PlanningCode1
Codabench: Flexible, Easy-to-Use and Reproducible Benchmarking PlatformCode1
MultiRes-NetVLAD: Augmenting Place Recognition Training with Low-Resolution ImageryCode1
CodeS: Natural Language to Code Repository via Multi-Layer SketchCode1
Benchmarking for Biomedical Natural Language Processing Tasks with a Domain Specific ALBERTCode1
MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection BenchmarkCode1
MuSe-GNN: Learning Unified Gene Representation From Multimodal Biological Graph DataCode1
CommonPower: A Framework for Safe Data-Driven Smart Grid ControlCode1
Working Memory Capacity of ChatGPT: An Empirical StudyCode1
Show:102550
← PrevPage 100 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified