SOTAVerified

Benchmarking

Papers

Showing 10711080 of 5548 papers

TitleStatusHype
fseval: A Benchmarking Framework for Feature Selection and Feature Ranking AlgorithmsCode1
Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMsCode1
Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question AnsweringCode1
FragXsiteDTI: Revealing Responsible Segments in Drug-Target Interaction with Transformer-Driven InterpretationCode1
3D AffordanceNet: A Benchmark for Visual Object Affordance UnderstandingCode1
Foundation Model of Electronic Medical Records for Adaptive Risk EstimationCode1
FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World ConditionsCode1
FTNet: Feature Transverse Network for Thermal Image Semantic SegmentationCode1
GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease DetectionCode1
Benchmarking emergency department triage prediction models with machine learning and large public electronic health recordsCode1
Show:102550
← PrevPage 108 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified