SOTAVerified

Benchmarking

Papers

Showing 14711480 of 5548 papers

TitleStatusHype
Benchmarking Graph Neural Networks on Dynamic Link PredictionCode1
Benchmarking Graph Neural Networks for FMRI analysisCode1
MatTools: Benchmarking Large Language Models for Materials Science ToolsCode1
Boosting Healthcare LLMs Through Retrieved ContextCode1
Beyond neural scaling laws: beating power law scaling via data pruningCode1
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm OptimizationCode1
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language ModelsCode1
DependEval: Benchmarking LLMs for Repository Dependency UnderstandingCode1
Labelling unlabelled videos from scratch with multi-modal self-supervisionCode1
LogLead -- Fast and Integrated Log Loader, Enhancer, and Anomaly DetectorCode1
Show:102550
← PrevPage 148 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified