SOTAVerified

Benchmarking

Papers

Showing 49714980 of 5548 papers

TitleStatusHype
Evaluating SAT and SMT Solvers on Large-Scale Sudoku PuzzlesCode0
NbBench: Benchmarking Language Models for Comprehensive Nanobody TasksCode0
NCAdapt: Dynamic adaptation with domain-specific Neural Cellular Automata for continual hippocampus segmentationCode0
A Systematic Review of Green AICode0
Evaluating LLP Methods: Challenges and ApproachesCode0
Evaluating Feature Attribution Methods in the Image DomainCode0
NegBio: a high-performance tool for negation and uncertainty detection in radiology reportsCode0
A Comprehensive Comparison of Multi-Dimensional Image Denoising MethodsCode0
NeMig -- A Bilingual News Collection and Knowledge Graph about MigrationCode0
NengoDL: Combining deep learning and neuromorphic modelling methodsCode0
Show:102550
← PrevPage 498 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified