SOTAVerified

Benchmarking

Papers

Showing 44714480 of 5548 papers

TitleStatusHype
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm OptimizationCode1
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style TransferCode1
A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images0
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam0
BERT-based Chinese Text Classification for Emergency Domain with a Novel Loss Function0
Dynabench: Rethinking Benchmarking in NLP0
Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python0
Robust Semantic Interpretability: Revisiting Concept Activation VectorsCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
What Will it Take to Fix Benchmarking in Natural Language Understanding?0
Show:102550
← PrevPage 448 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified