SOTAVerified

Benchmarking

Papers

Showing 25512560 of 5548 papers

TitleStatusHype
Coherent Feed Forward Quantum Neural Network0
We're Not Using Videos Effectively: An Updated Domain Adaptive Video Segmentation BaselineCode1
Benchmarking Transferable Adversarial AttacksCode1
Benchmarking Sensitivity of Continual Graph Learning for Skeleton-Based Action Recognition0
I Think, Therefore I am: Benchmarking Awareness of Large Language Models Using AwareBenchCode4
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
Explainable Benchmarking for Iterative Optimization HeuristicsCode1
Category-wise Fine-Tuning: Resisting Incorrect Pseudo-Labels in Multi-Label Image Classification with Partial LabelsCode1
Planning, Creation, Usage: Benchmarking LLMs for Comprehensive Tool Utilization in Real-World Complex ScenariosCode2
ToPro: Token-Level Prompt Decomposition for Cross-Lingual Sequence Labeling TasksCode0
Show:102550
← PrevPage 256 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified