SOTAVerified

Benchmarking

Papers

Showing 43014325 of 5548 papers

TitleStatusHype
VATr++: Choose Your Words Wisely for Handwritten Text Generation0
Vec2Face: Unveil Human Faces from their Blackbox Features in Face Recognition0
VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment0
VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination0
VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts0
Verifiable Format Control for Large Language Model Generations0
VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity0
VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models0
VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution0
ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations0
Benchmarking Badminton Action Recognition with a New Fine-Grained Dataset0
VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos0
VidLBEval: Benchmarking and Mitigating Language Bias in Video-Involved LVLMs0
Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground0
Village-Net Clustering: A Rapid approach to Non-linear Unsupervised Clustering of High-Dimensional Data0
VIPPrint: A Large Scale Dataset of Printed and Scanned Images for Synthetic Face Images Detection and Source Linking0
Virus-MNIST: Machine Learning Baseline Calculations for Image Classification0
VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning0
VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning0
VisImages: A Fine-Grained Expert-Annotated Visualization Dataset0
WebCode2M: A Real-World Dataset for Code Generation from Webpage Designs0
Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information0
Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircraft0
VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph0
Vision Learners Meet Web Image-Text Pairs0
Show:102550
← PrevPage 173 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified