| VATr++: Choose Your Words Wisely for Handwritten Text Generation | Feb 16, 2024 | BenchmarkingText Generation | —Unverified | 0 |
| Vec2Face: Unveil Human Faces from their Blackbox Features in Face Recognition | Mar 16, 2020 | BenchmarkingFace Recognition | —Unverified | 0 |
| VELOCITI: Benchmarking Video-Language Compositional Reasoning with Strict Entailment | Jun 16, 2024 | Action UnderstandingBenchmarking | —Unverified | 0 |
| VeriContaminated: Assessing LLM-Driven Verilog Coding for Data Contamination | Mar 17, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| VeriFact: Enhancing Long-Form Factuality Evaluation with Refined Fact Extraction and Reference Facts | May 14, 2025 | BenchmarkingForm | —Unverified | 0 |
| Verifiable Format Control for Large Language Model Generations | Feb 6, 2025 | BenchmarkingInstruction Following | —Unverified | 0 |
| VERIFY: A Benchmark of Visual Explanation and Reasoning for Investigating Multimodal Reasoning Fidelity | Mar 14, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models | May 21, 2025 | BenchmarkingReinforcement Learning (RL) | —Unverified | 0 |
| VFHQ: A High-Quality Dataset and Benchmark for Video Face Super-Resolution | May 6, 2022 | BenchmarkingSpeaker Identification | —Unverified | 0 |
| ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations | May 20, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Badminton Action Recognition with a New Fine-Grained Dataset | Mar 19, 2024 | Action RecognitionBenchmarking | —Unverified | 0 |
| VideoMathQA: Benchmarking Mathematical Reasoning via Multimodal Understanding in Videos | Jun 5, 2025 | BenchmarkingMathematical Reasoning | —Unverified | 0 |
| VidLBEval: Benchmarking and Mitigating Language Bias in Video-Involved LVLMs | Feb 23, 2025 | Benchmarking | —Unverified | 0 |
| Views Are My Own, but Also Yours: Benchmarking Theory of Mind Using Common Ground | Mar 4, 2024 | Benchmarking | —Unverified | 0 |
| Village-Net Clustering: A Rapid approach to Non-linear Unsupervised Clustering of High-Dimensional Data | Jan 16, 2025 | BenchmarkingClustering | —Unverified | 0 |
| VIPPrint: A Large Scale Dataset of Printed and Scanned Images for Synthetic Face Images Detection and Source Linking | Feb 1, 2021 | BenchmarkingImage Manipulation | —Unverified | 0 |
| Virus-MNIST: Machine Learning Baseline Calculations for Image Classification | Nov 3, 2021 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| VisAidMath: Benchmarking Visual-Aided Mathematical Reasoning | Oct 30, 2024 | BenchmarkingHallucination | —Unverified | 0 |
| VISCO: Benchmarking Fine-Grained Critique and Correction Towards Self-Improvement in Visual Reasoning | Dec 3, 2024 | BenchmarkingVisual Reasoning | —Unverified | 0 |
| VisImages: A Fine-Grained Expert-Annotated Visualization Dataset | Jul 9, 2020 | Benchmarking | —Unverified | 0 |
| WebCode2M: A Real-World Dataset for Code Generation from Webpage Designs | Apr 9, 2024 | BenchmarkingCode Generation | —Unverified | 0 |
| Vision-Based Deep Reinforcement Learning of UAV Autonomous Navigation Using Privileged Information | Dec 9, 2024 | Autonomous NavigationBenchmarking | —Unverified | 0 |
| Vision-Based Power Line Cables and Pylons Detection for Low Flying Aircraft | Jul 19, 2024 | BenchmarkingTransfer Learning | —Unverified | 0 |
| VisionKG: Unleashing the Power of Visual Datasets via Knowledge Graph | Sep 24, 2023 | BenchmarkingKnowledge Graphs | —Unverified | 0 |
| Vision Learners Meet Web Image-Text Pairs | Jan 17, 2023 | BenchmarkingSelf-Supervised Learning | —Unverified | 0 |
| Vision Transformer for Efficient Chest X-ray and Gastrointestinal Image Classification | Apr 23, 2023 | BenchmarkingData Augmentation | —Unverified | 0 |
| Visual Attention on the Sun: What Do Existing Models Actually Predict? | Nov 25, 2018 | BenchmarkingDeep Attention | —Unverified | 0 |
| Visual Fidelity Index for Generative Semantic Communications with Critical Information Embedding | May 15, 2025 | BenchmarkingSemantic Communication | —Unverified | 0 |
| Visual Object Tracking on Multi-modal RGB-D Videos: A Review | Jan 23, 2022 | BenchmarkingObject | —Unverified | 0 |
| Visual Place Recognition for Large-Scale UAV Applications | Jul 20, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| VITAL: A New Dataset for Benchmarking Pluralistic Alignment in Healthcare | Feb 19, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| VoiceWukong: Benchmarking Deepfake Voice Detection | Sep 10, 2024 | BenchmarkingFace Swapping | —Unverified | 0 |
| V-STaR: Benchmarking Video-LLMs on Video Spatio-Temporal Reasoning | Mar 14, 2025 | BenchmarkingRelational Reasoning | —Unverified | 0 |
| v-SVR Polynomial Kernel for Predicting the Defect Density in New Software Projects | Dec 15, 2018 | Benchmarkingregression | —Unverified | 0 |
| Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins | Mar 24, 2023 | BenchmarkingFace Recognition | —Unverified | 0 |
| From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking | Aug 14, 2020 | Benchmarking | —Unverified | 0 |
| Ward: Provable RAG Dataset Inference via LLM Watermarks | Oct 4, 2024 | BenchmarkingRAG | —Unverified | 0 |
| Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation | Dec 12, 2023 | BenchmarkingColumns Property Annotation | —Unverified | 0 |
| WebVision Challenge: Visual Learning and Understanding With Web Data | May 16, 2017 | Benchmarkingimage-classification | —Unverified | 0 |
| WelQrate: Defining the Gold Standard in Small Molecule Drug Discovery Benchmarking | Nov 14, 2024 | BenchmarkingDrug Discovery | —Unverified | 0 |
| WER We Stand: Benchmarking Urdu ASR Models | Sep 17, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| What can 5.17 billion regression fits tell us about artificial models of the human visual system? | Oct 12, 2021 | Benchmarking | —Unverified | 0 |
| What cleaves? Is proteasomal cleavage prediction reaching a ceiling? | Oct 24, 2022 | BenchmarkingDenoising | —Unverified | 0 |
| What Does Neuro Mean to Cardio? Investigating the Role of Clinical Specialty Data in Medical LLMs | May 15, 2025 | AllBenchmarking | —Unverified | 0 |
| What Emotions Make One or Five Stars? Understanding Ratings of Online Product Reviews by Sentiment Analysis and XAI | Feb 29, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| What if we had no Wikipedia? Domain-independent Term Extraction from a Large News Corpus | Sep 17, 2020 | BenchmarkingTerm Extraction | —Unverified | 0 |
| Alexpaca: Learning Factual Clarification Question Generation Without Examples | Oct 17, 2023 | BenchmarkingChatbot | —Unverified | 0 |
| What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts | Aug 1, 2021 | BenchmarkingBinary Classification | —Unverified | 0 |
| Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms | Mar 8, 2023 | BenchmarkingEvolutionary Algorithms | —Unverified | 0 |
| What Will it Take to Fix Benchmarking in Natural Language Understanding? | Apr 5, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |