ViDDAR: Vision Language Model-Based Task-Detrimental Content Detection for Augmented Reality Jan 22, 2025 Language Modeling Language Modelling
Code Code Available 0VideoBERT: A Joint Model for Video and Language Representation Learning Apr 3, 2019 Action Classification General Classification
Code Code Available 0Video (language) modeling: a baseline for generative models of natural videos Dec 20, 2014 Language Modeling Language Modelling
Code Code Available 0video-SALMONN: Speech-Enhanced Audio-Visual Large Language Models Jun 22, 2024 Diversity Language Modeling
Code Code Available 0VILA: Learning Image Aesthetics from User Comments with Vision-Language Pretraining Mar 24, 2023 Decoder Language Modelling
Code Code Available 0Enhancing Visual Grounding and Generalization: A Multi-Task Cycle Training Approach for Vision-Language Models Nov 21, 2023 Image Segmentation Language Modelling
Code Code Available 0ViLP: Knowledge Exploration using Vision, Language, and Pose Embeddings for Video Action Recognition Aug 7, 2023 Action Recognition Language Modeling
Code Code Available 0ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation May 21, 2025 Decision Making Language Modeling
Code Code Available 0Virology Capabilities Test (VCT): A Multimodal Virology Q&A Benchmark Apr 21, 2025 Language Modeling Language Modelling
Code Code Available 0VirusT5: Harnessing Large Language Models to Predicting SARS-CoV-2 Evolution Dec 20, 2024 Language Modeling Language Modelling
Code Code Available 0Vision Conformer: Incorporating Convolutions into Vision Transformer Layers Apr 27, 2023 Inductive Bias Language Modeling
Code Code Available 0Vision-Language and Large Language Model Performance in Gastroenterology: GPT, Claude, Llama, Phi, Mistral, Gemma, and Quantized Models Aug 25, 2024 Language Modeling Language Modelling
Code Code Available 0Vision-Language In-Context Learning Driven Few-Shot Visual Inspection Model Feb 13, 2025 In-Context Learning Language Modeling
Code Code Available 0Vision-Language Pre-Training for Boosting Scene Text Detectors Apr 29, 2022 Contrastive Learning Language Modeling
Code Code Available 0VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning Jul 17, 2025 Language Modeling Language Modelling
Code Code Available 0ViSoBERT: A Pre-Trained Language Model for Vietnamese Social Media Text Processing Oct 17, 2023 Language Modeling Language Modelling
Code Code Available 0VIS-Shepherd: Constructing Critic for LLM-based Data Visualization Generation Jun 16, 2025 Data Visualization Language Modeling
Code Code Available 0Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model May 28, 2024 Language Modeling Language Modelling
Code Code Available 0Visually-Aware Context Modeling for News Image Captioning Aug 16, 2023 Articles Image Captioning
Code Code Available 0Visually Dehallucinative Instruction Generation Feb 13, 2024 Hallucination Language Modeling
Code Code Available 0Visually Dehallucinative Instruction Generation: Know What You Don't Know Feb 15, 2024 Hallucination Language Modeling
Code Code Available 0Visual Re-ranking with Natural Language Understanding for Text Spotting Oct 29, 2018 Language Modeling Language Modelling
Code Code Available 0VIXEN: Visual Text Comparison Network for Image Difference Captioning Feb 29, 2024 Language Modeling Language Modelling
Code Code Available 0VLTP: Vision-Language Guided Token Pruning for Task-Oriented Segmentation Sep 13, 2024 Decoder Language Modelling
Code Code Available 0VL-Uncertainty: Detecting Hallucination in Large Vision-Language Model via Uncertainty Estimation Nov 18, 2024 Hallucination Language Modeling
Code Code Available 0Vocabulary-free Image Classification and Semantic Segmentation Apr 16, 2024 Classification image-classification
Code Code Available 0VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation Aug 15, 2017 Language Modeling Language Modelling
Code Code Available 0VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration May 26, 2025 Language Modeling Language Modelling
Code Code Available 0Demonstration of an Adversarial Attack Against a Multimodal Vision Language Model for Pathology Imaging Jan 4, 2024 Adversarial Attack Domain Adaptation
Code Code Available 0V-Zen: Efficient GUI Understanding and Precise Grounding With A Novel Multimodal LLM May 24, 2024 Language Modelling Large Language Model
Code Code Available 0Walk Extraction Strategies for Node Embeddings with RDF2Vec in Knowledge Graphs Sep 9, 2020 Knowledge Graphs Language Modelling
Code Code Available 0Wanda++: Pruning Large Language Models via Regional Gradients Mar 6, 2025 Decoder GPU
Code Code Available 0WatChat: Explaining perplexing programs by debugging mental models Mar 8, 2024 counterfactual Language Modelling
Code Code Available 0Watch What You Just Said: Image Captioning with Text-Conditional Attention Jun 15, 2016 Image Captioning Language Modeling
Code Code Available 0Watermark under Fire: A Robustness Evaluation of LLM Watermarking Nov 20, 2024 Language Modeling Language Modelling
Code Code Available 0Scaling Capability in Token Space: An Analysis of Large Vision Language Model Dec 24, 2024 Language Modeling Language Modelling
Code Code Available 0We are what we repeatedly do: Inducing and deploying habitual schemas in persona-based responses Oct 10, 2023 Dialogue Generation Language Modelling
Code Code Available 0Web Page Classification using LLMs for Crawling Support May 11, 2025 Classification Language Modeling
Code Code Available 0We're Calling an Intervention: Exploring Fundamental Hurdles in Adapting Language Models to Nonstandard Text Apr 10, 2024 Language Modeling Language Modelling
Code Code Available 0WET: Overcoming Paraphrasing Vulnerabilities in Embeddings-as-a-Service with Linear Transformation Watermarks Aug 29, 2024 Language Modeling Language Modelling
Code Code Available 0What a neural language model tells us about spatial relations Jun 1, 2019 Image Description Language Modeling
Code Code Available 0What BERT is not: Lessons from a new suite of psycholinguistic diagnostics for language models Jul 31, 2019 Language Modeling Language Modelling
Code Code Available 0What Does BERT Look At? An Analysis of BERT's Attention Jun 11, 2019 Language Modeling Language Modelling
Code Code Available 0What Do Llamas Really Think? Revealing Preference Biases in Language Model Representations Nov 30, 2023 Language Modeling Language Modelling
Code Code Available 0What Do Recurrent Neural Network Grammars Learn About Syntax? Nov 17, 2016 Constituency Parsing Dependency Parsing
Code Code Available 0What makes a language easy to deep-learn? Deep neural networks and humans similarly benefit from compositional structure Feb 23, 2023 Language Modeling Language Modelling
Code Code Available 0What Makes Pre-trained Language Models Better Zero-shot Learners? Sep 30, 2022 Language Modelling Prompt Learning
Code Code Available 0What's in a Name? Evaluating Assembly-Part Semantic Knowledge in Language Models through User-Provided Names in CAD Files Apr 25, 2023 Language Modelling
Code Code Available 0What's the Difference? Supporting Users in Identifying the Effects of Prompt and Model Changes Through Token Patterns Apr 22, 2025 Language Modeling Language Modelling
Code Code Available 0When Babies Teach Babies: Can student knowledge sharing outperform Teacher-Guided Distillation on small datasets? Nov 25, 2024 Knowledge Distillation Language Modeling
Code Code Available 0