Self-supervised Video Representation Learning Using Inter-intra Contrastive Framework Aug 6, 2020 Action Recognition In Videos Contrastive Learning
Code Code Available 1Temporal Context Aggregation for Video Retrieval with Contrastive Learning Aug 4, 2020 Contrastive Learning Representation Learning
Code Code Available 1Memory-augmented Dense Predictive Coding for Video Representation Learning Aug 3, 2020 Action Classification Action Recognition
Code Code Available 1The End-of-End-to-End: A Video Understanding Pentathlon Challenge (2020) Aug 3, 2020 Natural Language Queries Retrieval
Code Code Available 1Multi-modal Transformer for Video Retrieval Jul 21, 2020 Natural Language Queries Retrieval
Code Code Available 1Generalized Few-Shot Video Classification with Video Retrieval and Feature Generation Jul 9, 2020 Few-Shot Image Classification Few-Shot Learning
Code Code Available 1Video Playback Rate Perception for Self-supervisedSpatio-Temporal Representation Learning Jun 20, 2020 Action Recognition Decoder
Code Code Available 1AVLnet: Learning Audio-Visual Language Representations from Instructional Videos Jun 16, 2020 Automatic Speech Recognition Automatic Speech Recognition (ASR)
Code Code Available 1Delta Descriptors: Change-Based Place Representation for Robust Visual Localization Jun 10, 2020 Autonomous Driving Image Retrieval
Code Code Available 1Hysia: Serving DNN-Based Video-to-Retail Applications in Cloud Jun 9, 2020 GPU Video Retrieval
Code Code Available 1Searching for Actions on the Hyperbole Jun 1, 2020 Action Recognition Video Retrieval
Code Code Available 1Video Playback Rate Perception for Self-Supervised Spatio-Temporal Representation Learning Jun 1, 2020 Action Recognition Decoder
Code Code Available 1Condensed Movies: Story Based Retrieval with Contextual Embeddings May 8, 2020 Retrieval Text to Video Retrieval
Code Code Available 1HERO: Hierarchical Encoder for Video+Language Omni-representation Pre-training May 1, 2020 Language Modeling Language Modelling
Code Code Available 1Targeted Attack for Deep Hashing based Retrieval Apr 15, 2020 Deep Hashing Image Retrieval
Code Code Available 1SpeedNet: Learning the Speediness in Videos Apr 13, 2020 Action Recognition Binary Classification
Code Code Available 1UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation Feb 15, 2020 Action Segmentation Decoder
Code Code Available 1TVR: A Large-Scale Dataset for Video-Subtitle Moment Retrieval Jan 24, 2020 Moment Retrieval Retrieval
Code Code Available 1Video Cloze Procedure for Self-Supervised Spatio-Temporal Learning Jan 2, 2020 Action Recognition Representation Learning
Code Code Available 1End-to-End Learning of Visual Representations from Uncurated Instructional Videos Dec 13, 2019 Action Localization Action Recognition
Code Code Available 1Multimedia Retrieval Through Unsupervised Hypergraph-Based Manifold Ranking Dec 1, 2019 Content-Based Image Retrieval Retrieval
Code Code Available 1ViSiL: Fine-grained Spatio-Temporal Video Similarity Learning Aug 20, 2019 ISVR Retrieval
Code Code Available 1Use What You Have: Video Retrieval Using Representations From Collaborative Experts Jul 31, 2019 Natural Language Queries Retrieval
Code Code Available 1HowTo100M: Learning a Text-Video Embedding by Watching Hundred Million Narrated Video Clips Jun 7, 2019 Action Localization Long Video Retrieval (Background Removed)
Code Code Available 1IMAE for Noise-Robust Learning: Mean Absolute Error Does Not Treat Examples Equally and Gradient Magnitude's Variance Matters Mar 28, 2019 image-classification Image Classification
Code Code Available 1Learning a Text-Video Embedding from Incomplete and Heterogeneous Data Apr 7, 2018 Retrieval Text Retrieval
Code Code Available 1Dense-Captioning Events in Videos May 2, 2017 Dense Captioning Retrieval
Code Code Available 1Q2E: Query-to-Event Decomposition for Zero-Shot Multilingual Text-to-Video Retrieval Jun 11, 2025 Retrieval Text to Video Retrieval
— Unverified 0MAGMaR Shared Task System Description: Video Retrieval with OmniEmbed Jun 11, 2025 Retrieval Video Retrieval
— Unverified 0From Play to Replay: Composed Video Retrieval for Temporally Fine-Grained Videos Jun 5, 2025 Action Classification Composed Video Retrieval (CoVR)
Code Code Available 0Leveraging Auxiliary Information in Text-to-Video Retrieval: A Review May 29, 2025 Retrieval Text to Video Retrieval
— Unverified 0Learning World Models for Interactive Video Generation May 28, 2025 In-Context Learning Retrieval
— Unverified 0A Challenge to Build Neuro-Symbolic Video Agents May 20, 2025 Scene Classification Video Retrieval
Code Code Available 0Contrastive Alignment with Semantic Gap-Aware Corrections in Text-Video Retrieval May 18, 2025 Contrastive Learning Retrieval
Code Code Available 0CMAWRNet: Multiple Adverse Weather Removal via a Unified Quaternion Neural Architecture May 3, 2025 Autonomous Driving Benchmarking
— Unverified 0Empowering Agentic Video Analytics Systems with Video Language Models May 1, 2025 Knowledge Graphs RAG
— Unverified 0ReSpec: Relevance and Specificity Grounded Online Filtering for Learning on Video-Text Data Streams Apr 21, 2025 Informativeness Low-latency processing
Code Code Available 0Prototypes are Balanced Units for Efficient and Effective Partially Relevant Video Retrieval Apr 17, 2025 Partially Relevant Video Retrieval Retrieval
— Unverified 0Towards Efficient Partially Relevant Video Retrieval with Active Moment Discovering Apr 15, 2025 Partially Relevant Video Retrieval Retrieval
Code Code Available 0Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking Apr 11, 2025 Moment Retrieval Question Answering
— Unverified 0TC-MGC: Text-Conditioned Multi-Grained Contrastive Learning for Text-Video Retrieval Apr 7, 2025 Contrastive Learning Retrieval
Code Code Available 0Leveraging Modality Tags for Enhanced Cross-Modal Video Retrieval Apr 2, 2025 cross-modal alignment Retrieval
— Unverified 0Video-ColBERT: Contextualized Late Interaction for Text-to-Video Retrieval Mar 24, 2025 Retrieval Text to Video Retrieval
— Unverified 0Enhancing Subsequent Video Retrieval via Vision-Language Models (VLMs) Mar 21, 2025 Representation Learning Retrieval
Code Code Available 0Long-VMNet: Accelerating Long-Form Video Understanding via Fixed Memory Mar 17, 2025 Form GPU
— Unverified 0Quality Over Quantity? LLM-Based Curation for a Data-Efficient Audio-Video Foundation Model Mar 12, 2025 AudioCaps Contrastive Learning
— Unverified 0Narrating the Video: Boosting Text-Video Retrieval via Comprehensive Utilization of Frame-Level Captions Mar 7, 2025 Retrieval Video Retrieval
— Unverified 0LLaVE: Large Language and Vision Embedding Models with Hardness-Weighted Contrastive Learning Mar 4, 2025 Contrastive Learning Image-text Retrieval
— Unverified 0Learning to Generate Long-term Future Narrations Describing Activities of Daily Living Mar 3, 2025 Action Anticipation Decision Making
— Unverified 0TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba Feb 21, 2025 image-classification Image Classification
— Unverified 0