| DeSPITE: Exploring Contrastive Deep Skeleton-Pointcloud-IMU-Text Embeddings for Advanced Point Cloud Human Activity Understanding | Jun 16, 2025 | Activity RecognitionHuman Activity Recognition | —Unverified | 0 |
| Retrieval Augmented Generation Evaluation for Health Documents | May 7, 2025 | Moment RetrievalRAG | —Unverified | 0 |
| Grounding-MD: Grounded Video-language Pre-training for Open-World Moment Detection | Apr 20, 2025 | Action DetectionDecoder | —Unverified | 0 |
| Towards Efficient and Robust Moment Retrieval System: A Unified Framework for Multi-Granularity Models and Temporal Reranking | Apr 11, 2025 | Moment RetrievalQuestion Answering | —Unverified | 0 |
| TimeLoc: A Unified End-to-End Framework for Precise Timestamp Localization in Long Videos | Mar 9, 2025 | Action LocalizationBoundary Detection | CodeCode Available | 1 |
| MomentSeeker: A Task-Oriented Benchmark For Long-Video Moment Retrieval | Feb 18, 2025 | Action RecognitionMoment Retrieval | —Unverified | 0 |
| Moment of Untruth: Dealing with Negative Queries in Video Moment Retrieval | Feb 12, 2025 | AvgMoment Retrieval | CodeCode Available | 0 |
| Multi-modal Fusion and Query Refinement Network for Video Moment Retrieval and Highlight Detection | Jan 18, 2025 | AvgHighlight Detection | —Unverified | 0 |
| LD-DETR: Loop Decoder DEtection TRansformer for Video Moment Retrieval and Highlight Detection | Jan 18, 2025 | Contrastive LearningDecoder | CodeCode Available | 1 |
| Zero-shot Video Moment Retrieval via Off-the-shelf Multimodal Large Language Models | Jan 14, 2025 | Moment RetrievalRetrieval | —Unverified | 0 |
| The Devil is in the Spurious Correlation: Boosting Moment Retrieval via Temporal Dynamic Learning | Jan 13, 2025 | Moment RetrievalRetrieval | —Unverified | 0 |
| A Flexible and Scalable Framework for Video Moment Search | Jan 9, 2025 | Moment RetrievalRe-Ranking | CodeCode Available | 1 |
| Watch Video, Catch Keyword: Context-aware Keyword Attention for Moment Retrieval and Highlight Detection | Jan 5, 2025 | Contrastive LearningHighlight Detection | CodeCode Available | 1 |
| DTOS: Dynamic Time Object Sensing with Large Multimodal Model | Jan 1, 2025 | Moment RetrievalReferring Video Object Segmentation | CodeCode Available | 0 |
| Anchor-Aware Similarity Cohesion in Target Frames Enables Predicting Temporal Moment Boundaries in 2D | Jan 1, 2025 | Moment RetrievalSemantic Similarity | CodeCode Available | 0 |
| Length-Aware DETR for Robust Moment Retrieval | Dec 30, 2024 | Information RetrievalMoment Retrieval | CodeCode Available | 1 |
| DAVE: Diverse Atomic Visual Elements Dataset with High Representation of Vulnerable Road Users in Complex and Unpredictable Environments | Dec 28, 2024 | Action LocalizationAction Recognition | —Unverified | 0 |
| Query-centric Audio-Visual Cognition Network for Moment Retrieval, Segmentation and Step-Captioning | Dec 18, 2024 | Moment RetrievalMulti-Task Learning | —Unverified | 0 |
| FlashVTG: Feature Layering and Adaptive Score Handling Network for Video Temporal Grounding | Dec 18, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 |
| Agent-based Video Trimming | Dec 12, 2024 | Highlight DetectionMoment Retrieval | —Unverified | 0 |
| VideoLights: Feature Refinement and Cross-Task Alignment Transformer for Joint Video Highlight Detection and Moment Retrieval | Dec 2, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 1 |
| Vid-Morp: Video Moment Retrieval Pretraining from Unlabeled Videos in the Wild | Dec 1, 2024 | Moment RetrievalRetrieval | CodeCode Available | 1 |
| LLaVA-MR: Large Language-and-Vision Assistant for Video Moment Retrieval | Nov 21, 2024 | Moment RetrievalNatural Language Moment Retrieval | CodeCode Available | 0 |
| Number it: Temporal Grounding Videos like Flipping Manga | Nov 15, 2024 | Highlight DetectionMoment Retrieval | CodeCode Available | 2 |
| TimeSuite: Improving MLLMs for Long Video Understanding via Grounded Tuning | Oct 25, 2024 | EgoSchemaHallucination | CodeCode Available | 2 |