| Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction | Apr 19, 2025 | DenoisingImage Generation | —Unverified | 0 |
| DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction | Apr 14, 2025 | Computational EfficiencySaliency Prediction | —Unverified | 0 |
| Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues | Feb 1, 2025 | Action ClassificationAction Localization | —Unverified | 0 |
| Relevance-guided Audio Visual Fusion for Video Saliency Prediction | Nov 18, 2024 | PredictionSaliency Prediction | —Unverified | 0 |
| AIM 2024 Challenge on Video Saliency Prediction: Methods and Results | Sep 23, 2024 | Saliency DetectionSaliency Prediction | CodeCode Available | 1 |
| CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion | Aug 21, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| SalFoM: Dynamic Saliency Prediction with Video Foundation Models | Apr 3, 2024 | DecoderPrediction | —Unverified | 0 |
| Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding | Jan 15, 2024 | DecoderSaliency Prediction | —Unverified | 0 |
| UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection | Sep 15, 2023 | Decoderobject-detection | —Unverified | 0 |
| Spherical Vision Transformer for 360-degree Video Saliency Prediction | Aug 24, 2023 | PredictionSaliency Prediction | CodeCode Available | 1 |
| CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective | Mar 11, 2023 | DecoderSaliency Prediction | —Unverified | 0 |
| TinyHD: Efficient Video Saliency Prediction with Heterogeneous Decoders using Hierarchical Maps Distillation | Jan 11, 2023 | Knowledge DistillationPrediction | CodeCode Available | 1 |
| CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective | Jan 1, 2023 | DecoderSaliency Prediction | —Unverified | 0 |
| GASP: Gated Attention For Saliency Prediction | Jun 9, 2022 | PredictionSaliency Prediction | CodeCode Available | 1 |
| Spatio-Temporal Self-Attention Network for Video Saliency Prediction | Aug 24, 2021 | PredictionSaliency Prediction | CodeCode Available | 1 |
| Noise-Aware Video Saliency Prediction | Apr 16, 2021 | PredictionSaliency Prediction | CodeCode Available | 0 |
| ViNet: Pushing the limits of Visual Modality for Audio-Visual Saliency Prediction | Dec 11, 2020 | Action RecognitionDecoder | CodeCode Available | 1 |
| Hierarchical Domain-Adapted Feature Learning for Video Saliency Prediction | Oct 2, 2020 | Domain AdaptationSaliency Detection | CodeCode Available | 1 |
| Video Saliency Prediction Using Enhanced Spatiotemporal Alignment Network | Jan 2, 2020 | PredictionSaliency Prediction | CodeCode Available | 1 |
| Simple vs complex temporal recurrences for video saliency prediction | Jul 3, 2019 | PredictionSaliency Prediction | CodeCode Available | 0 |
| Learning to Explore Intrinsic Saliency for Stereoscopic Video | Jun 1, 2019 | Saliency DetectionSaliency Prediction | —Unverified | 0 |
| DAVE: A Deep Audio-Visual Embedding for Dynamic Saliency Prediction | May 25, 2019 | DecoderPrediction | CodeCode Available | 0 |
| Model-guided Multi-path Knowledge Aggregation for Aerial Saliency Prediction | Nov 14, 2018 | Aerial Video Saliency PredictionPrediction | —Unverified | 0 |
| DeepVS: A Deep Learning Based Video Saliency Prediction Approach | Sep 1, 2018 | Deep LearningPrediction | CodeCode Available | 0 |
| Temporal Saliency Adaptation in Egocentric Videos | Aug 28, 2018 | Saliency PredictionVideo Saliency Prediction | CodeCode Available | 0 |