SOTAVerified

Saliency Prediction

A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.

Papers

Showing 150 of 268 papers

TitleStatusHype
CASE: Contrastive Activation for Saliency EstimationCode0
Modeling Saliency Dataset Bias0
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target DetectionCode0
LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs0
Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction0
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction0
Uncertainty Guided Refinement for Fine-Grained Salient Object DetectionCode1
Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured MeshesCode1
A Deep Learning Framework for Visual Attention Prediction and Analysis of News InterfacesCode0
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video0
Explainable Saliency: Articulating Reasoning with Contextual Prioritization0
Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D GraphicsCode0
Relevance-guided Audio Visual Fusion for Video Saliency Prediction0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
AIM 2024 Challenge on Video Saliency Prediction: Methods and ResultsCode1
Data Augmentation via Latent Diffusion for Saliency PredictionCode1
Adapting Segment Anything Model to Multi-modal Salient Object Detection with Semantic Feature Fusion GuidanceCode1
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion0
How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and ModelCode0
Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study0
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillationCode0
Shap-Mix: Shapley Value Guided Mixing for Long-Tailed Skeleton Based Action RecognitionCode1
SUM: Saliency Unification through Mamba for Visual Attention ModelingCode2
Alignment-Free RGBT Salient Object Detection: Semantics-guided Asymmetric Correlation Network and A Unified BenchmarkCode1
MDS-ViTNet: Improving saliency prediction for Eye-Tracking with Vision TransformerCode1
Enhancing Saliency Prediction in Monitoring Tasks: The Role of Visual Highlights0
Bridging the Gap Between Saliency Prediction and Image Quality AssessmentCode0
Salient Object Detection From Arbitrary ModalitiesCode0
Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models0
Shifting Focus with HCEye: Exploring the Dynamics of Visual Highlighting and Cognitive Load on User Attention and Saliency Prediction0
Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method0
How is Visual Attention Influenced by Text Guidance? Database and ModelCode0
SalFoM: Dynamic Saliency Prediction with Video Foundation Models0
GreenSaliency: A Lightweight and Efficient Image Saliency Detection Method0
Selective Attention-based Modulation for Continual LearningCode0
DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° Images0
Learning User Embeddings from Human Gaze for Personalised Saliency Prediction0
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models0
Brand Visibility in Packaging: A Deep Learning Approach for Logo Detection, Saliency-Map Prediction, and Logo Placement AnalysisCode1
DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction0
Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding0
Learning Saliency From Fixations0
Audio-visual Saliency for Omnidirectional Videos0
XAI-CLASS: Explanation-Enhanced Text Classification with Extremely Weak Supervision0
What Do Deep Saliency Models Learn about Visual Attention?Code0
UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection0
Spherical Vision Transformer for 360-degree Video Saliency PredictionCode1
Attention for Robot Touch: Tactile Saliency Prediction for Robust Sim-to-Real Tactile Control0
Show:102550
← PrevPage 1 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TransalnetKL0.87Unverified
2TempSALKL0.71Unverified
3SSwin transformerKL0.65Unverified
4ECT-SALKL0.58Unverified
5SUMKL0.47Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC0.88Unverified
2SalNAS-XL + Self-KDAUC0.87Unverified
3TempSALAUC0.87Unverified
4MDS-ViTNetAUC0.87Unverified
5TranSalNetAUC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC-Judd0.91Unverified
2TranSalNetAUC-Judd0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMKL0.27Unverified