SOTAVerified

Saliency Prediction

A saliency map is a model that predicts eye fixations on a visual scene. Saliency prediction is informed by the human visual attention mechanism and predicts the possibility of the human eyes to stay in a certain position in the scene.

Papers

Showing 51100 of 268 papers

TitleStatusHype
CASE: Contrastive Activation for Saliency EstimationCode0
CSPENet: Contour-Aware and Saliency Priors Embedding Network for Infrared Small Target DetectionCode0
Modeling Saliency Dataset Bias0
LMME3DHF: Benchmarking and Evaluating Multimodal 3D Human Face Generation with LMMs0
Text-Audio-Visual-conditioned Diffusion Model for Video Saliency Prediction0
DTFSal: Audio-Visual Dynamic Token Fusion for Video Saliency Prediction0
A Deep Learning Framework for Visual Attention Prediction and Analysis of News InterfacesCode0
EditIQ: Automated Cinematic Editing of Static Wide-Angle Videos via Dialogue Interpretation and Saliency Cues0
Minimalistic Video Saliency Prediction via Efficient Decoder & Spatio Temporal Action Cues0
Explainable Saliency: Articulating Reasoning with Contextual Prioritization0
CASP: Consistency-aware Audio-induced Saliency Prediction Model for Omnidirectional Video0
Textured Mesh Saliency: Bridging Geometry and Texture for Human Perception in 3D GraphicsCode0
Relevance-guided Audio Visual Fusion for Video Saliency Prediction0
Correlation of Object Detection Performance with Visual Saliency and Depth EstimationCode0
CaRDiff: Video Salient Object Ranking Chain of Thought Reasoning for Saliency Prediction with Diffusion0
How Does Audio Influence Visual Attention in Omnidirectional Videos? Database and ModelCode0
Exploiting Change Blindness for Video Coding: Perspectives from a Less Promising User Study0
SalNAS: Efficient Saliency-prediction Neural Architecture Search with self-knowledge distillationCode0
Enhancing Saliency Prediction in Monitoring Tasks: The Role of Visual Highlights0
Bridging the Gap Between Saliency Prediction and Image Quality AssessmentCode0
Salient Object Detection From Arbitrary ModalitiesCode0
Unified Dynamic Scanpath Predictors Outperform Individually Trained Neural Models0
Shifting Focus with HCEye: Exploring the Dynamics of Visual Highlighting and Cognitive Load on User Attention and Saliency Prediction0
How is Visual Attention Influenced by Text Guidance? Database and ModelCode0
Separated Attention: An Improved Cycle GAN Based Under Water Image Enhancement Method0
SalFoM: Dynamic Saliency Prediction with Video Foundation Models0
GreenSaliency: A Lightweight and Efficient Image Saliency Detection Method0
Selective Attention-based Modulation for Continual LearningCode0
DiffGaze: A Diffusion Model for Continuous Gaze Sequence Generation on 360° Images0
Learning User Embeddings from Human Gaze for Personalised Saliency Prediction0
A Modified Word Saliency-Based Adversarial Attack on Text Classification Models0
DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction0
Transformer-based Video Saliency Prediction with High Temporal Dimension Decoding0
Learning Saliency From Fixations0
Audio-visual Saliency for Omnidirectional Videos0
XAI-CLASS: Explanation-Enhanced Text Classification with Extremely Weak Supervision0
What Do Deep Saliency Models Learn about Visual Attention?Code0
UniST: Towards Unifying Saliency Transformer for Video Saliency Prediction and Detection0
Attention for Robot Touch: Tactile Saliency Prediction for Robust Sim-to-Real Tactile Control0
Few-shot Personalized Saliency Prediction Based on Inter-personnel Gaze Patterns0
Teaching AI to Teach: Leveraging Limited Human Salience Data Into Unlimited Saliency-Based Training0
ViDaS Video Depth-aware Saliency Network0
A positive feedback method based on F-measure value for Salient Object DetectionCode0
MRGAN360: Multi-stage Recurrent Generative Adversarial Network for 360 Degree Image Saliency Prediction0
CASP-Net: Rethinking Video Saliency Prediction from an Audio-VisualConsistency Perceptual Perspective0
Deep Saliency Mapping for 3D Meshes and ApplicationsCode0
Semantic Segmentation Enhanced Transformer Model for Human Attention Prediction0
Co-Salient Object Detection With Uncertainty-Aware Group Exchange-Masking0
CASP-Net: Rethinking Video Saliency Prediction From an Audio-Visual Consistency Perceptual Perspective0
FBLNet: FeedBack Loop Network for Driver Attention Prediction0
Show:102550
← PrevPage 2 of 6Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1TransalnetKL0.87Unverified
2TempSALKL0.71Unverified
3SSwin transformerKL0.65Unverified
4ECT-SALKL0.58Unverified
5SUMKL0.47Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC0.88Unverified
2SalNAS-XL + Self-KDAUC0.87Unverified
3TempSALAUC0.87Unverified
4MDS-ViTNetAUC0.87Unverified
5TranSalNetAUC0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMAUC-Judd0.91Unverified
2TranSalNetAUC-Judd0.87Unverified
#ModelMetricClaimedVerifiedStatus
1SUMKL0.27Unverified