SOTAVerified

Scene Classification

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Papers

Showing 51100 of 453 papers

TitleStatusHype
A Label Propagation Strategy for CutMix in Multi-Label Remote Sensing Image Classification0
Data-Efficient Low-Complexity Acoustic Scene Classification in the DCASE 2024 ChallengeCode0
Dual-Branch Network for Portrait Image Quality AssessmentCode1
Deep Space Separable Distillation for Lightweight Acoustic Scene Classification0
A Toolchain for Comprehensive Audio/Video Analysis Using Deep Learning Based Multimodal Approach (A use case of riot or violent context detection)0
Visual and audio scene classification for detecting discrepancies in video: a baseline method and experimental protocolCode0
Pretraining Billion-scale Geospatial Foundational Models on Frontier0
Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification0
Bridging Remote Sensors with Multisensor Geospatial Foundation ModelsCode2
VHM: Versatile and Honest Vision Language Model for Remote Sensing Image AnalysisCode2
RSMamba: Remote Sensing Image Classification with State Space ModelCode3
Neural Embedding Compression For Efficient Multi-Task Earth Observation ModellingCode0
Leveraging feature communication in federated learning for remote sensing image classification0
MTP: Advancing Remote Sensing Foundation Model via Multi-Task PretrainingCode3
Leveraging Self-Supervised Learning for Scene Classification in Child Sexual Abuse Imagery0
Comparing Importance Sampling Based Methods for Mitigating the Effect of Class ImbalanceCode0
Efficient Multi-Resolution Fusion for Remote Sensing Data with Label UncertaintyCode0
Description on IEEE ICME 2024 Grand Challenge: Semi-supervised Acoustic Scene Classification under Domain ShiftCode1
Deep Semantic-Visual Alignment for Zero-Shot Remote Sensing Image Scene ClassificationCode1
Knowledge-Aware Neuron Interpretation for Scene Classification0
Bayesian adaptive learning to latent variables via Variational Bayes and Maximum a Posteriori0
Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems0
A Volumetric Saliency Guided Image Summarization for RGB-D Indoor Scene Classification0
Generic Knowledge Boosted Pre-training For Remote Sensing ImagesCode1
Kronecker Product Feature Fusion for Convolutional Neural Network in Remote Sensing Scene Classification0
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote SensingCode2
Language-Assisted 3D Scene Understanding0
CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence0
GeoChat: Grounded Large Vision-Language Model for Remote SensingCode2
Cascade Learning Localises Discriminant Features in Visual Scene Classification0
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive LearningCode0
SpectralGPT: Spectral Remote Sensing Foundation ModelCode2
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Unlocking the capabilities of explainable fewshot learning in remote sensing0
Audio Event-Relational Graph Representation Learning for Acoustic Scene ClassificationCode1
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene ClassificationCode0
Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs0
MVP: Meta Visual Prompt Tuning for Few-Shot Remote Sensing Image Scene Classification0
Decoupling Common and Unique Representations for Multimodal Self-supervised LearningCode1
SOAR: Scene-debiasing Open-set Action RecognitionCode1
Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion ModelsCode1
Visual Saliency Detection in Advanced Driver Assistance Systems0
Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type0
EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition0
TbExplain: A Text-based Explanation Method for Scene Classification Models with the Statistical Prediction Correction0
In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification0
On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers0
Domain Information Control at Inference Time for Acoustic Scene ClassificationCode0
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration0
Efficient Multi-Task Scene Analysis with RGB-D TransformersCode1
Show:102550
← PrevPage 2 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1µ2Net+ (ViT-L/16)Accuracy (%)100Unverified
2AGOSAccuracy (%)99.88Unverified
3LSE-NetAccuracy (%)99.78Unverified
4ResNet50Accuracy (%)99.61Unverified
5MSMatchAccuracy (%)98.33Unverified
6MIDC-NetAccuracy (%)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1iSQRT-COV-Net (ResNet-50)Top 1 Error43.68Unverified
2WaveMixTop 1 Error43.55Unverified