SOTAVerified

Scene Classification

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Papers

Showing 76100 of 453 papers

TitleStatusHype
SkyScript: A Large and Semantically Diverse Vision-Language Dataset for Remote SensingCode2
Language-Assisted 3D Scene Understanding0
CartoMark: a benchmark dataset for map pattern recognition and 1 map content retrieval with machine intelligence0
GeoChat: Grounded Large Vision-Language Model for Remote SensingCode2
Cascade Learning Localises Discriminant Features in Visual Scene Classification0
AudioLog: LLMs-Powered Long Audio Logging with Hybrid Token-Semantic Contrastive LearningCode0
SpectralGPT: Spectral Remote Sensing Foundation ModelCode2
CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer VisionCode1
Unlocking the capabilities of explainable fewshot learning in remote sensing0
Audio Event-Relational Graph Representation Learning for Acoustic Scene ClassificationCode1
Bringing the Discussion of Minima Sharpness to the Audio Domain: a Filter-Normalised Evaluation for Acoustic Scene ClassificationCode0
Locality-preserving Directions for Interpreting the Latent Space of Satellite Image GANs0
MVP: Meta Visual Prompt Tuning for Few-Shot Remote Sensing Image Scene Classification0
Decoupling Common and Unique Representations for Multimodal Self-supervised LearningCode1
SOAR: Scene-debiasing Open-set Action RecognitionCode1
Universal Adversarial Defense in Remote Sensing Based on Pre-trained Denoising Diffusion ModelsCode1
Visual Saliency Detection in Advanced Driver Assistance Systems0
Do humans and Convolutional Neural Networks attend to similar areas during scene classification: Effects of task and image type0
EnTri: Ensemble Learning with Tri-level Representations for Explainable Scene Recognition0
TbExplain: A Text-based Explanation Method for Scene Classification Models with the Statistical Prediction Correction0
In-Domain Self-Supervised Learning Improves Remote Sensing Image Scene Classification0
On Frequency-Wise Normalizations for Better Recording Device Generalization in Audio Spectrogram Transformers0
Domain Information Control at Inference Time for Acoustic Scene ClassificationCode0
Acoustic Scene Clustering Using Joint Optimization of Deep Embedding Learning and Clustering Iteration0
Efficient Multi-Task Scene Analysis with RGB-D TransformersCode1
Show:102550
← PrevPage 4 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1µ2Net+ (ViT-L/16)Accuracy (%)100Unverified
2AGOSAccuracy (%)99.88Unverified
3LSE-NetAccuracy (%)99.78Unverified
4ResNet50Accuracy (%)99.61Unverified
5MSMatchAccuracy (%)98.33Unverified
6MIDC-NetAccuracy (%)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1iSQRT-COV-Net (ResNet-50)Top 1 Error43.68Unverified
2WaveMixTop 1 Error43.55Unverified