SOTAVerified

Scene Classification

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Papers

Showing 51100 of 453 papers

TitleStatusHype
A Two-Stage Approach to Device-Robust Acoustic Scene ClassificationCode1
ApproxDet: Content and Contention-Aware Approximate Object Detection for MobilesCode1
What Can You Learn from Your Muscles? Learning Visual Representation from Human InteractionsCode1
MLRSNet: A Multi-label High Spatial Resolution Remote Sensing Dataset for Semantic Scene UnderstandingCode1
DCASENET: A joint pre-trained deep neural network for detecting and classifying acoustic scenes and eventsCode1
Understanding the Role of Individual Units in a Deep Neural NetworkCode1
Scene-Graph Augmented Data-Driven Risk Assessment of Autonomous Vehicle DecisionsCode1
CITISEN: A Deep Learning-Based Speech Signal-Processing Mobile ApplicationCode1
Device-Robust Acoustic Scene Classification Based on Two-Stage Categorization and Data AugmentationCode1
On Creating Benchmark Dataset for Aerial Image Interpretation: Reviews, Guidances and Million-AIDCode1
Emergent Properties of Foveated Perceptual SystemsCode1
Multi-Temporal Scene Classification and Scene Change Detection with Correlation based FusionCode1
Vision-based Fight Detection from Surveillance CamerasCode1
Emotion and Theme Recognition in Music with Frequency-Aware RF-Regularized CNNsCode1
Receptive-field-regularized CNN variants for acoustic scene classificationCode1
The Receptive Field as a Regularizer in Deep Convolutional Neural Networks for Acoustic Scene ClassificationCode1
Deep-Learning-Based Aerial Image Classification for Emergency Response Applications Using Unmanned Aerial VehiclesCode1
SEN12MS -- A Curated Dataset of Georeferenced Multi-Spectral Sentinel-1/2 Imagery for Deep Learning and Data FusionCode1
Deep CNNs Meet Global Covariance Pooling: Better Representation and GeneralizationCode1
Remote Sensing Image Scene Classification: Benchmark and State of the ArtCode1
AID: A Benchmark Dataset for Performance Evaluation of Aerial Scene ClassificationCode1
Towards Scalable and Generalizable Earth Observation Data Mining via Foundation Model Composition0
A Challenge to Build Neuro-Symbolic Video AgentsCode0
EarthSynth: Generating Informative Earth Observation with Diffusion Models0
Energy efficiency analysis of Spiking Neural Networks for space applications0
Minimizing Risk Through Minimizing Model-Data Interaction: A Protocol For Relying on Proxy Tasks When Designing Child Sexual Abuse Imagery Detection Models0
Low-Complexity Acoustic Scene Classification with Device Information in the DCASE 2025 ChallengeCode0
FrogDogNet: Fourier frequency Retained visual prompt Output Guidance for Domain Generalization of CLIP in Remote SensingCode0
FlexiMo: A Flexible Remote Sensing Foundation Model0
Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments0
Vanishing Depth: A Depth Adapter with Positional Depth Encoding for Generalized Image EncodersCode0
What Time Tells Us? An Explorative Study of Time Awareness Learned from Static Images0
Improving Acoustic Scene Classification with City Features0
Optimal Transport Adapter Tuning for Bridging Modality Gaps in Few-Shot Remote Sensing Scene Classification0
GeoRSMLLM: A Multimodal Large Language Model for Vision-Language Tasks in Geoscience and Remote Sensing0
MEET: A Million-Scale Dataset for Fine-Grained Geospatial Scene Classification with Zoom-Free Remote Sensing Imagery0
Creating a Good Teacher for Knowledge Distillation in Acoustic Scene Classification0
Dual Classification Head Self-training Network for Cross-scene Hyperspectral Image Classification0
Variational Bayesian Adaptive Learning of Deep Latent Variables for Acoustic Knowledge Transfer0
A Vision-Language Framework for Multispectral Scene Representation Using Language-Grounded Features0
Quantum-Enhanced Transformers for Robust Acoustic Scene Classification in IoT Environments0
Multi-Label Scene Classification in Remote Sensing Benefits from Image Super-Resolution0
Vision-Language Models for Autonomous Driving: CLIP-Based Dynamic Scene Understanding0
Enhancing Scene Classification in Cloudy Image Scenarios: A Collaborative Transfer Method with Information Regulation Mechanism using Optical Cloud-Covered and SAR Remote Sensing ImagesCode0
FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models0
Improving Acoustic Scene Classification in Low-Resource Conditions0
UniRS: Unifying Multi-temporal Remote Sensing Tasks through Vision Language ModelsCode0
How Certain are Uncertainty Estimates? Three Novel Earth Observation Datasets for Benchmarking Uncertainty Quantification in Machine Learning0
Multimodal Remote Sensing Scene Classification Using VLMs and Dual-Cross Attention NetworksCode0
Aerial Flood Scene Classification Using Fine-Tuned Attention-based Architecture for Flood-Prone Countries in South Asia0
Show:102550
← PrevPage 2 of 10Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1µ2Net+ (ViT-L/16)Accuracy (%)100Unverified
2AGOSAccuracy (%)99.88Unverified
3LSE-NetAccuracy (%)99.78Unverified
4ResNet50Accuracy (%)99.61Unverified
5MSMatchAccuracy (%)98.33Unverified
6MIDC-NetAccuracy (%)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1iSQRT-COV-Net (ResNet-50)Top 1 Error43.68Unverified
2WaveMixTop 1 Error43.55Unverified