SOTAVerified

Scene Classification

Scene Classification is a task in which scenes from photographs are categorically classified. Unlike object classification, which focuses on classifying prominent objects in the foreground, Scene Classification uses the layout of objects within the scene, in addition to the ambient context, for classification.

Source: Scene classification with Convolutional Neural Networks

Papers

Showing 251275 of 453 papers

TitleStatusHype
Sample Dropout for Audio Scene Classification Using Multi-Scale Dense Connected Convolutional Neural Network0
Satellite Image Scene Classification via ConvNet with Context Aggregation0
Scalable Multitask Representation Learning for Scene Classification0
Scenarios: A New Representation for Complex Scene Understanding0
Scene Categorization from Contours: Medial Axis Based Salience Measures0
Scene Classification in Indoor Environments for Robots using Context Based Word Embeddings0
Scene Classification With Semantic Fisher Vectors0
Scene Recognition with Objectness, Attribute and Category Learning0
Scene Retrieval for Contextual Visual Mapping0
SeasoNet: A Seasonal Scene Classification, segmentation and Retrieval dataset for satellite Imagery over Germany0
Second-Order Constrained Parametric Proposals and Sequential Search-Based Structured Prediction for Semantic Segmentation in RGB-D Images0
See More for Scene: Pairwise Consistency Learning for Scene Classification0
Self-Supervised In-Domain Representation Learning for Remote Sensing Image Scene Classification0
Self-supervised Learning of Audio Representations from Audio-Visual Data using Spatial Alignment0
Semantic Fisher Scores for Task Transfer: Using Objects to Classify Scenes0
Sensing Urban Land-Use Patterns By Integrating Google Tensorflow And Scene-Classification Models0
SGMNet: Scene Graph Matching Network for Few-Shot Remote Sensing Scene Classification0
Shape Distributions of Nonlinear Dynamical Systems for Video-based Inference0
Short-Term Memory Convolutions0
SOAT: A Scene- and Object-Aware Transformer for Vision-and-Language Navigation0
Solving Visual Madlibs with Multiple Cues0
Sparse Output Coding for Large-Scale Visual Recognition0
Spatio-Temporal Attention Pooling for Audio Scene Classification0
SpecAugment++: A Hidden Space Data Augmentation Method for Acoustic Scene Classification0
Squeeze-Excitation Convolutional Recurrent Neural Networks for Audio-Visual Scene Classification0
Show:102550
← PrevPage 11 of 19Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1µ2Net+ (ViT-L/16)Accuracy (%)100Unverified
2AGOSAccuracy (%)99.88Unverified
3LSE-NetAccuracy (%)99.78Unverified
4ResNet50Accuracy (%)99.61Unverified
5MSMatchAccuracy (%)98.33Unverified
6MIDC-NetAccuracy (%)97.4Unverified
#ModelMetricClaimedVerifiedStatus
1iSQRT-COV-Net (ResNet-50)Top 1 Error43.68Unverified
2WaveMixTop 1 Error43.55Unverified