| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 | 5 |
| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 | 5 |
| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 | 5 |
| A Study of Face Obfuscation in ImageNet | Mar 10, 2021 | AttributeObject | CodeCode Available | 1 | 5 |
| NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research | Nov 15, 2022 | Continual LearningDiversity | CodeCode Available | 1 | 5 |
| MultiScene: A Large-scale Dataset and Benchmark for Multi-scene Recognition in Single Aerial Images | Apr 7, 2021 | Learning with noisy labelsScene Recognition | CodeCode Available | 1 | 5 |
| BORM: Bayesian Object Relation Model for Indoor Scene Recognition | Aug 1, 2021 | ObjectRelation | CodeCode Available | 1 | 5 |
| Where in the World is this Image? Transformer-based Geo-localization in the Wild | Apr 29, 2022 | Diversitygeo-localization | CodeCode Available | 1 | 5 |
| Deep Attentional Structured Representation Learning for Visual Recognition | May 14, 2018 | Representation LearningScene Recognition | CodeCode Available | 1 | 5 |
| When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition | Apr 26, 2020 | Object RecognitionScene Recognition | CodeCode Available | 1 | 5 |
| Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition | May 18, 2020 | Scene Recognition | CodeCode Available | 1 | 5 |
| Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition | Aug 1, 2021 | ObjectScene Recognition | CodeCode Available | 1 | 5 |
| NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations | Dec 11, 2023 | Autonomous DrivingDescriptive | CodeCode Available | 1 | 5 |
| Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-Grained Recognition | Jan 1, 2021 | Material RecognitionScene Recognition | CodeCode Available | 1 | 5 |
| PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | May 16, 2024 | Image-text RetrievalRepresentation Learning | CodeCode Available | 1 | 5 |
| Unsupervised Model Personalization while Preserving Privacy and Scalability: An Open Problem | Mar 30, 2020 | Continual LearningDomain Adaptation | CodeCode Available | 1 | 5 |
| Visual Memorability for Robotic Interestingness via Unsupervised Online Learning | May 18, 2020 | Decision MakingIncremental Learning | CodeCode Available | 1 | 5 |
| NarrativeXL: A Large-scale Dataset For Long-Term Memory Models | May 23, 2023 | Multiple-choiceReading Comprehension | CodeCode Available | 1 | 5 |
| CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets | Feb 13, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 | 5 |
| Indoor Scene Recognition in 3D | Feb 28, 2020 | 3D geometryMulti-Task Learning | CodeCode Available | 1 | 5 |
| A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval | Oct 27, 2023 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 1 | 5 |
| Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics | Aug 31, 2020 | Action RecognitionRepresentation Learning | CodeCode Available | 1 | 5 |
| MovieCLIP: Visual Scene Recognition in Movies | Oct 20, 2022 | Genre classificationScene Recognition | CodeCode Available | 1 | 5 |
| Bidirectional Projection Network for Cross Dimension Scene Understanding | Mar 26, 2021 | 2D Semantic Segmentation3D Semantic Segmentation | CodeCode Available | 1 | 5 |
| A multiple-instance densely-connected ConvNet for aerial scene classification | Mar 3, 2020 | Aerial Scene ClassificationClassification | CodeCode Available | 0 | 5 |
| Aerial Scene Understanding in The Wild: Multi-Scene Recognition via Prototype-based Memory Networks | Apr 22, 2021 | RetrievalScene Recognition | CodeCode Available | 0 | 5 |
| Ambiguity-Aware Multi-Object Pose Optimization for Visually-Assisted Robot Manipulation | Nov 2, 2022 | 6D Pose Estimation using RGBObject | CodeCode Available | 0 | 5 |
| Advancing ALS Applications with Large-Scale Pre-training: Dataset Development and Downstream Assessment | Jan 9, 2025 | Scene RecognitionSelf-Supervised Learning | CodeCode Available | 0 | 5 |
| Local semantic enhanced convnet for aerial scene recognition | Jul 8, 2021 | Aerial Scene ClassificationImage Classification | CodeCode Available | 0 | 5 |
| Object Detectors Emerge in Deep Scene CNNs | Dec 22, 2014 | General ClassificationObject | CodeCode Available | 0 | 5 |
| All Grains, One Scheme (AGOS): Learning Multi-grain Instance Representation for Aerial Scene Classification | May 6, 2022 | Aerial Scene ClassificationAll | CodeCode Available | 0 | 5 |
| A Retention-Centric Framework for Continual Learning with Guaranteed Model Developmental Safety | Oct 4, 2024 | Autonomous DrivingContinual Learning | CodeCode Available | 0 | 5 |
| Knowledge Guided Disambiguation for Large-Scale Scene Classification with Multi-Resolution CNNs | Oct 4, 2016 | General ClassificationScene Classification | CodeCode Available | 0 | 5 |
| HalluciNet-ing Spatiotemporal Representations Using a 2D-CNN | Dec 10, 2019 | Action AnticipationAction Classification | CodeCode Available | 0 | 5 |
| Learning From Less Data: A Unified Data Subset Selection and Active Learning Framework for Computer Vision | Jan 3, 2019 | Active LearningBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Empirical Analysis of Foundational Distinctions in Linked Open Data | Mar 26, 2018 | Common Sense ReasoningNatural Language Understanding | CodeCode Available | 0 | 5 |
| CNN Features off-the-shelf: an Astounding Baseline for Recognition | Mar 23, 2014 | AttributeGeneral Classification | CodeCode Available | 0 | 5 |
| Classifying Variable-Length Audio Files with All-Convolutional Networks and Masked Global Pooling | Jul 11, 2016 | Acoustic Scene ClassificationAll | CodeCode Available | 0 | 5 |
| AGA: Attribute-Guided Augmentation | Jul 1, 2017 | AttributeData Augmentation | CodeCode Available | 0 | 5 |
| From Volcano to Toyshop: Adaptive Discriminative Region Discovery for Scene Recognition | Jul 23, 2018 | Attributeimage-classification | CodeCode Available | 0 | 5 |
| DFPENet-geology: A Deep Learning Framework for High Precision Recognition and Segmentation of Co-seismic Landslides | Aug 28, 2019 | DecoderScene Recognition | CodeCode Available | 0 | 5 |
| DisasterNets: Embedding Machine Learning in Disaster Mapping | Jun 16, 2023 | AttributeChange Detection | CodeCode Available | 0 | 5 |
| Learning image representations tied to ego-motion | May 8, 2015 | Autonomous DrivingScene Recognition | CodeCode Available | 0 | 5 |
| Deep Filter Banks for Texture Recognition and Segmentation | Jun 1, 2015 | Material RecognitionScene Recognition | CodeCode Available | 0 | 5 |
| An embarrassingly simple comparison of machine learning algorithms for indoor scene classification | Sep 25, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 0 | 5 |
| InstaIndoor and Multi-modal Deep Learning for Indoor Scene Recognition | Dec 23, 2021 | BenchmarkingDeep Learning | CodeCode Available | 0 | 5 |
| Capsule Networks as Generative Models | Sep 6, 2022 | Scene Recognition | CodeCode Available | 0 | 5 |
| AGA: Attribute Guided Augmentation | Dec 8, 2016 | AttributeData Augmentation | CodeCode Available | 0 | 5 |
| Counting Manatee Aggregations using Deep Neural Networks and Anisotropic Gaussian Kernel | Nov 4, 2023 | Crowd CountingScene Recognition | CodeCode Available | 0 | 5 |
| Depth CNNs for RGB-D scene recognition: learning from scratch better than transferring from RGB-CNNs | Jan 21, 2018 | Scene Recognition | CodeCode Available | 0 | 5 |