| On the Road with GPT-4V(ision): Early Explorations of Visual-Language Model on Autonomous Driving | Nov 9, 2023 | Autonomous DrivingCommon Sense Reasoning | CodeCode Available | 2 |
| Omnivore: A Single Model for Many Visual Modalities | Jan 20, 2022 | Action ClassificationAction Recognition | CodeCode Available | 2 |
| An Empirical Study of Remote Sensing Pretraining | Apr 6, 2022 | Aerial Scene ClassificationBuilding change detection for remote sensing images | CodeCode Available | 2 |
| A Study of Face Obfuscation in ImageNet | Mar 10, 2021 | AttributeObject | CodeCode Available | 1 |
| MultiScene: A Large-scale Dataset and Benchmark for Multi-scene Recognition in Single Aerial Images | Apr 7, 2021 | Learning with noisy labelsScene Recognition | CodeCode Available | 1 |
| Cross-Task Transfer for Geotagged Audiovisual Aerial Scene Recognition | May 18, 2020 | Scene Recognition | CodeCode Available | 1 |
| Stochastic Partial Swap: Enhanced Model Generalization and Interpretability for Fine-Grained Recognition | Jan 1, 2021 | Material RecognitionScene Recognition | CodeCode Available | 1 |
| PIR: Remote Sensing Image-Text Retrieval with Prior Instruction Representation Learning | May 16, 2024 | Image-text RetrievalRepresentation Learning | CodeCode Available | 1 |
| A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval | Oct 27, 2023 | Cross-Modal RetrievalImage-text Retrieval | CodeCode Available | 1 |
| Visual Memorability for Robotic Interestingness via Unsupervised Online Learning | May 18, 2020 | Decision MakingIncremental Learning | CodeCode Available | 1 |
| NuScenes-MQA: Integrated Evaluation of Captions and QA for Autonomous Driving Datasets using Markup Annotations | Dec 11, 2023 | Autonomous DrivingDescriptive | CodeCode Available | 1 |
| BORM: Bayesian Object Relation Model for Indoor Scene Recognition | Aug 1, 2021 | ObjectRelation | CodeCode Available | 1 |
| Indoor Scene Recognition in 3D | Feb 28, 2020 | 3D geometryMulti-Task Learning | CodeCode Available | 1 |
| Bidirectional Projection Network for Cross Dimension Scene Understanding | Mar 26, 2021 | 2D Semantic Segmentation3D Semantic Segmentation | CodeCode Available | 1 |
| NarrativeXL: A Large-scale Dataset For Long-Term Memory Models | May 23, 2023 | Multiple-choiceReading Comprehension | CodeCode Available | 1 |
| MovieCLIP: Visual Scene Recognition in Movies | Oct 20, 2022 | Genre classificationScene Recognition | CodeCode Available | 1 |
| Object-to-Scene: Learning to Transfer Object Knowledge to Indoor Scene Recognition | Aug 1, 2021 | ObjectScene Recognition | CodeCode Available | 1 |
| NEVIS'22: A Stream of 100 Tasks Sampled from 30 Years of Computer Vision Research | Nov 15, 2022 | Continual LearningDiversity | CodeCode Available | 1 |
| Unsupervised Model Personalization while Preserving Privacy and Scalability: An Open Problem | Mar 30, 2020 | Continual LearningDomain Adaptation | CodeCode Available | 1 |
| Where in the World is this Image? Transformer-based Geo-localization in the Wild | Apr 29, 2022 | Diversitygeo-localization | CodeCode Available | 1 |
| CoMAE: Single Model Hybrid Pre-training on Small-Scale RGB-D Datasets | Feb 13, 2023 | Contrastive LearningRepresentation Learning | CodeCode Available | 1 |
| When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition | Apr 26, 2020 | Object RecognitionScene Recognition | CodeCode Available | 1 |
| Self-supervised Video Representation Learning by Uncovering Spatio-temporal Statistics | Aug 31, 2020 | Action RecognitionRepresentation Learning | CodeCode Available | 1 |
| Deep Attentional Structured Representation Learning for Visual Recognition | May 14, 2018 | Representation LearningScene Recognition | CodeCode Available | 1 |
| A Bag of Visual Words Approach for Symbols-Based Coarse-Grained Ancient Coin Classification | Apr 23, 2013 | General ClassificationScene Recognition | —Unverified | 0 |
| Attentional Graph Convolutional Network for Structure-aware Audio-Visual Scene Classification | Dec 31, 2022 | Scene ClassificationScene Recognition | —Unverified | 0 |
| A Survey of Vision Transformers in Autonomous Driving: Current Trends and Future Directions | Mar 12, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| Amodal Completion and Size Constancy in Natural Scenes | Sep 27, 2015 | Objectobject-detection | —Unverified | 0 |
| Audio Event and Scene Recognition: A Unified Approach using Strongly and Weakly Labeled Data | Nov 12, 2016 | Scene RecognitionWeakly-supervised Learning | —Unverified | 0 |
| Bags of Spacetime Energies for Dynamic Scene Recognition | Jun 1, 2014 | General ClassificationScene Recognition | —Unverified | 0 |
| Effective Data Augmentation with Multi-Domain Learning GANs | Dec 25, 2019 | Data AugmentationGeneral Classification | —Unverified | 0 |
| A Supervised Neural Autoregressive Topic Model for Simultaneous Image Classification and Annotation | May 23, 2013 | General Classificationimage-classification | —Unverified | 0 |
| Digital Divides in Scene Recognition: Uncovering Socioeconomic Biases in Deep Learning Systems | Jan 23, 2024 | Scene ClassificationScene Recognition | —Unverified | 0 |
| Adaptive Active Learning for Image Classification | Jun 1, 2013 | Active LearningClassification | —Unverified | 0 |
| A Robust Indoor Scene Recognition Method based on Sparse Representation | Aug 24, 2017 | Scene Recognition | —Unverified | 0 |
| Developing efficient transfer learning strategies for robust scene recognition in mobile robotics using pre-trained convolutional neural networks | Jul 23, 2021 | Data AugmentationInference Optimization | —Unverified | 0 |
| Discriminative Multi-Modal Feature Fusion for RGBD Indoor Scene Recognition | Jun 1, 2016 | Image SegmentationObject Recognition | —Unverified | 0 |
| Effectiveness of Function Matching in Driving Scene Recognition | Aug 20, 2022 | Autonomous Drivingimage-classification | —Unverified | 0 |
| ConceptLearner: Discovering Visual Concepts from Weakly Labeled Image Collections | Nov 19, 2014 | object-detectionObject Detection | —Unverified | 0 |
| A Novel Locally Linear KNN Model for Visual Recognition | Jun 1, 2015 | Action RecognitionDensity Estimation | —Unverified | 0 |
| ALA: Naturalness-aware Adversarial Lightness Attack | Jan 16, 2022 | Adversarial AttackDenoising | —Unverified | 0 |
| Coarse-to-fine Task-driven Inpainting for Geoscience Images | Nov 20, 2022 | Data AugmentationDecoder | —Unverified | 0 |
| CNN-LTE: a Class of 1-X Pooling Convolutional Neural Networks on Label Tree Embeddings for Audio Scene Recognition | Jul 8, 2016 | Scene Recognition | —Unverified | 0 |
| Combining Multiple Cues for Visual Madlibs Question Answering | Nov 1, 2016 | AttributeGeneral Classification | —Unverified | 0 |
| A Novel Feature Extraction Method for Scene Recognition Based on Centered Convolutional Restricted Boltzmann Machines | Jun 24, 2015 | Object RecognitionScene Recognition | —Unverified | 0 |
| Context-Aware Multi-Task Learning for Traffic Scene Recognition in Autonomous Vehicles | Apr 3, 2020 | Autonomous VehiclesMulti-Task Learning | —Unverified | 0 |
| Contrastive Visual Data Augmentation | Feb 24, 2025 | Data AugmentationNovel Concepts | —Unverified | 0 |
| ConvNets vs. Transformers: Whose Visual Representations are More Transferable? | Aug 11, 2021 | ClassificationDepth Estimation | —Unverified | 0 |
| ASK: Adaptively Selecting Key Local Features for RGB-D Scene Recognition | Oct 14, 2021 | feature selectionScene Classification | —Unverified | 0 |
| A Discriminative Representation of Convolutional Features for Indoor Scene Recognition | Jun 17, 2015 | ObjectObject Recognition | —Unverified | 0 |