Learning what and where to attend May 22, 2018 Diagnostic Image Categorization
Code Code Available 1Wavelet Convolutional Neural Networks May 20, 2018 General Classification image-classification
Code Code Available 1Dynamic Few-Shot Visual Learning without Forgetting Apr 25, 2018 Few-Shot Image Classification Few-Shot Learning
Code Code Available 1DeepScores -- A Dataset for Segmentation, Detection and Classification of Tiny Objects Mar 27, 2018 General Classification Object
Code Code Available 1Generalizable Data-free Objective for Crafting Universal Adversarial Perturbations Jan 24, 2018 Adversarial Attack Depth Estimation
Code Code Available 1Relation Networks for Object Detection Nov 30, 2017 Object object-detection
Code Code Available 1Distributed Deep Neural Networks over the Cloud, the Edge and End Devices Sep 6, 2017 Distributed Computing Object Recognition
Code Code Available 1Multiple Instance Detection Network with Online Instance Classifier Refinement Apr 1, 2017 Multiple Instance Learning Object
Code Code Available 1Evolving Deep Neural Networks Mar 1, 2017 Deep Learning Image Captioning
Code Code Available 1Densely Connected Convolutional Networks Aug 25, 2016 Breast Tumour Classification Classification
Code Code Available 1Deep Predictive Coding Networks for Video Prediction and Unsupervised Learning May 25, 2016 Object Recognition Video Prediction
Code Code Available 1Domain Generalization for Object Recognition with Multi-task Autoencoders Aug 31, 2015 Denoising Domain Generalization
Code Code Available 1Training Deep Neural Networks on Noisy Labels with Bootstrapping Dec 20, 2014 Emotion Recognition Object Recognition
Code Code Available 1Deep Gaze I: Boosting Saliency Prediction with Feature Maps Trained on ImageNet Nov 4, 2014 Object Recognition Point Processes
Code Code Available 1Going Deeper with Convolutions Sep 17, 2014 General Classification Image Classification
Code Code Available 1ImageNet Large Scale Visual Recognition Challenge Sep 1, 2014 General Classification image-classification
Code Code Available 13D ShapeNets: A Deep Representation for Volumetric Shapes Jun 22, 2014 3D Point Cloud Classification 3D Shape Representation
Code Code Available 1Microsoft COCO: Common Objects in Context May 1, 2014 Instance Segmentation Object
Code Code Available 1OverFeat: Integrated Recognition, Localization and Detection using Convolutional Networks Dec 21, 2013 General Classification Image Classification
Code Code Available 1Describing Textures in the Wild Nov 14, 2013 Material Recognition Object Recognition
Code Code Available 1Improving neural networks by preventing co-adaptation of feature detectors Jul 3, 2012 Image Classification Object Recognition
Code Code Available 1GeoMag: A Vision-Language Model for Pixel-level Fine-Grained Remote Sensing Image Parsing Jul 8, 2025 Language Modeling Language Modelling
— Unverified 0Out-of-distribution detection in 3D applications: a review Jul 1, 2025 Autonomous Driving Navigate
— Unverified 0SASep: Saliency-Aware Structured Separation of Geometry and Feature for Open Set Learning on Point Clouds Jun 16, 2025 3D Object Recognition Object Recognition
Code Code Available 0Continual Hyperbolic Learning of Instances and Classes Jun 12, 2025 Continual Learning Object Recognition
— Unverified 0DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects Jun 11, 2025 Depth Completion Depth Estimation
— Unverified 0Aligning Text, Images, and 3D Structure Token-by-Token Jun 9, 2025 3D Object Recognition Instruction Following
— Unverified 0Feature-Based Lie Group Transformer for Real-World Applications Jun 5, 2025 Object Object Recognition
— Unverified 0EV-Flying: an Event-based Dataset for In-The-Wild Recognition of Flying Objects Jun 4, 2025 Event-based vision Object Recognition
— Unverified 0Explicitly Modeling Subcortical Vision with a Neuro-Inspired Front-End Improves CNN Robustness Jun 3, 2025 Data Augmentation Object Recognition
— Unverified 0Efficient Estimation of Regularized Tyler's M-Estimator Using Approximate LOOCV May 30, 2025 Face Recognition Object Recognition
— Unverified 0TrackVLA: Embodied Visual Tracking in the Wild May 29, 2025 Language Modeling Language Modelling
— Unverified 0SHTOcc: Effective 3D Occupancy Prediction with Sparse Head and Tail Voxels May 28, 2025 Autonomous Driving GPU
Code Code Available 0ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting May 26, 2025 NeRF object-detection
— Unverified 0Detailed Evaluation of Modern Machine Learning Approaches for Optic Plastics Sorting May 22, 2025 Instance Segmentation Object Recognition
— Unverified 0Refining Neural Activation Patterns for Layer-Level Concept Discovery in Neural Network-Based Receivers May 21, 2025 Clustering Object Recognition
— Unverified 0RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation May 21, 2025 GPU Natural Language Queries
— Unverified 0PLAICraft: Large-Scale Time-Aligned Vision-Speech-Action Dataset for Embodied AI May 19, 2025 Benchmarking Minecraft
— Unverified 0Model alignment using inter-modal bridges May 18, 2025 Image Generation model
— Unverified 0ViEEG: Hierarchical Neural Coding with Cross-Modal Progressive Enhancement for EEG-Based Visual Decoding May 18, 2025 Brain Decoding Contrastive Learning
— Unverified 0A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision May 16, 2025 Large Language Model Navigate
— Unverified 0AW-GATCN: Adaptive Weighted Graph Attention Convolutional Network for Event Camera Data Joint Denoising and Object Recognition May 16, 2025 Denoising Event Segmentation
— Unverified 0MIRAGE: A Multi-modal Benchmark for Spatial Perception, Reasoning, and Intelligence May 15, 2025 Attribute Object
— Unverified 0Improving Unsupervised Task-driven Models of Ventral Visual Stream via Relative Position Predictivity May 13, 2025 Contrastive Learning Object
Code Code Available 0Topology-Guided Knowledge Distillation for Efficient Point Cloud Processing May 12, 2025 3D Object Recognition Autonomous Driving
Code Code Available 0Visually Interpretable Subtask Reasoning for Visual Question Answering May 12, 2025 Attribute Object Recognition
Code Code Available 0ArtRAG: Retrieval-Augmented Generation with Structured Context for Visual Art Understanding May 9, 2025 Image Captioning Object Recognition
— Unverified 0Beyond Recognition: Evaluating Visual Perspective Taking in Vision Language Models May 3, 2025 Diagnostic Object Recognition
— Unverified 0Transferable Adversarial Attacks on Black-Box Vision-Language Models May 2, 2025 Image Captioning Object Recognition
— Unverified 0Zoomer: Adaptive Image Focus Optimization for Black-box MLLM Apr 30, 2025 Image Captioning Object Recognition
— Unverified 0