Lightweight Pixel Difference Networks for Efficient Visual Representation Learning Feb 1, 2024 Edge Detection Object Recognition
Code Code Available 4RTMDet: An Empirical Study of Designing Real-Time Object Detectors Dec 14, 2022 GPU Instance Segmentation
Code Code Available 4Detectron2 Object Detection & Manipulating Images using Cartoonization Aug 1, 2021 Autonomous Vehicles Data Visualization
Code Code Available 4Interactive Medical Image Segmentation: A Benchmark Dataset and Baseline Nov 19, 2024 Image Segmentation Interactive Segmentation
Code Code Available 3UniBench: Visual Reasoning Requires Rethinking Vision-Language Beyond Scaling Aug 9, 2024 GPU Language Modeling
Code Code Available 3pix2gestalt: Amodal Segmentation by Synthesizing Wholes Jan 25, 2024 3D Reconstruction Object Recognition
Code Code Available 3DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models Feb 8, 2022 Diagnostic Image Captioning
Code Code Available 3Datasets: A Community Library for Natural Language Processing Sep 7, 2021 Image Classification Object Recognition
Code Code Available 3InstructSAM: A Training-Free Framework for Instruction-Oriented Remote Sensing Object Recognition May 21, 2025 Earth Observation Object
Code Code Available 2Taccel: Scaling Up Vision-based Tactile Robotics via High-performance GPU Simulation Apr 17, 2025 GPU Object Recognition
Code Code Available 2P2Object: Single Point Supervised Object Detection and Instance Segmentation Apr 10, 2025 Instance Segmentation Multiple Instance Learning
Code Code Available 2NUDT4MSTAR: A Large Dataset and Benchmark Towards Remote Sensing Object Recognition in the Wild Jan 23, 2025 Earth Observation Object Recognition
Code Code Available 2MG-LLaVA: Towards Multi-Granularity Visual Instruction Tuning Jun 25, 2024 Object Object Recognition
Code Code Available 2StableSemantics: A Synthetic Language-Vision Dataset of Semantic Representations in Naturalistic Images Jun 19, 2024 Object Recognition Scene Understanding
Code Code Available 2Is CLIP the main roadblock for fine-grained open-world perception? Apr 4, 2024 Autonomous Driving Novel Concepts
Code Code Available 2Lifting Multi-View Detection and Tracking to the Bird's Eye View Mar 19, 2024 3D Object Recognition Multi-Object Tracking
Code Code Available 2Local Feature Matching Using Deep Learning: A Survey Jan 31, 2024 3D Reconstruction Deep Learning
Code Code Available 2Seeing the roads through the trees: A benchmark for modeling spatial dependencies with aerial imagery Jan 12, 2024 Object Recognition Road Segmentation
Code Code Available 2Towards Language Models That Can See: Computer Vision Through the LENS of Natural Language Jun 28, 2023 Descriptive Language Modeling
Code Code Available 2Roboflow 100: A Rich, Multi-Domain Object Detection Benchmark Nov 24, 2022 2D Object Detection Image Retrieval
Code Code Available 2The Equalization Losses: Gradient-Driven Training for Long-tailed Object Recognition Oct 11, 2022 image-classification Image Classification
Code Code Available 2Patchwork++: Fast and Robust Ground Segmentation Solving Partial Under-Segmentation Using 3D Point Cloud Jul 25, 2022 Object Recognition Segmentation
Code Code Available 2Omni3D: A Large Benchmark and Model for 3D Object Detection in the Wild Jul 21, 2022 3D Object Detection 3D Object Detection From Monocular Images
Code Code Available 2HAKE: A Knowledge Engine Foundation for Human Activity Understanding Feb 14, 2022 Action Recognition Human-Object Interaction Detection
Code Code Available 2A Simple Episodic Linear Probe Improves Visual Recognition in the Wild Jan 1, 2022 Fine-Grained Image Classification Image Classification
Code Code Available 2Learning Transferable Visual Models From Natural Language Supervision Feb 26, 2021 Action Recognition Benchmarking
Code Code Available 2Sparse R-CNN: End-to-End Object Detection with Learnable Proposals Nov 25, 2020 2D Object Detection Object
Code Code Available 2A Simple Framework for Contrastive Learning of Visual Representations Feb 13, 2020 Contrastive Learning Image Classification
Code Code Available 2SceneGraphNet: Neural Message Passing for 3D Indoor Scene Augmentation Jul 25, 2019 3D Object Recognition Object Recognition
Code Code Available 2GCNet: Non-local Networks Meet Squeeze-Excitation Networks and Beyond Apr 25, 2019 Instance Segmentation Object Detection
Code Code Available 2Hypergraph Neural Networks Sep 25, 2018 Object Recognition Representation Learning
Code Code Available 2Some Improvements on Deep Convolutional Neural Network Based Image Classification Dec 19, 2013 Classification General Classification
Code Code Available 2STSBench: A Spatio-temporal Scenario Benchmark for Multi-modal Large Language Models in Autonomous Driving Jun 6, 2025 Autonomous Driving Autonomous Vehicles
Code Code Available 1Benchmarking Multimodal Mathematical Reasoning with Explicit Visual Dependency Apr 24, 2025 Benchmarking Math
Code Code Available 1Spatial457: A Diagnostic Benchmark for 6D Spatial Reasoning of Large Multimodal Models Feb 12, 2025 Attribute Diagnostic
Code Code Available 1Comprehensive Multi-Modal Prototypes are Simple and Effective Classifiers for Vast-Vocabulary Object Detection Dec 23, 2024 object-detection Object Detection
Code Code Available 1CREST: An Efficient Conjointly-trained Spike-driven Framework for Event-based Object Detection Exploiting Spatiotemporal Dynamics Dec 17, 2024 Object object-detection
Code Code Available 1WiseAD: Knowledge Augmented End-to-End Autonomous Driving with Vision-Language Model Dec 13, 2024 Autonomous Driving Decision Making
Code Code Available 1Expanding Event Modality Applications through a Robust CLIP-Based Encoder Dec 4, 2024 Few-Shot Learning Object Recognition
Code Code Available 1LRSAA: Large-scale Remote Sensing Image Target Recognition and Automatic Annotation Nov 24, 2024 Ensemble Learning Object
Code Code Available 1Leveraging MLLM Embeddings and Attribute Smoothing for Compositional Zero-Shot Learning Nov 18, 2024 Attribute Compositional Zero-Shot Learning
Code Code Available 1Large-scale Remote Sensing Image Target Recognition and Automatic Annotation Nov 12, 2024 Ensemble Learning Object
Code Code Available 1MomentumSMoE: Integrating Momentum into Sparse Mixture of Experts Oct 18, 2024 Language Modeling Language Modelling
Code Code Available 1DaWin: Training-free Dynamic Weight Interpolation for Robust Adaptation Oct 3, 2024 Multi-Task Learning Object Recognition
Code Code Available 1CSIM: A Copula-based similarity index sensitive to local changes for Image quality assessment Oct 2, 2024 Astronomy Image Quality Assessment
Code Code Available 1Category-Prompt Refined Feature Learning for Long-Tailed Multi-Label Image Classification Aug 15, 2024 image-classification Image Classification
Code Code Available 1On the Element-Wise Representation and Reasoning in Zero-Shot Image Recognition: A Systematic Survey Aug 9, 2024 Object Recognition
Code Code Available 1MarvelOVD: Marrying Object Recognition and Vision-Language Models for Robust Open-Vocabulary Object Detection Jul 31, 2024 Language Modelling Object
Code Code Available 1Dual-Hybrid Attention Network for Specular Highlight Removal Jul 17, 2024 highlight removal Object Recognition
Code Code Available 1PartImageNet++ Dataset: Scaling up Part-based Models for Robust Recognition Jul 15, 2024 Adversarial Robustness Inductive Bias
Code Code Available 1