| Enhancing Novel Object Detection via Cooperative Foundational Models | Nov 19, 2023 | Novel Class DiscoveryNovel Object Detection | CodeCode Available | 1 |
| Expanding Scene Graph Boundaries: Fully Open-vocabulary Scene Graph Generation via Visual-Concept Alignment and Retention | Nov 18, 2023 | Concept AlignmentGraph Generation | CodeCode Available | 1 |
| Point Cloud Self-supervised Learning via 3D to Multi-view Masked Autoencoder | Nov 17, 2023 | 3D Object Classification3D Object Detection | CodeCode Available | 1 |
| Overcoming Data Scarcity in Biomedical Imaging with a Foundational Multi-Task Model | Nov 16, 2023 | Multi-Task Learningobject-detection | CodeCode Available | 1 |
| CD-COCO: A Versatile Complex Distorted COCO Database for Scene-Context-Aware Computer Vision | Nov 12, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| Florence-2: Advancing a Unified Representation for a Variety of Vision Tasks | Nov 10, 2023 | DiversityMulti-Task Learning | CodeCode Available | 1 |
| Linear Gaussian Bounding Box Representation and Ring-Shaped Rotated Convolution for Oriented Object Detection | Nov 9, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| DAMEX: Dataset-aware Mixture-of-Experts for visual understanding of mixture-of-datasets | Nov 8, 2023 | Mixture-of-Expertsobject-detection | CodeCode Available | 1 |
| Meta-Adapter: An Online Few-shot Learner for Vision-Language Model | Nov 7, 2023 | Few-Shot Learningimage-classification | CodeCode Available | 1 |
| Instruct Me More! Random Prompting for Visual In-Context Learning | Nov 7, 2023 | Foreground SegmentationIn-Context Learning | CodeCode Available | 1 |
| Inner-IoU: More Effective Intersection over Union Loss with Auxiliary Bounding Box | Nov 6, 2023 | Object Detectionregression | CodeCode Available | 1 |
| NeuSyRE: Neuro-Symbolic Visual Understanding and Reasoning Framework based on Scene Graph Enrichment | Nov 5, 2023 | Caption GenerationCommon Sense Reasoning | CodeCode Available | 1 |
| Adapting Segment Anything Model (SAM) through Prompt-based Learning for Enhanced Protein Identification in Cryo-EM Micrographs | Nov 4, 2023 | Image Segmentationobject-detection | CodeCode Available | 1 |
| Proposal-Level Unsupervised Domain Adaptation for Open World Unbiased Detector | Nov 4, 2023 | Domain AdaptationIncremental Learning | CodeCode Available | 1 |
| Bridging the Gap between Multi-focus and Multi-modal: A Focused Integration Framework for Multi-modal Image Fusion | Nov 3, 2023 | Depth Estimationobject-detection | CodeCode Available | 1 |
| Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection | Nov 3, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Patch-based Selection and Refinement for Early Object Detection | Nov 3, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Effective Human-AI Teams via Learned Natural Language Rules and Onboarding | Nov 2, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV Images | Nov 2, 2023 | Anomaly DetectionImage Classification | CodeCode Available | 1 |
| Ultra-Efficient On-Device Object Detection on AI-Integrated Smart Glasses with TinyissimoYOLO | Nov 2, 2023 | BenchmarkingEdge-computing | CodeCode Available | 1 |
| Recognize Any Regions | Nov 2, 2023 | object-detectionObject Detection | CodeCode Available | 1 |
| TPSeNCE: Towards Artifact-Free Realistic Rain Generation for Deraining and Object Detection in Rain | Nov 1, 2023 | Contrastive LearningImage-to-Image Translation | CodeCode Available | 1 |
| Re-Scoring Using Image-Language Similarity for Few-Shot Object Detection | Nov 1, 2023 | ClassificationFew-Shot Object Detection | CodeCode Available | 1 |
| HEDNet: A Hierarchical Encoder-Decoder Network for 3D Object Detection in Point Clouds | Oct 31, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| RGB-X Object Detection via Scene-Specific Fusion Modules | Oct 30, 2023 | Autonomous VehiclesMultispectral Object Detection | CodeCode Available | 1 |
| A High-Resolution Dataset for Instance Detection with Multi-View Instance Capture | Oct 30, 2023 | 8kObject | CodeCode Available | 1 |
| HSIC-based Moving WeightAveraging for Few-Shot Open-Set Object Detection | Oct 27, 2023 | Few Shot Open Set Object Detectionobject-detection | CodeCode Available | 1 |
| IndustReal: A Dataset for Procedure Step Recognition Handling Execution Errors in Egocentric Videos in an Industrial-Like Setting | Oct 26, 2023 | Action RecognitionObject Detection | CodeCode Available | 1 |
| Navigating Data Heterogeneity in Federated Learning A Semi-Supervised Federated Object Detection | Oct 26, 2023 | Autonomous DrivingFederated Learning | CodeCode Available | 1 |
| LP-OVOD: Open-Vocabulary Object Detection by Linear Probing | Oct 26, 2023 | Objectobject-detection | CodeCode Available | 1 |
| CoDet: Co-Occurrence Guided Region-Word Alignment for Open-Vocabulary Object Detection | Oct 25, 2023 | Objectobject-detection | CodeCode Available | 1 |
| GUPNet++: Geometry Uncertainty Propagation Network for Monocular 3D Object Detection | Oct 24, 2023 | 3D Object DetectionMonocular 3D Object Detection | CodeCode Available | 1 |
| Salient Object Detection in RGB-D Videos | Oct 24, 2023 | AttributeObject | CodeCode Available | 1 |
| Safe Navigation: Training Autonomous Vehicles using Deep Reinforcement Learning in CARLA | Oct 23, 2023 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| OV-VG: A Benchmark for Open-Vocabulary Visual Grounding | Oct 22, 2023 | Novel Conceptsobject-detection | CodeCode Available | 1 |
| Multimodal Transformer Using Cross-Channel attention for Object Detection in Remote Sensing Images | Oct 21, 2023 | Earth ObservationObject | CodeCode Available | 1 |
| ScalableMap: Scalable Map Learning for Online Long-Range Vectorized HD Map Construction | Oct 20, 2023 | 3D Lane Detectionobject-detection | CodeCode Available | 1 |
| EarlyBird: Early-Fusion for Multi-View Tracking in the Bird's Eye View | Oct 20, 2023 | 3D Object DetectionMulti-Object Tracking | CodeCode Available | 1 |
| Zone Evaluation: Revealing Spatial Bias in Object Detection | Oct 20, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Multi‑camera trajectory matching based on hierarchical clustering and constraints | Oct 19, 2023 | AttributeAutonomous Driving | CodeCode Available | 1 |
| MonoSKD: General Distillation Framework for Monocular 3D Object Detection via Spearman Correlation Coefficient | Oct 17, 2023 | 3D Object DetectionGPU | CodeCode Available | 1 |
| Towards Generalizable Multi-Camera 3D Object Detection via Perspective Debiasing | Oct 17, 2023 | 3D Object DetectionDomain Adaptation | CodeCode Available | 1 |
| RoboLLM: Robotic Vision Tasks Grounded on Multimodal Large Language Models | Oct 16, 2023 | Instance SegmentationModel Selection | CodeCode Available | 1 |
| Open-CRB: Towards Open World Active Learning for 3D Object Detection | Oct 16, 2023 | 3D Object DetectionActive Learning | CodeCode Available | 1 |
| RefConv: Re-parameterized Refocusing Convolution for Powerful ConvNets | Oct 16, 2023 | image-classificationImage Classification | CodeCode Available | 1 |
| Rank-DETR for High Quality Object Detection | Oct 13, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Relational Prior Knowledge Graphs for Detection and Instance Segmentation | Oct 11, 2023 | Instance SegmentationKnowledge Graphs | CodeCode Available | 1 |
| Dual Radar: A Multi-modal Dataset with Dual 4D Radar for Autonomous Driving | Oct 11, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| EViT: An Eagle Vision Transformer with Bi-Fovea Self-Attention | Oct 10, 2023 | Computational Efficiencyimage-classification | CodeCode Available | 1 |