| Weak-to-Strong Compositional Learning from Generative Models for Language-based Object Detection | Jul 21, 2024 | Contrastive Learningobject-detection | —Unverified | 0 |
| Multiple Object Detection and Tracking in Panoramic Videos for Cycling Safety Analysis | Jul 21, 2024 | Multiple Object Trackingobject-detection | CodeCode Available | 1 |
| RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies | Jul 20, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| A New Lightweight Hybrid Graph Convolutional Neural Network -- CNN Scheme for Scene Classification using Object Detection Inference | Jul 19, 2024 | Autonomous Vehiclesobject-detection | CodeCode Available | 0 |
| MLMT-CNN for Object Detection and Segmentation in Multi-layer and Multi-spectral Images | Jul 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Jul 19, 2024 | Emotion Recognitionimage-classification | —Unverified | 0 |
| Bucketed Ranking-based Losses for Efficient Training of Object Detectors | Jul 19, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Enhancing Layout Hotspot Detection Efficiency with YOLOv8 and PCA-Guided Augmentation | Jul 19, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Evaluating and Enhancing Trustworthiness of LLMs in Perception Tasks | Jul 18, 2024 | Hallucinationobject-detection | —Unverified | 0 |
| SUSTechGAN: Image Generation for Object Detection in Adverse Conditions of Autonomous Driving | Jul 18, 2024 | Autonomous DrivingImage Generation | CodeCode Available | 0 |
| Learning Visual Grounding from Generative Vision and Language Model | Jul 18, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection | Jul 18, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| General Geometry-aware Weakly Supervised 3D Object Detection | Jul 18, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| Enhancing Source-Free Domain Adaptive Object Detection with Low-confidence Pseudo Label Distillation | Jul 18, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jul 18, 2024 | DenoisingObject | —Unverified | 0 |
| Learning Camouflaged Object Detection from Noisy Pseudo Label | Jul 18, 2024 | Camouflaged Object SegmentationMemorization | —Unverified | 0 |
| GroupMamba: Efficient Group-Based Visual State Space Model | Jul 18, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| ColorMAE: Exploring data-independent masking strategies in Masked AutoEncoders | Jul 17, 2024 | Image ClassificationInstance Segmentation | CodeCode Available | 0 |
| AdaLog: Post-Training Quantization for Vision Transformers with Adaptive Logarithm Quantizer | Jul 17, 2024 | Instance Segmentationobject-detection | CodeCode Available | 1 |
| Weighting Pseudo-Labels via High-Activation Feature Index Similarity and Object Detection for Semi-Supervised Segmentation | Jul 17, 2024 | object-detectionObject Detection | CodeCode Available | 0 |
| Toward INT4 Fixed-Point Training via Exploring Quantization Error for Gradients | Jul 17, 2024 | image-classificationImage Classification | —Unverified | 0 |
| Exploring Deeper! Segment Anything Model with Depth Perception for Camouflaged Object Detection | Jul 17, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| CerberusDet: Unified Multi-Dataset Object Detection | Jul 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| GLARE: Low Light Image Enhancement via Generative Latent Feature based Codebook Retrieval | Jul 17, 2024 | DecoderImage Enhancement | CodeCode Available | 2 |
| Enhancing Wrist Fracture Detection with YOLO | Jul 17, 2024 | Anomaly DetectionFracture detection | CodeCode Available | 0 |
| Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation | Jul 17, 2024 | Dataset GenerationDeep Learning | —Unverified | 0 |
| Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection | Jul 17, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Generative AI Driven Task-Oriented Adaptive Semantic Communications | Jul 16, 2024 | Instance Segmentationobject-detection | —Unverified | 0 |
| PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Jul 16, 2024 | 2D Object DetectionComputational Efficiency | —Unverified | 0 |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection | Jul 16, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| Monocular pose estimation of articulated surgical instruments in open surgery | Jul 16, 2024 | 6D Pose EstimationDomain Adaptation | —Unverified | 0 |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Jul 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 3 |
| Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Jul 16, 2024 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| TCFormer: Visual Recognition via Token Clustering Transformer | Jul 16, 2024 | Clusteringimage-classification | CodeCode Available | 3 |
| The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities | Jul 16, 2024 | Anomaly DetectionImage Reconstruction | —Unverified | 0 |
| MaskVD: Region Masking for Efficient Video Object Detection | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs | Jul 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models | Jul 15, 2024 | Graph Generationobject-detection | CodeCode Available | 1 |
| OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer | Jul 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Anticipating Future Object Compositions without Forgetting | Jul 15, 2024 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Interpreting Hand gestures using Object Detection and Digits Classification | Jul 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Jul 15, 2024 | 3D Lane Detection3D Object Detection | CodeCode Available | 1 |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jul 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Backdoor Attacks against Image-to-Image Networks | Jul 15, 2024 | Backdoor AttackDenoising | —Unverified | 0 |
| FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Jul 14, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Jul 14, 2024 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 0 |
| Augmented Neural Fine-Tuning for Efficient Backdoor Purification | Jul 14, 2024 | Action RecognitionData Augmentation | CodeCode Available | 1 |
| LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Jul 14, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |