| Close the Sim2real Gap via Physically-based Structured Light Synthetic Data Simulation | Jul 17, 2024 | Dataset GenerationDeep Learning | —Unverified | 0 |
| Embracing Events and Frames with Hierarchical Feature Refinement Network for Object Detection | Jul 17, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Generative AI Driven Task-Oriented Adaptive Semantic Communications | Jul 16, 2024 | Instance Segmentationobject-detection | —Unverified | 0 |
| PADRe: A Unifying Polynomial Attention Drop-in Replacement for Efficient Vision Transformer | Jul 16, 2024 | 2D Object DetectionComputational Efficiency | —Unverified | 0 |
| LaMI-DETR: Open-Vocabulary Detection with Language Model Instruction | Jul 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Bridge Past and Future: Overcoming Information Asymmetry in Incremental Object Detection | Jul 16, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| Monocular pose estimation of articulated surgical instruments in open surgery | Jul 16, 2024 | 6D Pose EstimationDomain Adaptation | —Unverified | 0 |
| Relation DETR: Exploring Explicit Position Relation Prior for Object Detection | Jul 16, 2024 | 2D Object Detectionobject-detection | CodeCode Available | 3 |
| Crowd-SAM: SAM as a Smart Annotator for Object Detection in Crowded Scenes | Jul 16, 2024 | Human Instance SegmentationInstance Segmentation | CodeCode Available | 2 |
| TCFormer: Visual Recognition via Token Clustering Transformer | Jul 16, 2024 | Clusteringimage-classification | CodeCode Available | 3 |
| The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities | Jul 16, 2024 | Anomaly DetectionImage Reconstruction | —Unverified | 0 |
| MaskVD: Region Masking for Efficient Video Object Detection | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| AFIDAF: Alternating Fourier and Image Domain Adaptive Filters as an Efficient Alternative to Attention in ViTs | Jul 16, 2024 | object-detectionObject Detection | —Unverified | 0 |
| Improving Unsupervised Video Object Segmentation via Fake Flow Generation | Jul 16, 2024 | Objectobject-detection | —Unverified | 0 |
| OpenPSG: Open-set Panoptic Scene Graph Generation via Large Multimodal Models | Jul 15, 2024 | Graph Generationobject-detection | CodeCode Available | 1 |
| OVLW-DETR: Open-Vocabulary Light-Weighted Detection Transformer | Jul 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Anticipating Future Object Compositions without Forgetting | Jul 15, 2024 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Interpreting Hand gestures using Object Detection and Digits Classification | Jul 15, 2024 | object-detectionObject Detection | —Unverified | 0 |
| RepVF: A Unified Vector Fields Representation for Multi-task 3D Perception | Jul 15, 2024 | 3D Lane Detection3D Object Detection | CodeCode Available | 1 |
| OPEN: Object-wise Position Embedding for Multi-view 3D Object Detection | Jul 15, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Backdoor Attacks against Image-to-Image Networks | Jul 15, 2024 | Backdoor AttackDenoising | —Unverified | 0 |
| FSD-BEV: Foreground Self-Distillation for Multi-view 3D Object Detection | Jul 14, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Shape2Scene: 3D Scene Representation Learning Through Pre-training on Shape Data | Jul 14, 2024 | 3D Object Detection3D Semantic Segmentation | CodeCode Available | 0 |
| Augmented Neural Fine-Tuning for Efficient Backdoor Purification | Jul 14, 2024 | Action RecognitionData Augmentation | CodeCode Available | 1 |
| LabelDistill: Label-guided Cross-modal Knowledge Distillation for Camera-based 3D Object Detection | Jul 14, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |