| What is YOLOv5: A deep look into the internal features of the popular object detector | Jul 30, 2024 | Objectobject-detection | —Unverified | 0 |
| MEVDT: Multi-Modal Event-Based Vehicle Detection and Tracking Dataset | Jul 29, 2024 | Event-based visionObject | —Unverified | 0 |
| Practical Video Object Detection via Feature Selection and Aggregation | Jul 29, 2024 | feature selectionGPU | CodeCode Available | 3 |
| ClickDiff: Click to Induce Semantic Contact Map for Controllable Grasp Generation with Diffusion Models | Jul 28, 2024 | Controllable Grasp GenerationGrasp Generation | CodeCode Available | 0 |
| Progressive Domain Adaptation for Thermal Infrared Object Tracking | Jul 28, 2024 | Domain AdaptationObject | —Unverified | 0 |
| Rapid Object Annotation | Jul 26, 2024 | Object | —Unverified | 0 |
| SHIC: Shape-Image Correspondences with no Keypoint Supervision | Jul 26, 2024 | Keypoint DetectionObject | —Unverified | 0 |
| Floating No More: Object-Ground Reconstruction from a Single Image | Jul 26, 2024 | 3D Object Reconstruction3D Reconstruction | —Unverified | 0 |
| XS-VID: An Extremely Small Video Object Detection Dataset | Jul 25, 2024 | DiversityObject | —Unverified | 0 |
| ReCorD: Reasoning and Correcting Diffusion for HOI Generation | Jul 25, 2024 | Human-Object Interaction GenerationImage Generation | CodeCode Available | 1 |
| Guided Latent Slot Diffusion for Object-Centric Learning | Jul 25, 2024 | Conditional Image GenerationDecoder | —Unverified | 0 |
| Move and Act: Enhanced Object Manipulation and Background Integrity for Image Editing | Jul 25, 2024 | ObjectPosition | CodeCode Available | 1 |
| PEEKABOO: Hiding parts of an image for unsupervised object localization | Jul 24, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model | Jul 24, 2024 | Image InpaintingObject | CodeCode Available | 3 |
| AI-based Density Recognition | Jul 24, 2024 | ObjectObject Recognition | —Unverified | 0 |
| What Matters in Range View 3D Object Detection | Jul 23, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| PartGLEE: A Foundation Model for Recognizing and Parsing Any Objects | Jul 23, 2024 | Instance SegmentationObject | CodeCode Available | 2 |
| ESOD: Efficient Small Object Detection on High-Resolution Images | Jul 23, 2024 | GPUObject | CodeCode Available | 2 |
| Understanding Impacts of Electromagnetic Signal Injection Attacks on Object Detection | Jul 23, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| FoRA: Low-Rank Adaptation Model beyond Multimodal Siamese Network | Jul 23, 2024 | Objectobject-detection | CodeCode Available | 1 |
| MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object Detection | Jul 23, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| On Feasibility of Intent Obfuscating Attacks | Jul 22, 2024 | Object | CodeCode Available | 0 |
| Local Occupancy-Enhanced Object Grasping with Multiple Triplanar Projection | Jul 22, 2024 | ObjectPose Estimation | —Unverified | 0 |
| Affordance Labeling and Exploration: A Manifold-Based Approach | Jul 22, 2024 | ClassificationClustering | —Unverified | 0 |
| CarFormer: Self-Driving with Learned Object-Centric Representations | Jul 22, 2024 | Object | —Unverified | 0 |
| Towards Open-World Object-based Anomaly Detection via Self-Supervised Outlier Synthesis | Jul 22, 2024 | Anomaly DetectionObject | CodeCode Available | 0 |
| Disentangling spatio-temporal knowledge for weakly supervised object detection and segmentation in surgical video | Jul 22, 2024 | DisentanglementKnowledge Distillation | CodeCode Available | 0 |
| SS-SFR: Synthetic Scenes Spatial Frequency Response on Virtual KITTI and Degraded Automotive Simulations for Object Detection | Jul 22, 2024 | Objectobject-detection | —Unverified | 0 |
| Flow as the Cross-Domain Manipulation Interface | Jul 21, 2024 | Object | —Unverified | 0 |
| Hybrid PHD-PMB Trajectory Smoothing Using Backward Simulation | Jul 20, 2024 | Object | —Unverified | 0 |
| RayFormer: Improving Query-Based Multi-Camera 3D Object Detection via Ray-Centric Strategies | Jul 20, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| EmoCAM: Toward Understanding What Drives CNN-based Emotion Recognition | Jul 19, 2024 | Emotion Recognitionimage-classification | —Unverified | 0 |
| OCTrack: Benchmarking the Open-Corpus Multi-Object Tracking | Jul 19, 2024 | BenchmarkingMulti-Object Tracking | —Unverified | 0 |
| Investigating the Indirect Object Identification circuit in Mamba | Jul 19, 2024 | MambaObject | CodeCode Available | 0 |
| Interior Object Geometry via Fitted Frames | Jul 19, 2024 | Object | —Unverified | 0 |
| PD-APE: A Parallel Decoding Framework with Adaptive Position Encoding for 3D Visual Grounding | Jul 19, 2024 | 3D visual groundingAttribute | —Unverified | 0 |
| Learning Visual Grounding from Generative Vision and Language Model | Jul 18, 2024 | AttributeLanguage Modeling | —Unverified | 0 |
| The Art of Imitation: Learning Long-Horizon Manipulation Tasks from Few Demonstrations | Jul 18, 2024 | Imitation LearningInductive Bias | CodeCode Available | 1 |
| Attention Based Simple Primitives for Open World Compositional Zero-Shot Learning | Jul 18, 2024 | AttributeCompositional Zero-Shot Learning | CodeCode Available | 0 |
| OAT: Object-Level Attention Transformer for Gaze Scanpath Prediction | Jul 18, 2024 | DecoderObject | CodeCode Available | 0 |
| FocusDiffuser: Perceiving Local Disparities for Camouflaged Object Detection | Jul 18, 2024 | DenoisingObject | —Unverified | 0 |
| General Geometry-aware Weakly Supervised 3D Object Detection | Jul 18, 2024 | 3D Object DetectionObject | CodeCode Available | 1 |
| Learning Camouflaged Object Detection from Noisy Pseudo Label | Jul 18, 2024 | Camouflaged Object SegmentationMemorization | —Unverified | 0 |
| DFMSD: Dual Feature Masking Stage-wise Knowledge Distillation for Object Detection | Jul 18, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Data-driven Verification of DNNs for Object Recognition | Jul 17, 2024 | Image SegmentationObject | —Unverified | 0 |
| Strawberry detection and counting based on YOLOv7 pruning and information based tracking algorithm | Jul 17, 2024 | Multiple Object TrackingObject | —Unverified | 0 |
| NL2Contact: Natural Language Guided 3D Hand-Object Contact Modeling with Diffusion Model | Jul 17, 2024 | DescriptiveGrasp Generation | —Unverified | 0 |
| HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects | Jul 17, 2024 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| CerberusDet: Unified Multi-Dataset Object Detection | Jul 17, 2024 | Objectobject-detection | CodeCode Available | 1 |
| Object-Aware Query Perturbation for Cross-Modal Image-Text Retrieval | Jul 17, 2024 | Image-text RetrievalObject | CodeCode Available | 0 |