| Multiview Scene Graph | Oct 15, 2024 | DecoderObject | CodeCode Available | 2 |
| Open World Object Detection: A Survey | Oct 15, 2024 | Incremental LearningObject | CodeCode Available | 2 |
| Visual-Geometric Collaborative Guidance for Affordance Learning | Oct 15, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Oct 14, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| DINTR: Tracking via Diffusion-based Interpolation | Oct 14, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Oct 14, 2024 | Adversarial RobustnessObject | CodeCode Available | 0 |
| MagicEraser: Erasing Any Objects via Semantics-Aware Control | Oct 14, 2024 | Image InpaintingObject | CodeCode Available | 1 |
| Data-Driven Approaches for Modelling Target Behaviour | Oct 14, 2024 | Gaussian ProcessesObject | —Unverified | 0 |
| High-Precision Dichotomous Image Segmentation via Probing Diffusion Capacity | Oct 14, 2024 | DenoisingDichotomous Image Segmentation | CodeCode Available | 2 |
| LoLI-Street: Benchmarking Low-Light Image Enhancement and Beyond | Oct 13, 2024 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 |
| Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders | Oct 13, 2024 | ObjectPosition regression | —Unverified | 0 |
| VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking | Oct 11, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction | Oct 10, 2024 | 3D ReconstructionCommon Sense Reasoning | —Unverified | 0 |
| SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | Oct 10, 2024 | Object | —Unverified | 0 |
| RegionGrasp: A Novel Task for Contact Region Controllable Hand Grasp Generation | Oct 10, 2024 | Grasp GenerationObject | —Unverified | 0 |
| HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Oct 10, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Structured Spatial Reasoning with Open Vocabulary Object Detectors | Oct 9, 2024 | ObjectObject Rearrangement | —Unverified | 0 |
| Progressive Multi-Modal Fusion for Robust 3D Object Detection | Oct 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Self-Supervised Learning for Real-World Object Detection: a Survey | Oct 9, 2024 | Objectobject-detection | —Unverified | 0 |
| Towards Interpreting Visual Information Processing in Vision-Language Models | Oct 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation | Oct 9, 2024 | Human-Object Interaction DetectionHuman-Object Interaction Generation | —Unverified | 0 |
| First experimental study of multiple orientation muon tomography, with image optimization in sparse data environments | Oct 8, 2024 | Object | —Unverified | 0 |
| Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Oct 8, 2024 | Instance SegmentationObject | —Unverified | 0 |
| Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Oct 8, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| Believing is Seeing: Unobserved Object Detection using Generative Models | Oct 8, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga | Oct 8, 2024 | ColorizationData Augmentation | —Unverified | 0 |
| Toward General Object-level Mapping from Sparse Views with 3D Diffusion Priors | Oct 7, 2024 | Object | CodeCode Available | 1 |
| Next state prediction gives rise to entangled, yet compositional representations of objects | Oct 7, 2024 | Object | —Unverified | 0 |
| Improving Object Detection via Local-global Contrastive Learning | Oct 7, 2024 | Contrastive LearningImage-to-Image Translation | —Unverified | 0 |
| StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting | Oct 6, 2024 | Autonomous DrivingNovel View Synthesis | —Unverified | 0 |
| Deformable NeRF using Recursively Subdivided Tetrahedra | Oct 6, 2024 | NeRFNovel View Synthesis | —Unverified | 0 |
| Multimodal 3D Fusion and In-Situ Learning for Spatially Aware AI | Oct 6, 2024 | 3D ReconstructionObject | CodeCode Available | 1 |
| STONE: A Submodular Optimization Framework for Active 3D Object Detection | Oct 4, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 0 |
| Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction | Oct 4, 2024 | Object | —Unverified | 0 |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Oct 4, 2024 | counterfactualData Augmentation | CodeCode Available | 0 |
| Task-Decoupled Image Inpainting Framework for Class-specific Object Remover | Oct 3, 2024 | Image InpaintingObject | —Unverified | 0 |
| Multi-Scale Fusion for Object Representation | Oct 2, 2024 | Object | CodeCode Available | 1 |
| Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Oct 2, 2024 | Autonomous DrivingComputational Efficiency | CodeCode Available | 0 |
| Open3DTrack: Towards Open-Vocabulary 3D Multi-Object Tracking | Oct 2, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 1 |
| Simplified priors for Object-Centric Learning | Oct 1, 2024 | Continual LearningObject | —Unverified | 0 |
| ARPOV: Expanding Visualization of Object Detection in AR with Panoramic Mosaic Stitching | Oct 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Oct 1, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| SuperPose: Improved 6D Pose Estimation with Robust Tracking and Mask-Free Initialization | Sep 30, 2024 | 6D Pose EstimationObject | —Unverified | 0 |
| HazyDet: Open-source Benchmark for Drone-view Object Detection with Depth-cues in Hazy Scenes | Sep 30, 2024 | Objectobject-detection | CodeCode Available | 2 |
| TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning | Sep 30, 2024 | Image CaptioningObject | CodeCode Available | 0 |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Sep 30, 2024 | HallucinationObject | CodeCode Available | 0 |
| DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Sep 30, 2024 | ObjectOptical Flow Estimation | —Unverified | 0 |
| fCOP: Focal Length Estimation from Category-level Object Priors | Sep 29, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Applying the Lower-Biased Teacher Model in Semi-Supervised Object Detection | Sep 29, 2024 | Objectobject-detection | —Unverified | 0 |