| FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction | Oct 10, 2024 | 3D ReconstructionCommon Sense Reasoning | —Unverified | 0 |
| Self-Supervised Learning for Real-World Object Detection: a Survey | Oct 9, 2024 | Objectobject-detection | —Unverified | 0 |
| Progressive Multi-Modal Fusion for Robust 3D Object Detection | Oct 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Structured Spatial Reasoning with Open Vocabulary Object Detectors | Oct 9, 2024 | ObjectObject Rearrangement | —Unverified | 0 |
| AvatarGO: Zero-shot 4D Human-Object Interaction Generation and Animation | Oct 9, 2024 | Human-Object Interaction DetectionHuman-Object Interaction Generation | —Unverified | 0 |
| Learning Gaussian Data Augmentation in Feature Space for One-shot Object Detection in Manga | Oct 8, 2024 | ColorizationData Augmentation | —Unverified | 0 |
| Training-Free Open-Ended Object Detection and Segmentation via Attention as Prompts | Oct 8, 2024 | Instance SegmentationObject | —Unverified | 0 |
| Adver-City: Open-Source Multi-Modal Dataset for Collaborative Perception Under Adverse Weather Conditions | Oct 8, 2024 | Autonomous VehiclesObject | —Unverified | 0 |
| Believing is Seeing: Unobserved Object Detection using Generative Models | Oct 8, 2024 | Objectobject-detection | CodeCode Available | 0 |
| First experimental study of multiple orientation muon tomography, with image optimization in sparse data environments | Oct 8, 2024 | Object | —Unverified | 0 |
| Improving Object Detection via Local-global Contrastive Learning | Oct 7, 2024 | Contrastive LearningImage-to-Image Translation | —Unverified | 0 |
| Next state prediction gives rise to entangled, yet compositional representations of objects | Oct 7, 2024 | Object | —Unverified | 0 |
| StreetSurfGS: Scalable Urban Street Surface Reconstruction with Planar-based Gaussian Splatting | Oct 6, 2024 | Autonomous DrivingNovel View Synthesis | —Unverified | 0 |
| Deformable NeRF using Recursively Subdivided Tetrahedra | Oct 6, 2024 | NeRFNovel View Synthesis | —Unverified | 0 |
| STONE: A Submodular Optimization Framework for Active 3D Object Detection | Oct 4, 2024 | 3D Object DetectionActive Learning | CodeCode Available | 0 |
| Learning Object Properties Using Robot Proprioception via Differentiable Robot-Object Interaction | Oct 4, 2024 | Object | —Unverified | 0 |
| Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models | Oct 4, 2024 | counterfactualData Augmentation | CodeCode Available | 0 |
| Task-Decoupled Image Inpainting Framework for Class-specific Object Remover | Oct 3, 2024 | Image InpaintingObject | —Unverified | 0 |
| Perceptual Piercing: Human Visual Cue-based Object Detection in Low Visibility Conditions | Oct 2, 2024 | Autonomous DrivingComputational Efficiency | CodeCode Available | 0 |
| Simplified priors for Object-Centric Learning | Oct 1, 2024 | Continual LearningObject | —Unverified | 0 |
| ARPOV: Expanding Visualization of Object Detection in AR with Panoramic Mosaic Stitching | Oct 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Can We Remove the Ground? Obstacle-aware Point Cloud Compression for Remote Object Detection | Oct 1, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| DressRecon: Freeform 4D Human Reconstruction from Monocular Video | Sep 30, 2024 | ObjectOptical Flow Estimation | —Unverified | 0 |
| TROPE: TRaining-Free Object-Part Enhancement for Seamlessly Improving Fine-Grained Zero-Shot Image Captioning | Sep 30, 2024 | Image CaptioningObject | CodeCode Available | 0 |
| HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding | Sep 30, 2024 | HallucinationObject | CodeCode Available | 0 |
| SuperPose: Improved 6D Pose Estimation with Robust Tracking and Mask-Free Initialization | Sep 30, 2024 | 6D Pose EstimationObject | —Unverified | 0 |
| Applying the Lower-Biased Teacher Model in Semi-Supervised Object Detection | Sep 29, 2024 | Objectobject-detection | —Unverified | 0 |
| fCOP: Focal Length Estimation from Category-level Object Priors | Sep 29, 2024 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| 1st Place Solution to the 8th HANDS Workshop Challenge -- ARCTIC Track: 3DGS-based Bimanual Category-agnostic Interaction Reconstruction | Sep 28, 2024 | 3DGSObject | —Unverified | 0 |
| Search3D: Hierarchical Open-Vocabulary 3D Segmentation | Sep 27, 2024 | 3D Instance Segmentation3D Part Segmentation | —Unverified | 0 |
| Query matching for spatio-temporal action detection with query-based object detector | Sep 27, 2024 | Action DetectionObject | —Unverified | 0 |
| You Only Speak Once to See | Sep 27, 2024 | Contrastive LearningObject | —Unverified | 0 |
| An Overview of Multi-Object Estimation via Labeled Random Finite Set | Sep 27, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| CAFF-DINO: Multi-spectral object detection transformers with cross-attention features fusion | Sep 27, 2024 | Multispectral Object DetectionObject | —Unverified | 0 |
| Search and Detect: Training-Free Long Tail Object Detection via Web-Image Retrieval | Sep 26, 2024 | Image RetrievalObject | —Unverified | 0 |
| Advancing Object Detection in Transportation with Multimodal Large Language Models (MLLMs): A Comprehensive Review and Empirical Testing | Sep 26, 2024 | Event DetectionObject | —Unverified | 0 |
| CAMOT: Camera Angle-aware Multi-Object Tracking | Sep 26, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| SOAR: Self-supervision Optimized UAV Action Recognition with Efficient Object-Aware Pretraining | Sep 26, 2024 | Action RecognitionObject | —Unverified | 0 |
| Amodal Instance Segmentation with Diffusion Shape Prior Estimation | Sep 26, 2024 | Amodal Instance SegmentationInstance Segmentation | —Unverified | 0 |
| Hand-object reconstruction via interaction-aware graph attention mechanism | Sep 26, 2024 | Graph AttentionGraph Neural Network | —Unverified | 0 |
| General Compression Framework for Efficient Transformer Object Tracking | Sep 26, 2024 | Model CompressionObject | —Unverified | 0 |
| A Grasping Movement Intention Estimator for Intuitive Control of Assistive Devices | Sep 25, 2024 | Object | —Unverified | 0 |
| Transient Adversarial 3D Projection Attacks on Object Detection in Autonomous Driving | Sep 25, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM | Sep 25, 2024 | 3D Scene ReconstructionObject | —Unverified | 0 |
| A Versatile and Differentiable Hand-Object Interaction Representation | Sep 25, 2024 | Mixed RealityObject | —Unverified | 0 |
| Tiny Robotics Dataset and Benchmark for Continual Object Detection | Sep 24, 2024 | Autonomous NavigationContinual Learning | CodeCode Available | 0 |
| Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking | Sep 24, 2024 | Object | —Unverified | 0 |
| Towards Robust Object Detection: Identifying and Removing Backdoors via Module Inconsistency Analysis | Sep 24, 2024 | backdoor defenseObject | —Unverified | 0 |
| UICE-MIRNet guided image enhancement for underwater object detection | Sep 24, 2024 | feature selectionImage Enhancement | —Unverified | 0 |
| OW-Rep: Open World Object Detection with Instance Representation Learning | Sep 24, 2024 | Novel Class DiscoveryObject | —Unverified | 0 |