| FlightScope: An Experimental Comparative Review of Aircraft Detection Algorithms in Satellite Imagery | Apr 3, 2024 | Objectobject-detection | CodeCode Available | 1 |
| ALOHa: A New Measure for Hallucination in Captioning Models | Apr 3, 2024 | HallucinationObject | —Unverified | 0 |
| One Noise to Rule Them All: Multi-View Adversarial Attacks with Universal Perturbation | Apr 2, 2024 | 3D Object RecognitionAll | CodeCode Available | 0 |
| GEARS: Local Geometry-aware Hand-object Interaction Synthesis | Apr 2, 2024 | Object | —Unverified | 0 |
| LR-FPN: Enhancing Remote Sensing Object Detection with Location Refined Feature Pyramid Network | Apr 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Disentangled Pre-training for Human-Object Interaction Detection | Apr 2, 2024 | Action RecognitionDecoder | CodeCode Available | 1 |
| Task Integration Distillation for Object Detectors | Apr 2, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Scene Adaptive Sparse Transformer for Event-based Object Detection | Apr 2, 2024 | Objectobject-detection | CodeCode Available | 2 |
| Event-assisted Low-Light Video Object Segmentation | Apr 2, 2024 | ObjectSemantic Segmentation | CodeCode Available | 1 |
| Uncertainty-aware Active Learning of NeRF-based Object Models for Robot Manipulators using Visual and Re-orientation Actions | Apr 2, 2024 | Active LearningInformativeness | —Unverified | 0 |
| EGTR: Extracting Graph from Transformer for Scene Graph Generation | Apr 2, 2024 | Graph GenerationMulti-Task Learning | CodeCode Available | 2 |
| Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection | Apr 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Segment Any 3D Object with Language | Apr 2, 2024 | 3D Instance SegmentationDecoder | —Unverified | 0 |
| Open-Vocabulary Object Detectors: Robustness Challenges under Distribution Shifts | Apr 1, 2024 | Objectobject-detection | —Unverified | 0 |
| Object-conditioned Bag of Instances for Few-Shot Personalized Instance Recognition | Apr 1, 2024 | Objectobject-detection | —Unverified | 0 |
| ContactHandover: Contact-Guided Robot-to-Human Object Handover | Apr 1, 2024 | Object | —Unverified | 0 |
| SUGAR: Pre-training 3D Visual Representations for Robotics | Apr 1, 2024 | 3D Instance Segmentation3D Object Recognition | —Unverified | 0 |
| What is Point Supervision Worth in Video Instance Segmentation? | Apr 1, 2024 | Instance SegmentationObject | —Unverified | 0 |
| Detect2Interact: Localizing Object Key Field in Visual Question Answering (VQA) with LLMs | Apr 1, 2024 | Common Sense ReasoningObject | —Unverified | 0 |
| Text2HOI: Text-guided 3D Motion Generation for Hand-Object Interaction | Mar 31, 2024 | Motion GenerationObject | CodeCode Available | 2 |
| Object-level Copy-Move Forgery Image Detection based on Inconsistency Mining | Mar 31, 2024 | Forgery Image DetectionObject | —Unverified | 0 |
| Weak-to-Strong 3D Object Detection with X-Ray Distillation | Mar 31, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Cognitive Planning for Object Goal Navigation using Generative AI Models | Mar 30, 2024 | Efficient ExplorationIn-Context Learning | —Unverified | 0 |
| HOI-M3:Capture Multiple Humans and Objects Interaction within Contextual Environment | Mar 30, 2024 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Constrained Layout Generation with Factor Graphs | Mar 30, 2024 | Graph Neural NetworkLayout Generation | —Unverified | 0 |
| VSRD: Instance-Aware Volumetric Silhouette Rendering for Weakly Supervised 3D Object Detection | Mar 29, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 1 |
| DVIS-DAQ: Improving Video Segmentation via Dynamic Anchor Queries | Mar 29, 2024 | ObjectVideo Instance Segmentation | CodeCode Available | 2 |
| PLoc: A New Evaluation Criterion Based on Physical Location for Autonomous Driving Datasets | Mar 29, 2024 | Autonomous DrivingObject | CodeCode Available | 0 |
| Temporally Consistent Referring Video Object Segmentation with Hybrid Memory | Mar 28, 2024 | HTRObject | CodeCode Available | 1 |
| OAKINK2: A Dataset of Bimanual Hands-Object Manipulation in Complex Task Completion | Mar 28, 2024 | Motion SynthesisObject | —Unverified | 0 |
| GraspXL: Generating Grasping Motions for Diverse Objects at Scale | Mar 28, 2024 | Object | —Unverified | 0 |
| Total-Decom: Decomposed 3D Scene Reconstruction with Minimal Interaction | Mar 28, 2024 | 3D geometry3D Reconstruction | CodeCode Available | 2 |
| Algorithmic Ways of Seeing: Using Object Detection to Facilitate Art Exploration | Mar 28, 2024 | Objectobject-detection | —Unverified | 0 |
| RiEMann: Near Real-Time SE(3)-Equivariant Robot Manipulation without Point Cloud Segmentation | Mar 28, 2024 | Imitation LearningObject | —Unverified | 0 |
| Enhancing Multiple Object Tracking Accuracy via Quantum Annealing | Mar 27, 2024 | ManagementMultiple Object Tracking | —Unverified | 0 |
| DODA: Diffusion for Object-detection Domain Adaptation in Agriculture | Mar 27, 2024 | Domain AdaptationHead Detection | CodeCode Available | 1 |
| FlexEdit: Flexible and Controllable Diffusion-based Object-centric Image Editing | Mar 27, 2024 | DenoisingObject | —Unverified | 0 |
| Tracking-Assisted Object Detection with Event Cameras | Mar 27, 2024 | AttributeObject | CodeCode Available | 0 |
| Online Embedding Multi-Scale CLIP Features into 3D Maps | Mar 27, 2024 | ObjectRetrieval | —Unverified | 0 |
| ObjectDrop: Bootstrapping Counterfactuals for Photorealistic Object Removal and Insertion | Mar 27, 2024 | counterfactualObject | —Unverified | 0 |
| Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations | Mar 27, 2024 | AttributeDiagnostic | CodeCode Available | 0 |
| Benchmarking Object Detectors with COCO: A New Path Forward | Mar 27, 2024 | BenchmarkingObject | CodeCode Available | 1 |
| BAM: Box Abstraction Monitors for Real-time OoD Detection in Object Detection | Mar 27, 2024 | Objectobject-detection | —Unverified | 0 |
| SpectralWaste Dataset: Multimodal Data for Waste Sorting Automation | Mar 26, 2024 | ManagementObject | —Unverified | 0 |
| EgoLifter: Open-world 3D Segmentation for Egocentric Perception | Mar 26, 2024 | 3D ReconstructionObject | CodeCode Available | 2 |
| Hierarchical Open-Vocabulary 3D Scene Graphs for Language-Grounded Robot Navigation | Mar 26, 2024 | ObjectRobot Navigation | —Unverified | 0 |
| DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions | Mar 26, 2024 | Object | —Unverified | 0 |
| Exploring Dynamic Transformer for Efficient Object Tracking | Mar 26, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Efficient Image Pre-Training with Siamese Cropped Masked Autoencoders | Mar 26, 2024 | ObjectSelf-Supervised Learning | CodeCode Available | 2 |
| Efficient Video Object Segmentation via Modulated Cross-Attention Memory | Mar 26, 2024 | GPUObject | CodeCode Available | 2 |