| I'M HOI: Inertia-aware Monocular Capture of 3D Human-Object Interactions | Dec 10, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| The Quest for an Integrated Set of Neural Mechanisms Underlying Object Recognition in Primates | Dec 10, 2023 | ObjectObject Recognition | —Unverified | 0 |
| Correcting Diffusion Generation through Resampling | Dec 10, 2023 | Image GenerationObject | CodeCode Available | 1 |
| Open World Object Detection in the Era of Foundation Models | Dec 10, 2023 | Medical Image AnalysisObject | —Unverified | 0 |
| InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models | Dec 10, 2023 | Human-Object Interaction GenerationObject | CodeCode Available | 1 |
| 3D Copy-Paste: Physically Plausible Object Insertion for Monocular 3D Detection | Dec 8, 2023 | 3D Object DetectionData Augmentation | CodeCode Available | 1 |
| Towards Enhanced Image Inpainting: Mitigating Unwanted Object Insertion and Preserving Color Consistency | Dec 8, 2023 | DecoderHallucination | CodeCode Available | 1 |
| SmartMask: Context Aware High-Fidelity Mask Generation for Fine-grained Object Insertion and Layout Control | Dec 8, 2023 | Image GenerationImage Inpainting | —Unverified | 0 |
| Benchmarking and Analysis of Unsupervised Object Segmentation from Real-world Single Images | Dec 8, 2023 | BenchmarkingObject | CodeCode Available | 1 |
| Correspondences of the Third Kind: Camera Pose Estimation from Object Reflection | Dec 7, 2023 | Camera Pose EstimationMotion Estimation | —Unverified | 0 |
| PhysHOI: Physics-Based Imitation of Dynamic Human-Object Interaction | Dec 7, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Natural-language-driven Simulation Benchmark and Copilot for Efficient Production of Object Interactions in Virtual Road Scenes | Dec 7, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| Gen2Det: Generate to Detect | Dec 7, 2023 | Image GenerationObject | —Unverified | 0 |
| TeMO: Towards Text-Driven 3D Stylization for Multi-Object Meshes | Dec 7, 2023 | Graph AttentionObject | —Unverified | 0 |
| High Pileup Particle Tracking with Object Condensation | Dec 6, 2023 | Edge ClassificationObject | CodeCode Available | 1 |
| Controllable Human-Object Interaction Synthesis | Dec 6, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| SurfaceAug: Closing the Gap in Multimodal Ground Truth Sampling | Dec 6, 2023 | Data AugmentationObject | —Unverified | 0 |
| Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion | Dec 6, 2023 | 3D InpaintingNeRF | —Unverified | 0 |
| Automated Multimodal Data Annotation via Calibration With Indoor Positioning System | Dec 6, 2023 | Objectobject-detection | —Unverified | 0 |
| TokenCompose: Text-to-Image Diffusion with Token-level Supervision | Dec 6, 2023 | DenoisingImage Generation | CodeCode Available | 1 |
| A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting | Dec 6, 2023 | Image InpaintingObject | CodeCode Available | 0 |
| Texture-Semantic Collaboration Network for ORSI Salient Object Detection | Dec 6, 2023 | DecoderObject | CodeCode Available | 0 |
| MotionCtrl: A Unified and Flexible Motion Controller for Video Generation | Dec 6, 2023 | ObjectVideo Generation | CodeCode Available | 3 |
| Low-shot Object Learning with Mutual Exclusivity Bias | Dec 6, 2023 | Object | CodeCode Available | 0 |
| DreamComposer: Controllable 3D Object Generation via Multi-View Conditions | Dec 6, 2023 | 3D Object ReconstructionNovel View Synthesis | CodeCode Available | 1 |
| Mitigating Open-Vocabulary Caption Hallucinations | Dec 6, 2023 | DiversityHallucination | CodeCode Available | 1 |
| Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation | Dec 6, 2023 | ObjectVisual Navigation | —Unverified | 0 |
| Boosting Segment Anything Model Towards Open-Vocabulary Learning | Dec 6, 2023 | modelObject | CodeCode Available | 1 |
| DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing | Dec 5, 2023 | ObjectVideo Editing | —Unverified | 0 |
| ScAR: Scaling Adversarial Robustness for LiDAR Object Detection | Dec 5, 2023 | 3D Object DetectionAdversarial Attack | CodeCode Available | 0 |
| ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models | Dec 5, 2023 | DecoderGraph Matching | —Unverified | 0 |
| Are Vision Transformers More Data Hungry Than Newborn Visual Systems? | Dec 5, 2023 | ObjectObject Recognition | CodeCode Available | 0 |
| SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints | Dec 5, 2023 | Model OptimizationNovel Concepts | CodeCode Available | 2 |
| Diffusion-SS3D: Diffusion Model for Semi-supervised 3D Object Detection | Dec 5, 2023 | 3D Object DetectionDenoising | CodeCode Available | 1 |
| RotaTR: Detection Transformer for Dense and Rotated Object | Dec 5, 2023 | Action RecognitionDecoder | —Unverified | 0 |
| MANUS: Markerless Grasp Capture using Articulated 3D Gaussians | Dec 4, 2023 | Mixed RealityObject | —Unverified | 0 |
| Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites | Dec 4, 2023 | HallucinationHallucination Evaluation | CodeCode Available | 1 |
| Object Recognition as Next Token Prediction | Dec 4, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 |
| Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane | Dec 4, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| Adaptive Confidence Threshold for ByteTrack in Multi-Object Tracking | Dec 4, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 0 |
| BEVNeXt: Reviving Dense BEV Frameworks for 3D Object Detection | Dec 4, 2023 | 3D Object DetectionDecoder | CodeCode Available | 1 |
| Aligning and Prompting Everything All at Once for Universal Visual Perception | Dec 4, 2023 | AllObject | CodeCode Available | 2 |
| SANeRF-HQ: Segment Anything for NeRF in High Quality | Dec 3, 2023 | NeRFNovel View Synthesis | —Unverified | 0 |
| SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects | Dec 3, 2023 | Language ModellingObject | —Unverified | 0 |
| ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models | Dec 3, 2023 | Novel View SynthesisObject | —Unverified | 0 |
| Toward Improving Robustness of Object Detectors Against Domain Shift | Dec 2, 2023 | Data AugmentationDiversity | CodeCode Available | 1 |
| Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction | Dec 2, 2023 | ObjectObject Reconstruction | —Unverified | 0 |
| Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D | Dec 2, 2023 | 3D Object RetrievalDepth Estimation | —Unverified | 0 |
| ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation | Dec 2, 2023 | 3D GenerationObject | CodeCode Available | 2 |
| FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models | Dec 1, 2023 | 6D Pose EstimationObject | —Unverified | 0 |