| SurfaceAug: Closing the Gap in Multimodal Ground Truth Sampling | Dec 6, 2023 | Data AugmentationObject | —Unverified | 0 |
| Building Category Graphs Representation with Spatial and Temporal Attention for Visual Navigation | Dec 6, 2023 | ObjectVisual Navigation | —Unverified | 0 |
| Texture-Semantic Collaboration Network for ORSI Salient Object Detection | Dec 6, 2023 | DecoderObject | CodeCode Available | 0 |
| Inpaint3D: 3D Scene Content Generation using 2D Inpainting Diffusion | Dec 6, 2023 | 3D InpaintingNeRF | —Unverified | 0 |
| Low-shot Object Learning with Mutual Exclusivity Bias | Dec 6, 2023 | Object | CodeCode Available | 0 |
| RotaTR: Detection Transformer for Dense and Rotated Object | Dec 5, 2023 | Action RecognitionDecoder | —Unverified | 0 |
| Are Vision Transformers More Data Hungry Than Newborn Visual Systems? | Dec 5, 2023 | ObjectObject Recognition | CodeCode Available | 0 |
| DiffusionAtlas: High-Fidelity Consistent Diffusion Video Editing | Dec 5, 2023 | ObjectVideo Editing | —Unverified | 0 |
| ScAR: Scaling Adversarial Robustness for LiDAR Object Detection | Dec 5, 2023 | 3D Object DetectionAdversarial Attack | CodeCode Available | 0 |
| ZeroReg: Zero-Shot Point Cloud Registration with Foundation Models | Dec 5, 2023 | DecoderGraph Matching | —Unverified | 0 |
| Adaptive Confidence Threshold for ByteTrack in Multi-Object Tracking | Dec 4, 2023 | Multi-Object TrackingMultiple Object Tracking | CodeCode Available | 0 |
| Light Field Imaging in the Restrictive Object Space based on Flexible Angular Plane | Dec 4, 2023 | 3D ReconstructionObject | —Unverified | 0 |
| MANUS: Markerless Grasp Capture using Articulated 3D Gaussians | Dec 4, 2023 | Mixed RealityObject | —Unverified | 0 |
| ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models | Dec 3, 2023 | Novel View SynthesisObject | —Unverified | 0 |
| SAGE: Bridging Semantic and Actionable Parts for GEneralizable Manipulation of Articulated Objects | Dec 3, 2023 | Language ModellingObject | —Unverified | 0 |
| SANeRF-HQ: Segment Anything for NeRF in High Quality | Dec 3, 2023 | NeRFNovel View Synthesis | —Unverified | 0 |
| Neural Parametric Gaussians for Monocular Non-Rigid Object Reconstruction | Dec 2, 2023 | ObjectObject Reconstruction | —Unverified | 0 |
| Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D | Dec 2, 2023 | 3D Object RetrievalDepth Estimation | —Unverified | 0 |
| Open-vocabulary object 6D pose estimation | Dec 1, 2023 | 6D Pose EstimationLanguage Modelling | —Unverified | 0 |
| FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models | Dec 1, 2023 | 6D Pose EstimationObject | —Unverified | 0 |
| FoundPose: Unseen Object Pose Estimation with Foundation Features | Nov 30, 2023 | 6D Pose EstimationObject | —Unverified | 0 |
| Hy-Tracker: A Novel Framework for Enhancing Efficiency and Accuracy of Object Tracking in Hyperspectral Videos | Nov 30, 2023 | Objectobject-detection | —Unverified | 0 |
| LucidDreaming: Controllable Object-Centric 3D Generation | Nov 30, 2023 | 3D GenerationBenchmarking | —Unverified | 0 |
| Union-over-Intersections: Object Detection beyond Winner-Takes-All | Nov 30, 2023 | AllInstance Segmentation | CodeCode Available | 0 |
| TrafficMOT: A Challenging Dataset for Multi-Object Tracking in Complex Traffic Scenarios | Nov 30, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| TIDE: Test Time Few Shot Object Detection | Nov 30, 2023 | Data AugmentationFew-Shot Object Detection | CodeCode Available | 0 |
| SimulFlow: Simultaneously Extracting Feature and Identifying Target for Unsupervised Video Object Segmentation | Nov 30, 2023 | Objectobject-detection | —Unverified | 0 |
| Object-based (yet Class-agnostic) Video Domain Adaptation | Nov 29, 2023 | Action RecognitionDomain Adaptation | —Unverified | 0 |
| A Stochastic-Geometrical Framework for Object Pose Estimation based on Mixture Models Avoiding the Correspondence Problem | Nov 29, 2023 | ObjectPose Estimation | —Unverified | 0 |
| Leveraging VLM-Based Pipelines to Annotate 3D Objects | Nov 29, 2023 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Weakly-semi-supervised object detection in remotely sensed imagery | Nov 29, 2023 | Objectobject-detection | —Unverified | 0 |
| CG3D: Compositional Generation for Text-to-3D via Gaussian Splatting | Nov 29, 2023 | 3D GenerationObject | —Unverified | 0 |
| StructRe: Rewriting for Structured Shape Modeling | Nov 29, 2023 | Object | —Unverified | 0 |
| HiDiffusion: Unlocking Higher-Resolution Creativity and Efficiency in Pretrained Diffusion Models | Nov 29, 2023 | AttributeImage Generation | —Unverified | 0 |
| Informal Safety Guarantees for Simulated Optimizers Through Extrapolation from Partial Simulations | Nov 29, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Feedback RoI Features Improve Aerial Object Detection | Nov 28, 2023 | feature selectionObject | —Unverified | 0 |
| Point'n Move: Interactive Scene Object Manipulation on Gaussian Splatting Radiance Fields | Nov 28, 2023 | Object | —Unverified | 0 |
| CLiC: Concept Learning in Context | Nov 28, 2023 | Object | —Unverified | 0 |
| Large Model Based Referring Camouflaged Object Detection | Nov 28, 2023 | modelObject | —Unverified | 0 |
| DyRA: Portable Dynamic Resolution Adjustment Network for Existing Detectors | Nov 28, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Image segmentation with traveling waves in an exactly solvable recurrent neural network | Nov 28, 2023 | Image SegmentationObject | —Unverified | 0 |
| DepthSSC: Monocular 3D Semantic Scene Completion via Depth-Spatial Alignment and Voxel Adaptation | Nov 28, 2023 | 3D Semantic Scene CompletionAutonomous Driving | —Unverified | 0 |
| HandyPriors: Physically Consistent Perception of Hand-Object Interactions with Differentiable Priors | Nov 28, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| EVCap: Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension | Nov 27, 2023 | Image CaptioningObject | —Unverified | 0 |
| CG-HOI: Contact-Guided 3D Human-Object Interaction Generation | Nov 27, 2023 | Human-Object Interaction DetectionHuman-Object Interaction Generation | —Unverified | 0 |
| Obj-NeRF: Extract Object NeRFs from Multi-view Images | Nov 26, 2023 | 3D geometry3D Reconstruction | —Unverified | 0 |
| OpenNet: Incremental Learning for Autonomous Driving Object Detection with Balanced Loss | Nov 25, 2023 | Autonomous DrivingIncremental Learning | —Unverified | 0 |
| D-SCo: Dual-Stream Conditional Diffusion for Monocular Hand-Held Object Reconstruction | Nov 23, 2023 | DenoisingObject | —Unverified | 0 |
| GS-Pose: Category-Level Object Pose Estimation via Geometric and Semantic Correspondence | Nov 23, 2023 | ObjectPose Estimation | —Unverified | 0 |
| P2RBox: Point Prompt Oriented Object Detection with SAM | Nov 22, 2023 | Objectobject-detection | —Unverified | 0 |