| Transparent Object Tracking with Enhanced Fusion Module | Sep 13, 2023 | ObjectObject Tracking | CodeCode Available | 0 |
| Tree-Structured Shading Decomposition | Sep 13, 2023 | Object | —Unverified | 0 |
| Zero-Shot Visual Classification with Guided Cropping | Sep 12, 2023 | ClassificationObject | —Unverified | 0 |
| Mobile Object Tracking in Panoramic Video and LiDAR for Radiological Source-Object Attribution and Improved Source Detection | Sep 12, 2023 | Objectobject-detection | —Unverified | 0 |
| Towards High-Quality Specular Highlight Removal by Leveraging Large-Scale Synthetic Data | Sep 12, 2023 | highlight removalObject | CodeCode Available | 1 |
| Beyond Generation: Harnessing Text to Image Models for Object Detection and Segmentation | Sep 12, 2023 | Image CaptioningImage Generation | CodeCode Available | 1 |
| Grounded Language Acquisition From Object and Action Imagery | Sep 12, 2023 | Action RecognitionContrastive Learning | —Unverified | 0 |
| SCP: Scene Completion Pre-training for 3D Object Detection | Sep 12, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SHIFT3D: Synthesizing Hard Inputs For Tricking 3D Detectors | Sep 11, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| Mobile Vision Transformer-based Visual Object Tracking | Sep 11, 2023 | GPUObject | CodeCode Available | 1 |
| Gall Bladder Cancer Detection from US Images with Only Image Level Labels | Sep 11, 2023 | Diagnosticimage-classification | CodeCode Available | 0 |
| ViHOPE: Visuotactile In-Hand Object 6D Pose Estimation with Shape Completion | Sep 11, 2023 | 6D Pose EstimationGenerative Adversarial Network | —Unverified | 0 |
| Interactive Class-Agnostic Object Counting | Sep 11, 2023 | ObjectObject Counting | —Unverified | 0 |
| Learning Geometric Representations of Objects via Interaction | Sep 11, 2023 | ObjectRepresentation Learning | CodeCode Available | 0 |
| Zero-Shot Co-salient Object Detection Framework | Sep 11, 2023 | Co-Salient Object DetectionObject | CodeCode Available | 1 |
| Diffusion-Guided Reconstruction of Everyday Hand-Object Interaction Clips | Sep 11, 2023 | Object | —Unverified | 0 |
| Multi3DRefer: Grounding Text Description to Multiple 3D Objects | Sep 11, 2023 | 3D visual groundingContrastive Learning | CodeCode Available | 1 |
| MultIOD: Rehearsal-free Multihead Incremental Object Detector | Sep 11, 2023 | class-incremental learningClass Incremental Learning | —Unverified | 0 |
| A Skeleton-based Approach For Rock Crack Detection Towards A Climbing Robot Application | Sep 10, 2023 | ObjectSegmentation | CodeCode Available | 0 |
| Transformers in Small Object Detection: A Benchmark and Survey of State-of-the-Art | Sep 10, 2023 | Objectobject-detection | CodeCode Available | 1 |
| Reducing the False Positive Rate Using Bayesian Inference in Autonomous Driving Perception | Sep 9, 2023 | Autonomous DrivingBayesian Inference | —Unverified | 0 |
| UnitModule: A Lightweight Joint Image Enhancement Module for Underwater Object Detection | Sep 9, 2023 | Data AugmentationImage Enhancement | —Unverified | 0 |
| SortedAP: Rethinking evaluation metrics for instance segmentation | Sep 9, 2023 | Instance SegmentationObject | CodeCode Available | 0 |
| Semi-supervised Instance Segmentation with a Learned Shape Prior | Sep 9, 2023 | Cell SegmentationInstance Segmentation | —Unverified | 0 |
| DeNoising-MOT: Towards Multiple Object Tracking with Severe Occlusions | Sep 9, 2023 | DecoderDenoising | —Unverified | 0 |
| Four Ways to Improve Verbo-visual Fusion for Dense 3D Visual Grounding | Sep 8, 2023 | 3D Instance Segmentation3D visual grounding | —Unverified | 0 |
| Language Prompt for Autonomous Driving | Sep 8, 2023 | Autonomous DrivingObject | CodeCode Available | 1 |
| Unsupervised Object Localization with Representer Point Selection | Sep 8, 2023 | ObjectObject Localization | CodeCode Available | 0 |
| Weakly Supervised Point Clouds Transformer for 3D Object Detection | Sep 8, 2023 | 3D Object DetectionObject | —Unverified | 0 |
| Enabling energy-Efficient object detection with surrogate gradient descent in spiking neural networks | Sep 7, 2023 | Objectobject-detection | —Unverified | 0 |
| ArtiGrasp: Physically Plausible Synthesis of Bi-Manual Dexterous Grasping and Articulation | Sep 7, 2023 | Diversityhand-object pose | —Unverified | 0 |
| SimNP: Learning Self-Similarity Priors Between Neural Points | Sep 7, 2023 | 3D Object ReconstructionObject | —Unverified | 0 |
| Temporal Collection and Distribution for Referring Video Object Segmentation | Sep 7, 2023 | ObjectReferring Video Object Segmentation | —Unverified | 0 |
| Sparse Federated Training of Object Detection in the Internet of Vehicles | Sep 7, 2023 | Federated LearningObject | —Unverified | 0 |
| Sparse 3D Reconstruction via Object-Centric Ray Sampling | Sep 6, 2023 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 0 |
| FishMOT: A Simple and Effective Method for Fish Tracking Based on IoU Matching | Sep 6, 2023 | Fish DetectionMulti-Object Tracking | CodeCode Available | 1 |
| Dynamic Hyperbolic Attention Network for Fine Hand-object Reconstruction | Sep 6, 2023 | ObjectObject Reconstruction | —Unverified | 0 |
| Fast and Resource-Efficient Object Tracking on Edge Devices: A Measurement Study | Sep 6, 2023 | Multi-Object TrackingObject | CodeCode Available | 1 |
| 3D Object Positioning Using Differentiable Multimodal Learning | Sep 6, 2023 | Autonomous VehiclesObject | —Unverified | 0 |
| Diffusion Model is Secretly a Training-free Open Vocabulary Semantic Segmenter | Sep 6, 2023 | Contrastive LearningDenoising | CodeCode Available | 1 |
| Vote2Cap-DETR++: Decoupling Localization and Describing for End-to-End 3D Dense Captioning | Sep 6, 2023 | 3D dense captioningCaption Generation | CodeCode Available | 1 |
| Iterative Superquadric Recomposition of 3D Objects from Multiple Views | Sep 5, 2023 | Inductive BiasObject | CodeCode Available | 1 |
| DR-Pose: A Two-stage Deformation-and-Registration Pipeline for Category-level 6D Object Pose Estimation | Sep 5, 2023 | 6D Pose Estimation using RGBObject | CodeCode Available | 1 |
| Physically Grounded Vision-Language Models for Robotic Manipulation | Sep 5, 2023 | Image CaptioningLanguage Modelling | —Unverified | 0 |
| Dense Object Grounding in 3D Scenes | Sep 5, 2023 | Autonomous DrivingDecoder | —Unverified | 0 |
| Diffusion-based 3D Object Detection with Random Boxes | Sep 5, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SATAY: A Streaming Architecture Toolflow for Accelerating YOLO Models on FPGA Devices | Sep 4, 2023 | Autonomous VehiclesGPU | —Unverified | 0 |
| Semantic-Constraint Matching Transformer for Weakly Supervised Object Localization | Sep 4, 2023 | ObjectObject Localization | —Unverified | 0 |
| CoTDet: Affordance Knowledge Prompting for Task Driven Object Detection | Sep 3, 2023 | Objectobject-detection | —Unverified | 0 |
| EdaDet: Open-Vocabulary Object Detection Using Early Dense Alignment | Sep 3, 2023 | Objectobject-detection | —Unverified | 0 |