| Audio-Visual Segmentation by Exploring Cross-Modal Mutual Semantics | Jul 31, 2023 | ObjectSegmentation | —Unverified | 0 |
| Bridging the Gap: Exploring the Capabilities of Bridge-Architectures for Complex Visual Reasoning Tasks | Jul 31, 2023 | Image RetrievalObject | —Unverified | 0 |
| Detecting Out-of-distribution Objects Using Neuron Activation Patterns | Jul 31, 2023 | Autonomous VehiclesObject | CodeCode Available | 0 |
| A Modular Ontology for MODS -- Metadata Object Description Schema | Jul 31, 2023 | Knowledge GraphsObject | —Unverified | 0 |
| Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving | Jul 30, 2023 | Autonomous DrivingMixture-of-Experts | —Unverified | 0 |
| Implementing Edge Based Object Detection For Microplastic Debris | Jul 30, 2023 | Objectobject-detection | —Unverified | 0 |
| Enhancing Object Detection in Ancient Documents with Synthetic Data Generation and Transformer-Based Models | Jul 29, 2023 | Objectobject-detection | —Unverified | 0 |
| Generalized Open-World Semi-Supervised Object Detection | Jul 28, 2023 | Objectobject-detection | —Unverified | 0 |
| TrackAgent: 6D Object Tracking via Reinforcement Learning | Jul 28, 2023 | ObjectObject Tracking | —Unverified | 0 |
| Aligned Unsupervised Pretraining of Object Detectors with Self-training | Jul 28, 2023 | Few-Shot Object DetectionObject | —Unverified | 0 |
| The detection and rectification for identity-switch based on unfalsified control | Jul 27, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| A Memory-Augmented Multi-Task Collaborative Framework for Unsupervised Traffic Accident Detection in Driving Videos | Jul 27, 2023 | Autonomous DrivingObject | —Unverified | 0 |
| YOLOBench: Benchmarking Efficient Object Detectors on Embedded Systems | Jul 26, 2023 | BenchmarkingCPU | CodeCode Available | 0 |
| Cos R-CNN for Online Few-shot Object Detection | Jul 25, 2023 | Few-Shot Object DetectionObject | —Unverified | 0 |
| 3DRP-Net: 3D Relative Position-aware Network for 3D Visual Grounding | Jul 25, 2023 | 3D visual groundingObject | —Unverified | 0 |
| Learning Transferable Object-Centric Diffeomorphic Transformations for Data Augmentation in Medical Image Segmentation | Jul 25, 2023 | Data AugmentationImage Segmentation | —Unverified | 0 |
| LIST: Learning Implicitly from Spatial Transformers for Single-View 3D Reconstruction | Jul 23, 2023 | 3D ReconstructionObject | CodeCode Available | 0 |
| TransNet: Transparent Object Manipulation Through Category-Level Pose Estimation | Jul 23, 2023 | Depth CompletionObject | —Unverified | 0 |
| Challenges for Monocular 6D Object Pose Estimation in Robotics | Jul 22, 2023 | 6D Pose Estimation using RGBObject | —Unverified | 0 |
| Leveraging Knowledge Graphs for Zero-Shot Object-agnostic State Classification | Jul 22, 2023 | AttributeClassification | —Unverified | 0 |
| Spatial Self-Distillation for Object Detection with Inaccurate Bounding Boxes | Jul 22, 2023 | Multiple Instance LearningObject | —Unverified | 0 |
| R2Det: Redemption from Range-view for Accurate 3D Object Detection | Jul 21, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| YOLOPose V2: Understanding and Improving Transformer-based 6D Pose Estimation | Jul 21, 2023 | 6D Pose Estimation6D Pose Estimation using RGB | —Unverified | 0 |
| SCA-PVNet: Self-and-Cross Attention Based Aggregation of Point Cloud and Multi-View for 3D Object Retrieval | Jul 20, 2023 | 3D Object RetrievalObject | —Unverified | 0 |
| A novel integrated method of detection-grasping for specific object based on the box coordinate matching | Jul 20, 2023 | Instance SegmentationObject | —Unverified | 0 |
| Improving Online Lane Graph Extraction by Object-Lane Clustering | Jul 20, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Mining Conditional Part Semantics with Occluded Extrapolation for Human-Object Interaction Detection | Jul 19, 2023 | Human-Object Interaction DetectionObject | —Unverified | 0 |
| Online Continual Learning for Robust Indoor Object Recognition | Jul 19, 2023 | Continual LearningObject | —Unverified | 0 |
| Divert More Attention to Vision-Language Object Tracking | Jul 19, 2023 | AttributeObject | —Unverified | 0 |
| Attacking by Aligning: Clean-Label Backdoor Attacks on Object Detection | Jul 19, 2023 | Autonomous DrivingBackdoor Attack | CodeCode Available | 0 |
| Conditional 360-degree Image Synthesis for Immersive Indoor Scene Decoration | Jul 18, 2023 | Generative Adversarial NetworkImage Generation | —Unverified | 0 |
| Grounded Object Centric Learning | Jul 18, 2023 | ObjectObject Discovery | —Unverified | 0 |
| Learning Dynamic Attribute-factored World Models for Efficient Multi-object Reinforcement Learning | Jul 18, 2023 | AttributeObject | —Unverified | 0 |
| Rethinking Intersection Over Union for Small Object Detection in Few-Shot Regime | Jul 17, 2023 | Few-Shot Object DetectionObject | —Unverified | 0 |
| Multi-Task Cross-Modality Attention-Fusion for 2D Object Detection | Jul 17, 2023 | 2D Object DetectionAutonomous Driving | —Unverified | 0 |
| ROFusion: Efficient Object Detection using Hybrid Point-wise Radar-Optical Fusion | Jul 17, 2023 | Autonomous DrivingObject | CodeCode Available | 0 |
| LiDAR-BEVMTN: Real-Time LiDAR Bird's-Eye View Multi-Task Perception Network for Autonomous Driving | Jul 17, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Does Visual Pretraining Help End-to-End Reasoning? | Jul 17, 2023 | image-classificationImage Classification | —Unverified | 0 |
| Active Learning for Object Detection with Non-Redundant Informative Sampling | Jul 17, 2023 | Active LearningDiversity | —Unverified | 0 |
| Hierarchical Spatiotemporal Transformers for Video Object Segmentation | Jul 17, 2023 | Inductive BiasObject | —Unverified | 0 |
| CVSformer: Cross-View Synthesis Transformer for Semantic Scene Completion | Jul 16, 2023 | Object | —Unverified | 0 |
| Multi-Object Discovery by Low-Dimensional Object Motion | Jul 16, 2023 | Depth EstimationMonocular Depth Estimation | —Unverified | 0 |
| Enforcing 3D Topological Constraints in Composite Objects via Implicit Functions | Jul 16, 2023 | 3D Object Reconstruction3D Reconstruction | —Unverified | 0 |
| Learning from Pseudo-labeled Segmentation for Multi-Class Object Counting | Jul 15, 2023 | ObjectObject Counting | —Unverified | 0 |
| RFLA: A Stealthy Reflected Light Adversarial Attack in the Physical World | Jul 14, 2023 | Adversarial AttackObject | CodeCode Available | 0 |
| PiTL: Cross-modal Retrieval with Weakly-supervised Vision-language Pre-training via Prompting | Jul 14, 2023 | Cross-Modal RetrievalImage to text | —Unverified | 0 |
| Deteksi Sampah di Permukaan dan Dalam Perairan pada Objek Video dengan Metode Robust and Efficient Post-Processing dan Tubelet-Level Bounding Box Linking | Jul 14, 2023 | ManagementObject | —Unverified | 0 |
| Switching Head-Tail Funnel UNITER for Dual Referring Expression Comprehension with Fetch-and-Carry Tasks | Jul 14, 2023 | ObjectReferring Expression | —Unverified | 0 |
| MPDIoU: A Loss for Efficient and Accurate Bounding Box Regression | Jul 14, 2023 | Instance SegmentationObject | —Unverified | 0 |
| Multimodal Object Detection in Remote Sensing | Jul 13, 2023 | Objectobject-detection | —Unverified | 0 |