| GiVE: Guiding Visual Encoder to Perceive Overlooked Information | Oct 26, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Non-rigid Relative Placement through 3D Dense Diffusion | Oct 25, 2024 | ObjectRobot Manipulation | —Unverified | 0 |
| Semantics in Robotics: Environmental Data Can't Yield Conventions of Human Behaviour | Oct 25, 2024 | Object | —Unverified | 0 |
| IPPON: Common Sense Guided Informative Path Planning for Object Goal Navigation | Oct 25, 2024 | Common Sense ReasoningLanguage Modeling | —Unverified | 0 |
| DECADE: Towards Designing Efficient-yet-Accurate Distance Estimation Modules for Collision Avoidance in Mobile Advanced Driver Assistance Systems | Oct 25, 2024 | 3D Object DetectionCollision Avoidance | —Unverified | 0 |
| Radar and Camera Fusion for Object Detection and Tracking: A Comprehensive Survey | Oct 24, 2024 | Objectobject-detection | —Unverified | 0 |
| Dynamic 3D Gaussian Tracking for Graph-Based Neural Dynamics Modeling | Oct 24, 2024 | 3DGSObject | —Unverified | 0 |
| Zero-shot Object Navigation with Vision-Language Models Reasoning | Oct 24, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Learning Global Object-Centric Representations via Disentangled Slot Attention | Oct 24, 2024 | ObjectPosition | —Unverified | 0 |
| YOLOv11: An Overview of the Key Architectural Enhancements | Oct 23, 2024 | Computational EfficiencyInstance Segmentation | CodeCode Available | 0 |
| YOLO-Vehicle-Pro: A Cloud-Edge Collaborative Framework for Object Detection in Autonomous Driving under Adverse Weather Conditions | Oct 23, 2024 | Autonomous DrivingImage Dehazing | —Unverified | 0 |
| Towards Real Zero-Shot Camouflaged Object Segmentation without Camouflaged Annotations | Oct 22, 2024 | Camouflaged Object SegmentationLarge Language Model | CodeCode Available | 0 |
| Multi Kernel Estimation based Object Segmentation | Oct 22, 2024 | ObjectSegmentation | CodeCode Available | 0 |
| SINGAPO: Single Image Controlled Generation of Articulated Parts in Objects | Oct 21, 2024 | Object | —Unverified | 0 |
| Online Pseudo-Label Unified Object Detection for Multiple Datasets Training | Oct 21, 2024 | Objectobject-detection | —Unverified | 0 |
| Few-shot target-driven instance detection based on open-vocabulary object detection models | Oct 21, 2024 | Image AugmentationObject | —Unverified | 0 |
| Deep Learning and Machine Learning -- Object Detection and Semantic Segmentation: From Theory to Applications | Oct 21, 2024 | Deep LearningModel Optimization | —Unverified | 0 |
| Object-Centric Temporal Consistency via Conditional Autoregressive Inductive Biases | Oct 21, 2024 | ObjectQuestion Answering | —Unverified | 0 |
| Joint Top-Down and Bottom-Up Frameworks for 3D Visual Grounding | Oct 21, 2024 | 3D visual groundingObject | —Unverified | 0 |
| Open-vocabulary vs. Closed-set: Best Practice for Few-shot Object Detection Considering Text Describability | Oct 20, 2024 | Few-Shot Object Detectionimage-classification | CodeCode Available | 0 |
| GRS: Generating Robotic Simulation Tasks from Real-World Images | Oct 20, 2024 | ObjectSemantic Segmentation | —Unverified | 0 |
| SOCIAL MEDIA MANAGEMENT SYSTEM PROJECT REPORT | Oct 20, 2024 | ManagementObject | —Unverified | 0 |
| 3D Multi-Object Tracking Employing MS-GLMB Filter for Autonomous Driving | Oct 19, 2024 | 3D Multi-Object TrackingAutonomous Driving | CodeCode Available | 0 |
| Skill Generalization with Verbs | Oct 18, 2024 | ObjectSkill Generalization | —Unverified | 0 |
| Interpretable end-to-end Neurosymbolic Reinforcement Learning agents | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Accelerating Object Detection with YOLOv4 for Real-Time Applications | Oct 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Spatiotemporal Object Detection for Improved Aerial Vehicle Detection in Traffic Monitoring | Oct 17, 2024 | Objectobject-detection | —Unverified | 0 |
| Object Pose Estimation Using Implicit Representation For Transparent Objects | Oct 17, 2024 | NeRFObject | —Unverified | 0 |
| Generative Location Modeling for Spatially Aware Object Insertion | Oct 17, 2024 | Object | —Unverified | 0 |
| GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction | Oct 17, 2024 | Human-Object Interaction DetectionImage Generation | —Unverified | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| Hiding-in-Plain-Sight (HiPS) Attack on CLIP for Targetted Object Removal from Images | Oct 16, 2024 | Image CaptioningObject | —Unverified | 0 |
| Cocoon: Robust Multi-Modal Perception with Uncertainty-Aware Sensor Fusion | Oct 16, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| Optimizing YOLOv5s Object Detection through Knowledge Distillation algorithm | Oct 16, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| Stable Object Placement Planning From Contact Point Robustness | Oct 16, 2024 | Object | —Unverified | 0 |
| MambaBEV: An efficient 3D detection model with Mamba2 | Oct 16, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Fractal Calibration for long-tailed object detection | Oct 15, 2024 | Instance SegmentationLong-tailed Object Detection | CodeCode Available | 0 |
| Jigsaw++: Imagining Complete Shape Priors for Object Reassembly | Oct 15, 2024 | Object | —Unverified | 0 |
| Visual-Geometric Collaborative Guidance for Affordance Learning | Oct 15, 2024 | Human-Object Interaction DetectionObject | CodeCode Available | 0 |
| SGEdit: Bridging LLM with Text2Image Generative Model for Scene Graph-based Image Editing | Oct 15, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Out-of-Bounding-Box Triggers: A Stealthy Approach to Cheat Object Detectors | Oct 14, 2024 | Adversarial RobustnessObject | CodeCode Available | 0 |
| UAV3D: A Large-scale 3D Perception Benchmark for Unmanned Aerial Vehicles | Oct 14, 2024 | 3D Object DetectionObject | —Unverified | 0 |
| DINTR: Tracking via Diffusion-based Interpolation | Oct 14, 2024 | ObjectObject Tracking | —Unverified | 0 |
| Data-Driven Approaches for Modelling Target Behaviour | Oct 14, 2024 | Gaussian ProcessesObject | —Unverified | 0 |
| Block-to-Scene Pre-training for Point Cloud Hybrid-Domain Masked Autoencoders | Oct 13, 2024 | ObjectPosition regression | —Unverified | 0 |
| VideoSAM: Open-World Video Segmentation | Oct 11, 2024 | Autonomous DrivingDecoder | —Unverified | 0 |
| VOVTrack: Exploring the Potentiality in Videos for Open-Vocabulary Object Tracking | Oct 11, 2024 | Multi-Object TrackingObject | —Unverified | 0 |
| HeightFormer: A Semantic Alignment Monocular 3D Object Detection Method from Roadside Perspective | Oct 10, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| FusionSense: Bridging Common Sense, Vision, and Touch for Robust Sparse-View Reconstruction | Oct 10, 2024 | 3D ReconstructionCommon Sense Reasoning | —Unverified | 0 |
| SG-Nav: Online 3D Scene Graph Prompting for LLM-based Zero-shot Object Navigation | Oct 10, 2024 | Object | —Unverified | 0 |