| A Fine-Grained Image Description Generation Method Based on Joint Objectives | Sep 2, 2023 | Image DescriptionObject | —Unverified | 0 |
| Discovering Predictive Relational Object Symbols with Symbolic Attentive Layers | Sep 2, 2023 | Object | —Unverified | 0 |
| ObjectLab: Automated Diagnosis of Mislabeled Images in Object Detection Data | Sep 2, 2023 | Autonomous VehiclesObject | —Unverified | 0 |
| Contrastive Grouping with Transformer for Referring Image Segmentation | Sep 2, 2023 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| Object-Centric Multiple Object Tracking | Sep 1, 2023 | Multiple Object TrackingObject | CodeCode Available | 1 |
| What Makes Good Open-Vocabulary Detector: A Disassembling Perspective | Sep 1, 2023 | Objectobject-detection | —Unverified | 0 |
| Towards Addressing the Misalignment of Object Proposal Evaluation for Vision-Language Tasks via Semantic Grounding | Sep 1, 2023 | Graph GenerationImage Captioning | CodeCode Available | 0 |
| A Theoretical and Practical Framework for Evaluating Uncertainty Calibration in Object Detection | Sep 1, 2023 | Autonomous DrivingMedical Diagnosis | CodeCode Available | 0 |
| InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion | Aug 31, 2023 | 3D Human DynamicsHuman Dynamics | CodeCode Available | 2 |
| Coarse-to-Fine Amodal Segmentation with Shape Prior | Aug 31, 2023 | ObjectSegmentation | CodeCode Available | 1 |
| PointLLM: Empowering Large Language Models to Understand Point Clouds | Aug 31, 2023 | 3D Object Captioning3D Object Classification | CodeCode Available | 2 |
| Unsupervised Recognition of Unknown Objects for Open-World Object Detection | Aug 31, 2023 | Objectobject-detection | CodeCode Available | 1 |
| SA6D: Self-Adaptive Few-Shot 6D Pose Estimator for Novel and Occluded Objects | Aug 31, 2023 | 6D Pose EstimationObject | —Unverified | 0 |
| SoccerNet 2023 Tracking Challenge -- 3rd place MOT4MOT Team Technical Report | Aug 31, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| MS23D: A 3D Object Detection Method Using Multi-Scale Semantic Feature Points to Construct 3D Feature Layer | Aug 31, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Occlusion-Aware Detection and Re-ID Calibrated Network for Multi-Object Tracking | Aug 30, 2023 | Multi-Object TrackingObject | —Unverified | 0 |
| Exploring Multi-Modal Contextual Knowledge for Open-Vocabulary Object Detection | Aug 30, 2023 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| WALL-E: Embodied Robotic WAiter Load Lifting with Large Language Model | Aug 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CircleFormer: Circular Nuclei Detection in Whole Slide Images with Circle Queries and Attention | Aug 30, 2023 | DecoderMedical Image Analysis | CodeCode Available | 1 |
| Fusing Pseudo Labels with Weak Supervision for Dynamic Traffic Scenarios | Aug 30, 2023 | Decision MakingObject | —Unverified | 0 |
| On the Robustness of Object Detection Models on Aerial Images | Aug 29, 2023 | Data AugmentationObject | CodeCode Available | 1 |
| Ego-Motion Estimation and Dynamic Motion Separation from 3D Point Clouds for Accumulating Data and Improving 3D Object Detection | Aug 29, 2023 | 3D Object DetectionMotion Estimation | —Unverified | 0 |
| Modeling infant object perception as program induction | Aug 28, 2023 | AttributeInductive Learning | —Unverified | 0 |
| The Interstate-24 3D Dataset: a new benchmark for 3D multi-camera vehicle tracking | Aug 28, 2023 | 3D Object TrackingObject | —Unverified | 0 |
| RobustCLEVR: A Benchmark and Framework for Evaluating Robustness in Object-centric Learning | Aug 28, 2023 | Image GenerationObject | —Unverified | 0 |
| Bridging Cross-task Protocol Inconsistency for Distillation in Dense Object Detection | Aug 28, 2023 | Binary ClassificationClassification | CodeCode Available | 1 |
| Group Regression for Query Based Object Detection and Tracking | Aug 28, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Improving the performance of object detection by preserving label distribution | Aug 28, 2023 | Objectobject-detection | CodeCode Available | 0 |
| Image Coding for Machines with Object Region Learning | Aug 27, 2023 | Image CompressionObject | —Unverified | 0 |
| Nonrigid Object Contact Estimation With Regional Unwrapping Transformer | Aug 27, 2023 | Object | —Unverified | 0 |
| Joint Gaze-Location and Gaze-Object Detection | Aug 26, 2023 | Objectobject-detection | —Unverified | 0 |
| SOGDet: Semantic-Occupancy Guided Multi-view 3D Object Detection | Aug 26, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Integrating Boxes and Masks: A Multi-Object Framework for Unified Visual Tracking and Segmentation | Aug 25, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| Decoding Natural Images from EEG for Object Recognition | Aug 25, 2023 | Contrastive LearningEEG | CodeCode Available | 1 |
| Data-Side Efficiencies for Lightweight Convolutional Neural Networks | Aug 24, 2023 | image-classificationImage Classification | —Unverified | 0 |
| ROAM: Robust and Object-Aware Motion Generation Using Neural Pose Descriptors | Aug 24, 2023 | Motion GenerationMotion Synthesis | —Unverified | 0 |
| On Offline Evaluation of 3D Object Detection for Autonomous Driving | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| SCoRD: Subject-Conditional Relation Detection with Text-Augmented Data | Aug 24, 2023 | ObjectRelation | CodeCode Available | 0 |
| Learning Heavily-Degraded Prior for Underwater Object Detection | Aug 24, 2023 | Objectobject-detection | CodeCode Available | 1 |
| I3DOD: Towards Incremental 3D Object Detection via Prompting | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Perspective-aware Convolution for Monocular 3D Object Detection | Aug 24, 2023 | 3D Object DetectionAutonomous Driving | CodeCode Available | 0 |
| Computational models of object motion detectors accelerated using FPGA technology | Aug 23, 2023 | Motion DetectionObject | —Unverified | 0 |
| CHORUS: Learning Canonicalized 3D Human-Object Spatial Relations from Unbounded Synthesized Images | Aug 23, 2023 | Common Sense ReasoningDiversity | —Unverified | 0 |
| RefEgo: Referring Expression Comprehension Dataset from First-Person Perception of Ego4D | Aug 23, 2023 | ObjectObject Tracking | CodeCode Available | 1 |
| Blending-NeRF: Text-Driven Localized Editing in Neural Radiance Fields | Aug 23, 2023 | NeRFObject | —Unverified | 0 |
| AMSP-UOD: When Vortex Convolution and Stochastic Perturbation Meet Underwater Object Detection | Aug 23, 2023 | FADObject | CodeCode Available | 1 |
| Opening the Vocabulary of Egocentric Actions | Aug 22, 2023 | Action RecognitionObject | CodeCode Available | 0 |
| Small Object Detection for Birds with Swin Transformer | Aug 22, 2023 | Objectobject-detection | —Unverified | 0 |
| Ensemble Fusion for Small Object Detection | Aug 22, 2023 | Objectobject-detection | —Unverified | 0 |
| Object Detection Difficulty: Suppressing Over-aggregation for Faster and Better Video Object Detection | Aug 22, 2023 | Objectobject-detection | CodeCode Available | 0 |