| Vision-Aware Text Features in Referring Image Segmentation: From Object Understanding to Context Understanding | Apr 12, 2024 | DecoderImage Segmentation | CodeCode Available | 0 |
| IDD-X: A Multi-View Dataset for Ego-relative Important Object Localization and Explanation in Dense and Unstructured Traffic | Apr 12, 2024 | ObjectObject Localization | CodeCode Available | 0 |
| Into the Fog: Evaluating Robustness of Multiple Object Tracking | Apr 12, 2024 | Depth EstimationMonocular Depth Estimation | CodeCode Available | 0 |
| TDANet: Target-Directed Attention Network For Object-Goal Visual Navigation With Zero-Shot Ability | Apr 12, 2024 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Adapting the Segment Anything Model During Usage in Novel Situations | Apr 12, 2024 | Interactive SegmentationObject | —Unverified | 0 |
| Overcoming Scene Context Constraints for Object Detection in wild using Defilters | Apr 12, 2024 | Objectobject-detection | CodeCode Available | 0 |
| Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns | Apr 11, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange | Apr 11, 2024 | ObjectScene Understanding | CodeCode Available | 0 |
| Simplifying Two-Stage Detectors for On-Device Inference in Remote Sensing | Apr 11, 2024 | Model CompressionObject | —Unverified | 0 |
| RMAFF-PSN: A Residual Multi-Scale Attention Feature Fusion Photometric Stereo Network | Apr 11, 2024 | Object | CodeCode Available | 0 |
| Interactive Learning of Physical Object Properties Through Robot Manipulation and Database of Object Measurements | Apr 10, 2024 | Bayesian InferenceObject | CodeCode Available | 0 |
| Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Apr 10, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Apr 10, 2024 | Image SegmentationObject | —Unverified | 0 |
| Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models | Apr 10, 2024 | AttributeObject | —Unverified | 0 |
| Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector | Apr 9, 2024 | Defect DetectionObject | —Unverified | 0 |
| Counting Objects in a Robotic Hand | Apr 9, 2024 | Contrastive LearningObject | —Unverified | 0 |
| Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Apr 9, 2024 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| Object Dynamics Modeling with Hierarchical Point Cloud-based Representations | Apr 9, 2024 | Object | —Unverified | 0 |
| Reconstructing Hand-Held Objects in 3D from Images and Videos | Apr 9, 2024 | ObjectObject Reconstruction | —Unverified | 0 |
| LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks | Apr 9, 2024 | ObjectObject Tracking | CodeCode Available | 0 |
| Label-Efficient 3D Object Detection For Road-Side Units | Apr 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| A Dataset and Framework for Learning State-invariant Object Representations | Apr 9, 2024 | ObjectObject Recognition | CodeCode Available | 0 |
| Spatial-Temporal Multi-level Association for Video Object Segmentation | Apr 9, 2024 | ObjectSegmentation | —Unverified | 0 |
| HOEG: A New Approach for Object-Centric Predictive Process Monitoring | Apr 8, 2024 | BenchmarkingGraph Neural Network | CodeCode Available | 0 |
| Learning a Category-level Object Pose Estimator without Pose Annotations | Apr 8, 2024 | ObjectPose Estimation | —Unverified | 0 |
| SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing | Apr 8, 2024 | Image GenerationObject | —Unverified | 0 |
| MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Apr 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Hyperbolic Learning with Synthetic Captions for Open-World Detection | Apr 7, 2024 | HallucinationNovel Concepts | —Unverified | 0 |
| GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Few-Shot Object Detection: Research Advances and Challenges | Apr 7, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models | Apr 6, 2024 | MMEObject | CodeCode Available | 0 |
| GLCM-Based Feature Combination for Extraction Model Optimization in Object Detection Using Machine Learning | Apr 6, 2024 | Computational EfficiencyModel Optimization | —Unverified | 0 |
| Learning Correlation Structures for Vision Transformers | Apr 5, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| Context-Aware Aerial Object Detection: Leveraging Inter-Object and Background Relationships | Apr 5, 2024 | Objectobject-detection | —Unverified | 0 |
| PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments | Apr 4, 2024 | Object | —Unverified | 0 |
| OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning | Apr 4, 2024 | DescriptiveDiversity | —Unverified | 0 |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Apr 4, 2024 | Grasp GenerationLanguage Modeling | —Unverified | 0 |
| You Only Scan Once: A Dynamic Scene Reconstruction Pipeline for 6-DoF Robotic Grasping of Novel Objects | Apr 4, 2024 | ObjectPose Tracking | —Unverified | 0 |
| BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset Using Micro QR Codes | Apr 4, 2024 | ObjectVideo Understanding | —Unverified | 0 |
| Representation Alignment Contrastive Regularization for Multi-Object Tracking | Apr 3, 2024 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Adjusting Interpretable Dimensions in Embedding Space with Human Judgments | Apr 3, 2024 | Object | —Unverified | 0 |
| Independently Keypoint Learning for Small Object Semantic Correspondence | Apr 3, 2024 | DecoderObject | —Unverified | 0 |
| Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Apr 3, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 0 |
| I-Design: Personalized LLM Interior Designer | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ALOHa: A New Measure for Hallucination in Captioning Models | Apr 3, 2024 | HallucinationObject | —Unverified | 0 |
| GEARS: Local Geometry-aware Hand-object Interaction Synthesis | Apr 2, 2024 | Object | —Unverified | 0 |
| Sparse Semi-DETR: Sparse Learnable Queries for Semi-Supervised Object Detection | Apr 2, 2024 | Objectobject-detection | —Unverified | 0 |
| Task Integration Distillation for Object Detectors | Apr 2, 2024 | Knowledge DistillationObject | —Unverified | 0 |
| LR-FPN: Enhancing Remote Sensing Object Detection with Location Refined Feature Pyramid Network | Apr 2, 2024 | Objectobject-detection | —Unverified | 0 |
| One Noise to Rule Them All: Multi-View Adversarial Attacks with Universal Perturbation | Apr 2, 2024 | 3D Object RecognitionAll | CodeCode Available | 0 |