| Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models | Apr 11, 2024 | AttributeObject | CodeCode Available | 1 |
| Run-time Monitoring of 3D Object Detection in Automated Driving Systems Using Early Layer Neural Activation Patterns | Apr 11, 2024 | 2D Object Detection3D Object Detection | —Unverified | 0 |
| Mitigating Object Dependencies: Improving Point Cloud Self-Supervised Learning through Object Exchange | Apr 11, 2024 | ObjectScene Understanding | CodeCode Available | 0 |
| Interactive Learning of Physical Object Properties Through Robot Manipulation and Database of Object Measurements | Apr 10, 2024 | Bayesian InferenceObject | CodeCode Available | 0 |
| Object-Conditioned Energy-Based Attention Map Alignment in Text-to-Image Diffusion Models | Apr 10, 2024 | AttributeObject | —Unverified | 0 |
| O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation | Apr 10, 2024 | Image SegmentationObject | —Unverified | 0 |
| Identification of Fine-grained Systematic Errors via Controlled Scene Generation | Apr 10, 2024 | Autonomous DrivingObject | —Unverified | 0 |
| Counting Objects in a Robotic Hand | Apr 9, 2024 | Contrastive LearningObject | —Unverified | 0 |
| Reconstructing Hand-Held Objects in 3D from Images and Videos | Apr 9, 2024 | ObjectObject Reconstruction | —Unverified | 0 |
| Automatic Defect Detection in Sewer Network Using Deep Learning Based Object Detector | Apr 9, 2024 | Defect DetectionObject | —Unverified | 0 |
| Object Dynamics Modeling with Hierarchical Point Cloud-based Representations | Apr 9, 2024 | Object | —Unverified | 0 |
| A Dataset and Framework for Learning State-invariant Object Representations | Apr 9, 2024 | ObjectObject Recognition | CodeCode Available | 0 |
| YOLC: You Only Look Clusters for Tiny Object Detection in Aerial Images | Apr 9, 2024 | Objectobject-detection | CodeCode Available | 2 |
| ZeST: Zero-Shot Material Transfer from a Single Image | Apr 9, 2024 | Appearance TransferObject | CodeCode Available | 3 |
| Counterfactual Reasoning for Multi-Label Image Classification via Patching-Based Training | Apr 9, 2024 | Causal Inferencecounterfactual | CodeCode Available | 0 |
| Learning Embeddings with Centroid Triplet Loss for Object Identification in Robotic Grasping | Apr 9, 2024 | Image RetrievalObject | CodeCode Available | 2 |
| Spatial-Temporal Multi-level Association for Video Object Segmentation | Apr 9, 2024 | ObjectSegmentation | —Unverified | 0 |
| LRR: Language-Driven Resamplable Continuous Representation against Adversarial Tracking Attacks | Apr 9, 2024 | ObjectObject Tracking | CodeCode Available | 0 |
| Label-Efficient 3D Object Detection For Road-Side Units | Apr 9, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| HOEG: A New Approach for Object-Centric Predictive Process Monitoring | Apr 8, 2024 | BenchmarkingGraph Neural Network | CodeCode Available | 0 |
| MOSE: Boosting Vision-based Roadside 3D Object Detection with Scene Cues | Apr 8, 2024 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Detecting Every Object from Events | Apr 8, 2024 | Autonomous DrivingClass-agnostic Object Detection | CodeCode Available | 1 |
| DepthMOT: Depth Cues Lead to a Strong Multi-Object Tracker | Apr 8, 2024 | Camera Pose EstimationMulti-Object Tracking | CodeCode Available | 1 |
| Learning a Category-level Object Pose Estimator without Pose Annotations | Apr 8, 2024 | ObjectPose Estimation | —Unverified | 0 |
| Self-Supervised Multi-Object Tracking with Path Consistency | Apr 8, 2024 | Multi-Object TrackingObject | CodeCode Available | 1 |
| Retrieval-Augmented Open-Vocabulary Object Detection | Apr 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SwapAnything: Enabling Arbitrary Object Swapping in Personalized Visual Editing | Apr 8, 2024 | Image GenerationObject | —Unverified | 0 |
| Joint Reconstruction of 3D Human and Object via Contact-Based Refinement Transformer | Apr 7, 2024 | 3D Human Reconstruction3D Object Reconstruction | CodeCode Available | 2 |
| Few-Shot Object Detection: Research Advances and Challenges | Apr 7, 2024 | Few-Shot LearningFew-Shot Object Detection | —Unverified | 0 |
| MonoTAKD: Teaching Assistant Knowledge Distillation for Monocular 3D Object Detection | Apr 7, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 1 |
| Hyperbolic Learning with Synthetic Captions for Open-World Detection | Apr 7, 2024 | HallucinationNovel Concepts | —Unverified | 0 |
| GenEARL: A Training-Free Generative Framework for Multimodal Event Argument Role Labeling | Apr 7, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| GLCM-Based Feature Combination for Extraction Model Optimization in Object Detection Using Machine Learning | Apr 6, 2024 | Computational EfficiencyModel Optimization | —Unverified | 0 |
| Joint Visual and Text Prompting for Improved Object-Centric Perception with Multimodal Large Language Models | Apr 6, 2024 | MMEObject | CodeCode Available | 0 |
| Learning Correlation Structures for Vision Transformers | Apr 5, 2024 | Action ClassificationAction Recognition | —Unverified | 0 |
| SCAResNet: A ResNet Variant Optimized for Tiny Object Detection in Transmission and Distribution Towers | Apr 5, 2024 | 2D Object Detection2D Tiny Object Detection | CodeCode Available | 1 |
| Context-Aware Aerial Object Detection: Leveraging Inter-Object and Background Relationships | Apr 5, 2024 | Objectobject-detection | —Unverified | 0 |
| DQ-DETR: DETR with Dynamic Query for Tiny Object Detection | Apr 4, 2024 | Objectobject-detection | CodeCode Available | 2 |
| You Only Scan Once: A Dynamic Scene Reconstruction Pipeline for 6-DoF Robotic Grasping of Novel Objects | Apr 4, 2024 | ObjectPose Tracking | —Unverified | 0 |
| Is CLIP the main roadblock for fine-grained open-world perception? | Apr 4, 2024 | Autonomous DrivingNovel Concepts | CodeCode Available | 2 |
| OW-VISCapTor: Abstractors for Open-World Video Instance Segmentation and Captioning | Apr 4, 2024 | DescriptiveDiversity | —Unverified | 0 |
| PreAfford: Universal Affordance-Based Pre-Grasping for Diverse Objects and Environments | Apr 4, 2024 | Object | —Unverified | 0 |
| SemGrasp: Semantic Grasp Generation via Language Aligned Discretization | Apr 4, 2024 | Grasp GenerationLanguage Modeling | —Unverified | 0 |
| BioVL-QR: Egocentric Biochemical Vision-and-Language Dataset Using Micro QR Codes | Apr 4, 2024 | ObjectVideo Understanding | —Unverified | 0 |
| MonoCD: Monocular 3D Object Detection with Complementary Depths | Apr 4, 2024 | 3D Object DetectionDepth Estimation | CodeCode Available | 2 |
| Representation Alignment Contrastive Regularization for Multi-Object Tracking | Apr 3, 2024 | Multi-Object TrackingObject | CodeCode Available | 0 |
| Ego-Motion Aware Target Prediction Module for Robust Multi-Object Tracking | Apr 3, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 0 |
| Adjusting Interpretable Dimensions in Embedding Space with Human Judgments | Apr 3, 2024 | Object | —Unverified | 0 |
| Independently Keypoint Learning for Small Object Semantic Correspondence | Apr 3, 2024 | DecoderObject | —Unverified | 0 |
| I-Design: Personalized LLM Interior Designer | Apr 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |