| TransGOP: Transformer-Based Gaze Object Prediction | Feb 21, 2024 | Gaze EstimationObject | CodeCode Available | 1 |
| Weakly Supervised Object Detection in Chest X-Rays with Differentiable ROI Proposal Networks and Soft ROI Pooling | Feb 19, 2024 | image-classificationImage Classification | CodeCode Available | 1 |
| UncertaintyTrack: Exploiting Detection and Localization Uncertainty in Multi-Object Tracking | Feb 19, 2024 | Autonomous DrivingMulti-Object Tracking | CodeCode Available | 1 |
| LangXAI: Integrating Large Vision Models for Generating Textual Explanations to Enhance Explainability in Visual Perception Tasks | Feb 19, 2024 | Explainable artificial intelligenceExplainable Artificial Intelligence (XAI) | CodeCode Available | 1 |
| LiRaFusion: Deep Adaptive LiDAR-Radar Fusion for 3D Object Detection | Feb 18, 2024 | 3D Object Detectionobject-detection | CodeCode Available | 1 |
| GraphKD: Exploring Knowledge Distillation Towards Document Object Detection with Structured Graph Creation | Feb 17, 2024 | Knowledge Distillationobject-detection | CodeCode Available | 1 |
| ReViT: Enhancing Vision Transformers Feature Diversity with Attention Residual Connections | Feb 17, 2024 | Diversityimage-classification | CodeCode Available | 1 |
| LLMs as Bridges: Reformulating Grounded Multimodal Named Entity Recognition | Feb 15, 2024 | Grounded Multimodal Named Entity RecognitionMulti-modal Named Entity Recognition | CodeCode Available | 1 |
| Efficient One-stage Video Object Detection by Exploiting Temporal Consistency | Feb 14, 2024 | object-detectionObject Detection | CodeCode Available | 1 |
| Switch EMA: A Free Lunch for Better Flatness and Sharpness | Feb 14, 2024 | Attributeimage-classification | CodeCode Available | 1 |