| Cross-Attention Head Position Patterns Can Align with Human Visual Concepts in Text-to-Image Generative Models | Dec 3, 2024 | Image GenerationPosition | CodeCode Available | 1 |
| AI Meets Antimatter: Unveiling Antihydrogen Annihilations | Dec 1, 2024 | Deep LearningPosition | —Unverified | 0 |
| Multi-scale Vehicle Localization In Heterogeneous Mobile Communication Networks | Dec 1, 2024 | Position | —Unverified | 0 |
| PGSO: Prompt-based Generative Sequence Optimization Network for Aspect-based Sentiment Analysis | Dec 1, 2024 | Aspect-Based Sentiment AnalysisAspect-Based Sentiment Analysis (ABSA) | —Unverified | 0 |
| Video-3D LLM: Learning Position-Aware Video Representation for 3D Scene Understanding | Nov 30, 2024 | 3D Question Answering (3D-QA)Position | CodeCode Available | 0 |
| Prognostic Framework for Robotic Manipulators Operating Under Dynamic Task Severities | Nov 30, 2024 | Position | —Unverified | 0 |
| MonoPP: Metric-Scaled Self-Supervised Monocular Depth Estimation by Planar-Parallax Geometry in Automotive Applications | Nov 29, 2024 | Depth EstimationDepth Prediction | —Unverified | 0 |
| On the Ethical Considerations of Generative Agents | Nov 28, 2024 | Position | —Unverified | 0 |
| Grid-augmented vision: A simple yet effective approach for enhanced spatial understanding in multi-modal agents | Nov 27, 2024 | Autonomous NavigationObject Recognition | CodeCode Available | 0 |
| Enhancing Character-Level Understanding in LLMs through Token Internal Structure Learning | Nov 26, 2024 | Computational EfficiencyPosition | CodeCode Available | 0 |