| CityNavAgent: Aerial Vision-and-Language Navigation with Hierarchical Semantic Planning and Global Memory | May 8, 2025 | Large Language ModelNavigate | CodeCode Available | 1 | 5 |
| History Aware Multimodal Transformer for Vision-and-Language Navigation | Oct 25, 2021 | Decision MakingNavigate | CodeCode Available | 1 | 5 |
| HOLMES: Health OnLine Model Ensemble Serving for Deep Learning Models in Intensive Care Units | Aug 10, 2020 | NavigatePrediction | CodeCode Available | 1 | 5 |
| DualCam: A Novel Benchmark Dataset for Fine-grained Real-time Traffic Light Detection | Sep 3, 2022 | NavigateSelf-Driving Cars | CodeCode Available | 1 | 5 |
| How GPT learns layer by layer | Jan 13, 2025 | NavigateRepresentation Learning | CodeCode Available | 1 | 5 |
| CFGPT: Chinese Financial Assistant with Large Language Model | Sep 19, 2023 | Decision MakingFinancial Analysis | CodeCode Available | 1 | 5 |
| Catch Me If You Hear Me: Audio-Visual Navigation in Complex Unmapped Environments with Moving Sounds | Nov 29, 2021 | NavigateVisual Navigation | CodeCode Available | 1 | 5 |
| Embodied BERT: A Transformer Model for Embodied, Language-guided Visual Task Completion | Aug 10, 2021 | NavigateObject | CodeCode Available | 1 | 5 |
| AidUI: Toward Automated Recognition of Dark Patterns in User Interfaces | Mar 12, 2023 | Navigate | CodeCode Available | 1 | 5 |
| Hierarchical Generative Adversarial Imitation Learning with Mid-level Input Generation for Autonomous Driving on Urban Environments | Feb 9, 2023 | Autonomous DrivingAutonomous Navigation | CodeCode Available | 1 | 5 |