| Continuous Video Process: Modeling Videos as Continuous Multi-Dimensional Processes for Video Prediction | Dec 6, 2024 | Image GenerationNavigate | —Unverified | 0 |
| Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction | Dec 5, 2024 | Multimodal ReasoningNatural Language Visual Grounding | CodeCode Available | 3 |
| NaVILA: Legged Robot Vision-Language-Action Model for Navigation | Dec 5, 2024 | NavigateVision and Language Navigation | —Unverified | 0 |
| Using Cooperative Co-evolutionary Search to Generate Metamorphic Test Cases for Autonomous Driving Systems | Dec 5, 2024 | Autonomous DrivingEvolutionary Algorithms | —Unverified | 0 |
| Exploring the Role of AI-Powered Chatbots for Teens and Young Adults with ASD or Social Anxiety | Dec 4, 2024 | ChatbotNavigate | —Unverified | 0 |
| ObjectFinder: An Open-Vocabulary Assistive System for Interactive Object Search by Blind People | Dec 4, 2024 | Large Language ModelMultimodal Large Language Model | —Unverified | 0 |
| FANAL -- Financial Activity News Alerting Language Modeling Framework | Dec 4, 2024 | Event DetectionLanguage Modeling | —Unverified | 0 |
| Deep Learning, Machine Learning, Advancing Big Data Analytics and Management | Dec 3, 2024 | Anomaly DetectionDeep Learning | —Unverified | 0 |
| Single-Shot Metric Depth from Focused Plenoptic Cameras | Dec 3, 2024 | Depth EstimationNavigate | —Unverified | 0 |
| Best Practices for Large Language Models in Radiology | Dec 2, 2024 | Navigate | —Unverified | 0 |