| Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Dec 10, 2024 | Autonomous DrivingDescriptive | CodeCode Available | 0 |
| Cardiometabolic Risk Factors in South Asians: An Epidemiological and Anthropological Study in an Urban Populace of Eastern India | Dec 8, 2024 | Descriptive | —Unverified | 0 |
| Language-Guided Image Tokenization for Generation | Dec 8, 2024 | DescriptiveImage Generation | —Unverified | 0 |
| ProtDAT: A Unified Framework for Protein Sequence Design from Any Protein Text Description | Dec 5, 2024 | DescriptiveProtein Design | —Unverified | 0 |
| FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression | Dec 5, 2024 | DescriptiveVisual Question Answering | CodeCode Available | 2 |
| Scaling Inference-Time Search with Vision Value Model for Improved Visual Comprehension | Dec 4, 2024 | DescriptiveLanguage Modeling | CodeCode Available | 1 |
| Remote Sensing Temporal Vision-Language Models: A Comprehensive Survey | Dec 3, 2024 | Change DetectionDescriptive | CodeCode Available | 3 |
| Analyzing the Impact of AI Tools on Student Study Habits and Academic Performance | Dec 3, 2024 | Descriptive | —Unverified | 0 |
| EventGPT: Event Stream Understanding with Multimodal Large Language Models | Dec 1, 2024 | Descriptive | —Unverified | 0 |
| SelfPrompt: Autonomously Evaluating LLM Robustness via Domain-Constrained Knowledge Guidelines and Refined Adversarial Prompts | Dec 1, 2024 | DescriptiveKnowledge Graphs | —Unverified | 0 |