| AltGen: AI-Driven Alt Text Generation for Enhancing EPUB Accessibility | Dec 30, 2024 | DescriptiveText Generation | —Unverified | 0 |
| Is Your Text-to-Image Model Robust to Caption Noise? | Dec 27, 2024 | DescriptiveHallucination | —Unverified | 0 |
| Multi-Agent Norm Perception and Induction in Distributed Healthcare | Dec 24, 2024 | Descriptive | —Unverified | 0 |
| Underutilization of Syntactic Processing by Chinese Learners of English in Comprehending English Sentences, Evidenced from Adapted Garden-Path Ambiguity Experiment | Dec 21, 2024 | DescriptiveSentence | —Unverified | 0 |
| TalkWithMachines: Enhancing Human-Robot Interaction for Interpretable Industrial Robotics Through Large/Vision Language Models | Dec 19, 2024 | Descriptive | —Unverified | 0 |
| Descriptive Caption Enhancement with Visual Specialists for Multimodal Perception | Dec 18, 2024 | DescriptiveHuman-Object Interaction Detection | CodeCode Available | 0 |
| Real Classification by Description: Extending CLIP's Limits of Part Attributes Recognition | Dec 18, 2024 | AttributeDescriptive | CodeCode Available | 0 |
| JoVALE: Detecting Human Actions in Video Using Audiovisual and Language Contexts | Dec 18, 2024 | Action DetectionDescriptive | CodeCode Available | 0 |
| SEKE: Specialised Experts for Keyword Extraction | Dec 18, 2024 | DescriptiveKeyword Extraction | CodeCode Available | 0 |
| Digital Transformation in Switzerland: The Current State and Expectations | Dec 17, 2024 | DescriptiveSelf-Learning | —Unverified | 0 |
| Organizational culture and the usage of Industry 4.0 technologies: evidence from Swiss businesses | Dec 17, 2024 | Descriptive | —Unverified | 0 |
| Implicit Location-Caption Alignment via Complementary Masking for Weakly-Supervised Dense Video Captioning | Dec 17, 2024 | Dense Video CaptioningDescriptive | CodeCode Available | 0 |
| Is it the end of (generative) linguistics as we know it? | Dec 17, 2024 | DescriptivePOS | —Unverified | 0 |
| Semi-automated analysis of audio-recorded lessons: The case of teachers' engaging messages | Dec 16, 2024 | Descriptive | —Unverified | 0 |
| CoinMath: Harnessing the Power of Coding Instruction for Math LLMs | Dec 16, 2024 | DescriptiveMath | CodeCode Available | 0 |
| Multilingual and Explainable Text Detoxification with Parallel Corpora | Dec 16, 2024 | DescriptiveStyle Transfer | CodeCode Available | 0 |
| Bridging Vision and Language: Modeling Causality and Temporality in Video Narratives | Dec 14, 2024 | DescriptiveLanguage Modeling | —Unverified | 0 |
| Automated Image Captioning with CNNs and Transformers | Dec 13, 2024 | DescriptiveHyperparameter Optimization | CodeCode Available | 0 |
| MOPI-HFRS: A Multi-objective Personalized Health-aware Food Recommendation System with LLM-enhanced Interpretation | Dec 12, 2024 | DescriptiveFood recommendation | CodeCode Available | 0 |
| Interpreting Graphic Notation with MusicLDM: An AI Improvisation of Cornelius Cardew's Treatise | Dec 12, 2024 | DescriptiveMusic Generation | —Unverified | 0 |
| Hallucination Elimination and Semantic Enhancement Framework for Vision-Language Models in Traffic Scenarios | Dec 10, 2024 | Autonomous DrivingDescriptive | CodeCode Available | 0 |
| Cardiometabolic Risk Factors in South Asians: An Epidemiological and Anthropological Study in an Urban Populace of Eastern India | Dec 8, 2024 | Descriptive | —Unverified | 0 |
| Language-Guided Image Tokenization for Generation | Dec 8, 2024 | DescriptiveImage Generation | —Unverified | 0 |
| FlashSloth: Lightning Multimodal Large Language Models via Embedded Visual Compression | Dec 5, 2024 | DescriptiveVisual Question Answering | CodeCode Available | 2 |
| ProtDAT: A Unified Framework for Protein Sequence Design from Any Protein Text Description | Dec 5, 2024 | DescriptiveProtein Design | —Unverified | 0 |