| Autoregressive + Chain of Thought = Recurrent: Recurrence's Role in Language Models' Computability and a Revisit of Recurrent Transformer | Sep 14, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PeriGuru: A Peripheral Robotic Mobile App Operation Assistant based on GUI Image Understanding and Prompting with LLM | Sep 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Infrared and Visible Image Fusion with Hierarchical Human Perception | Sep 14, 2024 | Infrared And Visible Image FusionLanguage Modeling | —Unverified | 0 |
| Guiding Vision-Language Model Selection for Visual Question-Answering Across Tasks, Domains, and Knowledge Types | Sep 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Joint Semantic Knowledge Distillation and Masked Acoustic Modeling for Full-band Speech Restoration with Improved Intelligibility | Sep 14, 2024 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| Winning Solution For Meta KDD Cup' 24 | Sep 13, 2024 | HallucinationKnowledge Graphs | —Unverified | 0 |
| LLaQo: Towards a Query-Based Coach in Expressive Music Performance Assessment | Sep 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DomURLs_BERT: Pre-trained BERT-based Model for Malicious Domains and URLs Detection and Classification | Sep 13, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Electrocardiogram Report Generation and Question Answering via Retrieval-Augmented Self-Supervised Modeling | Sep 13, 2024 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Towards Unified Facial Action Unit Recognition Framework by Large Language Models | Sep 13, 2024 | Facial Action Unit DetectionLanguage Modeling | —Unverified | 0 |