| MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding | Sep 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding | Sep 22, 2024 | Anomaly DetectionGPU | CodeCode Available | 4 |
| Backtracking Improves Generation Safety | Sep 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Large Language Model and Denoising Diffusion Framework for Targeted Design of Microstructures with Commands in Natural Language | Sep 22, 2024 | Data AugmentationDenoising | —Unverified | 0 |
| Codec-SUPERB @ SLT 2024: A lightweight benchmark for neural audio codec models | Sep 21, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 5 |
| Test Time Learning for Time Series Forecasting | Sep 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ECHO: Environmental Sound Classification with Hierarchical Ontology-guided Semi-Supervised Learning | Sep 21, 2024 | Contrastive LearningEnvironmental Sound Classification | —Unverified | 0 |
| A Survey on Large Language Model-empowered Autonomous Driving | Sep 21, 2024 | Autonomous DrivingLanguage Modeling | —Unverified | 0 |
| OAEI-LLM: A Benchmark Dataset for Understanding Large Language Model Hallucinations in Ontology Matching | Sep 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Instruction Following without Instruction Tuning | Sep 21, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 1 |