| Boosting Few-Shot Detection with Large Language Models and Layout-to-Image Synthesis | Oct 9, 2024 | Image GenerationLanguage Modelling | —Unverified | 0 | 0 |
| An Implementation of Werewolf Agent That does not Truly Trust LLMs | Sep 3, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| DreamCS: Geometry-Aware Text-to-3D Generation with Unpaired 3D Reward Supervision | Jun 11, 2025 | 3D GenerationLarge Language Model | —Unverified | 0 | 0 |
| DRCap: Decoding CLAP Latents with Retrieval-Augmented Generation for Zero-shot Audio Captioning | Oct 12, 2024 | Audio captioningLarge Language Model | —Unverified | 0 | 0 |
| Boosting Code-Switching ASR with Mixture of Experts Enhanced Speech-Conditioned LLM | Sep 24, 2024 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 | 0 |
| D-Rax: Domain-specific Radiologic assistant leveraging multi-modal data and eXpert model predictions | Jul 2, 2024 | DiagnosticInstruction Following | —Unverified | 0 | 0 |
| Draw an Ugly Person An Exploration of Generative AIs Perceptions of Ugliness | Jul 16, 2025 | Large Language Model | —Unverified | 0 | 0 |
| BookGPT: A General Framework for Book Recommendation Empowered by Large Language Model | May 25, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Animating the Past: Reconstruct Trilobite via Video Generation | Oct 10, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| BongLLaMA: LLaMA for Bangla Language | Oct 28, 2024 | BenchmarkingData Augmentation | —Unverified | 0 | 0 |