| TED-VITON: Transformer-Empowered Diffusion Models for Virtual Try-On | Nov 26, 2024 | Large Language ModelText Generation | CodeCode Available | 0 |
| MotionLLaMA: A Unified Framework for Motion Synthesis and Comprehension | Nov 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DocEDA: Automated Extraction and Design of Analog Circuits from Documents with Large Language Model | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Towards Agentic Schema Refinement | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enhancing In-Hospital Mortality Prediction Using Multi-Representational Learning with LLM-Generated Expert Summaries | Nov 25, 2024 | ICU AdmissionLarge Language Model | —Unverified | 0 |
| Beyond Sight: Towards Cognitive Alignment in LVLM via Enriched Visual Knowledge | Nov 25, 2024 | Landmark RecognitionLarge Language Model | —Unverified | 0 |
| VideoOrion: Tokenizing Object Dynamics in Videos | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SAGEval: The frontiers of Satisfactory Agent based NLG Evaluation for reference-free open-ended text | Nov 25, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DreamRunner: Fine-Grained Storytelling Video Generation with Retrieval-Augmented Motion Adaptation | Nov 25, 2024 | Large Language ModelMotion Planning | —Unverified | 0 |
| SAVEn-Vid: Synergistic Audio-Visual Integration for Enhanced Understanding in Long Video Context | Nov 25, 2024 | Large Language ModelMME | —Unverified | 0 |