| Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning | Sep 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| InstructCV: Instruction-Tuned Text-to-Image Diffusion Models as Vision Generalists | Sep 30, 2023 | Depth EstimationImage Generation | CodeCode Available | 2 | 5 |
| Draw-and-Understand: Leveraging Visual Prompts to Enable MLLMs to Comprehend What You Want | Mar 29, 2024 | Instruction FollowingLanguage Modelling | CodeCode Available | 2 | 5 |
| Draft & Verify: Lossless Large Language Model Acceleration via Self-Speculative Decoding | Sep 15, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| Instruct2Act: Mapping Multi-modality Instructions to Robotic Actions with Large Language Model | May 18, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 2 | 5 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| iLLM-TSC: Integration reinforcement learning and large language model for traffic signal control policy improvement | Jul 8, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| DOCBENCH: A Benchmark for Evaluating LLM-based Document Reading Systems | Jul 15, 2024 | Language ModellingLarge Language Model | CodeCode Available | 2 | 5 |
| GLUS: Global-Local Reasoning Unified into A Single Large Language Model for Video Segmentation | Apr 10, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 | 5 |
| in2IN: Leveraging individual Information to Generate Human INteractions | Apr 15, 2024 | DiversityLanguage Modelling | CodeCode Available | 2 | 5 |