| CrowdCLIP: Unsupervised Crowd Counting via Vision-Language Model | Apr 9, 2023 | Cross-Part Crowd CountingCrowd Counting | CodeCode Available | 1 | 5 |
| CrowdVLM-R1: Expanding R1 Ability to Vision Language Model for Crowd Counting using Fuzzy Group Relative Policy Reward | Mar 31, 2025 | Crowd CountingLanguage Modeling | CodeCode Available | 1 | 5 |
| LEARN: Knowledge Adaptation from Large Language Model to Recommendation for Practical Industrial Application | May 7, 2024 | Collaborative FilteringLanguage Modeling | CodeCode Available | 1 | 5 |
| ConZIC: Controllable Zero-shot Image Captioning by Sampling-Based Polishing | Mar 4, 2023 | DiversityImage Captioning | CodeCode Available | 1 | 5 |
| Language Conditioned Traffic Generation | Jul 16, 2023 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| Knowledge-Augmented Language Model Verification | Oct 19, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 | 5 |
| CycleFormer : TSP Solver Based on Language Modeling | May 30, 2024 | DecoderLanguage Modeling | CodeCode Available | 1 | 5 |
| A Simple Contrastive Learning Objective for Alleviating Neural Text Degeneration | May 5, 2022 | Contrastive LearningDialogue Generation | CodeCode Available | 1 | 5 |
| Character-Aware Neural Language Models | Aug 26, 2015 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |