| Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models | Jan 30, 2024 | Data CompressionLanguage Modelling | CodeCode Available | 2 | 5 |
| Keeping Yourself is Important in Downstream Tuning Multimodal Large Language Model | Mar 6, 2025 | General KnowledgeImage Captioning | CodeCode Available | 2 | 5 |
| ClinicalGPT-R1: Pushing reasoning capability of generalist disease diagnosis with large language model | Apr 13, 2025 | DiagnosticLanguage Modeling | CodeCode Available | 2 | 5 |
| ML-Agent: Reinforcing LLM Agents for Autonomous Machine Learning Engineering | May 29, 2025 | Large Language ModelPrompt Engineering | CodeCode Available | 2 | 5 |
| CMMLU: Measuring massive multitask language understanding in Chinese | Jun 15, 2023 | Large Language Model | CodeCode Available | 2 | 5 |
| MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents | Aug 26, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| CLIP-MoE: Towards Building Mixture of Experts for CLIP with Diversified Multiplet Upcycling | Sep 28, 2024 | image-classificationImage Classification | CodeCode Available | 2 | 5 |
| MoA: Mixture of Sparse Attention for Automatic Large Language Model Compression | Jun 21, 2024 | GPULanguage Modeling | CodeCode Available | 2 | 5 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| KET-RAG: A Cost-Efficient Multi-Granular Indexing Framework for Graph-RAG | Feb 13, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 2 | 5 |