| 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data | Aug 7, 2024 | 16k2k | CodeCode Available | 3 |
| Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset | May 17, 2024 | 16kBenchmarking | CodeCode Available | 3 |
| SnapKV: LLM Knows What You are Looking for Before Generation | Apr 22, 2024 | 16kGPU | CodeCode Available | 3 |
| Training-Free Long-Context Scaling of Large Language Models | Feb 27, 2024 | 16k | CodeCode Available | 3 |
| LongBench: A Bilingual, Multitask Benchmark for Long Context Understanding | Aug 28, 2023 | 16kCode Completion | CodeCode Available | 3 |
| Investigating Efficiently Extending Transformers for Long Input Summarization | Aug 8, 2022 | 16kLong-range modeling | CodeCode Available | 3 |
| UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents | May 27, 2025 | 16k | CodeCode Available | 2 |
| Training Long-Context LLMs Efficiently via Chunk-wise Optimization | May 22, 2025 | 16kGPU | CodeCode Available | 2 |
| Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key | Jan 16, 2025 | 16kHallucination | CodeCode Available | 2 |
| LV-Eval: A Balanced Long-Context Benchmark with 5 Length Levels Up to 256K | Feb 6, 2024 | 16kBenchmarking | CodeCode Available | 2 |