| ClusterKV: Manipulating LLM KV Cache in Semantic Space for Recallable Compression | Dec 4, 2024 | 2kLogical Reasoning | CodeCode Available | 1 |
| SEED4D: A Synthetic Ego--Exo Dynamic 4D Data Generator, Driving Dataset and Benchmark | Dec 1, 2024 | 2k4D reconstruction | CodeCode Available | 1 |
| How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs | Oct 24, 2024 | 2kMachine Translation | CodeCode Available | 1 |
| TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models | Oct 14, 2024 | 2kBenchmarking | CodeCode Available | 1 |
| HarmoniCa: Harmonizing Training and Inference for Better Feature Caching in Diffusion Transformer Acceleration | Oct 2, 2024 | 2kDenoising | CodeCode Available | 1 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| Training Matting Models without Alpha Labels | Aug 20, 2024 | 2kImage Matting | CodeCode Available | 1 |
| Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector | Jun 17, 2024 | 2kHallucination | CodeCode Available | 1 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 |