| LongEmbed: Extending Embedding Models for Long Context Retrieval | Apr 18, 2024 | 4k8k | CodeCode Available | 2 | 5 |
| Odd-One-Out: Anomaly Detection by Comparing with Neighbors | Jun 28, 2024 | 8kAnomaly Detection | CodeCode Available | 2 | 5 |
| AbdomenAtlas-8K: Annotating 8,000 CT Volumes for Multi-Organ Segmentation in Three Weeks | May 16, 2023 | 8kActive Learning | CodeCode Available | 2 | 5 |
| Hyena Hierarchy: Towards Larger Convolutional Language Models | Feb 21, 2023 | 2k8k | CodeCode Available | 2 | 5 |
| CLUECorpus2020: A Large-scale Chinese Corpus for Pre-training Language Model | Mar 3, 2020 | 8kLanguage Modeling | CodeCode Available | 2 | 5 |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Dec 28, 2023 | 8kFeature Splatting | CodeCode Available | 2 | 5 |
| MMLongBench: Benchmarking Long-Context Vision-Language Models Effectively and Thoroughly | May 15, 2025 | 8kBenchmarking | CodeCode Available | 2 | 5 |
| Rethinking Abdominal Organ Segmentation (RAOS) in the clinical scenario: A robustness evaluation benchmark with challenging cases | Jun 19, 2024 | 8kHallucination | CodeCode Available | 2 | 5 |
| C^2: Scalable Auto-Feedback for LLM-based Chart Generation | Oct 24, 2024 | 8kDiversity | CodeCode Available | 1 | 5 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 | 5 |