| MIMICS: A Large-Scale Data Collection for Search Clarification | Jun 17, 2020 | 2k | CodeCode Available | 1 |
| DEPLAIN: A German Parallel Corpus with Intralingual Translations into Plain Language for Sentence and Document Simplification | May 30, 2023 | 2kSentence | CodeCode Available | 1 |
| Divide, Conquer and Combine: A Training-Free Framework for High-Resolution Image Perception in Multimodal Large Language Models | Aug 28, 2024 | 2k4k | CodeCode Available | 1 |
| OpenFWI: Large-Scale Multi-Structural Benchmark Datasets for Seismic Full Waveform Inversion | Nov 4, 2021 | 2kBenchmarking | CodeCode Available | 1 |
| How Good Are LLMs for Literary Translation, Really? Literary Translation Evaluation with Humans and LLMs | Oct 24, 2024 | 2kMachine Translation | CodeCode Available | 1 |
| CABM: Content-Aware Bit Mapping for Single Image Super-Resolution Network with Large Input | Apr 13, 2023 | 2k4k | CodeCode Available | 1 |
| Identifying concept libraries from language about object structure | May 11, 2022 | 2kMachine Translation | CodeCode Available | 1 |
| Double Domain Guided Real-Time Low-Light Image Enhancement for Ultra-High-Definition Transportation Surveillance | Sep 15, 2023 | 2k4k | CodeCode Available | 1 |
| Dual Adversarial Domain Adaptation | Jan 1, 2020 | 2kDomain Adaptation | CodeCode Available | 1 |
| Dataset Decomposition: Faster LLM Training with Variable Sequence Length Curriculum | May 21, 2024 | 2k8k | CodeCode Available | 1 |
| Data-Efficient Instance Generation from Instance Discrimination | Jun 8, 2021 | 2kData Augmentation | CodeCode Available | 1 |
| BAND-2k: Banding Artifact Noticeable Database for Banding Detection and Quality Assessment | Nov 29, 2023 | 2kImage Quality Assessment | CodeCode Available | 1 |
| DACO: Towards Application-Driven and Comprehensive Data Analysis via Code Generation | Mar 4, 2024 | 2kCode Generation | CodeCode Available | 1 |
| CycleNet: Rethinking Cycle Consistency in Text-Guided Diffusion for Image Manipulation | Oct 19, 2023 | 2kGPU | CodeCode Available | 1 |
| CoCA: Fusing Position Embedding with Collinear Constrained Attention in Transformers for Long Context Window Extending | Sep 15, 2023 | 2kPosition | CodeCode Available | 1 |
| Gated Linear Attention Transformers with Hardware-Efficient Training | Dec 11, 2023 | 2kLanguage Modeling | CodeCode Available | 1 |
| Scene-Text Grounding for Text-Based Video Question Answering | Sep 22, 2024 | 2kContrastive Learning | CodeCode Available | 1 |
| ClassSR: A General Framework to Accelerate Super-Resolution Networks by Data Characteristic | Mar 6, 2021 | 2k8k | CodeCode Available | 1 |
| Segment Policy Optimization: Effective Segment-Level Credit Assignment in RL for Large Language Models | May 29, 2025 | 2k4k | CodeCode Available | 1 |
| TemporalBench: Benchmarking Fine-grained Temporal Understanding for Multimodal Video Models | Oct 14, 2024 | 2kBenchmarking | CodeCode Available | 1 |
| Correlating and Predicting Human Evaluations of Language Models from Natural Language Processing Benchmarks | Feb 24, 2025 | 2kARC | —Unverified | 0 |
| A strengthened bound on the number of states required to characterize maximum parsimony distance | Jun 11, 2025 | 2k | —Unverified | 0 |
| Continuous Integration of Machine Learning Models with ease.ml/ci: Towards a Rigorous Yet Practical Treatment | Mar 1, 2019 | 2kBIG-bench Machine Learning | —Unverified | 0 |
| Arbitrary-Depth Universal Approximation Theorems for Operator Neural Networks | Sep 23, 2021 | 2k | —Unverified | 0 |
| Consistent recovery threshold of hidden nearest neighbor graphs | Nov 18, 2019 | 2k | —Unverified | 0 |