| UniCode^2: Cascaded Large-scale Codebooks for Unified Multimodal Understanding and Generation | Jun 25, 2025 | 16k | —Unverified | 0 |
| MSTAR: Box-free Multi-query Scene Text Retrieval with Attention Recycling | Jun 12, 2025 | 16kRetrieval | CodeCode Available | 0 |
| How Far Are We from Optimal Reasoning Efficiency? | Jun 8, 2025 | 16kBenchmarking | CodeCode Available | 0 |
| FlashDMoE: Fast Distributed MoE in a Single Kernel | Jun 5, 2025 | 16kCPU | CodeCode Available | 3 |
| FAMA: The First Large-Scale Open-Science Speech Foundation Model for English and Italian | May 28, 2025 | 16k | CodeCode Available | 0 |
| UI-Genie: A Self-Improving Approach for Iteratively Boosting MLLM-based Mobile GUI Agents | May 27, 2025 | 16k | CodeCode Available | 2 |
| SpecExtend: A Drop-in Enhancement for Speculative Decoding of Long Sequences | May 27, 2025 | 16kLong-Context Understanding | CodeCode Available | 0 |
| MonarchAttention: Zero-Shot Conversion to Fast, Hardware-Aware Structured Attention | May 24, 2025 | 16k4k | CodeCode Available | 1 |
| Training Long-Context LLMs Efficiently via Chunk-wise Optimization | May 22, 2025 | 16kGPU | CodeCode Available | 2 |
| PSC: Extending Context Window of Large Language Models via Phase Shift Calibration | May 18, 2025 | 16kPosition | CodeCode Available | 0 |
| Achieving Scalable Robot Autonomy via neurosymbolic planning using lightweight local LLM | May 13, 2025 | 16k8k | CodeCode Available | 0 |
| FalseReject: A Resource for Improving Contextual Safety and Mitigating Over-Refusals in LLMs via Structured Reasoning | May 12, 2025 | 16kBenchmarking | —Unverified | 0 |
| KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications | Mar 21, 2025 | 16k4k | CodeCode Available | 0 |
| NSF-SciFy: Mining the NSF Awards Database for Scientific Claims | Mar 11, 2025 | 16kAbstract generation | —Unverified | 0 |
| X-LRM: X-ray Large Reconstruction Model for Extremely Sparse-View Computed Tomography Recovery in One Second | Mar 9, 2025 | 16kCT Reconstruction | CodeCode Available | 0 |
| Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation | Feb 26, 2025 | 16k2k | —Unverified | 0 |
| EpMAN: Episodic Memory AttentioN for Generalizing to Longer Contexts | Feb 20, 2025 | 16kDecoder | —Unverified | 0 |
| CLOVER: A Test Case Generation Benchmark with Coverage, Long-Context, and Verification | Feb 12, 2025 | 16k4k | —Unverified | 0 |
| Fairness through Difference Awareness: Measuring Desired Group Discrimination in LLMs | Feb 4, 2025 | 16kDescriptive | CodeCode Available | 1 |
| M+: Extending MemoryLLM with Scalable Long-Term Memory | Feb 1, 2025 | 16kGPU | CodeCode Available | 3 |
| Parallel Sequence Modeling via Generalized Spatial Propagation Network | Jan 21, 2025 | 16kComputational Efficiency | —Unverified | 0 |
| Mitigating Hallucinations in Large Vision-Language Models via DPO: On-Policy Data Hold the Key | Jan 16, 2025 | 16kHallucination | CodeCode Available | 2 |
| Depression and Anxiety Prediction Using Deep Language Models and Transfer Learning | Dec 30, 2024 | 16kBinary Classification | —Unverified | 0 |
| SparseAccelerate: Efficient Long-Context Inference for Mid-Range GPUs | Dec 9, 2024 | 16k | —Unverified | 0 |
| MVReward: Better Aligning and Evaluating Multi-View Diffusion Models with Human Preferences | Dec 9, 2024 | 16k | —Unverified | 0 |
| CNNSum: Exploring Long-Context Summarization with Large Language Models in Chinese Novels | Dec 3, 2024 | 16k | CodeCode Available | 0 |
| Bimanual Dexterity for Complex Tasks | Nov 20, 2024 | 16k | —Unverified | 0 |
| Piecing It All Together: Verifying Multi-Hop Multimodal Claims | Nov 14, 2024 | 16kAll | —Unverified | 0 |
| Model Editing for LLMs4Code: How Far are We? | Nov 11, 2024 | 16kCode Generation | CodeCode Available | 0 |
| Retrieval or Global Context Understanding? On Many-Shot In-Context Learning for Long-Context Evaluation | Nov 11, 2024 | 16kBenchmarking | CodeCode Available | 0 |
| Denial-of-Service Poisoning Attacks against Large Language Models | Oct 14, 2024 | 16kSpeech-to-Text | CodeCode Available | 1 |
| Neural Fourier Modelling: A Highly Compact Approach to Time-Series Analysis | Oct 7, 2024 | 16kAnomaly Detection | CodeCode Available | 1 |
| Correlation-Aware Select and Merge Attention for Efficient Fine-Tuning and Context Length Extension | Oct 5, 2024 | 16kData Augmentation | —Unverified | 0 |
| SwiftKV: Fast Prefill-Optimized Inference with Knowledge-Preserving Model Transformation | Oct 4, 2024 | 16kCode Generation | CodeCode Available | 3 |
| Extending Context Window of Large Language Models from a Distributional Perspective | Oct 2, 2024 | 16k8k | CodeCode Available | 0 |
| LongGenBench: Benchmarking Long-Form Generation in Long Context LLMs | Sep 3, 2024 | 16kBenchmarking | CodeCode Available | 1 |
| LinFusion: 1 GPU, 1 Minute, 16K Image | Sep 3, 2024 | 16kCausal Inference | CodeCode Available | 3 |
| 1.5-Pints Technical Report: Pretraining in Days, Not Months -- Your Language Model Thrives on Quality Data | Aug 7, 2024 | 16k2k | CodeCode Available | 3 |
| Global Structure-from-Motion Revisited | Jul 29, 2024 | 16k | CodeCode Available | 7 |
| SpaceJAM: a Lightweight and Regularization-free Method for Fast Joint Alignment of Images | Jul 16, 2024 | 16k | CodeCode Available | 1 |
| Using Large Language Models for Generating Smart Contracts for Health Insurance from Textual Policies | Jul 9, 2024 | 16kTask 2 | —Unverified | 0 |
| Learning to (Learn at Test Time): RNNs with Expressive Hidden States | Jul 5, 2024 | 16k8k | CodeCode Available | 5 |
| LongIns: A Challenging Long-context Instruction-based Exam for LLMs | Jun 25, 2024 | 16k4k | —Unverified | 0 |
| Inferring Pluggable Types with Machine Learning | Jun 21, 2024 | 16kLanguage Modeling | —Unverified | 0 |
| LLaSA: A Multimodal LLM for Human Activity Analysis Through Wearable and Smartphone Sensors | Jun 20, 2024 | 16kInstruction Following | CodeCode Available | 1 |
| GraphReader: Building Graph-based Agent to Enhance Long-Context Abilities of Large Language Models | Jun 20, 2024 | 16k4k | —Unverified | 0 |
| Code-Switching Red-Teaming: LLM Evaluation for Safety and Multilingual Understanding | Jun 17, 2024 | 16kLanguage Modelling | CodeCode Available | 0 |
| DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence | Jun 17, 2024 | 16kLanguage Modeling | CodeCode Available | 9 |
| An Empirical Study of Mamba-based Language Models | Jun 12, 2024 | 16kIn-Context Learning | CodeCode Available | 0 |
| Benchmarking Large Language Models on CFLUE -- A Chinese Financial Language Understanding Evaluation Dataset | May 17, 2024 | 16kBenchmarking | CodeCode Available | 3 |