| KL3M Tokenizers: A Family of Domain-Specific and Character-Level Tokenizers for Legal, Financial, and Preprocessing Applications | Mar 21, 2025 | 16k4k | CodeCode Available | 0 |
| Efficient but Vulnerable: Benchmarking and Defending LLM Batch Prompting Attack | Mar 18, 2025 | 8kBenchmarking | —Unverified | 0 |
| Evaluating the Suitability of Different Intraoral Scan Resolutions for Deep Learning-Based Tooth Segmentation | Feb 26, 2025 | 16k2k | —Unverified | 0 |
| ParallelComp: Parallel Long-Context Compressor for Length Extrapolation | Feb 20, 2025 | 4k8k | —Unverified | 0 |
| Unshackling Context Length: An Efficient Selective Attention Approach through Query-Key Compression | Feb 20, 2025 | 8k | —Unverified | 0 |
| CopySpec: Accelerating LLMs with Speculative Copy-and-Paste Without Compromising Quality | Feb 13, 2025 | 8kGPU | CodeCode Available | 0 |
| BCAT: A Block Causal Transformer for PDE Foundation Models for Fluid Dynamics | Jan 31, 2025 | 8kImage Generation | —Unverified | 0 |
| State Stream Transformer (SST) : Emergent Metacognitive Behaviours Through Latent State Persistence | Jan 30, 2025 | 8kARC | —Unverified | 0 |
| Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration | Jan 27, 2025 | 4k8k | —Unverified | 0 |
| LongProc: Benchmarking Long-Context Language Models on Long Procedural Generation | Jan 9, 2025 | 2k8k | —Unverified | 0 |