| AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation | Mar 25, 2025 | Domain AdaptationGPU | CodeCode Available | 0 |
| A Probabilistic Neuro-symbolic Layer for Algebraic Constraint Satisfaction | Mar 25, 2025 | GPU | CodeCode Available | 1 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Scaling Down Text Encoders of Text-to-Image Diffusion Models | Mar 25, 2025 | GPUImage Generation | CodeCode Available | 2 |
| Improved Alignment of Modalities in Large Vision Language Models | Mar 25, 2025 | GPUImage Captioning | —Unverified | 0 |
| Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Mar 25, 2025 | Breast Cancer DetectionGPU | —Unverified | 0 |
| Efficient Self-Supervised Adaptation for Medical Image Analysis | Mar 24, 2025 | GPUMedical Image Analysis | CodeCode Available | 1 |
| Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Mar 24, 2025 | 8kGPU | —Unverified | 0 |
| BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache | Mar 24, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization | Mar 24, 2025 | GPULarge Language Model | —Unverified | 0 |