| Efficient Differentiable Approximation of Generalized Low-rank Regularization | May 21, 2025 | GPU | CodeCode Available | 0 |
| Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection | May 21, 2025 | Anomaly DetectionGPU | —Unverified | 0 |
| Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks | May 21, 2025 | BenchmarkingGPU | —Unverified | 0 |
| RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | May 21, 2025 | GPUNatural Language Queries | —Unverified | 0 |
| Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis | May 20, 2025 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity | May 20, 2025 | GPULarge Language Model | CodeCode Available | 0 |
| Balanced and Elastic End-to-end Training of Dynamic LLMs | May 20, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | May 20, 2025 | GPULifelong learning | CodeCode Available | 2 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers | May 20, 2025 | GPUVideo Generation | CodeCode Available | 2 |
| ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs | May 20, 2025 | GPULarge Language Model | —Unverified | 0 |
| 4D-ROLLS: 4D Radar Occupancy Learning via LiDAR Supervision | May 20, 2025 | Autonomous VehiclesBEV Segmentation | CodeCode Available | 0 |
| Multi-head Temporal Latent Attention | May 19, 2025 | GPUspeech-recognition | CodeCode Available | 4 |
| Frozen Backpropagation: Relaxing Weight Symmetry in Temporally-Coded Deep Spiking Neural Networks | May 19, 2025 | GPU | CodeCode Available | 0 |
| Half Search Space is All You Need | May 19, 2025 | AllGPU | —Unverified | 0 |
| MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning | May 19, 2025 | GPU | CodeCode Available | 1 |
| Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | May 19, 2025 | GPUQuantization | CodeCode Available | 1 |
| FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | May 19, 2025 | CPUGPU | —Unverified | 0 |
| TSPulse: Dual Space Tiny Pre-Trained Models for Rapid Time-Series Analysis | May 19, 2025 | Anomaly DetectionDisentanglement | —Unverified | 0 |
| CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design | May 18, 2025 | GPULanguage Modeling | —Unverified | 0 |
| HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing | May 18, 2025 | GPU | —Unverified | 0 |
| ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates | May 18, 2025 | CPUGPU | —Unverified | 0 |
| LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference | May 18, 2025 | GPURetrieval | CodeCode Available | 1 |
| VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold | May 18, 2025 | GPU | —Unverified | 0 |
| A Case for Library-Level k-Means Binning in Histogram Gradient-Boosted Trees | May 18, 2025 | GPU | CodeCode Available | 0 |