| ADGSyn: Dual-Stream Learning for Efficient Anticancer Drug Synergy Prediction | May 25, 2025 | GPU | CodeCode Available | 1 |
| Is Architectural Complexity Overrated? Competitive and Interpretable Knowledge Graph Completion with RelatE | May 25, 2025 | GPUKnowledge Graph Completion | —Unverified | 0 |
| KerZOO: Kernel Function Informed Zeroth-Order Optimization for Accurate and Accelerated LLM Fine-Tuning | May 24, 2025 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning | May 24, 2025 | GPUOffline RL | —Unverified | 0 |
| HD-PiSSA: High-Rank Distributed Orthogonal Adaptation | May 24, 2025 | Code GenerationGPU | —Unverified | 0 |
| Climate Implications of Diffusion-based Generative Visual AI Systems and their Mass Adoption | May 24, 2025 | GPU | —Unverified | 0 |
| VLA-RL: Towards Masterful and General Robotic Manipulation with Scalable Reinforcement Learning | May 24, 2025 | GPUReinforcement Learning (RL) | CodeCode Available | 3 |
| A DSP-Free Carrier Phase Recovery System using 16-Offset-QAM Laser Forwarded Links for 400Gb/s and Beyond | May 24, 2025 | GPU | —Unverified | 0 |
| Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models | May 23, 2025 | GPULanguage Modeling | CodeCode Available | 0 |
| Guided by Gut: Efficient Test-Time Scaling with Reinforced Intrinsic Confidence | May 23, 2025 | GPULarge Language Model | —Unverified | 0 |
| Dynamic Risk Assessments for Offensive Cybersecurity Agents | May 23, 2025 | GPU | CodeCode Available | 0 |
| A deep solver for backward stochastic Volterra integral equations | May 23, 2025 | GPU | CodeCode Available | 0 |
| JanusDNA: A Powerful Bi-directional Hybrid DNA Foundation Model | May 22, 2025 | GPULong-range modeling | CodeCode Available | 1 |
| FPQVAR: Floating Point Quantization for Visual Autoregressive Model with FPGA Hardware Co-design | May 22, 2025 | GPUImage Generation | CodeCode Available | 0 |
| Data-Driven Breakthroughs and Future Directions in AI Infrastructure: A Comprehensive Review | May 22, 2025 | Federated LearningGPU | —Unverified | 0 |
| GMatch: Geometry-Constrained Feature Matching for RGB-D Object Pose Estimation | May 22, 2025 | GPUPose Estimation | —Unverified | 0 |
| QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design | May 22, 2025 | CPUGPU | CodeCode Available | 2 |
| Training Long-Context LLMs Efficiently via Chunk-wise Optimization | May 22, 2025 | 16kGPU | CodeCode Available | 2 |
| CASS: Nvidia to AMD Transpilation with Data, Models, and Benchmark | May 22, 2025 | GPUTranslation | CodeCode Available | 1 |
| LLM-Based Emulation of the Radio Resource Control Layer: Towards AI-Native RAN Protocols | May 22, 2025 | GPU | —Unverified | 0 |
| PICT -- A Differentiable, GPU-Accelerated Multi-Block PISO Solver for Simulation-Coupled Learning Tasks in Fluid Dynamics | May 22, 2025 | GPU | CodeCode Available | 1 |
| The Polar Express: Optimal Matrix Sign Methods and Their Application to the Muon Algorithm | May 22, 2025 | GPU | —Unverified | 0 |
| Small Language Models in the Real World: Insights from Industrial Text Classification | May 21, 2025 | ClassificationDecoder | —Unverified | 0 |
| DeepCEE: Efficient Cross-Region Model Distributed Training System under Heterogeneous GPUs and Networks | May 21, 2025 | GPUPhilosophy | —Unverified | 0 |
| Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution | May 21, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Efficient Differentiable Approximation of Generalized Low-rank Regularization | May 21, 2025 | GPU | CodeCode Available | 0 |
| Flashback: Memory-Driven Zero-shot, Real-time Video Anomaly Detection | May 21, 2025 | Anomaly DetectionGPU | —Unverified | 0 |
| Guidelines for the Quality Assessment of Energy-Aware NAS Benchmarks | May 21, 2025 | BenchmarkingGPU | —Unverified | 0 |
| RAZER: Robust Accelerated Zero-Shot 3D Open-Vocabulary Panoptic Reconstruction with Spatio-Temporal Aggregation | May 21, 2025 | GPUNatural Language Queries | —Unverified | 0 |
| Quaff: Quantized Parameter-Efficient Fine-Tuning under Outlier Spatial Stability Hypothesis | May 20, 2025 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Polar Sparsity: High Throughput Batched LLM Inferencing with Scalable Contextual Sparsity | May 20, 2025 | GPULarge Language Model | CodeCode Available | 0 |
| Balanced and Elastic End-to-end Training of Dynamic LLMs | May 20, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| UltraEdit: Training-, Subject-, and Memory-Free Lifelong Editing in Large Language Models | May 20, 2025 | GPULifelong learning | CodeCode Available | 2 |
| UHD Image Dehazing via anDehazeFormer with Atmospheric-aware KV Cache | May 20, 2025 | 4k8k | —Unverified | 0 |
| Grouping First, Attending Smartly: Training-Free Acceleration for Diffusion Transformers | May 20, 2025 | GPUVideo Generation | CodeCode Available | 2 |
| ServerlessLoRA: Minimizing Latency and Cost in Serverless Inference for LoRA-Based LLMs | May 20, 2025 | GPULarge Language Model | —Unverified | 0 |
| 4D-ROLLS: 4D Radar Occupancy Learning via LiDAR Supervision | May 20, 2025 | Autonomous VehiclesBEV Segmentation | CodeCode Available | 0 |
| Multi-head Temporal Latent Attention | May 19, 2025 | GPUspeech-recognition | CodeCode Available | 4 |
| Frozen Backpropagation: Relaxing Weight Symmetry in Temporally-Coded Deep Spiking Neural Networks | May 19, 2025 | GPU | CodeCode Available | 0 |
| Half Search Space is All You Need | May 19, 2025 | AllGPU | —Unverified | 0 |
| MVAR: Visual Autoregressive Modeling with Scale and Spatial Markovian Conditioning | May 19, 2025 | GPU | CodeCode Available | 1 |
| Fine-tuning Quantized Neural Networks with Zeroth-order Optimization | May 19, 2025 | GPUQuantization | CodeCode Available | 1 |
| FreeKV: Boosting KV Cache Retrieval for Efficient LLM Inference | May 19, 2025 | CPUGPU | —Unverified | 0 |
| TSPulse: Dual Space Tiny Pre-Trained Models for Rapid Time-Series Analysis | May 19, 2025 | Anomaly DetectionDisentanglement | —Unverified | 0 |
| CALM: Co-evolution of Algorithms and Language Model for Automatic Heuristic Design | May 18, 2025 | GPULanguage Modeling | —Unverified | 0 |
| HybridServe: Efficient Serving of Large AI Models with Confidence-Based Cascade Routing | May 18, 2025 | GPU | —Unverified | 0 |
| ZenFlow: Enabling Stall-Free Offloading Training via Asynchronous Updates | May 18, 2025 | CPUGPU | —Unverified | 0 |
| LightRetriever: A LLM-based Hybrid Retrieval Architecture with 1000x Faster Query Inference | May 18, 2025 | GPURetrieval | CodeCode Available | 1 |
| VGGT-SLAM: Dense RGB SLAM Optimized on the SL(4) Manifold | May 18, 2025 | GPU | —Unverified | 0 |
| A Case for Library-Level k-Means Binning in Histogram Gradient-Boosted Trees | May 18, 2025 | GPU | CodeCode Available | 0 |