| FlatCAD: Fast Curvature Regularization of Neural SDFs for CAD Models | Jun 19, 2025 | GPU | —Unverified | 0 |
| LazyEviction: Lagged KV Eviction with Attention Pattern Observation for Efficient Long Reasoning | Jun 19, 2025 | GPU | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 |
| NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation Regions | Jun 17, 2025 | GPU | CodeCode Available | 0 |
| VideoMAR: Autoregressive Video Generatio with Continuous Tokens | Jun 17, 2025 | GPUImage Generation | —Unverified | 0 |
| MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration | Jun 16, 2025 | GPUMamba | —Unverified | 0 |
| Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences | Jun 16, 2025 | Document SummarizationGPU | CodeCode Available | 3 |
| From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars | Jun 16, 2025 | GPUSpeech Synthesis | —Unverified | 0 |
| TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting | Jun 16, 2025 | GPUInverse Rendering | CodeCode Available | 0 |
| Vine Copulas as Differentiable Computational Graphs | Jun 16, 2025 | GPUScheduling | CodeCode Available | 3 |
| Parallel Branch Model Predictive Control on GPUs | Jun 16, 2025 | CPUGPU | —Unverified | 0 |
| Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the Torus | Jun 15, 2025 | CPUGPU | CodeCode Available | 0 |
| ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs | Jun 14, 2025 | GPU | —Unverified | 0 |
| Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection | Jun 14, 2025 | Diabetic Retinopathy DetectionGPU | —Unverified | 0 |
| GroupNL: Low-Resource and Robust CNN Design over Cloud and Device | Jun 14, 2025 | GPU | —Unverified | 0 |
| GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction | Jun 13, 2025 | 3DGS3D Semantic Occupancy Prediction | —Unverified | 0 |
| SecONNds: Secure Outsourced Neural Network Inference on ImageNet | Jun 13, 2025 | CPUGPU | CodeCode Available | 0 |
| FeNN: A RISC-V vector processor for Spiking Neural Network acceleration | Jun 13, 2025 | GPU | —Unverified | 0 |
| Farseer: A Refined Scaling Law in Large Language Models | Jun 12, 2025 | GPU | CodeCode Available | 1 |
| Prompts to Summaries: Zero-Shot Language-Guided Video Summarization | Jun 12, 2025 | GPUQuery focused video summarization | —Unverified | 0 |
| GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning | Jun 12, 2025 | GPUVideo Generation | —Unverified | 0 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| Vector Representations of Vessel Trees | Jun 11, 2025 | GPUvalid | —Unverified | 0 |
| Mutual-Supervised Learning for Sequential-to-Parallel Code Translation | Jun 11, 2025 | Code TranslationGPU | CodeCode Available | 1 |
| AtmosMJ: Revisiting Gating Mechanism for AI Weather Forecasting Beyond the Year Scale | Jun 11, 2025 | GPUWeather Forecasting | CodeCode Available | 0 |
| GPU-accelerated Modeling of Biological Regulatory Networks | Jun 10, 2025 | CPUglobal-optimization | —Unverified | 0 |
| Can A Gamer Train A Mathematical Reasoning Model? | Jun 10, 2025 | GPUMathematical Reasoning | CodeCode Available | 0 |
| A PDE-Based Image Dehazing Method via Atmospheric Scattering Theory | Jun 10, 2025 | GPUImage Dehazing | —Unverified | 0 |
| Towards Secure and Private Language Models for Nuclear Power Plants | Jun 10, 2025 | GPULanguage Modeling | —Unverified | 0 |
| SeerAttention-R: Sparse Attention Adaptation for Long Reasoning | Jun 10, 2025 | 4kGPU | CodeCode Available | 2 |
| ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network | Jun 10, 2025 | GPUMamba | —Unverified | 0 |
| Olica: Efficient Structured Pruning of Large Language Models without Retraining | Jun 10, 2025 | GPU | CodeCode Available | 0 |
| PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production | Jun 10, 2025 | DiagnosticGPU | —Unverified | 0 |
| FZOO: Fast Zeroth-Order Optimizer for Fine-Tuning Large Language Models towards Adam-Scale Speed | Jun 10, 2025 | GPU | —Unverified | 0 |
| Plug-and-Play Linear Attention for Pre-trained Image and Video Restoration Models | Jun 10, 2025 | CPUDeblurring | CodeCode Available | 0 |
| Self Forcing: Bridging the Train-Test Gap in Autoregressive Video Diffusion | Jun 9, 2025 | GPUVideo Generation | —Unverified | 0 |
| GaussianVAE: Adaptive Learning Dynamics of 3D Gaussians for High-Fidelity Super-Resolution | Jun 9, 2025 | 3DGSComputational Efficiency | —Unverified | 0 |
| NeurIPS 2025 E2LM Competition : Early Training Evaluation of Language Models | Jun 9, 2025 | GPU | —Unverified | 0 |
| ProteinZero: Self-Improving Protein Generation via Online Reinforcement Learning | Jun 9, 2025 | DiversityGPU | —Unverified | 0 |
| MoE-GPS: Guidlines for Prediction Strategy for Dynamic Expert Duplication in MoE Load Balancing | Jun 9, 2025 | GPUMixture-of-Experts | —Unverified | 0 |
| Fractional-order Jacobian Matrix Differentiation and Its Application in Artificial Neural Networks | Jun 9, 2025 | GPU | —Unverified | 0 |
| Paged Attention Meets FlexAttention: Unlocking Long-Context Efficiency in Deployed Inference | Jun 8, 2025 | GPU | —Unverified | 0 |
| E-BATS: Efficient Backpropagation-Free Test-Time Adaptation for Speech Foundation Models | Jun 8, 2025 | GPUTest-time Adaptation | —Unverified | 0 |
| Faster than Fast: Accelerating Oriented FAST Feature Detection on Low-end Embedded GPUs | Jun 8, 2025 | GPUSimultaneous Localization and Mapping | —Unverified | 0 |
| FuncGNN: Learning Functional Semantics of Logic Circuits with Graph Neural Networks | Jun 7, 2025 | GPU | CodeCode Available | 0 |
| BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures | Jun 6, 2025 | BenchmarkingCPU | —Unverified | 0 |
| Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage | Jun 6, 2025 | CPUGPU | —Unverified | 0 |
| On-the-fly Reconstruction for Large-Scale Novel View Synthesis from Unposed Images | Jun 5, 2025 | 3DGSGPU | —Unverified | 0 |
| Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis | Jun 5, 2025 | GPUMulti-Label Classification | —Unverified | 0 |