| TrainVerify: Equivalence-Based Verification for Distributed LLM Training | Jun 19, 2025 | GPU | —Unverified | 0 |
| LazyEviction: Lagged KV Eviction with Attention Pattern Observation for Efficient Long Reasoning | Jun 19, 2025 | GPU | —Unverified | 0 |
| InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding | Jun 18, 2025 | GPUStreaming video understanding | —Unverified | 0 |
| Utility-Driven Speculative Decoding for Mixture-of-Experts | Jun 17, 2025 | GPULarge Language Model | —Unverified | 0 |
| NeuralPDR: Neural Differential Equations as surrogate models for Photodissociation Regions | Jun 17, 2025 | GPU | CodeCode Available | 0 |
| VideoMAR: Autoregressive Video Generatio with Continuous Tokens | Jun 17, 2025 | GPUImage Generation | —Unverified | 0 |
| MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration | Jun 16, 2025 | GPUMamba | —Unverified | 0 |
| Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences | Jun 16, 2025 | Document SummarizationGPU | CodeCode Available | 3 |
| From Flat to Feeling: A Feasibility and Impact Study on Dynamic Facial Emotions in AI-Generated Avatars | Jun 16, 2025 | GPUSpeech Synthesis | —Unverified | 0 |
| Parallel Branch Model Predictive Control on GPUs | Jun 16, 2025 | CPUGPU | —Unverified | 0 |
| TextureSplat: Per-Primitive Texture Mapping for Reflective Gaussian Splatting | Jun 16, 2025 | GPUInverse Rendering | CodeCode Available | 0 |
| Vine Copulas as Differentiable Computational Graphs | Jun 16, 2025 | GPUScheduling | CodeCode Available | 3 |
| Versatile and Fast Location-Based Private Information Retrieval with Fully Homomorphic Encryption over the Torus | Jun 15, 2025 | CPUGPU | CodeCode Available | 0 |
| ECLIP: Energy-efficient and Practical Co-Location of ML Inference on Spatially Partitioned GPUs | Jun 14, 2025 | GPU | —Unverified | 0 |
| Deploying and Evaluating Multiple Deep Learning Models on Edge Devices for Diabetic Retinopathy Detection | Jun 14, 2025 | Diabetic Retinopathy DetectionGPU | —Unverified | 0 |
| GroupNL: Low-Resource and Robust CNN Design over Cloud and Device | Jun 14, 2025 | GPU | —Unverified | 0 |
| GraphGSOcc: Semantic-Geometric Graph Transformer with Dynamic-Static Decoupling for 3D Gaussian Splatting-based Occupancy Prediction | Jun 13, 2025 | 3DGS3D Semantic Occupancy Prediction | —Unverified | 0 |
| SecONNds: Secure Outsourced Neural Network Inference on ImageNet | Jun 13, 2025 | CPUGPU | CodeCode Available | 0 |
| FeNN: A RISC-V vector processor for Spiking Neural Network acceleration | Jun 13, 2025 | GPU | —Unverified | 0 |
| GigaVideo-1: Advancing Video Generation via Automatic Feedback with 4 GPU-Hours Fine-Tuning | Jun 12, 2025 | GPUVideo Generation | —Unverified | 0 |
| Prompts to Summaries: Zero-Shot Language-Guided Video Summarization | Jun 12, 2025 | GPUQuery focused video summarization | —Unverified | 0 |
| Farseer: A Refined Scaling Law in Large Language Models | Jun 12, 2025 | GPU | CodeCode Available | 1 |
| MNN-LLM: A Generic Inference Engine for Fast Large Language Model Deployment on Mobile Devices | Jun 12, 2025 | CPUGPU | —Unverified | 0 |
| Vector Representations of Vessel Trees | Jun 11, 2025 | GPUvalid | —Unverified | 0 |
| Mutual-Supervised Learning for Sequential-to-Parallel Code Translation | Jun 11, 2025 | Code TranslationGPU | CodeCode Available | 1 |