| FlowR: Flowing from Sparse to Dense 3D Reconstructions | Apr 2, 2025 | GPUNovel View Synthesis | —Unverified | 0 |
| Quattro: Transformer-Accelerated Iterative Linear Quadratic Regulator Framework for Fast Trajectory Optimization | Apr 2, 2025 | GPUModel Predictive Control | CodeCode Available | 1 |
| Accelerating IoV Intrusion Detection: Benchmarking GPU-Accelerated vs CPU-Based ML Libraries | Apr 2, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Improved Visual-Spatial Reasoning via R1-Zero-Like Training | Apr 1, 2025 | GPUSpatial Reasoning | CodeCode Available | 1 |
| SentenceKV: Efficient LLM Inference via Sentence-Level Semantic KV Caching | Apr 1, 2025 | Computational EfficiencyCPU | —Unverified | 0 |
| Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources | Apr 1, 2025 | GPULarge Language Model | —Unverified | 0 |
| SCRec: A Scalable Computational Storage System with Statistical Sharding and Tensor-train Decomposition for Recommendation Models | Apr 1, 2025 | CPUGPU | —Unverified | 0 |
| Adapting Vision Foundation Models for Real-time Ultrasound Image Segmentation | Mar 31, 2025 | GPUImage Segmentation | —Unverified | 0 |
| THEMIS: Towards Practical Intellectual Property Protection for Post-Deployment On-Device Deep Learning Models | Mar 31, 2025 | GPU | CodeCode Available | 2 |
| GPU-centric Communication Schemes for HPC and ML Applications | Mar 31, 2025 | CPUGPU | —Unverified | 0 |
| Deep Learning Model Deployment in Multiple Cloud Providers: an Exploratory Study Using Low Computing Power Environments | Mar 31, 2025 | CPUGPU | —Unverified | 0 |
| Orchestrate Multimodal Data with Batch Post-Balancing to Accelerate Multimodal Large Language Model Training | Mar 31, 2025 | GPULanguage Modeling | —Unverified | 0 |
| StochasticSplats: Stochastic Rasterization for Sorting-Free 3D Gaussian Splatting | Mar 31, 2025 | 3DGSGPU | —Unverified | 0 |
| Pan-LUT: Efficient Pan-sharpening via Learnable Look-Up Tables | Mar 31, 2025 | 8kComputational Efficiency | —Unverified | 0 |
| Cocktail: Chunk-Adaptive Mixed-Precision Quantization for Long-Context LLM Inference | Mar 30, 2025 | GPUQuantization | —Unverified | 0 |
| FastVAR: Linear Visual Autoregressive Modeling via Cached Token Pruning | Mar 30, 2025 | 2kGPU | CodeCode Available | 2 |
| CityGS-X: A Scalable Architecture for Efficient and Geometrically Accurate Large-Scale Scene Reconstruction | Mar 29, 2025 | GPU | —Unverified | 0 |
| PartialLoading: User Scheduling and Bandwidth Allocation for Parameter-sharing Edge Inference | Mar 29, 2025 | GPUScheduling | —Unverified | 0 |
| Disentangled 4D Gaussian Splatting: Towards Faster and More Efficient Dynamic Scene Rendering | Mar 28, 2025 | 3DGSGPU | —Unverified | 0 |
| WeatherMesh-3: Fast and accurate operational global weather forecasting | Mar 28, 2025 | Computational EfficiencyGPU | CodeCode Available | 3 |
| Scenario Dreamer: Vectorized Latent Diffusion for Generating Driving Simulation Environments | Mar 28, 2025 | GPUScene Generation | —Unverified | 0 |
| CPPO: Accelerating the Training of Group Relative Policy Optimization-Based Reasoning Models | Mar 28, 2025 | GPUGSM8K | CodeCode Available | 2 |
| Lobster: A GPU-Accelerated Framework for Neurosymbolic Programming | Mar 27, 2025 | GPU | —Unverified | 0 |
| FACETS: Efficient Once-for-all Object Detection via Constrained Iterative Search | Mar 27, 2025 | AllGPU | —Unverified | 0 |
| Stochastic Engrams for Efficient Continual Learning with Binarized Neural Networks | Mar 27, 2025 | Computational EfficiencyContinual Learning | —Unverified | 0 |
| ChatAnyone: Stylized Real-time Portrait Video Generation with Hierarchical Motion Diffusion Model | Mar 27, 2025 | GPUVideo Generation | —Unverified | 0 |
| Robust DNN Partitioning and Resource Allocation Under Uncertain Inference Time | Mar 27, 2025 | CPUGPU | —Unverified | 0 |
| Self-ReS: Self-Reflection in Large Vision-Language Models for Long Video Understanding | Mar 26, 2025 | GPUQuestion Answering | —Unverified | 0 |
| High Quality Diffusion Distillation on a Single GPU with Relative and Absolute Position Matching | Mar 26, 2025 | GPUImage Generation | —Unverified | 0 |
| Bridging Evolutionary Multiobjective Optimization and GPU Acceleration via Tensorization | Mar 26, 2025 | CPUGPU | CodeCode Available | 7 |
| AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation | Mar 25, 2025 | Domain AdaptationGPU | CodeCode Available | 0 |
| A Probabilistic Neuro-symbolic Layer for Algebraic Constraint Satisfaction | Mar 25, 2025 | GPU | CodeCode Available | 1 |
| Scaling Down Text Encoders of Text-to-Image Diffusion Models | Mar 25, 2025 | GPUImage Generation | CodeCode Available | 2 |
| Improved Alignment of Modalities in Large Vision Language Models | Mar 25, 2025 | GPUImage Captioning | —Unverified | 0 |
| PyGraph: Robust Compiler Support for CUDA Graphs in PyTorch | Mar 25, 2025 | CPUGPU | —Unverified | 0 |
| Optimizing Breast Cancer Detection in Mammograms: A Comprehensive Study of Transfer Learning, Resolution Reduction, and Multi-View Classification | Mar 25, 2025 | Breast Cancer DetectionGPU | —Unverified | 0 |
| Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding | Mar 24, 2025 | 8kGPU | —Unverified | 0 |
| Efficient Self-Supervised Adaptation for Medical Image Analysis | Mar 24, 2025 | GPUMedical Image Analysis | CodeCode Available | 1 |
| BitDecoding: Unlocking Tensor Cores for Long-Context LLMs Decoding with Low-Bit KV Cache | Mar 24, 2025 | Computational EfficiencyGPU | CodeCode Available | 2 |
| Oaken: Fast and Efficient LLM Serving with Online-Offline Hybrid KV Cache Quantization | Mar 24, 2025 | GPULarge Language Model | —Unverified | 0 |
| GRiNS: A Python Library for Simulating Gene Regulatory Network Dynamics | Mar 24, 2025 | GPU | CodeCode Available | 0 |
| SceneSplat: Gaussian Splatting-based Scene Understanding with Vision-Language Pretraining | Mar 23, 2025 | 3DGSBenchmarking | CodeCode Available | 3 |
| Co-SemDepth: Fast Joint Semantic Segmentation and Depth Estimation on Aerial Images | Mar 23, 2025 | Autonomous NavigationDepth Estimation | CodeCode Available | 0 |
| WindowKV: Task-Adaptive Group-Wise KV Cache Window Selection for Efficient LLM Inference | Mar 23, 2025 | GPU | CodeCode Available | 0 |
| Temporal Action Detection Model Compression by Progressive Block Drop | Mar 21, 2025 | Action DetectionAutonomous Driving | —Unverified | 0 |
| UniCon: Unidirectional Information Flow for Effective Control of Large-Scale Diffusion Models | Mar 21, 2025 | GPU | —Unverified | 0 |
| PP-DocLayout: A Unified Document Layout Detection Model to Accelerate Large-Scale Data Construction | Mar 21, 2025 | CPUDocument Layout Analysis | CodeCode Available | 9 |
| Robustness of deep learning classification to adversarial input on GPUs: asynchronous parallel accumulation is a source of vulnerability | Mar 21, 2025 | Adversarial RobustnessBayesian Optimization | —Unverified | 0 |
| Splat-LOAM: Gaussian Splatting LiDAR Odometry and Mapping | Mar 21, 2025 | GPUMotion Estimation | CodeCode Available | 2 |
| Improving the End-to-End Efficiency of Offline Inference for Multi-LLM Applications Based on Sampling and Simulation | Mar 21, 2025 | GPUScheduling | —Unverified | 0 |