| Minimal Interaction Seperated Tuning: A New Paradigm for Visual Adaptation | Jan 1, 2025 | CPUGPU | —Unverified | 0 |
| ICP: Immediate Compensation Pruning for Mid-to-high Sparsity | Jan 1, 2025 | GPU | —Unverified | 0 |
| Towards Sustainable Large Language Model Serving | Dec 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Debunking the CUDA Myth Towards GPU-based AI Systems | Dec 31, 2024 | GPU | —Unverified | 0 |
| FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs | Dec 30, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Dec 30, 2024 | 3D ReconstructionCPU | —Unverified | 0 |
| Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring | Dec 30, 2024 | FairnessGPU | —Unverified | 0 |
| IMSSA: Deploying modern state-space models on memristive in-memory compute hardware | Dec 28, 2024 | GPUQuantization | —Unverified | 0 |
| LoL-PIM: Long-Context LLM Decoding with Scalable DRAM-PIM System | Dec 28, 2024 | GPUManagement | —Unverified | 0 |
| MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing | Dec 28, 2024 | GPUMamba | —Unverified | 0 |
| Towards Ideal Temporal Graph Neural Networks: Evaluations and Conclusions after 10,000 GPU Hours | Dec 28, 2024 | BenchmarkingGPU | —Unverified | 0 |
| Pushing the Envelope of Low-Bit LLM via Dynamic Error Compensation | Dec 28, 2024 | CPUGPU | —Unverified | 0 |
| RAIN: Real-time Animation of Infinite Video Stream | Dec 27, 2024 | DenoisingGPU | —Unverified | 0 |
| Paleoinspired Vision: From Exploring Colour Vision Evolution to Inspiring Camera Design | Dec 27, 2024 | GPU | —Unverified | 0 |
| Learning to Forget: Bayesian Time Series Forecasting using Recurrent Sparse Spectrum Signature Gaussian Processes | Dec 27, 2024 | Gaussian ProcessesGPU | —Unverified | 0 |
| Assessing Text Classification Methods for Cyberbullying Detection on Social Media Platforms | Dec 27, 2024 | CPUGPU | —Unverified | 0 |
| Dovetail: A CPU/GPU Heterogeneous Speculative Decoding for LLM inference | Dec 25, 2024 | CPUGPU | —Unverified | 0 |
| KunServe: Efficient Parameter-centric Memory Management for LLM Serving | Dec 24, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Power- and Fragmentation-aware Online Scheduling for GPU Datacenters | Dec 23, 2024 | CPUGPU | CodeCode Available | 0 |
| Broadband Ground Motion Synthesis by Diffusion Model with Minimal Condition | Dec 23, 2024 | GPUMotion Synthesis | —Unverified | 0 |
| GQSA: Group Quantization and Sparsity for Accelerating Large Language Model Inference | Dec 23, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Balanced 3DGS: Gaussian-wise Parallelism Rendering with Fine-Grained Tiling | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |
| CoSurfGS:Collaborative 3D Surface Gaussian Splatting with Distributed Learning for Large Scene Reconstruction | Dec 23, 2024 | 3DGSGPU | —Unverified | 0 |
| Lillama: Large Language Models Compression via Low-Rank Feature Distillation | Dec 21, 2024 | GPUMamba | —Unverified | 0 |
| Less is More: Towards Green Code Large Language Models via Unified Structural Pruning | Dec 20, 2024 | Computational EfficiencyGPU | —Unverified | 0 |