| Wanda++: Pruning Large Language Models via Regional Gradients | Mar 6, 2025 | DecoderGPU | CodeCode Available | 0 |
| Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Mar 6, 2025 | GPUHyperparameter Optimization | —Unverified | 0 |
| Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach | Mar 6, 2025 | GPULanguage Modeling | —Unverified | 0 |
| JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Mar 5, 2025 | GPUMamba | —Unverified | 0 |
| Partial Convolution Meets Visual Attention | Mar 5, 2025 | CPUGPU | —Unverified | 0 |
| CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory | Mar 4, 2025 | CPUGPU | —Unverified | 0 |
| Memory and Bandwidth are All You Need for Fully Sharded Data Parallel | Mar 4, 2025 | AllGPU | —Unverified | 0 |
| OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework | Mar 3, 2025 | GPUSensor Modeling | —Unverified | 0 |
| Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Mar 3, 2025 | GPUMeta-Learning | —Unverified | 0 |
| Open-source framework for detecting bias and overfitting for large pathology images | Mar 3, 2025 | GPUSelf-Supervised Learning | CodeCode Available | 0 |