| X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation | Mar 8, 2025 | GPUImage Generation | CodeCode Available | 2 |
| Real-Time Semantic Segmentation of Aerial Images Using an Embedded U-Net: A Comparison of CPU, GPU, and FPGA Workflows | Mar 7, 2025 | CPUGPU | —Unverified | 0 |
| Symbolic Mixture-of-Experts: Adaptive Skill-based Routing for Heterogeneous Reasoning | Mar 7, 2025 | GPUMath | —Unverified | 0 |
| Training and Inference Efficiency of Encoder-Decoder Speech Models | Mar 7, 2025 | DecoderGPU | —Unverified | 0 |
| Wanda++: Pruning Large Language Models via Regional Gradients | Mar 6, 2025 | DecoderGPU | CodeCode Available | 0 |
| Fine-Tuning Florence2 for Enhanced Object Detection in Un-constructed Environments: Vision-Language Model Approach | Mar 6, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Predictable Scale: Part I -- Optimal Hyperparameter Scaling Law in Large Language Model Pretraining | Mar 6, 2025 | GPUHyperparameter Optimization | —Unverified | 0 |
| Toward Lightweight and Fast Decoders for Diffusion Models in Image and Video Generation | Mar 6, 2025 | DecoderGPU | CodeCode Available | 1 |
| Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian Process | Mar 6, 2025 | Autonomous NavigationComputational Efficiency | CodeCode Available | 2 |
| Eventprop training for efficient neuromorphic applications | Mar 6, 2025 | BenchmarkingGPU | —Unverified | 0 |
| Partial Convolution Meets Visual Attention | Mar 5, 2025 | CPUGPU | —Unverified | 0 |
| JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba | Mar 5, 2025 | GPUMamba | —Unverified | 0 |
| Memory and Bandwidth are All You Need for Fully Sharded Data Parallel | Mar 4, 2025 | AllGPU | —Unverified | 0 |
| DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian Splatting | Mar 4, 2025 | Computational EfficiencyCPU | CodeCode Available | 1 |
| DivPrune: Diversity-based Visual Token Pruning for Large Multimodal Models | Mar 4, 2025 | DiversityGPU | CodeCode Available | 2 |
| CoServe: Efficient Collaboration-of-Experts (CoE) Model Inference with Limited Memory | Mar 4, 2025 | CPUGPU | —Unverified | 0 |
| KurTail : Kurtosis-based LLM Quantization | Mar 3, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Open-source framework for detecting bias and overfitting for large pathology images | Mar 3, 2025 | GPUSelf-Supervised Learning | CodeCode Available | 0 |
| Nature-Inspired Population-Based Evolution of Large Language Models | Mar 3, 2025 | GPUZero-shot Generalization | CodeCode Available | 1 |
| OceanSim: A GPU-Accelerated Underwater Robot Perception Simulation Framework | Mar 3, 2025 | GPUSensor Modeling | —Unverified | 0 |
| A Reconfigurable Stream-Based FPGA Accelerator for Bayesian Confidence Propagation Neural Networks | Mar 3, 2025 | GPUHigh-Level Synthesis | —Unverified | 0 |
| Category-level Meta-learned NeRF Priors for Efficient Object Mapping | Mar 3, 2025 | GPUMeta-Learning | —Unverified | 0 |
| LiteGS: A High-Performance Modular Framework for Gaussian Splatting Training | Mar 3, 2025 | 3DGSGPU | CodeCode Available | 3 |
| DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting | Mar 2, 2025 | CPUGPU | CodeCode Available | 1 |
| Streaming Video Question-Answering with In-context Video KV-Cache Retrieval | Mar 1, 2025 | GPUQuestion Answering | CodeCode Available | 2 |