| Omniwise: Predicting GPU Kernels Performance with LLMs | Jun 25, 2025 | GPU | —Unverified | 0 |
| GPU Kernel Scientist: An LLM-Driven Framework for Iterative Kernel Optimization | Jun 25, 2025 | GPU | —Unverified | 0 |
| Exploiting Lightweight Hierarchical ViT and Dynamic Framework for Efficient Visual Tracking | Jun 25, 2025 | GPUVisual Tracking | CodeCode Available | 1 |
| Fast ground penetrating radar dual-parameter full waveform inversion method accelerated by hybrid compilation of CUDA kernel function and PyTorch | Jun 25, 2025 | Computational EfficiencyGPR | CodeCode Available | 1 |
| DuoGPT: Training-free Dual Sparsity through Activation-aware Pruning in LLMs | Jun 25, 2025 | GPU | —Unverified | 0 |
| Scaling Speculative Decoding with Lookahead Reasoning | Jun 24, 2025 | GPUGSM8K | CodeCode Available | 0 |
| MegaFold: System-Level Optimizations for Accelerating Protein Structure Prediction Models | Jun 24, 2025 | GPUProtein Folding | CodeCode Available | 2 |
| Virtual Memory for 3D Gaussian Splatting | Jun 24, 2025 | GPUNovel View Synthesis | —Unverified | 0 |
| PocketVina Enables Scalable and Highly Accurate Physically Valid Docking through Multi-Pocket Conditioning | Jun 24, 2025 | BenchmarkingDrug Discovery | CodeCode Available | 2 |
| DIP: Unsupervised Dense In-Context Post-training of Visual Representations | Jun 23, 2025 | GPUMeta-Learning | CodeCode Available | 1 |
| Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised Models | Jun 23, 2025 | Domain AdaptationGPU | CodeCode Available | 3 |
| Let Your Video Listen to Your Music! | Jun 23, 2025 | GPUMusic Generation | —Unverified | 0 |
| Survey of HPC in US Research Institutions | Jun 23, 2025 | BenchmarkingGPU | —Unverified | 0 |
| 4D-LRM: Large Space-Time Reconstruction Model From and To Any View at Any Time | Jun 23, 2025 | 4D reconstructionGPU | —Unverified | 0 |
| CommVQ: Commutative Vector Quantization for KV Cache Compression | Jun 23, 2025 | GPUGSM8K | CodeCode Available | 1 |
| TDACloud: Point Cloud Recognition Using Topological Data Analysis | Jun 23, 2025 | Autonomous DrivingGPU | —Unverified | 0 |
| Lightweight RGB-T Tracking with Mobile Vision Transformers | Jun 23, 2025 | GPUObject Tracking | —Unverified | 0 |
| Confucius3-Math: A Lightweight High-Performance Reasoning LLM for Chinese K-12 Mathematics Learning | Jun 23, 2025 | GPULarge Language Model | CodeCode Available | 2 |
| ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation | Jun 22, 2025 | GPUImage Generation | CodeCode Available | 3 |
| Collaborative Texture Filtering | Jun 21, 2025 | GPU | —Unverified | 0 |
| ConsumerBench: Benchmarking Generative AI Applications on End-User Devices | Jun 21, 2025 | BenchmarkingCPU | CodeCode Available | 1 |
| VeriLocc: End-to-End Cross-Architecture Register Allocation via LLM | Jun 20, 2025 | GPU | —Unverified | 0 |
| Beyond Blur: A Fluid Perspective on Generative Diffusion Models | Jun 20, 2025 | DiversityGPU | —Unverified | 0 |
| Speeding up Local Optimization in Vehicle Routing with Tensor-based GPU Acceleration | Jun 20, 2025 | AttributeComputational Efficiency | —Unverified | 0 |
| TrainVerify: Equivalence-Based Verification for Distributed LLM Training | Jun 19, 2025 | GPU | —Unverified | 0 |