| Graph-Reward-SQL: Execution-Free Reinforcement Learning for Text-to-SQL via Graph Matching and Stepwise Reward | May 18, 2025 | GPUGraph Matching | CodeCode Available | 3 |
| Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation | May 17, 2025 | Dataset GenerationGPU | CodeCode Available | 1 |
| From Hand-Crafted Metrics to Evolved Training-Free Performance Predictors for Neural Architecture Search via Genetic Programming | May 16, 2025 | GPUNeural Architecture Search | —Unverified | 0 |
| Flash Invariant Point Attention | May 16, 2025 | GPU | CodeCode Available | 1 |
| HessFormer: Hessians at Foundation Scale | May 16, 2025 | GPU | —Unverified | 0 |
| Group Think: Multiple Concurrent Reasoning Agents Collaborating at Token Level Granularity | May 16, 2025 | GPU | —Unverified | 0 |
| Entropy-Driven Genetic Optimization for Deep-Feature-Guided Low-Light Image Enhancement | May 16, 2025 | GPUImage Enhancement | CodeCode Available | 0 |
| Gaussian Weight Sampling for Scalable, Efficient and Stable Pseudo-Quantization Training | May 16, 2025 | GPUQuantization | —Unverified | 0 |
| Group-in-Group Policy Optimization for LLM Agent Training | May 16, 2025 | GPUMathematical Reasoning | CodeCode Available | 5 |
| Accelerating Visual-Policy Learning through Parallel Differentiable Simulation | May 15, 2025 | GPU | CodeCode Available | 4 |
| VRSplat: Fast and Robust Gaussian Splatting for Virtual Reality | May 15, 2025 | 3DGSGPU | CodeCode Available | 2 |
| SpecOffload: Unlocking Latent GPU Capacity for LLM Inference on Resource-Constrained Devices | May 15, 2025 | CPUGPU | CodeCode Available | 1 |
| Marigold: Affordable Adaptation of Diffusion-Based Image Generators for Image Analysis | May 14, 2025 | DenoisingDepth Estimation | CodeCode Available | 7 |
| Single-shot prediction of parametric partial differential equations | May 14, 2025 | CPUGPU | —Unverified | 0 |
| AI Accelerators for Large Language Model In-ference: Architecture Analysis and Scaling Strategies | May 13, 2025 | GPULanguage Modeling | —Unverified | 0 |
| FlashMLA-ETAP: Efficient Transpose Attention Pipeline for Accelerating MLA Inference on NVIDIA H20 GPUs | May 13, 2025 | GPU | CodeCode Available | 1 |
| Generative Molecular Design with Steerable and Granular Synthesizability Control | May 13, 2025 | GPU | —Unverified | 0 |
| Scaling Multi Agent Reinforcement Learning for Underwater Acoustic Tracking via Autonomous Vehicles | May 13, 2025 | Autonomous VehiclesGPU | —Unverified | 0 |
| Fused3S: Fast Sparse Attention on Tensor Cores | May 12, 2025 | GPU | CodeCode Available | 0 |
| On the Cost and Benefits of Training Context with Utterance or Full Conversation Training: A Comparative Stud | May 12, 2025 | GPUHallucination | —Unverified | 0 |
| SLAG: Scalable Language-Augmented Gaussian Splatting | May 12, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| L-SWAG: Layer-Sample Wise Activation with Gradients information for Zero-Shot NAS on Vision Transformers | May 12, 2025 | GPUNeural Architecture Search | —Unverified | 0 |
| Private LoRA Fine-tuning of Open-Source LLMs with Homomorphic Encryption | May 12, 2025 | GPUKnowledge Base Question Answering | —Unverified | 0 |
| OnPrem.LLM: A Privacy-Conscious Document Intelligence Toolkit | May 12, 2025 | GPUPrivacy Preserving | CodeCode Available | 4 |