| AttriReBoost: A Gradient-Free Propagation Optimization Method for Cold Start Mitigation in Attribute Missing Graphs | Jan 1, 2025 | AttributeComputational Efficiency | CodeCode Available | 0 |
| Adjoint sharding for very long context training of state space models | Jan 1, 2025 | GPULarge Language Model | —Unverified | 0 |
| Towards Sustainable Large Language Model Serving | Dec 31, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Debunking the CUDA Myth Towards GPU-based AI Systems | Dec 31, 2024 | GPU | —Unverified | 0 |
| Lightweight G-YOLOv11: Advancing Efficient Fracture Detection in Pediatric Wrist X-rays | Dec 31, 2024 | Fracture detectionGPU | CodeCode Available | 1 |
| LTX-Video: Realtime Video Latent Diffusion | Dec 30, 2024 | DenoisingGPU | CodeCode Available | 9 |
| Efficient Multi-Task Inferencing with a Shared Backbone and Lightweight Task-Specific Adapters for Automatic Scoring | Dec 30, 2024 | FairnessGPU | —Unverified | 0 |
| FastCHGNet: Training one Universal Interatomic Potential to 1.5 Hours with 32 GPUs | Dec 30, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching and Clap-Ranked Preference Optimization | Dec 30, 2024 | Audio GenerationGPU | CodeCode Available | 4 |
| FPGA-based Acceleration of Neural Network for Image Classification using Vitis AI | Dec 30, 2024 | 3D ReconstructionCPU | —Unverified | 0 |