| STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting | Oct 1, 2024 | GPU | CodeCode Available | 1 |
| LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management | Oct 1, 2024 | GPULanguage Modeling | CodeCode Available | 3 |
| ManiSkill3: GPU Parallelized Robotics Simulation and Rendering for Generalizable Embodied AI | Oct 1, 2024 | GPUImitation Learning | CodeCode Available | 7 |
| Characterizing and Efficiently Accelerating Multimodal Generation Model Inference | Sep 30, 2024 | GPUmultimodal generation | —Unverified | 0 |
| HEADS-UP: Head-Mounted Egocentric Dataset for Trajectory Prediction in Blind Assistance Systems | Sep 30, 2024 | GPUPrediction | —Unverified | 0 |
| Simple and Fast Distillation of Diffusion Models | Sep 29, 2024 | GPUImage Generation | CodeCode Available | 3 |
| Simulation-based inference with the Python Package sbijax | Sep 28, 2024 | Bayesian InferenceCPU | —Unverified | 0 |
| Analog In-Memory Computing Attention Mechanism for Fast and Energy-Efficient Large Language Models | Sep 28, 2024 | GPU | CodeCode Available | 1 |
| Scalable Cross-Entropy Loss for Sequential Recommendations with Large Item Catalogs | Sep 27, 2024 | GPURecommendation Systems | CodeCode Available | 1 |
| TensorSocket: Shared Data Loading for Deep Learning Training | Sep 27, 2024 | Computational EfficiencyCPU | —Unverified | 0 |