| A Message Passing Neural Network Surrogate Model for Bond-Associated Peridynamic Material Correspondence Formulation | Oct 29, 2024 | GPU | —Unverified | 0 |
| Revisiting Reliability in Large-Scale Machine Learning Research Clusters | Oct 29, 2024 | GPU | —Unverified | 0 |
| AI-assisted Agile Propagation Modeling for Real-time Digital Twin Wireless Networks | Oct 29, 2024 | Computational EfficiencyCPU | —Unverified | 0 |
| Motion Graph Unleashed: A Novel Approach to Video Prediction | Oct 29, 2024 | GPUOptical Flow Estimation | CodeCode Available | 0 |
| Pushing the Performance Envelope of DNN-based Recommendation Systems Inference on GPUs | Oct 29, 2024 | GPURecommendation Systems | CodeCode Available | 0 |
| VL-Cache: Sparsity and Modality-Aware KV Cache Compression for Vision-Language Model Inference Acceleration | Oct 29, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Accelerated Bayesian parameter estimation and model selection for gravitational waves with normalizing flows | Oct 28, 2024 | CPUGPU | —Unverified | 0 |
| FusedInf: Efficient Swapping of DNN Models for On-Demand Serverless Inference Services on the Edge | Oct 28, 2024 | GPU | CodeCode Available | 0 |
| Deep Optimizer States: Towards Scalable Training of Transformer Models Using Interleaved Offloading | Oct 26, 2024 | CPUGPU | CodeCode Available | 0 |
| Computational Bottlenecks of Training Small-scale Large Language Models | Oct 25, 2024 | GPULanguage Modeling | —Unverified | 0 |