| MLTCP: Congestion Control for DNN Training | Feb 14, 2024 | GPU | —Unverified | 0 |
| Multi-Level GNN Preconditioner for Solving Large Scale Problems | Feb 13, 2024 | GPU | —Unverified | 0 |
| Graph Feature Preprocessor: Real-time Subgraph-based Feature Extraction for Financial Crime Detection | Feb 13, 2024 | CPUGPU | —Unverified | 0 |
| Benchmarking and Building Long-Context Retrieval Models with LoCo and M2-BERT | Feb 12, 2024 | BenchmarkingChunking | —Unverified | 0 |
| The I/O Complexity of Attention, or How Optimal is Flash Attention? | Feb 12, 2024 | GPU | —Unverified | 0 |
| Accelerating Distributed Deep Learning using Lossless Homomorphic Compression | Feb 12, 2024 | Computational EfficiencyCPU | CodeCode Available | 0 |
| Context-aware Multi-Model Object Detection for Diversely Heterogeneous Compute Systems | Feb 12, 2024 | GPUobject-detection | CodeCode Available | 0 |
| Cardiac ultrasound simulation for autonomous ultrasound navigation | Feb 9, 2024 | DiagnosticGPU | —Unverified | 0 |
| Anatomizing Deep Learning Inference in Web Browsers | Feb 8, 2024 | CPUDeep Learning | —Unverified | 0 |
| On the Convergence of Zeroth-Order Federated Tuning for Large Language Models | Feb 8, 2024 | Federated LearningGPU | —Unverified | 0 |