| Characterization of Large Language Model Development in the Datacenter | Mar 12, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Scalable Spatiotemporal Prediction with Bayesian Neural Fields | Mar 12, 2024 | Bayesian InferenceDemand Forecasting | CodeCode Available | 2 |
| Mondrian: On-Device High-Performance Video Analytics with Compressive Packed Inference | Mar 12, 2024 | GPUobject-detection | —Unverified | 0 |
| SSM Meets Video Diffusion Models: Efficient Long-Term Video Generation with Structured State Spaces | Mar 12, 2024 | GPUImage Generation | CodeCode Available | 1 |
| LookupFFN: Making Transformers Compute-lite for CPU inference | Mar 12, 2024 | CPUGPU | CodeCode Available | 1 |
| Multiple Population Alternate Evolution Neural Architecture Search | Mar 11, 2024 | DiversityGPU | —Unverified | 0 |
| A Converting Autoencoder Toward Low-latency and Energy-efficient DNN Inference at the Edge | Mar 11, 2024 | GPUimage-classification | —Unverified | 0 |
| Smart-Infinity: Fast Large Language Model Training using Near-Storage Processing on a Real System | Mar 11, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| Ensemble Quadratic Assignment Network for Graph Matching | Mar 11, 2024 | 3D Shape ClassificationGPU | —Unverified | 0 |
| SCORE: Self-supervised Correspondence Fine-tuning for Improved Content Representations | Mar 10, 2024 | Automatic Speech RecognitionData Augmentation | CodeCode Available | 0 |