| Efficient Subseasonal Weather Forecast using Teleconnection-informed Transformers | Jan 31, 2024 | GPUWeather Forecasting | —Unverified | 0 |
| Paramanu: A Family of Novel Efficient Generative Foundation Language Models for Indian Languages | Jan 31, 2024 | GPUReading Comprehension | —Unverified | 0 |
| SwapNet: Efficient Swapping for DNN Inference on Edge AI Devices Beyond the Memory Budget | Jan 30, 2024 | GPUModel Compression | —Unverified | 0 |
| GPU Cluster Scheduling for Network-Sensitive Deep Learning | Jan 29, 2024 | Deep LearningGPU | —Unverified | 0 |
| SHViT: Single-Head Vision Transformer with Memory Efficient Macro Design | Jan 29, 2024 | CPUGPU | CodeCode Available | 2 |
| M2-Encoder: Advancing Bilingual Image-Text Understanding by Large-scale Efficient Pretraining | Jan 29, 2024 | GPUzero-shot-classification | CodeCode Available | 0 |
| Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing | Jan 29, 2024 | GPURepresentation Learning | CodeCode Available | 2 |
| HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy | Jan 26, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| The Case for Co-Designing Model Architectures with Hardware | Jan 25, 2024 | Deep LearningGPU | —Unverified | 0 |
| FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design | Jan 25, 2024 | GPUQuantization | CodeCode Available | 3 |