| NGD-SLAM: Towards Real-Time Dynamic SLAM without GPU | May 12, 2024 | CPUDeep Learning | CodeCode Available | 3 |
| Input Snapshots Fusion for Scalable Discrete Dynamic Graph Nerual Networks | May 11, 2024 | DenoisingGPU | —Unverified | 0 |
| Selective Focus: Investigating Semantics Sensitivity in Post-training Quantization for Lane Detection | May 10, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| SKVQ: Sliding-window Key and Value Cache Quantization for Large Language Models | May 10, 2024 | GPUQuantization | —Unverified | 0 |
| Aerial-NeRF: Adaptive Spatial Partitioning and Sampling for Large-Scale Aerial Rendering | May 10, 2024 | GPUNeRF | —Unverified | 0 |
| Mirage: A Multi-Level Superoptimizer for Tensor Programs | May 9, 2024 | GPUNavigate | CodeCode Available | 7 |
| Preble: Efficient Distributed Prompt Scheduling for LLM Serving | May 8, 2024 | GPUScheduling | CodeCode Available | 2 |
| Vidur: A Large-Scale Simulation Framework For LLM Inference | May 8, 2024 | CPUGPU | CodeCode Available | 4 |
| You Only Cache Once: Decoder-Decoder Architectures for Language Models | May 8, 2024 | DecoderGPU | CodeCode Available | 0 |
| A New Dataset and Comparative Study for Aphid Cluster Detection and Segmentation in Sorghum Fields | May 7, 2024 | GPUobject-detection | —Unverified | 0 |