| GPU-Accelerated WFST Beam Search Decoder for CTC-based Speech Recognition | Nov 8, 2023 | CPUDecoder | CodeCode Available | 1 |
| Prompt Cache: Modular Attention Reuse for Low-Latency Inference | Nov 7, 2023 | CPUGPU | CodeCode Available | 1 |
| VR-NeRF: High-Fidelity Virtualized Walkable Spaces | Nov 5, 2023 | 2kGPU | CodeCode Available | 1 |
| In Search of Lost Online Test-time Adaptation: A Survey | Oct 31, 2023 | BenchmarkingGPU | CodeCode Available | 1 |
| Network Contention-Aware Cluster Scheduling with Reinforcement Learning | Oct 31, 2023 | GPUreinforcement-learning | CodeCode Available | 1 |
| Prediction of Effective Elastic Moduli of Rocks using Graph Neural Networks | Oct 30, 2023 | GPU | CodeCode Available | 1 |
| DiffusionVID: Denoising Object Boxes with Spatio-temporal Conditioning for Video Object Detection | Oct 30, 2023 | DenoisingGPU | CodeCode Available | 1 |
| SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models | Oct 29, 2023 | GPUMixture-of-Experts | CodeCode Available | 1 |
| LLMSTEP: LLM proofstep suggestions in Lean | Oct 27, 2023 | CPUGPU | CodeCode Available | 1 |
| Metrically Scaled Monocular Depth Estimation through Sparse Priors for Underwater Robots | Oct 25, 2023 | CPUDecoder | CodeCode Available | 1 |