| Towards Safe Load Balancing based on Control Barrier Functions and Deep Reinforcement Learning | Jan 10, 2024 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems | Jan 9, 2024 | GPUMeta-Learning | —Unverified | 0 |
| A foundation for exact binarized morphological neural networks | Jan 8, 2024 | BinarizationGPU | CodeCode Available | 0 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | GPURepresentation Learning | —Unverified | 0 |
| IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision Processes | Jan 8, 2024 | CPUGPU | CodeCode Available | 0 |
| FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs | Jan 8, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| LLaMA Beyond English: An Empirical Study on Language Capability Transfer | Jan 2, 2024 | GPUInformativeness | —Unverified | 0 |
| LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering | Jan 1, 2024 | GPUNeRF | —Unverified | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 |