| Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Jan 9, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 2 |
| G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems | Jan 9, 2024 | GPUMeta-Learning | —Unverified | 0 |
| Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models | Jan 9, 2024 | GPU | CodeCode Available | 3 |
| RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation | Jan 9, 2024 | GPUMath | CodeCode Available | 3 |
| IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision Processes | Jan 8, 2024 | CPUGPU | CodeCode Available | 0 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | GPURepresentation Learning | —Unverified | 0 |
| FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs | Jan 8, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| A foundation for exact binarized morphological neural networks | Jan 8, 2024 | BinarizationGPU | CodeCode Available | 0 |