| RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation | Jan 9, 2024 | GPUMath | CodeCode Available | 3 |
| G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems | Jan 9, 2024 | GPUMeta-Learning | —Unverified | 0 |
| Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models | Jan 9, 2024 | GPU | CodeCode Available | 3 |
| Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Jan 9, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 2 |
| IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision Processes | Jan 8, 2024 | CPUGPU | CodeCode Available | 0 |
| FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs | Jan 8, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | GPURepresentation Learning | —Unverified | 0 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| A foundation for exact binarized morphological neural networks | Jan 8, 2024 | BinarizationGPU | CodeCode Available | 0 |
| CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital Twins | Jan 6, 2024 | Autonomous VehiclesBenchmarking | CodeCode Available | 1 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| CoMoSVC: Consistency Model-based Singing Voice Conversion | Jan 3, 2024 | GPUmodel | CodeCode Available | 2 |
| LLaMA Beyond English: An Empirical Study on Language Capability Transfer | Jan 2, 2024 | GPUInformativeness | —Unverified | 0 |
| LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering | Jan 1, 2024 | GPUNeRF | —Unverified | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 |
| Scaling Laws for Data Filtering-- Data Curation cannot be Compute Agnostic | Jan 1, 2024 | GPU | —Unverified | 0 |
| Distraction is All You Need: Memory-Efficient Image Immunization against Diffusion-Based Image Editing | Jan 1, 2024 | AllDenoising | —Unverified | 0 |
| Resource-Efficient Transformer Pruning for Finetuning of Large Models | Jan 1, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |
| Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction | Jan 1, 2024 | 3D ReconstructionDenoising | —Unverified | 0 |
| Time- Memory- and Parameter-Efficient Visual Adaptation | Jan 1, 2024 | GPUVideo Classification | —Unverified | 0 |
| Learning to Select Views for Efficient Multi-View Understanding | Jan 1, 2024 | CPUGPU | —Unverified | 0 |
| TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction | Jan 1, 2024 | DecoderGPU | CodeCode Available | 1 |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Dec 29, 2023 | GPULanguage Modeling | CodeCode Available | 2 |
| Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System | Dec 28, 2023 | GPU | —Unverified | 0 |