| Cross-Scale MAE: A Tale of Multi-Scale Exploitation in Remote Sensing | Jan 29, 2024 | GPURepresentation Learning | CodeCode Available | 2 |
| Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model | Jan 17, 2024 | GPUImage Classification | CodeCode Available | 2 |
| Towards High-Quality and Efficient Speech Bandwidth Extension with Parallel Amplitude and Phase Prediction | Jan 12, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Jan 9, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 2 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| CoMoSVC: Consistency Model-based Singing Voice Conversion | Jan 3, 2024 | GPUmodel | CodeCode Available | 2 |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Dec 29, 2023 | GPULanguage Modeling | CodeCode Available | 2 |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Dec 28, 2023 | 8kFeature Splatting | CodeCode Available | 2 |
| Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference | Dec 23, 2023 | GPUHigh-Level Synthesis | CodeCode Available | 2 |