| RoSA: Accurate Parameter-Efficient Fine-Tuning via Robust Adaptation | Jan 9, 2024 | GPUMath | CodeCode Available | 3 |
| Lightning Attention-2: A Free Lunch for Handling Unlimited Sequence Lengths in Large Language Models | Jan 9, 2024 | GPU | CodeCode Available | 3 |
| Low-resource finetuning of foundation models beats state-of-the-art in histopathology | Jan 9, 2024 | GPUSelf-Supervised Learning | CodeCode Available | 2 |
| G-Meta: Distributed Meta Learning in GPU Clusters for Large-Scale Recommender Systems | Jan 9, 2024 | GPUMeta-Learning | —Unverified | 0 |
| IntervalMDP.jl: Accelerated Value Iteration for Interval Markov Decision Processes | Jan 8, 2024 | CPUGPU | CodeCode Available | 0 |
| FFSplit: Split Feed-Forward Network For Optimizing Accuracy-Efficiency Trade-off in Language Model Inference | Jan 8, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Efficient Multiscale Multimodal Bottleneck Transformer for Audio-Video Classification | Jan 8, 2024 | GPURepresentation Learning | —Unverified | 0 |
| FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs | Jan 8, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| A foundation for exact binarized morphological neural networks | Jan 8, 2024 | BinarizationGPU | CodeCode Available | 0 |
| WidthFormer: Toward Efficient Transformer-based BEV View Transformation | Jan 8, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| CAVIAR: Co-simulation of 6G Communications, 3D Scenarios and AI for Digital Twins | Jan 6, 2024 | Autonomous VehiclesBenchmarking | CodeCode Available | 1 |
| Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks | Jan 5, 2024 | Arithmetic ReasoningCode Generation | CodeCode Available | 2 |
| CoMoSVC: Consistency Model-based Singing Voice Conversion | Jan 3, 2024 | GPUmodel | CodeCode Available | 2 |
| LLaMA Beyond English: An Empirical Study on Language Capability Transfer | Jan 2, 2024 | GPUInformativeness | —Unverified | 0 |
| Scaling Laws for Data Filtering-- Data Curation cannot be Compute Agnostic | Jan 1, 2024 | GPU | —Unverified | 0 |
| Resource-Efficient Transformer Pruning for Finetuning of Large Models | Jan 1, 2024 | GPUNatural Language Understanding | CodeCode Available | 1 |
| LTM: Lightweight Textured Mesh Extraction and Refinement of Large Unbounded Scenes for Efficient Storage and Real-time Rendering | Jan 1, 2024 | GPUNeRF | —Unverified | 0 |
| Distraction is All You Need: Memory-Efficient Image Immunization against Diffusion-Based Image Editing | Jan 1, 2024 | AllDenoising | —Unverified | 0 |
| Slice3D: Multi-Slice Occlusion-Revealing Single View 3D Reconstruction | Jan 1, 2024 | 3D ReconstructionDenoising | —Unverified | 0 |
| LAMP: Learn A Motion Pattern for Few-Shot Video Generation | Jan 1, 2024 | GPUImage Animation | —Unverified | 0 |
| Time- Memory- and Parameter-Efficient Visual Adaptation | Jan 1, 2024 | GPUVideo Classification | —Unverified | 0 |
| Learning to Select Views for Efficient Multi-View Understanding | Jan 1, 2024 | CPUGPU | —Unverified | 0 |
| TinyPredNet: A Lightweight Framework for Satellite Image Sequence Prediction | Jan 1, 2024 | DecoderGPU | CodeCode Available | 1 |
| MosaicBERT: A Bidirectional Encoder Optimized for Fast Pretraining | Dec 29, 2023 | GPULanguage Modeling | CodeCode Available | 2 |
| Discovery of Small Ultra-short-period Planets Orbiting KG Dwarfs in Kepler Survey Using GPU Phase Folding and Deep Learning Detection System | Dec 28, 2023 | GPU | —Unverified | 0 |
| MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile Devices | Dec 28, 2023 | AutoMLCPU | CodeCode Available | 3 |
| Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis | Dec 28, 2023 | 8kFeature Splatting | CodeCode Available | 2 |
| City-on-Web: Real-time Neural Rendering of Large-scale Scenes on the Web | Dec 27, 2023 | 3D Scene ReconstructionGPU | CodeCode Available | 1 |
| FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNs | Dec 27, 2023 | BenchmarkingGPU | CodeCode Available | 0 |
| Masked Contrastive Reconstruction for Cross-modal Medical Image-Report Retrieval | Dec 26, 2023 | Contrastive LearningCross-Modal Retrieval | —Unverified | 0 |
| XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library | Dec 25, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 3 |
| Proximal Gradient Descent Unfolding Dense-spatial Spectral-attention Transformer for Compressive Spectral Imaging | Dec 25, 2023 | GPU | —Unverified | 0 |
| A Comprehensive Evaluation of Parameter-Efficient Fine-Tuning on Software Engineering Tasks | Dec 25, 2023 | GPUparameter-efficient fine-tuning | CodeCode Available | 0 |
| BiSwift: Bandwidth Orchestrator for Multi-Stream Video Analytics on Edge | Dec 25, 2023 | FairnessGPU | —Unverified | 0 |
| CARSS: Cooperative Attention-guided Reinforcement Subpath Synthesis for Solving Traveling Salesman Problem | Dec 24, 2023 | GPUMulti-agent Reinforcement Learning | —Unverified | 0 |
| PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs | Dec 23, 2023 | GPU | CodeCode Available | 0 |
| Understanding the Potential of FPGA-Based Spatial Acceleration for Large Language Model Inference | Dec 23, 2023 | GPUHigh-Level Synthesis | CodeCode Available | 2 |
| ZO-AdaMU Optimizer: Adapting Perturbation by the Momentum and Uncertainty in Zeroth-order Optimization | Dec 23, 2023 | GPU | CodeCode Available | 1 |
| Emage: Non-Autoregressive Text-to-Image Generation | Dec 22, 2023 | DenoisingGPU | —Unverified | 0 |
| BSS-Bench: Towards Reproducible and Effective Band Selection Search | Dec 22, 2023 | GPU | —Unverified | 0 |
| CRD: Collaborative Representation Distance for Practical Anomaly Detection | Dec 20, 2023 | Anomaly DetectionComputational Efficiency | —Unverified | 0 |
| NeRF-VO: Real-Time Sparse Visual Odometry with Neural Radiance Fields | Dec 20, 2023 | Depth EstimationDepth Prediction | —Unverified | 0 |
| PointeNet: A Lightweight Framework for Effective and Efficient Point Cloud Analysis | Dec 20, 2023 | 3D Object DetectionAutonomous Driving | —Unverified | 0 |
| Optimizing Distributed Training on Frontier for Large Language Models | Dec 20, 2023 | Computational EfficiencyGPU | —Unverified | 0 |
| Splatter Image: Ultra-Fast Single-View 3D Reconstruction | Dec 20, 2023 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 3 |
| Efficient LLM inference solution on Intel GPU | Dec 19, 2023 | DecoderGPU | —Unverified | 0 |
| Enhancing predictive capabilities in fusion burning plasmas through surrogate-based optimization in core transport solvers | Dec 19, 2023 | GPUPrediction | CodeCode Available | 1 |
| Lightning-Fast Image Inversion and Editing for Text-to-Image Diffusion Models | Dec 19, 2023 | GPU | CodeCode Available | 1 |
| IS-DARTS: Stabilizing DARTS through Precise Measurement on Candidate Importance | Dec 19, 2023 | GPUNeural Architecture Search | CodeCode Available | 0 |
| A Case Study in CUDA Kernel Fusion: Implementing FlashAttention-2 on NVIDIA Hopper Architecture using the CUTLASS Library | Dec 19, 2023 | GPU | CodeCode Available | 2 |