| Dynamic Adapter Meets Prompt Tuning: Parameter-Efficient Transfer Learning for Point Cloud Analysis | Mar 3, 2024 | 3D Parameter-Efficient Fine-Tuning for ClassificationGPU | CodeCode Available | 2 |
| LLM-PQ: Serving LLM on Heterogeneous Clusters with Phase-Aware Partition and Adaptive Quantization | Mar 2, 2024 | GPUQuantization | CodeCode Available | 1 |
| Parallel Hyperparameter Optimization Of Spiking Neural Network | Mar 1, 2024 | Bayesian OptimizationGPU | CodeCode Available | 0 |
| CollaFuse: Navigating Limited Resources and Privacy in Collaborative Generative AI | Feb 29, 2024 | Autonomous DrivingDenoising | CodeCode Available | 0 |
| WDM: 3D Wavelet Diffusion Models for High-Resolution Medical Image Synthesis | Feb 29, 2024 | DiversityGPU | CodeCode Available | 2 |
| Efficient Lifelong Model Evaluation in an Era of Rapid Progress | Feb 29, 2024 | BenchmarkingGPU | CodeCode Available | 1 |
| DistriFusion: Distributed Parallel Inference for High-Resolution Diffusion Models | Feb 29, 2024 | GPU | CodeCode Available | 4 |
| FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient Finetuning | Feb 29, 2024 | GPULanguage Modeling | CodeCode Available | 5 |
| FlattenQuant: Breaking Through the Inference Compute-bound for Large Language Models with Per-tensor Quantization | Feb 28, 2024 | GPUQuantization | —Unverified | 0 |
| JMLR: Joint Medical LLM and Retrieval Training for Enhancing Reasoning and Professional Question Answering Capability | Feb 27, 2024 | GPUInformation Retrieval | CodeCode Available | 0 |
| DropBP: Accelerating Fine-Tuning of Large Language Models by Dropping Backward Propagation | Feb 27, 2024 | GPUparameter-efficient fine-tuning | CodeCode Available | 1 |
| Differentiable Biomechanics Unlocks Opportunities for Markerless Motion Capture | Feb 27, 2024 | GPUMarkerless Motion Capture | —Unverified | 0 |
| Scaling Supervised Local Learning with Augmented Auxiliary Networks | Feb 27, 2024 | GPUimage-classification | CodeCode Available | 0 |
| Multimodal Learned Sparse Retrieval with Probabilistic Expansion Control | Feb 27, 2024 | GPUImage Retrieval | CodeCode Available | 1 |
| Compass: A Decentralized Scheduler for Latency-Sensitive ML Workflows | Feb 27, 2024 | GPUManagement | —Unverified | 0 |
| Single Neuromorphic Memristor closely Emulates Multiple Synaptic Mechanisms for Energy Efficient Neural Networks | Feb 26, 2024 | GPUMeta-Learning | —Unverified | 0 |
| Video-Based Autism Detection with Deep Learning | Feb 26, 2024 | Autism detectionDeep Learning | —Unverified | 0 |
| Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning | Feb 26, 2024 | GPUMinecraft | CodeCode Available | 3 |
| PyGim: An Efficient Graph Neural Network Library for Real Processing-In-Memory Architectures | Feb 26, 2024 | CPUGPU | CodeCode Available | 1 |
| DEYO: DETR with YOLO for End-to-End Object Detection | Feb 26, 2024 | DecoderGPU | CodeCode Available | 2 |
| Data-freeWeight Compress and Denoise for Large Language Models | Feb 26, 2024 | GPUQuantization | —Unverified | 0 |
| Divide-Conquer-and-Merge: Memory- and Time-Efficient Holographic Displays | Feb 25, 2024 | 16k8k | —Unverified | 0 |
| Sustainable Supercomputing for AI: GPU Power Capping at HPC Scale | Feb 25, 2024 | GPU | —Unverified | 0 |
| Spec-Gaussian: Anisotropic View-Dependent Appearance for 3D Gaussian Splatting | Feb 24, 2024 | GPU | —Unverified | 0 |
| Sparse MeZO: Less Parameters for Better Performance in Zeroth-Order LLM Fine-Tuning | Feb 24, 2024 | GPUparameter-efficient fine-tuning | —Unverified | 0 |
| Fast Adversarial Attacks on Language Models In One GPU Minute | Feb 23, 2024 | Adversarial AttackComputational Efficiency | CodeCode Available | 2 |
| Sampling-based Distributed Training with Message Passing Neural Network | Feb 23, 2024 | GPUGraph Neural Network | —Unverified | 0 |
| Optimal Transport on the Lie Group of Roto-translations | Feb 23, 2024 | GPUTranslation | —Unverified | 0 |
| Second-Order Fine-Tuning without Pain for LLMs:A Hessian Informed Zeroth-Order Optimizer | Feb 23, 2024 | GPU | —Unverified | 0 |
| Automated Design and Optimization of Distributed Filtering Circuits via Reinforcement Learning | Feb 22, 2024 | CPUGPU | —Unverified | 0 |
| FinGPT-HPC: Efficient Pretraining and Finetuning Large Language Models for Financial Applications with High-Performance Computing | Feb 21, 2024 | GPUModel Compression | —Unverified | 0 |
| Learning to Retrieve for Job Matching | Feb 21, 2024 | GPURecommendation Systems | —Unverified | 0 |
| NeuroFlux: Memory-Efficient CNN Training Using Adaptive Local Learning | Feb 21, 2024 | GPU | —Unverified | 0 |
| Green AI: A Preliminary Empirical Study on Energy Consumption in DL Models Across Different Runtime Infrastructures | Feb 21, 2024 | CPUGPU | —Unverified | 0 |
| Identifying Unnecessary 3D Gaussians using Clustering for Fast Rendering of 3D Gaussian Splatting | Feb 21, 2024 | GPUNeRF | —Unverified | 0 |
| Ray Tracing Algorithm for Reconfigurable Intelligent Surfaces | Feb 20, 2024 | GPU | —Unverified | 0 |
| Me LLaMA: Foundation Large Language Models for Medical Applications | Feb 20, 2024 | Few-Shot LearningGPU | CodeCode Available | 2 |
| CST: Calibration Side-Tuning for Parameter and Memory Efficient Transfer Learning | Feb 20, 2024 | GPUObject | —Unverified | 0 |
| TorchCP: A Python Library for Conformal Prediction | Feb 20, 2024 | Conformal PredictionDeep Learning | CodeCode Available | 3 |
| Mechanistic Neural Networks for Scientific Machine Learning | Feb 20, 2024 | Equation DiscoveryGPU | CodeCode Available | 1 |
| EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs | Feb 19, 2024 | GPU | CodeCode Available | 0 |
| Short-Period Variables in TESS Full-Frame Image Light Curves Identified via Convolutional Neural Networks | Feb 19, 2024 | GPU | —Unverified | 0 |
| All Language Models Large and Small | Feb 19, 2024 | AllDecision Making | —Unverified | 0 |
| LTL learning on GPUs | Feb 19, 2024 | GPUProgram Synthesis | CodeCode Available | 0 |
| BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation | Feb 18, 2024 | GPUQuestion Answering | CodeCode Available | 1 |
| Turn Waste into Worth: Rectifying Top-k Router of MoE | Feb 17, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| SpikeNAS: A Fast Memory-Aware Neural Architecture Search Framework for Spiking Neural Network-based Autonomous Agents | Feb 17, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| Expressive Higher-Order Link Prediction through Hypergraph Symmetry Breaking | Feb 17, 2024 | GPULink Prediction | CodeCode Available | 0 |
| PointMamba: A Simple State Space Model for Point Cloud Analysis | Feb 16, 2024 | GPUMamba | CodeCode Available | 4 |
| Fully Differentiable Lagrangian Convolutional Neural Network for Continuity-Consistent Physics-Informed Precipitation Nowcasting | Feb 16, 2024 | GPU | —Unverified | 0 |