| Searching Priors Makes Text-to-Video Synthesis Better | Jun 5, 2024 | GPU | —Unverified | 0 |
| A Comprehensive Library for Benchmarking Multi-class Visual Anomaly Detection | Jun 5, 2024 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| Multi-Stage Speech Bandwidth Extension with Flexible Sampling Rate Control | Jun 4, 2024 | Bandwidth ExtensionCPU | CodeCode Available | 2 |
| Scalable MatMul-free Language Modeling | Jun 4, 2024 | GPULanguage Modeling | CodeCode Available | 7 |
| Leveraging Visual Tokens for Extended Text Contexts in Multi-Modal Learning | Jun 4, 2024 | document understandingGPU | CodeCode Available | 1 |
| A Study of Optimizations for Fine-tuning Large Language Models | Jun 4, 2024 | GPU | —Unverified | 0 |
| Speeding up Policy Simulation in Supply Chain RL | Jun 4, 2024 | GPU | —Unverified | 0 |
| Flash Diffusion: Accelerating Any Conditional Diffusion Model for Few Steps Image Generation | Jun 4, 2024 | Face SwappingGPU | CodeCode Available | 4 |
| LlamaCare: A Large Medical Language Model for Enhancing Healthcare Knowledge Sharing | Jun 4, 2024 | ClassificationGPU | CodeCode Available | 1 |
| SUBLLM: A Novel Efficient Architecture with Token Sequence Subsampling for LLM | Jun 3, 2024 | DecoderGPU | CodeCode Available | 2 |
| ACCO: Accumulate While You Communicate for Communication-Overlapped Sharded LLM Training | Jun 3, 2024 | Distributed OptimizationFederated Learning | CodeCode Available | 1 |
| GPU-Accelerated Rule Evaluation and Evolution | Jun 3, 2024 | Explainable artificial intelligenceGPU | —Unverified | 0 |
| OLoRA: Orthonormal Low-Rank Adaptation of Large Language Models | Jun 3, 2024 | GPULanguage Modeling | —Unverified | 0 |
| D-CPT Law: Domain-specific Continual Pre-Training Scaling Law for Large Language Models | Jun 3, 2024 | GPUMath | —Unverified | 0 |
| CE-NAS: An End-to-End Carbon-Efficient Neural Architecture Search Framework | Jun 3, 2024 | GPUNeural Architecture Search | —Unverified | 0 |
| Helix: Serving Large Language Models over Heterogeneous GPUs and Network via Max-Flow | Jun 3, 2024 | GPULanguage Modeling | CodeCode Available | 2 |
| ZeroSmooth: Training-free Diffuser Adaptation for High Frame Rate Video Generation | Jun 3, 2024 | GPUVideo Generation | CodeCode Available | 2 |
| RGFN: Synthesizable Molecular Generation Using GFlowNets | Jun 1, 2024 | GPU | CodeCode Available | 1 |
| Multi-Objective Neural Architecture Search by Learning Search Space Partitions | Jun 1, 2024 | Bayesian OptimizationGPU | —Unverified | 0 |
| AudioLCM: Text-to-Audio Generation with Latent Consistency Models | Jun 1, 2024 | Audio GenerationAudio Synthesis | CodeCode Available | 5 |
| Advancing Supervised Local Learning Beyond Classification with Long-term Feature Bank | Jun 1, 2024 | GPUimage-classification | —Unverified | 0 |
| μLO: Compute-Efficient Meta-Generalization of Learned Optimizers | May 31, 2024 | GPUZero-shot Generalization | CodeCode Available | 1 |
| S3D: A Simple and Cost-Effective Self-Speculative Decoding Scheme for Low-Memory GPUs | May 30, 2024 | GPUQuantization | —Unverified | 0 |
| MotionFollower: Editing Video Motion via Lightweight Score-Guided Diffusion | May 30, 2024 | DenoisingGPU | CodeCode Available | 3 |
| Knowledge Graph Tuning: Real-time Large Language Model Personalization based on Human Feedback | May 30, 2024 | GPUKnowledge Graphs | —Unverified | 0 |