| Disrupting Diffusion-based Inpainters with Semantic Digression | Jul 14, 2024 | GPUMisinformation | —Unverified | 0 |
| LeanQuant: Accurate Large Language Model Quantization with Loss-Error-Aware Grid | Jul 14, 2024 | GPULanguage Modeling | —Unverified | 0 |
| Enhancing Training Efficiency Using Packing with Flash Attention | Jul 12, 2024 | GPU | —Unverified | 0 |
| Weight Block Sparsity: Training, Compilation, and AI Engine Accelerators | Jul 12, 2024 | Code GenerationGPU | —Unverified | 0 |
| Analyzing Machine Learning Performance in a Hybrid Quantum Computing and HPC Environment | Jul 10, 2024 | CPUGPU | —Unverified | 0 |
| Swiss DINO: Efficient and Versatile Vision Framework for On-device Personal Object Search | Jul 10, 2024 | Few-Shot LearningGPU | CodeCode Available | 0 |
| HAFormer: Unleashing the Power of Hierarchy-Aware Features for Lightweight Semantic Segmentation | Jul 10, 2024 | GPUSemantic Segmentation | CodeCode Available | 0 |
| Parameter Efficient Fine Tuning for Multi-scanner PET to PET Reconstruction | Jul 10, 2024 | DecoderGPU | —Unverified | 0 |
| INSIGHT: Universal Neural Simulator for Analog Circuits Harnessing Autoregressive Transformers | Jul 10, 2024 | GPU | —Unverified | 0 |
| 3D Gaussian Ray Tracing: Fast Tracing of Particle Scenes | Jul 9, 2024 | GPU | —Unverified | 0 |
| Mobius: A High Efficient Spatial-Temporal Parallel Training Paradigm for Text-to-Video Generation Task | Jul 9, 2024 | GPUText-to-Video Generation | CodeCode Available | 0 |
| DεpS: Delayed ε-Shrinking for Faster Once-For-All Training | Jul 8, 2024 | AllGPU | —Unverified | 0 |
| Accelerating MRI Uncertainty Estimation with Mask-based Bayesian Neural Network | Jul 7, 2024 | CPUDiagnostic | —Unverified | 0 |
| The Solution for the AIGC Inference Performance Optimization Competition | Jul 6, 2024 | Computational EfficiencyGPU | —Unverified | 0 |
| Lazarus: Resilient and Elastic Training of Mixture-of-Experts Models with Adaptive Expert Placement | Jul 5, 2024 | GPUMixture-of-Experts | —Unverified | 0 |
| Re-Tuning: Overcoming the Compositionality Limits of Large Language Models with Recursive Tuning | Jul 5, 2024 | GPU | CodeCode Available | 0 |
| Autoverse: An Evolvable Game Language for Learning Robust Embodied Agents | Jul 5, 2024 | GPUImitation Learning | —Unverified | 0 |
| PatchEX: High-Quality Real-Time Temporal Supersampling through Patch-based Parallel Extrapolation | Jul 5, 2024 | GPU | —Unverified | 0 |
| GOALPlace: Begin with the End in Mind | Jul 5, 2024 | GPU | —Unverified | 0 |
| LoCo: Low-Bit Communication Adaptor for Large-scale Model Training | Jul 5, 2024 | GPU | CodeCode Available | 0 |
| Green Multigrid Network | Jul 4, 2024 | GPUOperator learning | —Unverified | 0 |
| Learning Scalable Model Soup on a Single GPU: An Efficient Subspace Training Strategy | Jul 4, 2024 | GPU | CodeCode Available | 0 |
| Implementation and Analysis of GPU Algorithms for Vecchia Approximation | Jul 3, 2024 | Gaussian ProcessesGPU | CodeCode Available | 0 |
| Achieving High Throughput with a Trainable Neural-Network-Based Equalizer for Communications on FPGA | Jul 3, 2024 | GPU | —Unverified | 0 |
| Benchmarking End-To-End Performance of AI-Based Chip Placement Algorithms | Jul 3, 2024 | BenchmarkingCPU | —Unverified | 0 |
| M5: A Whole Genome Bacterial Encoder at Single Nucleotide Resolution | Jul 3, 2024 | GPU | —Unverified | 0 |
| Automated Text Scoring in the Age of Generative AI for the GPU-poor | Jul 2, 2024 | GPU | —Unverified | 0 |
| SparseSSP: 3D Subcellular Structure Prediction from Sparse-View Transmitted Light Images | Jul 2, 2024 | GPU | CodeCode Available | 0 |
| M^2IST: Multi-Modal Interactive Side-Tuning for Efficient Referring Expression Comprehension | Jul 1, 2024 | GPUReferring Expression | —Unverified | 0 |
| Needle in the Haystack for Memory Based Large Language Models | Jul 1, 2024 | DecoderGPU | —Unverified | 0 |
| PQCache: Product Quantization-based KVCache for Long Context LLM Inference | Jul 1, 2024 | GPUQuantization | —Unverified | 0 |
| SpectralKAN: Kolmogorov-Arnold Network for Hyperspectral Images Change Detection | Jul 1, 2024 | Change DetectionComputational Efficiency | CodeCode Available | 0 |
| Badllama 3: removing safety finetuning from Llama 3 in minutes | Jul 1, 2024 | GPU | —Unverified | 0 |
| Parm: Efficient Training of Large Sparsely-Activated Models with Dedicated Schedules | Jun 30, 2024 | GPU | CodeCode Available | 0 |
| Hierarchical Memory for Long Video QA | Jun 30, 2024 | GPUQuestion Answering | —Unverified | 0 |
| LASSI: An LLM-based Automated Self-Correcting Pipeline for Translating Parallel Scientific Codes | Jun 30, 2024 | GPU | —Unverified | 0 |
| Explore as a Storm, Exploit as a Raindrop: On the Benefit of Fine-Tuning Kernel Schedulers with Coordinate Descent | Jun 28, 2024 | GPUScheduling | CodeCode Available | 0 |
| Meta Large Language Model Compiler: Foundation Models of Compiler Optimization | Jun 27, 2024 | Compiler OptimizationGPU | —Unverified | 0 |
| MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data | Jun 26, 2024 | DecoderGPU | —Unverified | 0 |
| Real-time Structure Flow | Jun 26, 2024 | Autonomous VehiclesGPU | —Unverified | 0 |
| DICE: End-to-end Deformation Capture of Hand-Face Interactions from a Single Image | Jun 26, 2024 | GPU | —Unverified | 0 |
| Graph Neural Network as Computationally Efficient Emulator of Ice-sheet and Sea-level System Model (ISSM) | Jun 26, 2024 | CPUGPU | —Unverified | 0 |
| The Overcooked Generalisation Challenge | Jun 25, 2024 | GPU | CodeCode Available | 0 |
| BlockLLM: Memory-Efficient Adaptation of LLMs by Selecting and Optimizing the Right Coordinate Blocks | Jun 25, 2024 | GPU | CodeCode Available | 0 |
| Video-Infinity: Distributed Long Video Generation | Jun 24, 2024 | GPUVideo Generation | —Unverified | 0 |
| GraphPipe: Improving Performance and Scalability of DNN Training with Graph Pipeline Parallelism | Jun 24, 2024 | GPU | —Unverified | 0 |
| MLAAN: Scaling Supervised Local Learning with Multilaminar Leap Augmented Auxiliary Network | Jun 24, 2024 | GPU | CodeCode Available | 0 |
| Hardware-Aware Neural Dropout Search for Reliable Uncertainty Prediction on FPGA | Jun 23, 2024 | Decision MakingGPU | CodeCode Available | 0 |
| LaneSegNet Design Study | Jun 22, 2024 | Autonomous VehiclesDecoder | —Unverified | 0 |
| ExVideo: Extending Video Diffusion Models via Parameter-Efficient Post-Tuning | Jun 20, 2024 | GPUVideo Generation | CodeCode Available | 0 |