| Flexible and Scalable Deep Dendritic Spiking Neural Networks with Multiple Nonlinear Branching | Dec 9, 2024 | Few-Shot LearningGPU | —Unverified | 0 |
| Improving text-conditioned latent diffusion for cancer pathology | Dec 9, 2024 | GPUSynthetic Data Generation | CodeCode Available | 0 |
| GraphNeuralNetworks.jl: Deep Learning on Graphs with Julia | Dec 9, 2024 | Deep LearningGPU | CodeCode Available | 3 |
| Edge Delayed Deep Deterministic Policy Gradient: efficient continuous control for edge scenarios | Dec 9, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| ASGDiffusion: Parallel High-Resolution Generation with Asynchronous Structure Guidance | Dec 9, 2024 | DenoisingGPU | —Unverified | 0 |
| MCP-MedSAM: A Powerful Lightweight Medical Segment Anything Model Trained with a Single GPU in Just One Day | Dec 8, 2024 | GPUImage Segmentation | CodeCode Available | 1 |
| Batch-Max: Higher LLM Throughput using Larger Batch Sizes and KV Cache Compression | Dec 7, 2024 | GPU | —Unverified | 0 |
| Code generation and runtime techniques for enabling data-efficient deep learning training on GPUs | Dec 6, 2024 | Code GenerationDeep Learning | —Unverified | 0 |
| Flash Communication: Reducing Tensor Parallelization Bottleneck for Fast Large Language Model Inference | Dec 6, 2024 | GPULanguage Modeling | —Unverified | 0 |
| APOLLO: SGD-like Memory, AdamW-level Performance | Dec 6, 2024 | GPUQuantization | CodeCode Available | 3 |