| Automated Filtering of Human Feedback Data for Aligning Text-to-Image Diffusion Models | Oct 14, 2024 | DiversityGPU | CodeCode Available | 0 |
| MuseTalk: Real-Time High-Fidelity Video Dubbing via Spatio-Temporal Sampling | Oct 14, 2024 | Audio-Visual SynchronizationGPU | CodeCode Available | 9 |
| Deep Compression Autoencoder for Efficient High-Resolution Diffusion Models | Oct 14, 2024 | GPUImage Generation | —Unverified | 0 |
| SANA: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformers | Oct 14, 2024 | DecoderGPU | CodeCode Available | 9 |
| PromptGCN: Bridging Subgraph Gaps in Lightweight GCNs | Oct 14, 2024 | GPURecommendation Systems | —Unverified | 0 |
| MoIN: Mixture of Introvert Experts to Upcycle an LLM | Oct 13, 2024 | GPULanguage Modeling | —Unverified | 0 |
| CtrLoRA: An Extensible and Efficient Framework for Controllable Image Generation | Oct 12, 2024 | Conditional Image GenerationGPU | CodeCode Available | 3 |
| ActNAS : Generating Efficient YOLO Models using Activation NAS | Oct 11, 2024 | CPUGPU | —Unverified | 0 |
| Superpipeline: A Universal Approach for Reducing GPU Memory Usage in Large Models | Oct 11, 2024 | CPUGPU | CodeCode Available | 0 |
| Parallel Watershed Partitioning: GPU-Based Hierarchical Image Segmentation | Oct 11, 2024 | GPUImage Segmentation | —Unverified | 0 |
| VIBES -- Vision Backbone Efficient Selection | Oct 11, 2024 | GPU | —Unverified | 0 |
| SPA: 3D Spatial-Awareness Enables Effective Embodied Representation | Oct 10, 2024 | GPUNeural Rendering | CodeCode Available | 1 |
| HM-DF SNN: Transcending Conventional Online Learning with Advanced Training and Deployment | Oct 10, 2024 | GPU | —Unverified | 0 |
| Neural Reasoning Networks: Efficient Interpretable Neural Networks With Automatic Textual Explanations | Oct 10, 2024 | FairnessFeature Importance | CodeCode Available | 1 |
| CSA: Data-efficient Mapping of Unimodal Features to Multimodal Features | Oct 10, 2024 | Cross-Modal RetrievalGPU | —Unverified | 0 |
| QuAILoRA: Quantization-Aware Initialization for LoRA | Oct 9, 2024 | Causal Language ModelingGPU | —Unverified | 0 |
| TinyClick: Single-Turn Agent for Empowering GUI Automation | Oct 9, 2024 | Data AugmentationGPU | —Unverified | 0 |
| MoE++: Accelerating Mixture-of-Experts Methods with Zero-Computation Experts | Oct 9, 2024 | GPUMixture-of-Experts | CodeCode Available | 4 |
| Addax: Utilizing Zeroth-Order Gradients to Improve Memory Efficiency and Performance of SGD for Fine-Tuning Language Models | Oct 9, 2024 | GPU | —Unverified | 0 |
| TorchTitan: One-stop PyTorch native solution for production ready LLM pre-training | Oct 9, 2024 | GPU | CodeCode Available | 9 |
| Do better language models have crisper vision? | Oct 9, 2024 | DecoderGPU | —Unverified | 0 |
| Automated Quality Control System for Canned Tuna Production using Artificial Vision | Oct 8, 2024 | GPUOptical Character Recognition (OCR) | —Unverified | 0 |
| PortLLM: Personalizing Evolving Large Language Models with Training-Free and Portable Model Patches | Oct 8, 2024 | GPUGSM8K | —Unverified | 0 |
| Pyramidal Flow Matching for Efficient Video Generative Modeling | Oct 8, 2024 | GPUText-to-Video Generation | CodeCode Available | 7 |
| ViBiDSampler: Enhancing Video Interpolation Using Bidirectional Diffusion Sampler | Oct 8, 2024 | GPUVideo Generation | —Unverified | 0 |