| LinFusion: 1 GPU, 1 Minute, 16K Image | Sep 3, 2024 | 16kCausal Inference | CodeCode Available | 3 |
| GaussianPU: A Hybrid 2D-3D Upsampling Framework for Enhancing Color Point Clouds via 3D Gaussian Splatting | Sep 3, 2024 | 3DGSGPU | —Unverified | 0 |
| Compressing VAE-Based Out-of-Distribution Detectors for Embedded Deployment | Sep 2, 2024 | CPUGPU | —Unverified | 0 |
| TempMe: Video Temporal Token Merging for Efficient Text-Video Retrieval | Sep 2, 2024 | GPURetrieval | CodeCode Available | 1 |
| Follow-Your-Canvas: Higher-Resolution Video Outpainting with Extensive Content Generation | Sep 2, 2024 | GPU | CodeCode Available | 2 |
| Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications | Sep 2, 2024 | CPUFederated Learning | CodeCode Available | 2 |
| VideoLLaMB: Long-context Video Understanding with Recurrent Memory Bridges | Sep 2, 2024 | GPUMVBench | —Unverified | 0 |
| OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model | Sep 2, 2024 | GPUVideo Generation | —Unverified | 0 |
| Accelerating Hybrid Agent-Based Models and Fuzzy Cognitive Maps: How to Combine Agents who Think Alike? | Sep 1, 2024 | Community DetectionGPU | —Unverified | 0 |
| LongRecipe: Recipe for Efficient Long Context Generalization in Large Language Models | Aug 31, 2024 | 8kGPU | CodeCode Available | 2 |
| ContextVLM: Zero-Shot and Few-Shot Context Understanding for Autonomous Driving using Vision Language Models | Aug 30, 2024 | Autonomous DrivingGPU | —Unverified | 0 |
| VQ4DiT: Efficient Post-Training Vector Quantization for Diffusion Transformers | Aug 30, 2024 | GPUImage Generation | —Unverified | 0 |
| Training Ultra Long Context Language Model with Fully Pipelined Distributed Transformer | Aug 30, 2024 | GPULanguage Modeling | —Unverified | 0 |
| MemLong: Memory-Augmented Retrieval for Long Text Modeling | Aug 30, 2024 | 4kDecoder | CodeCode Available | 2 |
| H-SGANet: Hybrid Sparse Graph Attention Network for Deformable Medical Image Registration | Aug 29, 2024 | Deformable Medical Image RegistrationGPU | —Unverified | 0 |
| TinyTNAS: GPU-Free, Time-Bound, Hardware-Aware Neural Architecture Search for TinyML Time Series Classification | Aug 29, 2024 | CPUDiagnostic | CodeCode Available | 1 |
| 3-in-1: 2D Rotary Adaptation for Efficient Finetuning, Efficient Batching and Composability | Aug 28, 2024 | Arithmetic ReasoningGPU | CodeCode Available | 0 |
| Conan-embedding: General Text Embedding with More and Better Negative Samples | Aug 28, 2024 | Contrastive LearningGPU | —Unverified | 0 |
| microYOLO: Towards Single-Shot Object Detection on Microcontrollers | Aug 28, 2024 | GPUObject | —Unverified | 0 |
| InstanSeg: an embedding-based instance segmentation algorithm optimized for accurate, efficient and portable cell segmentation | Aug 28, 2024 | Cell SegmentationGPU | CodeCode Available | 3 |
| SCAN-Edge: Finding MobileNet-speed Hybrid Networks for Diverse Edge Devices via Hardware-Aware Evolutionary Search | Aug 27, 2024 | CPUGPU | —Unverified | 0 |
| GPU-Accelerated Counterfactual Regret Minimization | Aug 27, 2024 | counterfactualGPU | CodeCode Available | 1 |
| OctFusion: Octree-based Diffusion Models for 3D Shape Generation | Aug 27, 2024 | 3D Generation3D Shape Generation | CodeCode Available | 3 |
| Text-guided Foundation Model Adaptation for Long-Tailed Medical Image Classification | Aug 27, 2024 | DiagnosticGPU | —Unverified | 0 |
| The Mamba in the Llama: Distilling and Accelerating Hybrid Models | Aug 27, 2024 | GPULanguage Modeling | CodeCode Available | 3 |