| Momentum-GS: Momentum Gaussian Self-Distillation for High-Quality Large Scene Reconstruction | Dec 6, 2024 | 3D Reconstruction3D Scene Reconstruction | CodeCode Available | 2 |
| Dynamic-LLaVA: Efficient Multimodal Large Language Models via Dynamic Vision-language Context Sparsification | Dec 1, 2024 | GPUVisual Question Answering | CodeCode Available | 2 |
| Playable Game Generation | Dec 1, 2024 | GPUImage Generation | CodeCode Available | 2 |
| Real-Time Metric-Semantic Mapping for Autonomous Navigation in Outdoor Environments | Nov 30, 2024 | Autonomous NavigationGPU | CodeCode Available | 2 |
| Stochastic Taylor Derivative Estimator: Efficient amortization for arbitrary differential operators | Nov 27, 2024 | GPU | CodeCode Available | 2 |
| Collaborative Decoding Makes Visual Auto-Regressive Modeling Efficient | Nov 26, 2024 | GPUImage Generation | CodeCode Available | 2 |
| GaussianPretrain: A Simple Unified 3D Gaussian Representation for Visual Pre-training in Autonomous Driving | Nov 19, 2024 | 3D Object DetectionAutonomous Driving | CodeCode Available | 2 |
| AEROMamba: An efficient architecture for audio super-resolution using generative adversarial networks and state space models | Nov 11, 2024 | Audio Super-ResolutionGPU | CodeCode Available | 2 |
| Brain Tumour Removing and Missing Modality Generation using 3D WDM | Nov 7, 2024 | GPUPrediction | CodeCode Available | 2 |
| DeeR-VLA: Dynamic Inference of Multimodal Large Language Models for Efficient Robot Execution | Nov 4, 2024 | GPURobot Manipulation | CodeCode Available | 2 |