| Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification | May 20, 2024 | Hyperspectral Image Classificationimage-classification | CodeCode Available | 2 |
| GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details | May 20, 2024 | 3D Generation3D Geometry Prediction | CodeCode Available | 2 |
| Large-Scale Multi-Center CT and MRI Segmentation of Pancreas with Deep Learning | May 20, 2024 | BenchmarkingMRI segmentation | CodeCode Available | 2 |
| Diff-BGM: A Diffusion Model for Video Background Music Generation | May 20, 2024 | DiversityMusic Generation | CodeCode Available | 2 |
| SEMv3: A Fast and Robust Approach to Table Separation Line Detection | May 20, 2024 | Line Detection | CodeCode Available | 2 |
| SSAMBA: Self-Supervised Audio Representation Learning with Mamba State Space Model | May 20, 2024 | Audio ClassificationGPU | CodeCode Available | 2 |
| Imp: Highly Capable Large Multimodal Models for Mobile Devices | May 20, 2024 | QuantizationVisual Question Answering | CodeCode Available | 2 |
| MTVQA: Benchmarking Multilingual Text-Centric Visual Question Answering | May 20, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 2 |
| MathBench: Evaluating the Theory and Application Proficiency of LLMs with a Hierarchical Mathematics Benchmark | May 20, 2024 | College MathematicsGSM8K | CodeCode Available | 2 |
| MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise | May 20, 2024 | | CodeCode Available | 2 |
| AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field | May 20, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 2 |
| End-to-End Full-Page Optical Music Recognition for Pianoform Sheet Music | May 20, 2024 | Synthetic Data Generation | CodeCode Available | 2 |
| Slicedit: Zero-Shot Video Editing With Text-to-Image Diffusion Models Using Spatio-Temporal Slices | May 20, 2024 | Image GenerationVideo Editing | CodeCode Available | 2 |
| CoR-GS: Sparse-View 3D Gaussian Splatting via Co-Regularization | May 20, 2024 | 3DGSNovel View Synthesis | CodeCode Available | 2 |
| DATR: Unsupervised Domain Adaptive Detection Transformer with Dataset-Level Adaptation and Prototypical Alignment | May 20, 2024 | Contrastive LearningDomain Adaptation | CodeCode Available | 2 |
| Transcriptomics-guided Slide Representation Learning in Computational Pathology | May 19, 2024 | Contrastive LearningRepresentation Learning | CodeCode Available | 2 |
| Advancing 6-DoF Instrument Pose Estimation in Variable X-Ray Imaging Geometries | May 19, 2024 | 6D Pose EstimationGPU | CodeCode Available | 2 |
| NetMamba: Efficient Network Traffic Classification via Pre-training Unidirectional Mamba | May 19, 2024 | ClassificationFew-Shot Learning | CodeCode Available | 2 |
| SLAB: Efficient Transformers with Simplified Linear Attention and Progressive Re-parameterized Batch Normalization | May 19, 2024 | image-classificationImage Classification | CodeCode Available | 2 |
| Your Transformer is Secretly Linear | May 19, 2024 | | CodeCode Available | 2 |
| MotionGS : Compact Gaussian Splatting SLAM by Motion Filter | May 18, 2024 | 3DGSNeRF | CodeCode Available | 2 |
| Dreamer XL: Towards High-Resolution Text-to-3D Generation via Trajectory Score Matching | May 18, 2024 | 3D GenerationDenoising | CodeCode Available | 2 |
| MAMCA -- Optimal on Accuracy and Efficiency for Automatic Modulation Classification with Extended Signal Length | May 18, 2024 | DenoisingGPU | CodeCode Available | 2 |
| MediCLIP: Adapting CLIP for Few-shot Medical Image Anomaly Detection | May 18, 2024 | Anomaly DetectionDecision Making | CodeCode Available | 2 |
| GinAR: An End-To-End Multivariate Time Series Forecasting Model Suitable for Variable Missing | May 18, 2024 | Multivariate Time Series ForecastingTime Series | CodeCode Available | 2 |