| Efficient Visual State Space Model for Image Deblurring | May 23, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 | 5 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 | 5 |
| DiMeR: Disentangled Mesh Reconstruction Model | Apr 24, 2025 | Image to 3Dmodel | CodeCode Available | 2 | 5 |
| Towards Vision-Language Geo-Foundation Model: A Survey | Jun 13, 2024 | Earth ObservationImage Captioning | CodeCode Available | 2 | 5 |
| A Foundation Model for Music Informatics | Nov 6, 2023 | Information Retrievalmodel | CodeCode Available | 2 | 5 |
| Diffusion Recommender Model | Apr 11, 2023 | DenoisingImage Generation | CodeCode Available | 2 | 5 |
| Diffusion Model Quantization: A Review | May 8, 2025 | modelQuantization | CodeCode Available | 2 | 5 |
| DiffusionSat: A Generative Foundation Model for Satellite Imagery | Dec 6, 2023 | Crop Yield PredictionImage Generation | CodeCode Available | 2 | 5 |
| BAMM: Bidirectional Autoregressive Motion Model | Mar 28, 2024 | Denoisingmodel | CodeCode Available | 2 | 5 |
| A Multimodal Vision Foundation Model for Clinical Dermatology | Oct 19, 2024 | DiagnosticLesion Segmentation | CodeCode Available | 2 | 5 |