| DiffMM: Multi-Modal Diffusion Model for Recommendation | Jun 17, 2024 | Contrastive Learningmodel | CodeCode Available | 2 |
| Towards Vision-Language Geo-Foundation Model: A Survey | Jun 13, 2024 | Earth ObservationImage Captioning | CodeCode Available | 2 |
| Binarized Diffusion Model for Image Super-Resolution | Jun 9, 2024 | AttributeBinarization | CodeCode Available | 2 |
| SF-V: Single Forward Video Generation Model | Jun 6, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Evaluating the World Model Implicit in a Generative Model | Jun 6, 2024 | Logical Reasoningmodel | CodeCode Available | 2 |
| RecDiff: Diffusion Model for Social Recommendation | Jun 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Improved Canonicalization for Model Agnostic Equivariance | May 23, 2024 | Contrastive Learningmodel | CodeCode Available | 2 |
| Agent Planning with World Knowledge Model | May 23, 2024 | modelWorld Knowledge | CodeCode Available | 2 |
| Efficient Visual State Space Model for Image Deblurring | May 23, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 |
| Not All Language Model Features Are Linear | May 23, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| OAEI Machine Learning Dataset for Online Model Generation | Apr 29, 2024 | Graph Matchingmodel | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Graphic Design with Large Multimodal Model | Apr 22, 2024 | Layout Generationmodel | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Test-Time Model Adaptation with Only Forward Passes | Apr 2, 2024 | modelTest-time Adaptation | CodeCode Available | 2 |
| BAMM: Bidirectional Autoregressive Motion Model | Mar 28, 2024 | Denoisingmodel | CodeCode Available | 2 |
| SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model | Mar 27, 2024 | DenoisingDomain Adaptation | CodeCode Available | 2 |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Mar 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Faceptor: A Generalist Model for Face Perception | Mar 14, 2024 | Age EstimationAttribute | CodeCode Available | 2 |
| Face Swap via Diffusion Model | Mar 2, 2024 | Face AlignmentFace Detection | CodeCode Available | 2 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 |
| CoLLaVO: Crayon Large Language and Vision mOdel | Feb 17, 2024 | Large Language Modelmodel | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |