| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| LERT: A Linguistically-motivated Pre-trained Language Model | Nov 10, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |
| FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model | Mar 17, 2023 | Face Detectionmodel | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| Faceptor: A Generalist Model for Face Perception | Mar 14, 2024 | Age EstimationAttribute | CodeCode Available | 2 |
| Evaluating the World Model Implicit in a Generative Model | Jun 6, 2024 | Logical Reasoningmodel | CodeCode Available | 2 |
| Face Swap via Diffusion Model | Mar 2, 2024 | Face AlignmentFace Detection | CodeCode Available | 2 |
| Efficient Visual State Space Model for Image Deblurring | May 23, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 |
| Editing Language Model-based Knowledge Graph Embeddings | Jan 25, 2023 | EDIT Taskknowledge editing | CodeCode Available | 2 |