| DiffMM: Multi-Modal Diffusion Model for Recommendation | Jun 17, 2024 | Contrastive Learningmodel | CodeCode Available | 2 |
| Towards Vision-Language Geo-Foundation Model: A Survey | Jun 13, 2024 | Earth ObservationImage Captioning | CodeCode Available | 2 |
| Binarized Diffusion Model for Image Super-Resolution | Jun 9, 2024 | AttributeBinarization | CodeCode Available | 2 |
| Evaluating the World Model Implicit in a Generative Model | Jun 6, 2024 | Logical Reasoningmodel | CodeCode Available | 2 |
| SF-V: Single Forward Video Generation Model | Jun 6, 2024 | Denoisingmodel | CodeCode Available | 2 |
| RecDiff: Diffusion Model for Social Recommendation | Jun 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Efficient Visual State Space Model for Image Deblurring | May 23, 2024 | DeblurringImage Deblurring | CodeCode Available | 2 |
| Agent Planning with World Knowledge Model | May 23, 2024 | modelWorld Knowledge | CodeCode Available | 2 |
| Improved Canonicalization for Model Agnostic Equivariance | May 23, 2024 | Contrastive Learningmodel | CodeCode Available | 2 |
| Not All Language Model Features Are Linear | May 23, 2024 | AllLanguage Modeling | CodeCode Available | 2 |
| ASAM: Boosting Segment Anything Model with Adversarial Tuning | May 1, 2024 | Image Segmentationmodel | CodeCode Available | 2 |
| OAEI Machine Learning Dataset for Online Model Generation | Apr 29, 2024 | Graph Matchingmodel | CodeCode Available | 2 |
| WorldGPT: Empowering LLM as Multimodal World Model | Apr 28, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Graphic Design with Large Multimodal Model | Apr 22, 2024 | Layout Generationmodel | CodeCode Available | 2 |
| Decomposing and Editing Predictions by Modeling Model Computation | Apr 17, 2024 | counterfactualmodel | CodeCode Available | 2 |
| LaVy: Vietnamese Multimodal Large Language Model | Apr 11, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Test-Time Model Adaptation with Only Forward Passes | Apr 2, 2024 | modelTest-time Adaptation | CodeCode Available | 2 |
| BAMM: Bidirectional Autoregressive Motion Model | Mar 28, 2024 | Denoisingmodel | CodeCode Available | 2 |
| SingularTrajectory: Universal Trajectory Predictor Using Diffusion Model | Mar 27, 2024 | DenoisingDomain Adaptation | CodeCode Available | 2 |
| SelfIE: Self-Interpretation of Large Language Model Embeddings | Mar 16, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Faceptor: A Generalist Model for Face Perception | Mar 14, 2024 | Age EstimationAttribute | CodeCode Available | 2 |
| Face Swap via Diffusion Model | Mar 2, 2024 | Face AlignmentFace Detection | CodeCode Available | 2 |
| HiGPT: Heterogeneous Graph Language Model | Feb 25, 2024 | Graph LearningLanguage Modeling | CodeCode Available | 2 |
| CoLLaVO: Crayon Large Language and Vision mOdel | Feb 17, 2024 | Large Language Modelmodel | CodeCode Available | 2 |
| GraphTranslator: Aligning Graph Model to Large Language Model for Open-ended Tasks | Feb 11, 2024 | Graph Question AnsweringInstruction Following | CodeCode Available | 2 |
| Jailbreaking Attack against Multimodal Large Language Model | Feb 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Towards Efficient Exact Optimization of Language Model Alignment | Feb 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| ChemDFM: A Large Language Foundation Model for Chemistry | Jan 26, 2024 | Formmodel | CodeCode Available | 2 |
| DsDm: Model-Aware Dataset Selection with Datamodels | Jan 23, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CloSe: A 3D Clothing Segmentation Dataset and Model | Jan 22, 2024 | Continual Learningmodel | CodeCode Available | 2 |
| Spatial-Temporal Large Language Model for Traffic Prediction | Jan 18, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CoMoSVC: Consistency Model-based Singing Voice Conversion | Jan 3, 2024 | GPUmodel | CodeCode Available | 2 |
| DiffLoc: Diffusion Model for Outdoor LiDAR Localization | Jan 1, 2024 | Denoisingmodel | CodeCode Available | 2 |
| Reducing Energy Bloat in Large Model Training | Dec 12, 2023 | model | CodeCode Available | 2 |
| DiffusionSat: A Generative Foundation Model for Satellite Imagery | Dec 6, 2023 | Crop Yield PredictionImage Generation | CodeCode Available | 2 |
| PixelLM: Pixel Reasoning with Large Multimodal Model | Dec 4, 2023 | Decodermodel | CodeCode Available | 2 |
| CritiqueLLM: Towards an Informative Critique Generation Model for Evaluation of Large Language Model Generation | Nov 30, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLMGA: Multimodal Large Language Model based Generation Assistant | Nov 27, 2023 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Algorithm Evolution Using Large Language Model | Nov 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Controlled Text Generation via Language Model Arithmetic | Nov 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Diffusion Model Alignment Using Direct Preference Optimization | Nov 21, 2023 | modelText-to-Image Generation | CodeCode Available | 2 |
| SpectralGPT: Spectral Remote Sensing Foundation Model | Nov 13, 2023 | Change Detectionmodel | CodeCode Available | 2 |
| Neuro-GPT: Towards A Foundation Model for EEG | Nov 7, 2023 | Brain Computer InterfaceEEG | CodeCode Available | 2 |
| GLaMM: Pixel Grounding Large Multimodal Model | Nov 6, 2023 | Conversational Question AnsweringImage Captioning | CodeCode Available | 2 |
| A Foundation Model for Music Informatics | Nov 6, 2023 | Information Retrievalmodel | CodeCode Available | 2 |
| OWL: A Large Language Model for IT Operations | Sep 17, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| CityDreamer: Compositional Generative Model of Unbounded 3D Cities | Sep 1, 2023 | modelScene Generation | CodeCode Available | 2 |
| LLaSM: Large Language and Speech Model | Aug 30, 2023 | Instruction FollowingLanguage Modeling | CodeCode Available | 2 |
| DiffusionTrack: Diffusion Model For Multi-Object Tracking | Aug 19, 2023 | Denoisingmodel | CodeCode Available | 2 |
| Shepherd: A Critic for Language Model Generation | Aug 8, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 2 |