| Language Model Beats Diffusion -- Tokenizer is Key to Visual Generation | Oct 9, 2023 | Action RecognitionImage Generation | CodeCode Available | 4 |
| SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image Editing | May 7, 2024 | Image ManipulationLanguage Modeling | CodeCode Available | 4 |
| Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal Learning | May 23, 2025 | DecoderImage Captioning | CodeCode Available | 4 |
| Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach | Feb 7, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| SEED-Story: Multimodal Long Story Generation with Large Language Model | Jul 11, 2024 | Image GenerationLanguage Modeling | CodeCode Available | 4 |
| Self-Play Preference Optimization for Language Model Alignment | May 1, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Sailor: Open Language Models for South-East Asia | Apr 4, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Safurai 001: New Qualitative Approach for Code LLM Evaluation | Sep 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Samba: Simple Hybrid State Space Models for Efficient Unlimited Context Language Modeling | Jun 11, 2024 | 4kLanguage Modeling | CodeCode Available | 4 |
| AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling | Feb 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| INT2.1: Towards Fine-Tunable Quantized Large Language Models with Error Correction through Low-Rank Adaptation | Jun 13, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model | Dec 28, 2023 | Instance SegmentationLanguage Modeling | CodeCode Available | 4 |
| Interpretability in the Wild: a Circuit for Indirect Object Identification in GPT-2 small | Nov 1, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Choices are More Important than Efforts: LLM Enables Efficient Multi-Agent Exploration | Oct 3, 2024 | DiversityLanguage Modeling | CodeCode Available | 4 |
| RewardBench: Evaluating Reward Models for Language Modeling | Mar 20, 2024 | Instruction FollowingLanguage Modeling | CodeCode Available | 4 |
| ImgEdit: A Unified Image Editing Dataset and Benchmark | May 26, 2025 | Image Editing | CodeCode Available | 4 |
| Image Fusion via Vision-Language Model | Feb 3, 2024 | DecoderLanguage Modeling | CodeCode Available | 4 |
| ChatDoctor: A Medical Chat Model Fine-Tuned on a Large Language Model Meta-AI (LLaMA) Using Medical Domain Knowledge | Mar 24, 2023 | Information RetrievalLanguage Modeling | CodeCode Available | 4 |
| ChatHaruhi: Reviving Anime Character in Reality via Large Language Model | Aug 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMs | Jun 14, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| RepoAgent: An LLM-Powered Open-Source Framework for Repository-level Code Documentation Generation | Feb 26, 2024 | Code Documentation GenerationCode Generation | CodeCode Available | 4 |
| RaTEScore: A Metric for Radiology Report Generation | Jun 24, 2024 | DiagnosticEntity Embeddings | CodeCode Available | 4 |
| ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought Templates | Feb 10, 2025 | Hierarchical Reinforcement LearningLanguage Modeling | CodeCode Available | 4 |
| SNAC: Multi-Scale Neural Audio Codec | Oct 18, 2024 | Audio CompressionAudio Generation | CodeCode Available | 4 |
| Can Machines Help Us Answering Question 16 in Datasheets, and In Turn Reflecting on Inappropriate Content? | Feb 14, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| G-LLaVA: Solving Geometric Problem with Multi-Modal Large Language Model | Dec 18, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| GigaAM: Efficient Self-Supervised Learner for Speech Recognition | Jun 1, 2025 | Automatic Speech RecognitionLanguage Modeling | CodeCode Available | 4 |
| GLIPv2: Unifying Localization and Vision-Language Understanding | Jun 12, 2022 | 2D Object DetectionContrastive Learning | CodeCode Available | 4 |
| Gated Delta Networks: Improving Mamba2 with Delta Rule | Dec 9, 2024 | Common Sense ReasoningLanguage Modeling | CodeCode Available | 4 |
| Phoenix: Democratizing ChatGPT across Languages | Apr 20, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Photo-Realistic Image Restoration in the Wild with Controlled Vision-Language Models | Apr 15, 2024 | Image GenerationImage Restoration | CodeCode Available | 4 |
| BLOOM: A 176B-Parameter Open-Access Multilingual Language Model | Nov 9, 2022 | DecoderLanguage Modeling | CodeCode Available | 4 |
| Galactica: A Large Language Model for Science | Nov 16, 2022 | AnachronismsBias Detection | CodeCode Available | 4 |
| Generative Representational Instruction Tuning | Feb 15, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models | Mar 12, 2025 | DenoisingLanguage Modeling | CodeCode Available | 4 |
| FoundationPose: Unified 6D Pose Estimation and Tracking of Novel Objects | Dec 13, 2023 | 3D Object Detection3D Object Tracking | CodeCode Available | 4 |
| Partition Generative Modeling: Masked Modeling Without Masks | May 24, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 4 |
| BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text | Mar 27, 2024 | ArticlesLanguage Modeling | CodeCode Available | 4 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| OLMoE: Open Mixture-of-Experts Language Models | Sep 3, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Beyond Reward Hacking: Causal Rewards for Large Language Model Alignment | Jan 16, 2025 | Causal Inferencecounterfactual | CodeCode Available | 4 |
| Optimizing Prompts for Text-to-Image Generation | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| AgentGym: Evolving Large Language Model-based Agents across Diverse Environments | Jun 6, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| N-Grammer: Augmenting Transformers with latent n-grams | Jul 13, 2022 | Common Sense ReasoningCoreference Resolution | CodeCode Available | 4 |
| BLIP-2: Bootstrapping Language-Image Pre-training with Frozen Image Encoders and Large Language Models | Jan 30, 2023 | Generative Visual Question AnsweringImage Captioning | CodeCode Available | 4 |
| Reasoning with Language Model is Planning with World Model | May 24, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Baize: An Open-Source Chat Model with Parameter-Efficient Tuning on Self-Chat Data | Apr 3, 2023 | ChatbotLanguage Modeling | CodeCode Available | 4 |
| Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models | Apr 19, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| Flamingo: a Visual Language Model for Few-Shot Learning | Apr 29, 2022 | Few-Shot LearningGenerative Visual Question Answering | CodeCode Available | 4 |
| MutaPLM: Protein Language Modeling for Mutation Explanation and Engineering | Oct 30, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 4 |