| Open Source Planning & Control System with Language Agents for Autonomous Scientific Discovery | Jul 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DeSTA2.5-Audio: Toward General-Purpose Large Audio Language Model with Self-Generated Cross-Modal Alignment | Jul 3, 2025 | cross-modal alignmentInstruction Following | CodeCode Available | 2 |
| OctoThinker: Mid-training Incentivizes Reinforcement Learning Scaling | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Language Modeling by Language Models | Jun 25, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Pre-Trained LLM is a Semantic-Aware and Generalizable Segmentation Booster | Jun 22, 2025 | DecoderImage Segmentation | CodeCode Available | 2 |
| Watermarking Autoregressive Image Generation | Jun 19, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| BMFM-RNA: An Open Framework for Building and Evaluating Transcriptomic Foundation Models | Jun 17, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 2 |
| Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning | Jun 2, 2025 | Fact VerificationLanguage Modeling | CodeCode Available | 2 |
| MagiCodec: Simple Masked Gaussian-Injected Codec for High-Fidelity Reconstruction and Generation | May 31, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| GeoVision Labeler: Zero-Shot Geospatial Classification with Vision and Language Models | May 30, 2025 | ClassificationDisaster Response | CodeCode Available | 2 |
| ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL | May 30, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 2 |
| Zero-Shot Vision Encoder Grafting via LLM Surrogates | May 28, 2025 | DecoderLanguage Modeling | CodeCode Available | 2 |
| Improved Representation Steering for Language Models | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LLaMEA-BO: A Large Language Model Evolutionary Algorithm for Automatically Generating Bayesian Optimization Algorithms | May 27, 2025 | Bayesian OptimizationBenchmarking | CodeCode Available | 2 |
| WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| DanmakuTPPBench: A Multi-modal Benchmark for Temporal Point Process Modeling and Understanding | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Structure-Aligned Protein Language Model | May 22, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation | May 20, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 2 |
| Efficient Speech Language Modeling via Energy Distance in Continuous Latent Space | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SLOT: Sample-specific Language Model Optimization at Test-time | May 18, 2025 | GSM8KLanguage Modeling | CodeCode Available | 2 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| WorldPM: Scaling Human Preference Modeling | May 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |