| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| Sampling from Your Language Model One Byte at a Time | Jun 17, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| SeqPE: Transformer with Sequential Position Encoding | Jun 16, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Diffusion Sequence Models for Enhanced Protein Representation and Generation | Jun 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Universal Offline Black-Box Optimization via Learning Language Model Embeddings | Jun 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SAFE: Finding Sparse and Flat Minima to Improve Pruning | Jun 7, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition | May 29, 2025 | Handwritten Mathmatical Expression RecognitionLanguage Modeling | CodeCode Available | 1 |
| VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation | May 29, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Pretraining Language Models to Ponder in Continuous Space | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World | May 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Beyond Prompt Engineering: Robust Behavior Control in LLMs via Steering Target Atoms | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Daily-Omni: Towards Audio-Visual Reasoning with Temporal Alignment across Modalities | May 23, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 1 |
| RePrompt: Reasoning-Augmented Reprompting for Text-to-Image Generation via Reinforcement Learning | May 23, 2025 | Image GenerationLanguage Modeling | CodeCode Available | 1 |
| ChemMLLM: Chemical Multimodal Large Language Model | May 22, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 |
| Speculative Decoding Reimagined for Multimodal Large Language Models | May 20, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| U-SAM: An audio language Model for Unified Speech, Audio, and Music Understanding | May 20, 2025 | cross-modal alignmentLanguage Modeling | CodeCode Available | 1 |
| R3: Robust Rubric-Agnostic Reward Models | May 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| 3D Visual Illusion Depth Estimation | May 19, 2025 | Common Sense ReasoningDepth Estimation | CodeCode Available | 1 |
| Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation | May 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Unifying Segment Anything in Microscopy with Multimodal Large Language Model | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Multi-Token Prediction Needs Registers | May 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Extending Large Vision-Language Model for Diverse Interactive Tasks in Autonomous Driving | May 13, 2025 | 3D visual groundingAutonomous Driving | CodeCode Available | 1 |
| Kalman Filter Enhanced GRPO for Reinforcement Learning-Based Language Model Reasoning | May 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Symbolic Regression with Multimodal Large Language Models and Kolmogorov Arnold Networks | May 12, 2025 | Kolmogorov-Arnold NetworksLanguage Modeling | CodeCode Available | 1 |
| MM-Skin: Enhancing Dermatology Vision-Language Model with an Image-Text Dataset Derived from Textbooks | May 9, 2025 | DiagnosticInstruction Following | CodeCode Available | 1 |
| CreoPep: A Universal Deep Learning Framework for Target-Specific Peptide Design and Optimization | May 5, 2025 | DiversityLanguage Modeling | CodeCode Available | 1 |
| WirelessAgent: Large Language Model Agents for Intelligent Wireless Networks | May 2, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Visual Test-time Scaling for GUI Agent Grounding | May 1, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Reviving Any-Subset Autoregressive Models with Principled Parallel Sampling and Speculative Decoding | Apr 29, 2025 | Code GenerationDensity Estimation | CodeCode Available | 1 |
| PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Apr 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LEAM: A Prompt-only Large Language Model-enabled Antenna Modeling Method | Apr 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LongMamba: Enhancing Mamba's Long Context Capabilities via Training-Free Receptive Field Enlargement | Apr 22, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 1 |