| GPTailor: Large Language Model Pruning Through Layer Cutting and Stitching | Jun 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| LMR-BENCH: Evaluating LLM Agent's Ability on Reproducing Language Modeling Research | Jun 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Sampling from Your Language Model One Byte at a Time | Jun 17, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 1 |
| RMIT-ADM+S at the SIGIR 2025 LiveRAG Challenge | Jun 17, 2025 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| SeqPE: Transformer with Sequential Position Encoding | Jun 16, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| TagRouter: Learning Route to LLMs through Tags for Open-Domain Text Generation Tasks | Jun 14, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Diffusion Sequence Models for Enhanced Protein Representation and Generation | Jun 9, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Towards Universal Offline Black-Box Optimization via Learning Language Model Embeddings | Jun 8, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SAFE: Finding Sparse and Flat Minima to Improve Pruning | Jun 7, 2025 | image-classificationImage Classification | CodeCode Available | 1 |
| DAM: Dynamic Attention Mask for Long-Context Large Language Model Inference Acceleration | Jun 6, 2025 | Computational EfficiencyLanguage Modeling | CodeCode Available | 1 |
| OpenMaskDINO3D : Reasoning 3D Segmentation via Large Language Model | Jun 5, 2025 | Instance SegmentationLanguage Modeling | CodeCode Available | 1 |
| POSS: Position Specialist Generates Better Draft for Speculative Decoding | Jun 4, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Period-LLM: Extending the Periodic Capability of Multimodal Large Language Model | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Can Slow-thinking LLMs Reason Over Time? Empirical Studies in Time Series Forecasting | May 30, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Uni-MuMER: Unified Multi-Task Fine-Tuning of Vision-Language Model for Handwritten Mathematical Expression Recognition | May 29, 2025 | Handwritten Mathmatical Expression RecognitionLanguage Modeling | CodeCode Available | 1 |
| VCapsBench: A Large-scale Fine-grained Benchmark for Video Caption Quality Evaluation | May 29, 2025 | Caption GenerationLanguage Modeling | CodeCode Available | 1 |
| ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking | May 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Pretraining Language Models to Ponder in Continuous Space | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World | May 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Decoupled Visual Interpretation and Linguistic Reasoning for Math Problem Solving | May 23, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |