| Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations | May 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| NeuroGen: Neural Network Parameter Generation via Large Language Models | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems | May 18, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering | May 17, 2025 | Document RankingLarge Language Model | —Unverified | 0 |
| Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation | May 17, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation | May 17, 2025 | Dataset GenerationGPU | CodeCode Available | 1 |
| SpecMemo: Speculative Decoding is in Your Pocket | May 16, 2025 | Large Language Model | —Unverified | 0 |
| Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning | May 16, 2025 | Federated LearningLarge Language Model | —Unverified | 0 |
| Noise Injection Systemically Degrades Large Language Model Safety Guardrails | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents | May 16, 2025 | Large Language Model | —Unverified | 0 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Scaling Reasoning can Improve Factuality in Large Language Models | May 16, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 0 |
| Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation | May 16, 2025 | General KnowledgeLarge Language Model | —Unverified | 0 |
| Explain What You Mean: Intent Augmented Knowledge Graph Recommender Built With An LLM | May 16, 2025 | Knowledge GraphsLarge Language Model | —Unverified | 0 |
| REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning? | May 16, 2025 | Large Language ModelRobot Task Planning | —Unverified | 0 |
| Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |