| Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations | May 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems | May 18, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| NeuroGen: Neural Network Parameter Generation via Large Language Models | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Beyond Retrieval: Joint Supervision and Multimodal Document Ranking for Textbook Question Answering | May 17, 2025 | Document RankingLarge Language Model | —Unverified | 0 |
| LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tiny QA Benchmark++: Ultra-Lightweight, Synthetic Multilingual Dataset Generation & Smoke-Tests for Continuous LLM Evaluation | May 17, 2025 | Dataset GenerationGPU | CodeCode Available | 1 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation | May 17, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| SpecMemo: Speculative Decoding is in Your Pocket | May 16, 2025 | Large Language Model | —Unverified | 0 |
| Tool-Aided Evolutionary LLM for Generative Policy Toward Efficient Resource Management in Wireless Federated Learning | May 16, 2025 | Federated LearningLarge Language Model | —Unverified | 0 |
| Noise Injection Systemically Degrades Large Language Model Safety Guardrails | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| EnvInjection: Environmental Prompt Injection Attack to Multi-modal Web Agents | May 16, 2025 | Large Language Model | —Unverified | 0 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Scaling Reasoning can Improve Factuality in Large Language Models | May 16, 2025 | Knowledge GraphsLarge Language Model | CodeCode Available | 0 |
| Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation | May 16, 2025 | General KnowledgeLarge Language Model | —Unverified | 0 |
| Explain What You Mean: Intent Augmented Knowledge Graph Recommender Built With An LLM | May 16, 2025 | Knowledge GraphsLarge Language Model | —Unverified | 0 |
| REI-Bench: Can Embodied Agents Understand Vague Human Instructions in Task Planning? | May 16, 2025 | Large Language ModelRobot Task Planning | —Unverified | 0 |
| Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| EdgeWisePersona: A Dataset for On-Device User Profiling from Natural Language Interactions | May 16, 2025 | Large Language Model | CodeCode Available | 0 |
| Large Language Model Use Impact Locus of Control | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DRAGON: A Large-Scale Dataset of Realistic Images Generated by Diffusion Models | May 16, 2025 | Image GenerationLarge Language Model | —Unverified | 0 |
| Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese | May 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| A Light and Smart Wearable Platform with Multimodal Foundation Model for Enhanced Spatial Reasoning in People with Blindness and Low Vision | May 16, 2025 | Large Language ModelNavigate | —Unverified | 0 |
| On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unifying Segment Anything in Microscopy with Multimodal Large Language Model | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Model Performance-Guided Evaluation Data Selection for Effective Prompt Optimization | May 15, 2025 | BenchmarkingClustering | —Unverified | 0 |
| CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation | May 15, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Automating Security Audit Using Large Language Model based Agent: An Exploration Experiment | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Neural Thermodynamic Laws for Large Language Model Training | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cross-Image Contrastive Decoding: Precise, Lossless Suppression of Language Priors in Large Vision-Language Models | May 15, 2025 | Image CaptioningLanguage Modeling | —Unverified | 0 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Does Feasibility Matter? Understanding the Impact of Feasibility on Synthetic Training Data | May 15, 2025 | AttributeLarge Language Model | CodeCode Available | 0 |
| Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Learning Virtual Machine Scheduling in Cloud Computing through Language Agents | May 15, 2025 | Cloud ComputingLarge Language Model | —Unverified | 0 |
| Towards a Deeper Understanding of Reasoning Capabilities in Large Language Models | May 15, 2025 | Large Language ModelMath | CodeCode Available | 0 |
| ChronoSteer: Bridging Large Language Model and Time Series Foundation Model via Synthetic Data | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AI2MMUM: AI-AI Oriented Multi-Modal Universal Model Leveraging Telecom Domain Large Model | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Emotion Knowledge Enhancement for Vision Large Language Models: A Self-Verification Approach for High-Quality Emotion Instruction Data Generation | May 14, 2025 | Emotion RecognitionLarge Language Model | —Unverified | 0 |
| Contrastive Cross-Course Knowledge Tracing via Concept Graph Guided Knowledge Transfer | May 14, 2025 | Knowledge TracingLarge Language Model | CodeCode Available | 0 |
| FAS-LLM: Large Language Model-Based Channel Prediction for OTFS-Enabled Satellite-FAS Links | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Contextual Phenotyping of Pediatric Sepsis Cohort Using Large Language Models | May 14, 2025 | ClusteringLarge Language Model | —Unverified | 0 |
| Trustless Autonomy: Understanding Motivations, Benefits and Governance Dilemma in Self-Sovereign Decentralized AI Agents | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Models Are More Persuasive Than Incentivized Human Persuaders | May 14, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |