| Towards DS-NER: Unveiling and Addressing Latent Noise in Distant Annotations | May 18, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DS-ProGen: A Dual-Structure Deep Language Model for Functional Protein Design | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SLOT: Sample-specific Language Model Optimization at Test-time | May 18, 2025 | GSM8KLanguage Modeling | CodeCode Available | 2 |
| Self-Destructive Language Model | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mitigating Content Effects on Reasoning in Language Models through Fine-Grained Activation Steering | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Beyond Frameworks: Unpacking Collaboration Strategies in Multi-Agent Systems | May 18, 2025 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| NeuroGen: Neural Network Parameter Generation via Large Language Models | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SGDPO: Self-Guided Direct Preference Optimization for Language Model Alignment | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| From n-gram to Attention: How Model Architectures Learn and Propagate Bias in Language Modeling | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| mCLM: A Function-Infused and Synthesis-Friendly Modular Chemical Language Model | May 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging Generative and Discriminative Learning: Few-Shot Relation Extraction via Two-Stage Knowledge-Guided Pre-training | May 18, 2025 | Contrastive LearningIn-Context Learning | CodeCode Available | 0 |
| LifelongAgentBench: Evaluating LLM Agents as Lifelong Learners | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Chain-of-Model Learning for Language Model | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Internal Causal Mechanisms Robustly Predict Language Model Out-of-Distribution Behaviors | May 17, 2025 | counterfactualInstruction Following | CodeCode Available | 0 |
| An Explanation of Intrinsic Self-Correction via Linear Representations and Latent Concepts | May 17, 2025 | Concept AlignmentLanguage Modeling | —Unverified | 0 |
| Communication-Efficient Hybrid Language Model via Uncertainty-Aware Opportunistic and Compressed Transmission | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Efficiently Building a Domain-Specific Large Language Model from Scratch: A Case Study of a Classical Chinese Large Language Model | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Recursive Question Understanding for Complex Question Answering over Heterogeneous Personal Data | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| PRS-Med: Position Reasoning Segmentation with Vision-Language Model in Medical Imaging | May 17, 2025 | Image SegmentationLanguage Modeling | —Unverified | 0 |
| CorBenchX: Large-Scale Chest X-Ray Error Dataset and Vision-Language Model Benchmark for Report Error Correction | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SOCIA: An End-to-End Agentic Framework for Automated Cyber-Physical-Social Simulator Generation | May 17, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| Demystifying and Enhancing the Efficiency of Large Language Model Based Search Agents | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Reasoning Large Language Model Errors Arise from Hallucinating Critical Problem Features | May 17, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TinyRS-R1: Compact Multimodal Language Model for Remote Sensing | May 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Noise Injection Systemically Degrades Large Language Model Safety Guardrails | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Token-Level Uncertainty Estimation for Large Language Model Reasoning | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| THELMA: Task Based Holistic Evaluation of Large Language Model Applications-RAG Question Answering | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| An agentic system with reinforcement-learned subsystem improvements for parsing form-like documents | May 16, 2025 | FormLanguage Modeling | CodeCode Available | 0 |
| Feasibility with Language Models for Open-World Compositional Zero-Shot Learning | May 16, 2025 | AttributeCompositional Zero-Shot Learning | —Unverified | 0 |
| Towards Cultural Bridge by Bahnaric-Vietnamese Translation Using Transfer Learning of Sequence-To-Sequence Pre-training Language Model | May 16, 2025 | Data AugmentationLanguage Modeling | —Unverified | 0 |
| Large Language Model Use Impact Locus of Control | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Sample Efficient Reinforcement Learning via Large Vision Language Model Distillation | May 16, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese | May 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline | May 16, 2025 | Abstractive Text SummarizationLanguage Modeling | CodeCode Available | 0 |
| Maximizing Asynchronicity in Event-based Neural Networks | May 16, 2025 | Event-based visionLanguage Modeling | —Unverified | 0 |
| Efficient Attention via Pre-Scoring: Prioritizing Informative Keys in Transformers | May 16, 2025 | ClusteringLanguage Modeling | CodeCode Available | 0 |
| Improving the Data-efficiency of Reinforcement Learning by Warm-starting with LLM | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Enhancing Low-Resource Minority Language Translation with LLMs and Retrieval-Augmented Generation for Cultural Nuances | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On DeepSeekMoE: Statistical Benefits of Shared Experts and Normalized Sigmoid Gating | May 16, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unifying Segment Anything in Microscopy with Multimodal Large Language Model | May 16, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| ADALog: Adaptive Unsupervised Anomaly detection in Logs with Self-attention Masked Language Model | May 15, 2025 | Anomaly DetectionLanguage Modeling | —Unverified | 0 |
| ChestyBot: Detecting and Disrupting Chinese Communist Party Influence Stratagems | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Tracr-Injection: Distilling Algorithms into Pre-trained Language Models | May 15, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| UDDETTS: Unifying Discrete and Dimensional Emotions for Controllable Emotional Text-to-Speech | May 15, 2025 | Emotional Speech SynthesisLanguage Modeling | —Unverified | 0 |
| Automating Security Audit Using Large Language Model based Agent: An Exploration Experiment | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CRPE: Expanding The Reasoning Capability of Large Language Model for Code Generation | May 15, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention | May 15, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| ImagineBench: Evaluating Reinforcement Learning with Large Language Model Rollouts | May 15, 2025 | Continual LearningLanguage Modeling | CodeCode Available | 1 |
| Advanced Crash Causation Analysis for Freeway Safety: A Large Language Model Approach to Identifying Key Contributing Factors | May 15, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |