| Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering | May 22, 2025 | Global FactsLanguage Modeling | CodeCode Available | 0 |
| Structure-Aligned Protein Language Model | May 22, 2025 | Contrastive LearningLanguage Modeling | CodeCode Available | 2 |
| LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP | May 22, 2025 | Continual PretrainingDiagnostic | CodeCode Available | 0 |
| On Multilingual Encoder Language Model Compression for Low-Resource Languages | May 22, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CASTILLO: Characterizing Response Length Distributions of Large Language Models | May 22, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality | May 22, 2025 | Abstractive Text SummarizationInformativeness | CodeCode Available | 0 |
| EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions | May 22, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models | May 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| LaViDa: A Large Diffusion Language Model for Multimodal Understanding | May 22, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 3 |
| TensorAR: Refinement is All You Need in Autoregressive Image Generation | May 22, 2025 | AllImage Generation | —Unverified | 0 |
| Large Language Model-Empowered Interactive Load Forecasting | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization | May 22, 2025 | Combinatorial OptimizationLanguage Modeling | CodeCode Available | 1 |
| Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine | May 22, 2025 | Causal InferenceDrug Discovery | —Unverified | 0 |
| MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks | May 22, 2025 | Code GenerationLanguage Modeling | —Unverified | 0 |
| CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning | May 22, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Edge-First Language Model Inference: Models, Metrics, and Tradeoffs | May 22, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector | May 21, 2025 | Bias DetectionIn-Context Learning | —Unverified | 0 |
| Ensembling Sparse Autoencoders | May 21, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation | May 21, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 0 |
| Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition | May 21, 2025 | Dialogue GenerationLanguage Modeling | —Unverified | 0 |
| Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diagnosing our datasets: How does my language model learn clinical information? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Likelihood Variance as Text Importance for Resampling Texts to Map Language Models | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Internal and External Impacts of Natural Language Processing Papers | May 21, 2025 | ArticlesEthics | —Unverified | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revealing Language Model Trajectories via Kullback-Leibler Divergence | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | May 21, 2025 | Emotion RecognitionFace Detection | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective | May 21, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering | May 21, 2025 | counterfactualDenoising | —Unverified | 0 |
| lmgame-Bench: How Good are LLMs at Playing Games? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution | May 21, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Human in the Loop Adaptive Optimization for Improved Time Series Forecasting | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | May 21, 2025 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |