| X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diagnosing our datasets: How does my language model learn clinical information? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Likelihood Variance as Text Importance for Resampling Texts to Map Language Models | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Internal and External Impacts of Natural Language Processing Papers | May 21, 2025 | ArticlesEthics | —Unverified | 0 |
| Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Revealing Language Model Trajectories via Kullback-Leibler Divergence | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling | May 21, 2025 | Emotion RecognitionFace Detection | —Unverified | 0 |
| Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective | May 21, 2025 | Instruction FollowingLanguage Modeling | —Unverified | 0 |
| Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory | May 21, 2025 | BenchmarkingLanguage Modeling | CodeCode Available | 0 |
| Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering | May 21, 2025 | counterfactualDenoising | —Unverified | 0 |
| lmgame-Bench: How Good are LLMs at Playing Games? | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 4 |
| ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution | May 21, 2025 | GPULanguage Modeling | —Unverified | 0 |
| Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Human in the Loop Adaptive Optimization for Improved Time Series Forecasting | May 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning | May 21, 2025 | Domain GeneralizationLanguage Modeling | —Unverified | 0 |
| Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning | May 21, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications | May 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |