Attention with Trained Embeddings Provably Selects Important Tokens May 22, 2025 Binary Classification Language Modeling
— Unverified 0Diagnosing our datasets: How does my language model learn clinical information? May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Human in the Loop Adaptive Optimization for Improved Time Series Forecasting May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector May 21, 2025 Bias Detection In-Context Learning
— Unverified 0CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment May 21, 2025 Language Modeling Language Modelling
— Unverified 0Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective May 21, 2025 Instruction Following Language Modeling
— Unverified 0DEBATE, TRAIN, EVOLVE: Self Evolution of Language Model Reasoning May 21, 2025 Domain Generalization Language Modeling
— Unverified 0Ensembling Sparse Autoencoders May 21, 2025 Diversity Language Modeling
— Unverified 0Internal and External Impacts of Natural Language Processing Papers May 21, 2025 Articles Ethics
— Unverified 0Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model May 21, 2025 Language Modeling Language Modelling
— Unverified 0Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation May 21, 2025 Language Modeling Language Modelling
— Unverified 0Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering May 21, 2025 counterfactual Denoising
— Unverified 0Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition May 21, 2025 Dialogue Generation Language Modeling
— Unverified 0ClickSight: Interpreting Student Clickstreams to Reveal Insights on Learning Strategies via LLMs May 21, 2025 Language Modeling Language Modelling
Code Code Available 0MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling May 21, 2025 Emotion Recognition Face Detection
— Unverified 0Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning May 21, 2025 Language Modeling Language Modelling
— Unverified 0Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning May 21, 2025 Language Modeling Language Modelling
— Unverified 0Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering May 21, 2025 Benchmarking Language Modeling
Code Code Available 0LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Mechanistic evaluation of Transformers and state space models May 21, 2025 Language Modelling Mamba
— Unverified 0Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions May 21, 2025 Language Modeling Language Modelling
— Unverified 0Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information May 21, 2025 Language Modeling Language Modelling
— Unverified 0Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution May 21, 2025 GPU Language Modeling
— Unverified 0Likelihood Variance as Text Importance for Resampling Texts to Map Language Models May 21, 2025 Language Modeling Language Modelling
— Unverified 0Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory May 21, 2025 Benchmarking Language Modeling
Code Code Available 0Revealing Language Model Trajectories via Kullback-Leibler Divergence May 21, 2025 Language Modeling Language Modelling
— Unverified 0ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation May 21, 2025 Decision Making Language Modeling
Code Code Available 0X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Your Language Model Can Secretly Write Like Humans: Contrastive Paraphrase Attacks on LLM-Generated Text Detectors May 21, 2025 Language Modeling Language Modelling
— Unverified 0Vision-Language Modeling Meets Remote Sensing: Models, Datasets and Perspectives May 20, 2025 Caption Generation Contrastive Learning
— Unverified 0UniGen: Enhanced Training & Test-Time Strategies for Unified Multimodal Understanding and Generation May 20, 2025 Image Generation Language Modeling
— Unverified 0CAFES: A Collaborative Multi-Agent Framework for Multi-Granular Multimodal Essay Scoring May 20, 2025 Automated Essay Scoring Diversity
— Unverified 0Improve Language Model and Brain Alignment via Associative Memory May 20, 2025 Language Modeling Language Modelling
Code Code Available 0HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing May 20, 2025 Language Modeling Language Modelling
— Unverified 0CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation May 20, 2025 Conditional Text Generation Language Modeling
— Unverified 0Automated Journalistic Questions: A New Method for Extracting 5W1H in French May 20, 2025 Articles Language Modeling
— Unverified 0Improving Noise Robustness of LLM-based Zero-shot TTS via Discrete Acoustic Token Denoising May 20, 2025 Decoder Denoising
— Unverified 0Exploring Graph Representations of Logical Forms for Language Modeling May 20, 2025 Language Modeling Language Modelling
Code Code Available 0FuxiMT: Sparsifying Large Language Models for Chinese-Centric Multilingual Machine Translation May 20, 2025 Language Modeling Language Modelling
— Unverified 0Large Language Model-Driven Distributed Integrated Multimodal Sensing and Semantic Communications May 20, 2025 Language Modeling Language Modelling
— Unverified 0Structured Agent Distillation for Large Language Model May 20, 2025 Decision Making Imitation Learning
— Unverified 0Studying the Role of Input-Neighbor Overlap in Retrieval-Augmented Language Models Training Efficiency May 20, 2025 Language Modeling Language Modelling
— Unverified 0Too Long, Didn't Model: Decomposing LLM Long-Context Understanding With Novels May 20, 2025 Language Modeling Language Modelling
Code Code Available 0MultiHal: Multilingual Dataset for Knowledge-Graph Grounded Evaluation of LLM Hallucinations May 20, 2025 Fact Checking Hallucination
Code Code Available 0sudoLLM : On Multi-role Alignment of Language Models May 20, 2025 Language Modeling Language Modelling
— Unverified 0TRATES: Trait-Specific Rubric-Assisted Cross-Prompt Essay Scoring May 20, 2025 Automated Essay Scoring Language Modeling
— Unverified 0Rank-K: Test-Time Reasoning for Listwise Reranking May 20, 2025 Language Modeling Language Modelling
Code Code Available 0MAS-KCL: Knowledge component graph structure learning with large language model-based agentic workflow May 20, 2025 Graph structure learning Language Modeling
— Unverified 0