Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models May 22, 2025 Benchmarking Language Modeling
— Unverified 0LaViDa: A Large Diffusion Language Model for Multimodal Understanding May 22, 2025 Instruction Following Language Modeling
Code Code Available 3SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
Code Code Available 0Large Language Model-Empowered Interactive Load Forecasting May 22, 2025 Language Modeling Language Modelling
— Unverified 0CASTILLO: Characterizing Response Length Distributions of Large Language Models May 22, 2025 Instruction Following Language Modeling
Code Code Available 0PaTH Attention: Position Encoding via Accumulating Householder Transformations May 22, 2025 Language Modeling Language Modelling
— Unverified 0DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation May 22, 2025 Language Modeling Language Modelling
— Unverified 0EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions May 22, 2025 Claim Verification Fact Checking
Code Code Available 0LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning May 22, 2025 Language Modeling Language Modelling
— Unverified 0How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance May 22, 2025 Language Modeling Language Modelling
— Unverified 0TensorAR: Refinement is All You Need in Autoregressive Image Generation May 22, 2025 All Image Generation
— Unverified 0Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering May 22, 2025 Global Facts Language Modeling
Code Code Available 0Latent Principle Discovery for Language Model Self-Improvement May 22, 2025 Clustering Language Modeling
— Unverified 0Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality May 22, 2025 Abstractive Text Summarization Informativeness
Code Code Available 0Dimple: Discrete Diffusion Multimodal Large Language Model with Parallel Decoding May 22, 2025 Language Modeling Language Modelling
Code Code Available 2Incremental Sequence Classification with Temporal Consistency May 22, 2025 Classification Language Modeling
— Unverified 0Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
— Unverified 0A Comprehensive Evaluation of Contemporary ML-Based Solvers for Combinatorial Optimization May 22, 2025 Combinatorial Optimization Language Modeling
Code Code Available 1On Multilingual Encoder Language Model Compression for Low-Resource Languages May 22, 2025 Knowledge Distillation Language Modeling
— Unverified 0Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
— Unverified 0Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks May 22, 2025 Code Generation Language Modeling
— Unverified 0CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning May 22, 2025 Language Modeling Language Modelling
— Unverified 0Edge-First Language Model Inference: Models, Metrics, and Tradeoffs May 22, 2025 Benchmarking Language Modeling
— Unverified 0Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine May 22, 2025 Causal Inference Drug Discovery
— Unverified 0MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing May 22, 2025 Language Modeling Language Modelling
— Unverified 0Any Large Language Model Can Be a Reliable Judge: Debiasing with a Reasoning-based Bias Detector May 21, 2025 Bias Detection In-Context Learning
— Unverified 0Forging Time Series with Language: A Large Language Model Approach to Synthetic Data Generation May 21, 2025 Language Modeling Language Modelling
— Unverified 0Aligning Dialogue Agents with Global Feedback via Large Language Model Reward Decomposition May 21, 2025 Dialogue Generation Language Modeling
— Unverified 0Ensembling Sparse Autoencoders May 21, 2025 Diversity Language Modeling
— Unverified 0ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding Validation May 21, 2025 Decision Making Language Modeling
Code Code Available 0Keep Security! Benchmarking Security Policy Preservation in Large Language Model Contexts Against Indirect Attacks in Question Answering May 21, 2025 Benchmarking Language Modeling
Code Code Available 0Internal and External Impacts of Natural Language Processing Papers May 21, 2025 Articles Ethics
— Unverified 0Diagnosing our datasets: How does my language model learn clinical information? May 21, 2025 Language Modeling Language Modelling
Code Code Available 0MIKU-PAL: An Automated and Standardized Multi-Modal Method for Speech Paralinguistic and Affect Labeling May 21, 2025 Emotion Recognition Face Detection
— Unverified 0Diffusion vs. Autoregressive Language Models: A Text Embedding Perspective May 21, 2025 Instruction Following Language Modeling
— Unverified 0CP-LLM: Context and Pixel Aware Large Language Model for Video Quality Assessment May 21, 2025 Language Modeling Language Modelling
— Unverified 0Segmentation-Variant Codebooks for Preservation of Paralinguistic and Prosodic Information May 21, 2025 Language Modeling Language Modelling
— Unverified 0Likelihood Variance as Text Importance for Resampling Texts to Map Language Models May 21, 2025 Language Modeling Language Modelling
— Unverified 0Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model May 21, 2025 Language Modeling Language Modelling
— Unverified 0LyapLock: Bounded Knowledge Preservation in Sequential Large Language Model Editing May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Leveraging Online Data to Enhance Medical Knowledge in a Small Persian Language Model May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Revealing Language Model Trajectories via Kullback-Leibler Divergence May 21, 2025 Language Modeling Language Modelling
— Unverified 0X-WebAgentBench: A Multilingual Interactive Web Benchmark for Evaluating Global Agentic System May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Self-GIVE: Associative Thinking from Limited Structured Knowledge for Enhanced Large Language Model Reasoning May 21, 2025 Language Modeling Language Modelling
— Unverified 0Short-Range Dependency Effects on Transformer Instability and a Decomposed Attention Solution May 21, 2025 GPU Language Modeling
— Unverified 0Listen to the Context: Towards Faithful Large Language Models for Retrieval Augmented Generation on Climate Questions May 21, 2025 Language Modeling Language Modelling
— Unverified 0Lost in Benchmarks? Rethinking Large Language Model Benchmarking with Item Response Theory May 21, 2025 Benchmarking Language Modeling
Code Code Available 0Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering May 21, 2025 counterfactual Denoising
— Unverified 0Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation May 21, 2025 Language Modeling Language Modelling
Code Code Available 0Trajectory Bellman Residual Minimization: A Simple Value-Based Method for LLM Reasoning May 21, 2025 Language Modeling Language Modelling
— Unverified 0