Skip-Thinking: Chunk-wise Chain-of-Thought Distillation Enable Smaller Language Models to Reason Better and Faster May 24, 2025 Heuristic Search Language Modeling
— Unverified 0TULUN: Transparent and Adaptable Low-resource Machine Translation May 24, 2025 Domain Adaptation Language Modeling
Code Code Available 0MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors May 24, 2025 Language Modeling Language Modelling
— Unverified 0Anchored Diffusion Language Model May 24, 2025 Language Modeling Language Modelling
— Unverified 0Chain-of-Zoom: Extreme Super-Resolution via Scale Autoregression and Preference Alignment May 24, 2025 Image Super-Resolution Language Modeling
— Unverified 0EvdCLIP: Improving Vision-Language Retrieval with Entity Visual Descriptions from Large Language Models May 24, 2025 Image-text Retrieval Language Modeling
— Unverified 0Beyond Masked and Unmasked: Discrete Diffusion Models via Partial Masking May 24, 2025 Image Generation Language Modelling
— Unverified 0Inference Compute-Optimal Video Vision Language Models May 24, 2025 Language Modeling Language Modelling
— Unverified 0Building a Functional Machine Translation Corpus for Kpelle May 24, 2025 Data Augmentation Language Modelling
— Unverified 0Disentangling Knowledge Representations for Large Language Model Editing May 24, 2025 Disentanglement knowledge editing
— Unverified 0BiomechGPT: Towards a Biomechanically Fluent Multimodal Foundation Model for Clinically Relevant Motion Tasks May 24, 2025 Activity Recognition Descriptive
— Unverified 0ELDeR: Getting Efficient LLMs through Data-Driven Regularized Layer-wise Pruning May 23, 2025 Language Modeling Language Modelling
— Unverified 0Inference-Time Decomposition of Activations (ITDA): A Scalable Approach to Interpreting Large Language Models May 23, 2025 GPU Language Modeling
Code Code Available 0Large language model as user daily behavior data generator: balancing population diversity and individual personality May 23, 2025 Data Augmentation Diversity
— Unverified 0Simulating Macroeconomic Expectations using LLM Agents May 23, 2025 Language Modeling Language Modelling
— Unverified 0Multi-agent Systems for Misinformation Lifecycle : Detection, Correction And Source Identification May 23, 2025 AI Agent Language Modeling
— Unverified 0QwenLong-CPRS: Towards -LLMs with Dynamic Context Optimization May 23, 2025 4k Language Modeling
— Unverified 0Selection Mechanisms for Sequence Modeling using Linear State Space Models May 23, 2025 Fault Detection Language Modeling
— Unverified 0SpectraLDS: Provable Distillation for Linear Dynamical Systems May 23, 2025 Language Modeling Language Modelling
— Unverified 0keepitsimple at SemEval-2025 Task 3: LLM-Uncertainty based Approach for Multilingual Hallucination Span Detection May 23, 2025 Hallucination Language Modeling
Code Code Available 0Runaway is Ashamed, But Helpful: On the Early-Exit Behavior of Large Language Model-based Agents in Embodied Environments May 23, 2025 Language Modeling Language Modelling
Code Code Available 0Plan-R1: Safe and Feasible Trajectory Planning as Language Modeling May 23, 2025 Autonomous Driving Collision Avoidance
— Unverified 0Retrieval Augmented Generation-based Large Language Models for Bridging Transportation Cybersecurity Legal Knowledge Gaps May 23, 2025 Language Modeling Language Modelling
— Unverified 0NSNQuant: A Double Normalization Approach for Calibration-Free Low-Bit Vector Quantization of KV Cache May 23, 2025 Language Modeling Language Modelling
— Unverified 0Taming LLMs with Negative Samples: A Reference-Free Framework to Evaluate Presentation Content with Actionable Feedback May 23, 2025 Language Modeling Language Modelling
— Unverified 0LLaDA-V: Large Language Diffusion Models with Visual Instruction Tuning May 22, 2025 Language Modeling Language Modelling
— Unverified 0Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality May 22, 2025 Abstractive Text Summarization Informativeness
Code Code Available 0TensorAR: Refinement is All You Need in Autoregressive Image Generation May 22, 2025 All Image Generation
— Unverified 0PaTH Attention: Position Encoding via Accumulating Householder Transformations May 22, 2025 Language Modeling Language Modelling
— Unverified 0Large Language Model-Empowered Interactive Load Forecasting May 22, 2025 Language Modeling Language Modelling
— Unverified 0Plan and Budget: Effective and Efficient Test-Time Scaling on Large Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
— Unverified 0SATURN: SAT-based Reinforcement Learning to Unleash Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
Code Code Available 0On Multilingual Encoder Language Model Compression for Low-Resource Languages May 22, 2025 Knowledge Distillation Language Modeling
— Unverified 0Latent Principle Discovery for Language Model Self-Improvement May 22, 2025 Clustering Language Modeling
— Unverified 0Small-to-Large Generalization: Data Influences Models Consistently Across Scale May 22, 2025 Language Modeling Language Modelling
— Unverified 0MM-MovieDubber: Towards Multi-Modal Learning for Multi-Modal Movie Dubbing May 22, 2025 Language Modeling Language Modelling
— Unverified 0Mechanistic Understanding and Mitigation of Language Confusion in English-Centric Large Language Models May 22, 2025 Benchmarking Language Modeling
— Unverified 0Evaluating Large Language Model with Knowledge Oriented Language Specific Simple Question Answering May 22, 2025 Global Facts Language Modeling
Code Code Available 0Incremental Sequence Classification with Temporal Consistency May 22, 2025 Classification Language Modeling
— Unverified 0INFERENCEDYNAMICS: Efficient Routing Across LLMs through Structured Capability and Knowledge Profiling May 22, 2025 Language Modeling Language Modelling
— Unverified 0DeepRec: Towards a Deep Dive Into the Item Space with Large Language Model Based Recommendation May 22, 2025 Language Modeling Language Modelling
— Unverified 0Beyond Correlation: Towards Causal Large Language Model Agents in Biomedicine May 22, 2025 Causal Inference Drug Discovery
— Unverified 0Incentivizing Dual Process Thinking for Efficient Large Language Model Reasoning May 22, 2025 Language Modeling Language Modelling
— Unverified 0CASTILLO: Characterizing Response Length Distributions of Large Language Models May 22, 2025 Instruction Following Language Modeling
Code Code Available 0Edge-First Language Model Inference: Models, Metrics, and Tradeoffs May 22, 2025 Benchmarking Language Modeling
— Unverified 0How do Scaling Laws Apply to Knowledge Graph Engineering Tasks? The Impact of Model Size on Large Language Model Performance May 22, 2025 Language Modeling Language Modelling
— Unverified 0A Japanese Language Model and Three New Evaluation Benchmarks for Pharmaceutical NLP May 22, 2025 Continual Pretraining Diagnostic
Code Code Available 0Code Graph Model (CGM): A Graph-Integrated Large Language Model for Repository-Level Software Engineering Tasks May 22, 2025 Code Generation Language Modeling
— Unverified 0CTRAP: Embedding Collapse Trap to Safeguard Large Language Models from Harmful Fine-Tuning May 22, 2025 Language Modeling Language Modelling
— Unverified 0EMULATE: A Multi-Agent Framework for Determining the Veracity of Atomic Claims by Emulating Human Actions May 22, 2025 Claim Verification Fact Checking
Code Code Available 0