| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Representation Steering for Language Models | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction | May 27, 2025 | Domain AdaptationHallucination | —Unverified | 0 |
| Creativity in LLM-based Multi-Agent Systems: A Survey | May 27, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StreamLink: Large-Language-Model Driven Distributed Data Engineering System | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion | May 27, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | May 27, 2025 | DiagnosticKnowledge Graphs | —Unverified | 0 |
| LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SafeDPO: A Simple Approach to Direct Preference Optimization with Enhanced Safety | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Ankh3: Multi-Task Pretraining with Sequence Denoising and Completion Enhances Protein Representations | May 26, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Learning to Select In-Context Demonstration Preferred by Large Language Model | May 26, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Language Model-Enhanced Message Passing for Heterophilic Graph Learning | May 26, 2025 | Active LearningGraph Learning | —Unverified | 0 |
| MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning? | May 26, 2025 | In-Context Learningknowledge editing | —Unverified | 0 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Style2Code: A Style-Controllable Code Generation Framework with Dual-Modal Contrastive Representation Learning | May 26, 2025 | Code GenerationContrastive Learning | CodeCode Available | 0 |
| Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models | May 26, 2025 | DisentanglementHallucination | CodeCode Available | 0 |
| ImgEdit: A Unified Image Editing Dataset and Benchmark | May 26, 2025 | Image Editing | CodeCode Available | 4 |
| Attention! You Vision Language Model Could Be Maliciously Manipulated | May 26, 2025 | Decision MakingHallucination | —Unverified | 0 |
| ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining | May 26, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 |
| SeMe: Training-Free Language Model Merging via Semantic Alignment | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| It's High Time: A Survey of Temporal Information Retrieval and Question Answering | May 26, 2025 | ArticlesInformation Retrieval | —Unverified | 0 |
| ResSVD: Residual Compensated SVD for Large Language Model Compression | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Causal Distillation: Transferring Structured Explanations from Large to Compact Language Models | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| REARANK: Reasoning Re-ranking Agent via Reinforcement Learning | May 26, 2025 | Data AugmentationInformation Retrieval | CodeCode Available | 1 |
| Balancing Computation Load and Representation Expressivity in Parallel Hybrid Neural Networks | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| DiffVLA: Vision-Language Guided Diffusion Planning for Autonomous Driving | May 26, 2025 | Autonomous DrivingDiversity | —Unverified | 0 |
| TrojanStego: Your Language Model Can Secretly Be A Steganographic Privacy Leaking Agent | May 26, 2025 | Human DetectionLanguage Modeling | CodeCode Available | 0 |
| On the Same Page: Dimensions of Perceived Shared Understanding in Human-AI Interaction | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Dynamically Learned Test-Time Model Routing in Language Model Zoos with Service Level Guarantees | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation | May 26, 2025 | DecoderLanguage Modeling | CodeCode Available | 3 |
| LLM-Agent-Controller: A Universal Multi-Agent Large Language Model System as a Control Engineer | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Adaptive Classifier-Free Guidance via Dynamic Low-Confidence Masking | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Steering Techniques using Human Similarity Judgments | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Paying Alignment Tax with Contrastive Learning | May 25, 2025 | Contrastive LearningLanguage Modeling | —Unverified | 0 |
| LLaDA 1.5: Variance-Reduced Preference Optimization for Large Language Diffusion Models | May 25, 2025 | GSM8KHumanEval | —Unverified | 0 |
| ScreenExplorer: Training a Vision-Language Model for Diverse Exploration in Open GUI World | May 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| OpenHOI: Open-World Hand-Object Interaction Synthesis with Multimodal Large Language Model | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM) | May 25, 2025 | Dependency ParsingLanguage Modeling | —Unverified | 0 |
| REACT: Representation Extraction And Controllable Tuning to Overcome Overfitting in LLM Knowledge Editing | May 25, 2025 | knowledge editingLanguage Modeling | —Unverified | 0 |