| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Representation Steering for Language Models | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| StreamLink: Large-Language-Model Driven Distributed Data Engineering System | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction | May 27, 2025 | Domain AdaptationHallucination | —Unverified | 0 |
| Creativity in LLM-based Multi-Agent Systems: A Survey | May 27, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion | May 27, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | May 27, 2025 | DiagnosticKnowledge Graphs | —Unverified | 0 |
| Automated Privacy Information Annotation in Large Language Model Interactions | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| LLM Web Dynamics: Tracing Model Collapse in a Network of LLMs | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| VSCBench: Bridging the Gap in Vision-Language Model Safety Calibration | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| What Changed? Detecting and Evaluating Instruction-Guided Image Edits with Multimodal Large Language Models | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Can Compressed LLMs Truly Act? An Empirical Evaluation of Agentic Capabilities in LLM Compression | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Unifying Multimodal Large Language Model Capabilities and Modalities via Model Merging | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Ankh3: Multi-Task Pretraining with Sequence Denoising and Completion Enhances Protein Representations | May 26, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Style2Code: A Style-Controllable Code Generation Framework with Dual-Modal Contrastive Representation Learning | May 26, 2025 | Code GenerationContrastive Learning | CodeCode Available | 0 |
| Hierarchical Tree Search-based User Lifelong Behavior Modeling on Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Causal-LLaVA: Causal Disentanglement for Mitigating Hallucination in Multimodal Large Language Models | May 26, 2025 | DisentanglementHallucination | CodeCode Available | 0 |
| Editing as Unlearning: Are Knowledge Editing Methods Strong Baselines for Large Language Model Unlearning? | May 26, 2025 | In-Context Learningknowledge editing | —Unverified | 0 |
| WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference | May 26, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| Learning to Select In-Context Demonstration Preferred by Large Language Model | May 26, 2025 | In-Context LearningLanguage Modeling | —Unverified | 0 |
| Language Model-Enhanced Message Passing for Heterophilic Graph Learning | May 26, 2025 | Active LearningGraph Learning | —Unverified | 0 |
| MSD-LLM: Predicting Ship Detention in Port State Control Inspections with Large Language Model | May 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |