| CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models | May 27, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| HAD: Hybrid Architecture Distillation Outperforms Teacher in Genomic Sequence Modeling | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Improved Representation Steering for Language Models | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| PolarGrad: A Class of Matrix-Gradient Optimizers from a Unifying Preconditioning Perspective | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Lightweight Multi-Expert Generative Language Model System for Engineering Information and Knowledge Extraction | May 27, 2025 | Domain AdaptationHallucination | —Unverified | 0 |
| REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning | May 27, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| StreamLink: Large-Language-Model Driven Distributed Data Engineering System | May 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Creativity in LLM-based Multi-Agent Systems: A Survey | May 27, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Complex System Diagnostics Using a Knowledge Graph-Informed and Large Language Model-Enhanced Framework | May 27, 2025 | DiagnosticKnowledge Graphs | —Unverified | 0 |
| Accelerating Diffusion Language Model Inference via Efficient KV Caching and Guided Diffusion | May 27, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |