| MID-L: Matrix-Interpolated Dropout Layer with Layer-wise Neuron Selection | May 16, 2025 | Informativeness | —Unverified | 0 |
| MolTextNet: A Two-Million Molecule-Text Dataset for Multimodal Molecular Learning | May 15, 2025 | Drug DiscoveryInformativeness | —Unverified | 0 |
| Are LLM-generated plain language summaries truly understandable? A large-scale crowdsourced evaluation | May 15, 2025 | InformativenessMultiple-choice | —Unverified | 0 |
| Judging the Judges: Can Large Vision-Language Models Fairly Evaluate Chart Comprehension and Reasoning? | May 13, 2025 | Chart Question AnsweringFact Checking | CodeCode Available | 0 |
| Emotion-Gradient Metacognitive RSI (Part I): Theoretical Foundations and Single-Agent Architecture | May 12, 2025 | Informativeness | —Unverified | 0 |
| Towards a Unified Representation Evaluation Framework Beyond Downstream Tasks | May 9, 2025 | DisentanglementInformativeness | CodeCode Available | 0 |
| Gap the (Theory of) Mind: Sharing Beliefs About Teammates' Goals Boosts Collaboration Perception, Not Performance | May 6, 2025 | Informativeness | —Unverified | 0 |
| An Empirical Study of Evaluating Long-form Question Answering | Apr 25, 2025 | FormInformativeness | CodeCode Available | 0 |
| Assessing the Potential of Generative Agents in Crowdsourced Fact-Checking | Apr 24, 2025 | Decision MakingFact Checking | —Unverified | 0 |
| Machine Learning Interpretation of Optical Spectroscopy Using Peak-Sensitive Logistic Regression | Apr 23, 2025 | Feature ImportanceInformativeness | CodeCode Available | 0 |