| LLM-SEM: A Sentiment-Based Student Engagement Metric Using LLMS for E-Learning Platforms | Dec 18, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Reliability Paradox: Exploring How Shortcut Learning Undermines Language Model Calibration | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| On the Structural Memory of LLM Agents | Dec 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Posterior Mean Matching: Generative Modeling through Online Bayesian Inference | Dec 17, 2024 | Bayesian InferenceImage Generation | —Unverified | 0 |
| DnDScore: Decontextualization and Decomposition for Factuality Verification in Long-Form Text Generation | Dec 17, 2024 | FormLanguage Modeling | CodeCode Available | 0 |
| LMUnit: Fine-grained Evaluation with Natural Language Unit Tests | Dec 17, 2024 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Core Context Aware Attention for Long Context Language Modeling | Dec 17, 2024 | Computational EfficiencyLanguage Modeling | —Unverified | 0 |
| DuSSS: Dual Semantic Similarity-Supervised Vision-Language Model for Semi-Supervised Medical Image Segmentation | Dec 17, 2024 | Contrastive LearningImage Segmentation | CodeCode Available | 1 |
| SnakModel: Lessons Learned from Training an Open Danish Large Language Model | Dec 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Uncertainty-Aware Hybrid Inference with On-Device Small and Remote Large Language Models | Dec 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |