| Language Generation with Strictly Proper Scoring Rules | May 29, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Guided Visual Question Answering: Elevate Your Multimodal Language Model Using Knowledge-Enriched Prompts | Oct 31, 2023 | Image CaptioningLanguage Modeling | CodeCode Available | 1 |
| Mathfish: Evaluating Language Model Math Reasoning via Grounding in Educational Curricula | Aug 8, 2024 | GSM8KLanguage Modeling | CodeCode Available | 1 |
| CAFe: Unifying Representation and Generation with Contrastive-Autoregressive Finetuning | Mar 25, 2025 | HallucinationLanguage Modeling | CodeCode Available | 1 |
| Evaluation Benchmarks for Spanish Sentence Representations | Apr 15, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Model Decoding as Likelihood-Utility Alignment | Oct 13, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| SecureBERT: A Domain-Specific Language Model for Cybersecurity | Apr 6, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Evaluating Human-Language Model Interaction | Dec 19, 2022 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| Language Modeling on Tabular Data: A Survey of Foundations, Techniques and Evolution | Aug 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees | Mar 11, 2025 | ChatbotLanguage Modeling | CodeCode Available | 1 |