| Citrus: Leveraging Expert Cognitive Pathways in a Medical Language Model for Advanced Medical Decision Support | Feb 25, 2025 | Decision MakingDiagnostic | CodeCode Available | 2 |
| Can LLMs Explain Themselves Counterfactually? | Feb 25, 2025 | counterfactualCounterfactual Reasoning | —Unverified | 0 |
| olmOCR: Unlocking Trillions of Tokens in PDFs with Vision Language Models | Feb 25, 2025 | DiversityLanguage Modeling | CodeCode Available | 11 |
| AMPO: Active Multi-Preference Optimization | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LDGen: Enhancing Text-to-Image Synthesis via Large Language Model-Driven Language Representation | Feb 25, 2025 | Image GenerationLanguage Modeling | —Unverified | 0 |
| Rank1: Test-Time Compute for Reranking in Information Retrieval | Feb 25, 2025 | Information RetrievalInstruction Following | CodeCode Available | 2 |
| Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| TextGames: Learning to Self-Play Text-Based Puzzle Games via Language Model Reasoning | Feb 25, 2025 | Instruction FollowingLanguage Modeling | CodeCode Available | 0 |
| MindMem: Multimodal for Predicting Advertisement Memorability Using LLMs and Deep Learning | Feb 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| SPECTRE: An FFT-Based Efficient Drop-In Replacement to Self-Attention for Long Contexts | Feb 25, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |