| Improving Weak-to-Strong Generalization with Scalable Oversight and Ensemble Learning | Feb 1, 2024 | Ensemble LearningIn-Context Learning | CodeCode Available | 1 |
| Prompting Large Language Models for Zero-Shot Clinical Prediction with Structured Longitudinal Electronic Health Record Data | Jan 25, 2024 | Decision MakingIn-Context Learning | CodeCode Available | 1 |
| K-QA: A Real-World Medical Q&A Benchmark | Jan 25, 2024 | HallucinationIn-Context Learning | CodeCode Available | 1 |
| Enhancing In-context Learning via Linear Probe Calibration | Jan 22, 2024 | In-Context Learning | CodeCode Available | 1 |
| Revisiting Demonstration Selection Strategies in In-Context Learning | Jan 22, 2024 | In-Context Learning | CodeCode Available | 1 |
| Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs | Jan 18, 2024 | In-Context Learning | CodeCode Available | 1 |
| Batch-ICL: Effective, Efficient, and Order-Agnostic In-Context Learning | Jan 12, 2024 | In-Context LearningZero-Shot Learning | CodeCode Available | 1 |
| The Unreasonable Effectiveness of Easy Training Data for Hard Tasks | Jan 12, 2024 | General KnowledgeIn-Context Learning | CodeCode Available | 1 |
| MobileAgent: enhancing mobile control via human-machine interaction and SOP integration | Jan 4, 2024 | In-Context Learning | CodeCode Available | 1 |
| DIALIGHT: Lightweight Multilingual Development and Evaluation of Task-Oriented Dialogue Systems with Large Language Models | Jan 4, 2024 | In-Context LearningTask-Oriented Dialogue Systems | CodeCode Available | 1 |