| ADIFF: Explaining audio difference using natural language | Feb 6, 2025 | AudioCapsAudio captioning | CodeCode Available | 1 | 5 |
| Backpack Language Models | May 26, 2023 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| Dialogue State Tracking with a Language Model using Schema-Driven Prompting | Sep 15, 2021 | Dialogue State TrackingLanguage Modeling | CodeCode Available | 1 | 5 |
| Dialogue Summaries as Dialogue States (DS2), Template-Guided Summarization for Few-shot Dialogue State Tracking | Mar 3, 2022 | Abstractive Dialogue SummarizationDialogue State Tracking | CodeCode Available | 1 | 5 |
| How Language Model Hallucinations Can Snowball | May 22, 2023 | HallucinationLanguage Modeling | CodeCode Available | 1 | 5 |
| AgentGen: Enhancing Planning Abilities for Large Language Model based Agent via Environment and Task Generation | Aug 1, 2024 | DiversityLanguage Modeling | CodeCode Available | 1 | 5 |
| Automated Generation of Challenging Multiple-Choice Questions for Vision Language Model Evaluation | Jan 6, 2025 | Language Model EvaluationLanguage Modeling | CodeCode Available | 1 | 5 |
| CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation | Jul 9, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |
| CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models | Feb 20, 2025 | BlockingLanguage Modeling | CodeCode Available | 1 | 5 |
| How Much Knowledge Can You Pack Into the Parameters of a Language Model? | Feb 10, 2020 | Language ModelingLanguage Modelling | CodeCode Available | 1 | 5 |