| Pragmatically Appropriate Diversity for Dialogue Evaluation | Apr 6, 2023 | Dialogue EvaluationDiversity | —Unverified | 0 |
| GLM-Dialog: Noise-tolerant Pre-training for Knowledge-grounded Dialogue Generation | Feb 28, 2023 | Dialogue EvaluationDialogue Generation | CodeCode Available | 1 |
| Improving Open-Domain Dialogue Evaluation with a Causal Inference Model | Jan 31, 2023 | Causal Inferencecounterfactual | —Unverified | 0 |
| Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems | Dec 18, 2022 | ChatbotDialogue Evaluation | CodeCode Available | 1 |
| PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment | Dec 18, 2022 | Data AugmentationDialogue Evaluation | —Unverified | 0 |
| FineD-Eval: Fine-grained Automatic Dialogue-Level Evaluation | Oct 25, 2022 | Dialogue Evaluation | CodeCode Available | 1 |
| Joint Goal Segmentation and Goal Success Prediction on Multi-Domain Conversations | Oct 1, 2022 | Dialogue EvaluationMulti-Task Learning | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation | Aug 17, 2022 | Contrastive LearningDialogue Evaluation | CodeCode Available | 0 |
| Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis | Jul 1, 2022 | Dialogue Evaluation | —Unverified | 0 |