| U-NEED: A Fine-grained Dataset for User Needs-Centric E-commerce Conversational Recommendation | May 5, 2023 | Conversational RecommendationDialogue Evaluation | —Unverified | 0 |
| Pragmatically Appropriate Diversity for Dialogue Evaluation | Apr 6, 2023 | Dialogue EvaluationDiversity | —Unverified | 0 |
| Improving Open-Domain Dialogue Evaluation with a Causal Inference Model | Jan 31, 2023 | Causal Inferencecounterfactual | —Unverified | 0 |
| PoE: a Panel of Experts for Generalized Automatic Dialogue Assessment | Dec 18, 2022 | Data AugmentationDialogue Evaluation | —Unverified | 0 |
| Joint Goal Segmentation and Goal Success Prediction on Multi-Domain Conversations | Oct 1, 2022 | Dialogue EvaluationMulti-Task Learning | —Unverified | 0 |
| Dialogue Evaluation with Offline Reinforcement Learning | Sep 2, 2022 | Dialogue EvaluationOffline RL | —Unverified | 0 |
| SelF-Eval: Self-supervised Fine-grained Dialogue Evaluation | Aug 17, 2022 | Contrastive LearningDialogue Evaluation | CodeCode Available | 0 |
| Explaining Dialogue Evaluation Metrics using Adversarial Behavioral Analysis | Jul 1, 2022 | Dialogue Evaluation | —Unverified | 0 |
| MME-CRS: Multi-Metric Evaluation Based on Correlation Re-Scaling for Evaluating Open-Domain Dialogue | Jun 19, 2022 | Dialogue EvaluationMME | —Unverified | 0 |
| AdaCoach: A Virtual Coach for Training Customer Service Agents | Apr 27, 2022 | Dialogue Evaluation | —Unverified | 0 |