SOTAVerified

Open-Domain Dialog

Papers

Showing 125 of 60 papers

TitleStatusHype
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog EvaluationCode1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog SystemsCode1
Dialogue Response Ranking Training with Large-Scale Human Feedback DataCode1
Unsupervised Evaluation of Interactive Dialog with DialoGPTCode1
Hurdles to Progress in Long-form Question AnsweringCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
Re2G: Retrieve, Rerank, GenerateCode1
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response GenerationCode1
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog GenerationCode1
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical StudyCode0
Open-Domain Dialog Evaluation using Follow-Ups LikelihoodCode0
GenSF: Simultaneous Adaptation of Generative Pre-trained Models and Slot FillingCode0
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog SystemsCode0
Probing Neural Dialog Models for Conversational UnderstandingCode0
Evaluating Coherence in Dialogue Systems using EntailmentCode0
Investigating Robustness of Dialog Models to Popular Figurative Language ConstructsCode0
Improving Automated Evaluation of Open Domain Dialog via Diverse Reference AugmentationCode0
Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple ReferencesCode0
An Empirical Study on Context Length for Open-Domain Dialog GenerationCode0
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog GenerationCode0
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social ConversationsCode0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HindsightKILT-RL11.92Unverified
2Re2GKILT-RL11.39Unverified
3intersectKILT-RL10.45Unverified
4KGIKILT-RL10.36Unverified
5RAGKILT-RL7.59Unverified
6WikipediaKILT-RL6.55Unverified
7Multitask DPR + BARTKILT-RL5.91Unverified
8Routing Transformer, c-REALMKILT-RL4.41Unverified
9BART + DPRKILT-RL3.71Unverified
10multitaskKILT-RL2.04Unverified