SOTAVerified

Open-Domain Dialog

Papers

Showing 125 of 60 papers

TitleStatusHype
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
Re2G: Retrieve, Rerank, GenerateCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response GenerationCode1
Hurdles to Progress in Long-form Question AnsweringCode1
Dialogue Response Ranking Training with Large-Scale Human Feedback DataCode1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
Unsupervised Evaluation of Interactive Dialog with DialoGPTCode1
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog EvaluationCode1
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog GenerationCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog SystemsCode1
An Empirical Study on Context Length for Open-Domain Dialog GenerationCode0
Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations0
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog GenerationCode0
Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog0
Open-Domain Dialog Evaluation using Follow-Ups LikelihoodCode0
Interactive Evaluation of Dialog Track at DSTC90
Uncovering Surprising Event Boundaries in Narratives0
Sketching a Linguistically-Driven Reasoning Dialog Model for Social Talk0
What is wrong with you?: Leveraging User Sentiment for Automatic Dialog EvaluationCode0
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks0
Towards Learning Through Open-Domain Dialog0
An Empirical Study of Topic Transition in Dialogue0
Show:102550
← PrevPage 1 of 3Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HindsightKILT-RL11.92Unverified
2Re2GKILT-RL11.39Unverified
3intersectKILT-RL10.45Unverified
4KGIKILT-RL10.36Unverified
5RAGKILT-RL7.59Unverified
6WikipediaKILT-RL6.55Unverified
7Multitask DPR + BARTKILT-RL5.91Unverified
8Routing Transformer, c-REALMKILT-RL4.41Unverified
9BART + DPRKILT-RL3.71Unverified
10multitaskKILT-RL2.04Unverified