SOTAVerified

Open-Domain Dialog

Papers

Showing 150 of 60 papers

TitleStatusHype
GODEL: Large-Scale Pre-Training for Goal-Directed DialogCode2
CPED: A Large-Scale Chinese Personalized and Emotional Dialogue Dataset for Conversational AICode2
Re2G: Retrieve, Rerank, GenerateCode1
InstructDial: Improving Zero and Few-shot Generalization in Dialogue through Instruction TuningCode1
CoMAE: A Multi-factor Hierarchical Framework for Empathetic Response GenerationCode1
Hurdles to Progress in Long-form Question AnsweringCode1
Dialogue Response Ranking Training with Large-Scale Human Feedback DataCode1
KILT: a Benchmark for Knowledge Intensive Language TasksCode1
Unsupervised Evaluation of Interactive Dialog with DialoGPTCode1
Beyond User Self-Reported Likert Scale Ratings: A Comparison Model for Automatic Dialog EvaluationCode1
USR: An Unsupervised and Reference Free Evaluation Metric for Dialog GenerationCode1
ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact CentersCode1
RUBER: An Unsupervised Method for Automatic Evaluation of Open-Domain Dialog SystemsCode1
An Empirical Study on Context Length for Open-Domain Dialog GenerationCode0
Social Commonsense-Guided Search Query Generation for Open-Domain Knowledge-Powered Conversations0
Dior-CVAE: Pre-trained Language Models and Diffusion Priors for Variational Dialog GenerationCode0
Enhancing Task Bot Engagement with Synthesized Open-Domain Dialog0
Open-Domain Dialog Evaluation using Follow-Ups LikelihoodCode0
Interactive Evaluation of Dialog Track at DSTC90
Uncovering Surprising Event Boundaries in Narratives0
Sketching a Linguistically-Driven Reasoning Dialog Model for Social Talk0
What is wrong with you?: Leveraging User Sentiment for Automatic Dialog EvaluationCode0
Towards Identifying Social Bias in Dialog Systems: Frame, Datasets, and Benchmarks0
Towards Learning Through Open-Domain Dialog0
An Empirical Study of Topic Transition in Dialogue0
User Response and Sentiment Prediction for Automatic Dialogue Evaluation0
Investigating the Impact of Pre-trained Language Models on Dialog Evaluation0
Investigating Robustness of Dialog Models to Popular Figurative Language ConstructsCode0
Enhancing Self-Disclosure In Neural Dialog Models By Candidate Re-ranking0
REAM : An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation0
GenSF: Simultaneous Adaptation of Generative Pre-trained Models and Slot FillingCode0
Improving Automated Evaluation of Open Domain Dialog via Diverse Reference AugmentationCode0
HERALD: An Annotation Efficient Method to Detect User Disengagement in Social ConversationsCode0
REAM: An Enhancement Approach to Reference-based Evaluation Metrics for Open-domain Dialog Generation0
ProphetNet-X: Large-Scale Pre-training Models for English, Chinese, Multi-lingual, Dialog, and Code GenerationCode0
Discovering Dialog Structure Graph for Open-Domain Dialog Generation0
Self-attention Comparison Module for Boosting Performance on Retrieval-based Open-Domain Dialog Systems0
Topic-relevant Response Generation using Optimal Transport for an Open-domain Dialog System0
Policy-Driven Neural Response Generation for Knowledge-Grounded Dialog Systems0
Are Neural Open-Domain Dialog Systems Robust to Speech Recognition Errors in the Dialog History? An Empirical StudyCode0
Probing Neural Dialog Models for Conversational UnderstandingCode0
Non-Autoregressive Neural Dialogue Generation0
Way Off-Policy Batch Deep Reinforcement Learning of Human Preferences in Dialog0
Hierarchical Reinforcement Learning for Open-Domain DialogCode0
A Multi-Turn Emotionally Engaging Dialog ModelCode0
Investigating Evaluation of Open-Domain Dialogue Systems With Human Generated Multiple ReferencesCode0
Deep Reinforcement Learning For Modeling Chit-Chat Dialog With Discrete Attributes0
Large-Scale Transfer Learning for Natural Language GenerationCode0
Way Off-Policy Batch Deep Reinforcement Learning of Implicit Human Preferences in DialogCode0
Approximating Interactive Human Evaluation with Self-Play for Open-Domain Dialog SystemsCode0
Show:102550
← PrevPage 1 of 2Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1HindsightKILT-RL11.92Unverified
2Re2GKILT-RL11.39Unverified
3intersectKILT-RL10.45Unverified
4KGIKILT-RL10.36Unverified
5RAGKILT-RL7.59Unverified
6WikipediaKILT-RL6.55Unverified
7Multitask DPR + BARTKILT-RL5.91Unverified
8Routing Transformer, c-REALMKILT-RL4.41Unverified
9BART + DPRKILT-RL3.71Unverified
10multitaskKILT-RL2.04Unverified