Dialogue Safety Prediction

Determine the safety of a given dialogue context.

Papers

Showing 1–3 of 3 papers

Title	Date	Tasks	Status	Hype
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red Teaming	Apr 6, 2024	Adversarial RobustnessDialogue Safety Prediction	CodeCode Available	2
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for Hallucinations	Apr 15, 2024	BenchmarkingBias Detection	CodeCode Available	1
ProsocialDialog: A Prosocial Backbone for Conversational Agents	May 25, 2022	Dialogue GenerationDialogue Safety Prediction	CodeCode Available	1

Show:10 25 50

No leaderboard results yet.