SOTAVerified

Dialogue Safety Prediction

Determine the safety of a given dialogue context.

Papers

Showing 13 of 3 papers

TitleStatusHype
ALERT: A Comprehensive Benchmark for Assessing Large Language Models' Safety through Red TeamingCode2
Benchmarking Llama2, Mistral, Gemma and GPT for Factuality, Toxicity, Bias and Propensity for HallucinationsCode1
ProsocialDialog: A Prosocial Backbone for Conversational AgentsCode1
Show:102550

No leaderboard results yet.