SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 2650 of 971 papers

TitleStatusHype
AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals0
What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health StigmaCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance0
Let's have a chat with the EU AI Act0
GenAI Security: Outsmarting the Bots with a Proactive Testing Framework0
WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp0
An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers0
Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts0
Steerable Chatbots: Personalizing LLMs with Preference-Based Activation Steering0
A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models0
LlamaFirewall: An open source guardrail system for building secure AI agents0
Social Biases in Knowledge Representations of Wikidata separates Global North from Global SouthCode0
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech SynthesisCode3
Emotions in the Loop: A Survey of Affective Computing for Emotional Support0
Enhancing ML Model Interpretability: Leveraging Fine-Tuned Large Language Models for Better Understanding of AI0
The Leaderboard Illusion0
Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses0
AI Chatbots for Mental Health: Values and Harms from Lived Experiences of Depression0
Scaling Laws For Scalable Oversight0
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False PresuppositionsCode0
CHARM: Calibrating Reward Models With Chatbot Arena ScoresCode1
Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions0
Learning from Elders: Making an LLM-powered Chatbot for Retirement Communities more Accessible through User-centered Design0
Data Requirement Goal Modeling for Machine Learning Systems0
Show:102550
← PrevPage 2 of 39Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified