SOTAVerified|Agents Browse Leaderboard About Blog

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–75 of 971 papers

Title	Date	Tasks	Status	Hype	Score
Characteristic AI Agents via Large Language Models	Mar 19, 2024	Chatbot	CodeCode Available	1	5
ChatGPT: Jack of all trades, master of none	Feb 21, 2023	AllChatbot	CodeCode Available	1	5
CHARM: Calibrating Reward Models With Chatbot Arena Scores	Apr 14, 2025	Chatbot	CodeCode Available	1	5
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue Benchmark	Feb 27, 2024	Chatbot	CodeCode Available	1	5
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principles	May 22, 2023	ChatbotDecision Making	CodeCode Available	1	5
Learning to Assist Humans without Inferring Rewards	Nov 4, 2024	Chatbotreinforcement-learning	CodeCode Available	1	5
Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot Applications	Feb 16, 2025	ChatbotLanguage Modeling	CodeCode Available	1	5
Learning Implicit User Profiles for Personalized Retrieval-Based Chatbot	Aug 18, 2021	ChatbotRetrieval	CodeCode Available	1	5
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few Utterances	Apr 22, 2022	ChatbotRetrieval	CodeCode Available	1	5
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AI	Sep 20, 2021	Abuse DetectionAbusive Language	CodeCode Available	1	5
Hydragen: High-Throughput LLM Inference with Shared Prefixes	Feb 7, 2024	16kChatbot	CodeCode Available	1	5
Improving Ontology Requirements Engineering with OntoChat and Participatory Prompting	Aug 9, 2024	Chatbot	CodeCode Available	1	5
HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional Attention	Feb 8, 2020	ChatbotMedical question pair similarity computation	CodeCode Available	1	5
CataractBot: An LLM-Powered Expert-in-the-Loop Chatbot for Cataract Patients	Feb 7, 2024	Chatbot	CodeCode Available	1	5
How Robust is Google's Bard to Adversarial Image Attacks?	Sep 21, 2023	Adversarial RobustnessChatbot	CodeCode Available	1	5
Improving Your Model Ranking on Chatbot Arena by Vote Rigging	Jan 29, 2025	Chatbot	CodeCode Available	1	5
Bring Your Own Data! Self-Supervised Evaluation for Large Language Models	Jun 23, 2023	ChatbotLanguage Modeling	CodeCode Available	1	5
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process	Jan 26, 2024	ChatbotRAG	CodeCode Available	1	5
Few Shot Dialogue State Tracking using Meta-learning	Jan 17, 2021	ChatbotDialogue State Tracking	CodeCode Available	1	5
Causal Inference for Chatting Handoff	Oct 6, 2022	Causal InferenceChatbot	CodeCode Available	1	5
Inverse Constitutional AI: Compressing Preferences into Principles	Jun 2, 2024	ChatbotLanguage Modelling	CodeCode Available	1	5
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational Bioimaging	Oct 23, 2023	ChatbotInformation Retrieval	CodeCode Available	1	5
A Framework for Integrating Gesture Generation Models into Interactive Conversational Agents	Feb 24, 2021	ChatbotGesture Generation	CodeCode Available	1	5
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees	Mar 11, 2025	ChatbotLanguage Modeling	CodeCode Available	1	5
Automatic Evaluation and Moderation of Open-domain Dialogue Systems	Nov 3, 2021	ChatbotDialogue Evaluation	CodeCode Available	1	5

Show:10 25 50

← PrevPage 3 of 39Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Yi 34B Chat	Average win rate	27.2	—	Unverified