SOTAVerified|Agents Browse Leaderboard About Blog

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 26–50 of 971 papers

Title	Date	Tasks	Status	Hype
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent Education	Aug 5, 2023	ChatbotLanguage Modeling	CodeCode Available	2
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-Training	Mar 17, 2022	Chatbot	CodeCode Available	2
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software Development	May 22, 2025	Bug fixingChatbot	CodeCode Available	2
SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health Support	Apr 30, 2023	Chatbot	CodeCode Available	2
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational Biology	Mar 29, 2023	ChatbotPrompt Engineering	CodeCode Available	2
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language Models	May 24, 2023	ChatbotNatural Language Understanding	CodeCode Available	2
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement Learning	Apr 18, 2022	ChatbotOffline RL	CodeCode Available	2
SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware Decoding	Feb 14, 2024	ChatbotCode Generation	CodeCode Available	2
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual Contexts	Oct 3, 2023	ChatbotImage Captioning	CodeCode Available	2
MemoryBank: Enhancing Large Language Models with Long-Term Memory	May 17, 2023	Chatbot	CodeCode Available	2
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction	Feb 28, 2024	ChatbotReconstruction Attack	CodeCode Available	2
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design Automation	Dec 28, 2023	Answer GenerationChatbot	CodeCode Available	2
Language Model Powered Digital Biology with BRAD	Sep 4, 2024	ChatbotCode Generation	CodeCode Available	2
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language Models	Jun 26, 2024	ChatbotRed Teaming	CodeCode Available	2
Few Shot Dialogue State Tracking using Meta-learning	Jan 17, 2021	ChatbotDialogue State Tracking	CodeCode Available	1
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance Process	Jan 26, 2024	ChatbotRAG	CodeCode Available	1
Faithful Persona-based Conversational Dataset Generation with Large Language Models	Dec 15, 2023	ChatbotDataset Generation	CodeCode Available	1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability Trees	Mar 11, 2025	ChatbotLanguage Modeling	CodeCode Available	1
Few-Shot Bot: Prompt-Based Learning for Dialogue Systems	Oct 15, 2021	ChatbotDialogue State Tracking	CodeCode Available	1
Enhancing Dialogue Generation via Dynamic Graph Knowledge Aggregation	Jun 28, 2023	ChatbotDialogue Generation	CodeCode Available	1
Empathy-driven Arabic Conversational Chatbot	Dec 1, 2020	ChatbotDecoder	CodeCode Available	1
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error Correction	Jul 1, 2022	ChatbotGrammatical Error Correction	CodeCode Available	1
Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot Consistency	Jun 4, 2021	ChatbotNatural Language Inference	CodeCode Available	1
Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue Systems	Dec 18, 2022	ChatbotDialogue Evaluation	CodeCode Available	1
Domain-specific ChatBots for Science using Embeddings	Jun 15, 2023	ChatbotDiversity	CodeCode Available	1

Show:10 25 50

← PrevPage 2 of 39Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	Yi 34B Chat	Average win rate	27.2	—	Unverified