SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 5175 of 971 papers

TitleStatusHype
Characteristic AI Agents via Large Language ModelsCode1
ChatGPT: Jack of all trades, master of noneCode1
CHARM: Calibrating Reward Models With Chatbot Arena ScoresCode1
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue BenchmarkCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
Learning to Assist Humans without Inferring RewardsCode1
Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot ApplicationsCode1
Learning Implicit User Profiles for Personalized Retrieval-Based ChatbotCode1
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few UtterancesCode1
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AICode1
Hydragen: High-Throughput LLM Inference with Shared PrefixesCode1
Improving Ontology Requirements Engineering with OntoChat and Participatory PromptingCode1
HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional AttentionCode1
CataractBot: An LLM-Powered Expert-in-the-Loop Chatbot for Cataract PatientsCode1
How Robust is Google's Bard to Adversarial Image Attacks?Code1
Improving Your Model Ranking on Chatbot Arena by Vote RiggingCode1
Bring Your Own Data! Self-Supervised Evaluation for Large Language ModelsCode1
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance ProcessCode1
Few Shot Dialogue State Tracking using Meta-learningCode1
Causal Inference for Chatting HandoffCode1
Inverse Constitutional AI: Compressing Preferences into PrinciplesCode1
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational BioimagingCode1
A Framework for Integrating Gesture Generation Models into Interactive Conversational AgentsCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
Automatic Evaluation and Moderation of Open-domain Dialogue SystemsCode1
Show:102550
← PrevPage 3 of 39Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified