SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 2650 of 971 papers

TitleStatusHype
EduChat: A Large-Scale Language Model-based Chatbot System for Intelligent EducationCode2
EVA2.0: Investigating Open-Domain Chinese Dialogue Systems with Large-Scale Pre-TrainingCode2
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
SMILE: Single-turn to Multi-turn Inclusive Language Expansion via ChatGPT for Mental Health SupportCode2
Ten Quick Tips for Harnessing the Power of ChatGPT/GPT-4 in Computational BiologyCode2
Cheap and Quick: Efficient Vision-Language Instruction Tuning for Large Language ModelsCode2
CHAI: A CHatbot AI for Task-Oriented Dialogue with Offline Reinforcement LearningCode2
SafeDecoding: Defending against Jailbreak Attacks via Safety-Aware DecodingCode2
MathVista: Evaluating Mathematical Reasoning of Foundation Models in Visual ContextsCode2
MemoryBank: Enhancing Large Language Models with Long-Term MemoryCode2
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and ReconstructionCode2
LLM4EDA: Emerging Progress in Large Language Models for Electronic Design AutomationCode2
Language Model Powered Digital Biology with BRADCode2
WildTeaming at Scale: From In-the-Wild Jailbreaks to (Adversarially) Safer Language ModelsCode2
Few Shot Dialogue State Tracking using Meta-learningCode1
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance ProcessCode1
Faithful Persona-based Conversational Dataset Generation with Large Language ModelsCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
Few-Shot Bot: Prompt-Based Learning for Dialogue SystemsCode1
Enhancing Dialogue Generation via Dynamic Graph Knowledge AggregationCode1
Empathy-driven Arabic Conversational ChatbotCode1
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error CorrectionCode1
Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot ConsistencyCode1
Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue SystemsCode1
Domain-specific ChatBots for Science using EmbeddingsCode1
Show:102550
← PrevPage 2 of 39Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified