SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 51100 of 971 papers

TitleStatusHype
MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute ControlCode1
LLM4TS: Aligning Pre-Trained LLMs as Data-Efficient Time-Series ForecastersCode1
MTSI-BERT: A Session-aware Knowledge-based Conversational AgentCode1
One Chatbot Per Person: Creating Personalized Chatbots based on Implicit User ProfilesCode1
LLM Roleplay: Simulating Human-Chatbot InteractionCode1
Measuring and Controlling Instruction (In)Stability in Language Model DialogsCode1
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWaresCode1
MiniGPT-Pancreas: Multimodal Large Language Model for Pancreas Cancer Classification and DetectionCode1
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum LearningCode1
Red Teaming Language Models with Language ModelsCode1
Representing Rule-based Chatbots with TransformersCode1
Retail-GPT: leveraging Retrieval Augmented Generation (RAG) for building E-commerce Chat AssistantsCode1
A Recipe For Building a Compliant Real Estate ChatbotCode1
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue BenchmarkCode1
Learning Implicit User Profiles for Personalized Retrieval-Based ChatbotCode1
Inverse Constitutional AI: Compressing Preferences into PrinciplesCode1
Hydragen: High-Throughput LLM Inference with Shared PrefixesCode1
Improving Ontology Requirements Engineering with OntoChat and Participatory PromptingCode1
Learning to Assist Humans without Inferring RewardsCode1
From RAG to QA-RAG: Integrating Generative AI for Pharmaceutical Regulatory Compliance ProcessCode1
Few-Shot Bot: Prompt-Based Learning for Dialogue SystemsCode1
HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional AttentionCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
Asking Questions the Human Way: Scalable Question-Answer Generation from Text CorpusCode1
Enhancing Dialogue Generation via Dynamic Graph Knowledge AggregationCode1
Faithful Persona-based Conversational Dataset Generation with Large Language ModelsCode1
How Robust is Google's Bard to Adversarial Image Attacks?Code1
Domain-specific ChatBots for Science using EmbeddingsCode1
Diagnosing Infeasible Optimization Problems Using Large Language ModelsCode1
Empathy-driven Arabic Conversational ChatbotCode1
ErAConD: Error Annotated Conversational Dialog Dataset for Grammatical Error CorrectionCode1
Don't Forget Your ABC's: Evaluating the State-of-the-Art in Chat-Oriented Dialogue SystemsCode1
A Framework for Integrating Gesture Generation Models into Interactive Conversational AgentsCode1
Few Shot Dialogue State Tracking using Meta-learningCode1
Designing a Dashboard for Transparency and Control of Conversational AICode1
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AICode1
Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term ConversationsCode1
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue UtterancesCode1
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19Code1
Improving Your Model Ranking on Chatbot Arena by Vote RiggingCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
Knowledge Graph-Driven Retrieval-Augmented Generation: Integrating Deepseek-R1 with Weaviate for Advanced Chatbot ApplicationsCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATIONCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
CHARM: Calibrating Reward Models With Chatbot Arena ScoresCode1
Characteristic AI Agents via Large Language ModelsCode1
Causal Inference for Chatting HandoffCode1
Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot ConsistencyCode1
CataractBot: An LLM-Powered Expert-in-the-Loop Chatbot for Cataract PatientsCode1
Show:102550
← PrevPage 2 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified