SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 51100 of 971 papers

TitleStatusHype
Pchatbot: A Large-Scale Dataset for Personalized ChatbotCode1
PLATO-2: Towards Building an Open-Domain Chatbot via Curriculum LearningCode1
One Chatbot Per Person: Creating Personalized Chatbots based on Implicit User ProfilesCode1
MIRACLE: Towards Personalized Dialogue Generation with Latent-Space Multiple Personal Attribute ControlCode1
Conversations Are Not Flat: Modeling the Dynamic Information Flow across Dialogue UtterancesCode1
Diagnosing Infeasible Optimization Problems Using Large Language ModelsCode1
A Jailbroken GenAI Model Can Cause Substantial Harm: GenAI-powered Applications are Vulnerable to PromptWaresCode1
Domain-specific ChatBots for Science using EmbeddingsCode1
OntoChatGPT Information System: Ontology-Driven Structured Prompts for ChatGPT Meta-LearningCode1
Prompted LLMs as Chatbot Modules for Long Open-domain ConversationCode1
Red Teaming Language Models with Language ModelsCode1
Measuring and Controlling Instruction (In)Stability in Language Model DialogsCode1
Citation-Enhanced Generation for LLM-based ChatbotsCode1
Meet Your Favorite Character: Open-domain Chatbot Mimicking Fictional Characters with only a Few UtterancesCode1
A Qualitative Evaluation of Language Models on Automatic Question-Answering for COVID-19Code1
ChatGPT Chemistry Assistant for Text Mining and Prediction of MOF SynthesisCode1
ChatGPT: Jack of all trades, master of noneCode1
LLM Roleplay: Simulating Human-Chatbot InteractionCode1
CHARM: Calibrating Reward Models With Chatbot Arena ScoresCode1
Characteristic AI Agents via Large Language ModelsCode1
Learning Implicit User Profiles for Personalized Retrieval-Based ChatbotCode1
K-PLUG: KNOWLEDGE-INJECTED PRE-TRAINED LANGUAGE MODEL FOR NATURAL LANGUAGE UNDERSTANDING AND GENERATIONCode1
Learning to Assist Humans without Inferring RewardsCode1
LLM4TS: Aligning Pre-Trained LLMs as Data-Efficient Time-Series ForecastersCode1
metaCAT: A Metadata-based Task-oriented Chatbot Annotation ToolCode1
Improving Your Model Ranking on Chatbot Arena by Vote RiggingCode1
Improving Ontology Requirements Engineering with OntoChat and Participatory PromptingCode1
Inverse Constitutional AI: Compressing Preferences into PrinciplesCode1
Causal Inference for Chatting HandoffCode1
KoDialogBench: Evaluating Conversational Understanding of Language Models with Korean Dialogue BenchmarkCode1
CataractBot: An LLM-Powered Expert-in-the-Loop Chatbot for Cataract PatientsCode1
Bring Your Own Data! Self-Supervised Evaluation for Large Language ModelsCode1
A Framework for Integrating Gesture Generation Models into Interactive Conversational AgentsCode1
BioImage.IO Chatbot: A Community-Driven AI Assistant for Integrative Computational BioimagingCode1
How Robust is Google's Bard to Adversarial Image Attacks?Code1
HHH: An Online Medical Chatbot System based on Knowledge Graph and Hierarchical Bi-Directional AttentionCode1
Hydragen: High-Throughput LLM Inference with Shared PrefixesCode1
Few-Shot Bot: Prompt-Based Learning for Dialogue SystemsCode1
Automatic Evaluation and Moderation of Open-domain Dialogue SystemsCode1
Few Shot Dialogue State Tracking using Meta-learningCode1
EvalTree: Profiling Language Model Weaknesses via Hierarchical Capability TreesCode1
CLASS: A Design Framework for building Intelligent Tutoring Systems based on Learning Science principlesCode1
A Recipe For Building a Compliant Real Estate ChatbotCode1
Compress to Impress: Unleashing the Potential of Compressive Memory in Real-World Long-Term ConversationsCode1
Faithful Persona-based Conversational Dataset Generation with Large Language ModelsCode1
Assigning personality/identity to a chatting machine for coherent conversation generationCode1
MKA: A Scalable Medical Knowledge Assisted Mechanism for Generative Models on Medical Conversation TasksCode1
MTSI-BERT: A Session-aware Knowledge-based Conversational AgentCode1
Addressing Inquiries about History: An Efficient and Practical Framework for Evaluating Open-domain Chatbot ConsistencyCode1
ConvAbuse: Data, Analysis, and Benchmarks for Nuanced Abuse Detection in Conversational AICode1
Show:102550
← PrevPage 2 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified