SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 901950 of 971 papers

TitleStatusHype
The Illusion of Empathy: How AI Chatbots Shape Conversation PerceptionCode0
DAGFiNN: A Conversational Conference AssistantCode0
MulMarker: a comprehensive framework for identifying multi-gene prognostic signaturesCode0
Chatbot for admissionsCode0
Interactive Learning in Computer Science Education Supported by a Discord ChatbotCode0
Assessing Political Prudence of Open-domain ChatbotsCode0
The impact of responding to patient messages with large language model assistanceCode0
Exploring ChatGPT's Empathic AbilitiesCode0
Multi-Turn Response Selection for Chatbots with Deep Attention Matching NetworkCode0
BoilerBot: A reliable task-oriented chatbot enhanced with large language modelsCode0
ChaI-TeA: A Benchmark for Evaluating Autocompletion of Interactions with LLM-based ChatbotsCode0
Structured Dialogue System for Mental Health: An LLM Chatbot Leveraging the PM+ GuidelinesCode0
Is Your LLM Overcharging You? Tokenization, Transparency, and IncentivesCode0
Neural Generation Meets Real People: Building a Social, Informative Open-Domain Dialogue AgentCode0
AI Predicts AGI: Leveraging AGI Forecasting and Peer Review to Explore LLMs' Complex Reasoning CapabilitiesCode0
CASS: Towards Building a Social-Support Chatbot for Online Health CommunityCode0
Exploiting Persona Information for Diverse Generation of Conversational ResponsesCode0
RICoTA: Red-teaming of In-the-wild Conversation with Test AttemptsCode0
Can You Follow Me? Testing Situational Understanding in ChatGPTCode0
Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the WildCode0
Evaluator for Emotionally Consistent ChatbotsCode0
KatzBot: Revolutionizing Academic Chatbot for Enhanced CommunicationCode0
A Chatbot for Asylum-Seeking Migrants in EuropeCode0
RV4Chatbot: Are Chatbots Allowed to Dream of Electric Sheep?Code0
KL Penalty Control via Perturbation for Direct Preference OptimizationCode0
Evaluation of Large Language Models via Coupled Token GenerationCode0
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its UsefulnessCode0
Evaluation of In-Person Counseling Strategies To Develop Physical Activity Chatbot for WomenCode0
Evaluation and Improvement of Chatbot Text Classification Data Quality Using Plausible Negative ExamplesCode0
Language Model Alignment with Elastic ResetCode0
Evaluating Natural Language Understanding Services for Conversational Question Answering SystemsCode0
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False PresuppositionsCode0
Subword Semantic Hashing for Intent Classification on Small DatasetsCode0
ConvoCache: Smart Re-Use of Chatbot ResponsesCode0
A Statistical Framework for Ranking LLM-Based ChatbotsCode0
A Bi-Encoder LSTM Model For Learning Unstructured DialogsCode0
Auto-Arena: Automating LLM Evaluations with Agent Peer Battles and Committee DiscussionsCode0
A New Perspective on ADHD Research: Knowledge Graph Construction with LLMs and Network Based InsightsCode0
Can AI Relate: Testing Large Language Model Response for Mental Health SupportCode0
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack DefenseCode0
Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and EducationCode0
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!Code0
ConveRT for FAQ AnsweringCode0
Learning From Free-Text Human Feedback -- Collect New Datasets Or Extend Existing Ones?Code0
Evaluating Large Language Models with Human Feedback: Establishing a Swedish BenchmarkCode0
Learning Improvised Chatbots from Adversarial Modifications of Natural Language FeedbackCode0
An Ontology-Based Dialogue Management System for Banking and Finance Dialogue SystemsCode0
Using Adaptive Empathetic Responses for Teaching EnglishCode0
ErAConD : Error Annotated Conversational Dialog Dataset for Grammatical Error CorrectionCode0
Learning to love diligent trolls: Accounting for rater effects in the dialogue safety taskCode0
Show:102550
← PrevPage 19 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified