SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 926950 of 971 papers

TitleStatusHype
Evaluation of Large Language Models via Coupled Token GenerationCode0
SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its UsefulnessCode0
Evaluation of In-Person Counseling Strategies To Develop Physical Activity Chatbot for WomenCode0
Evaluation and Improvement of Chatbot Text Classification Data Quality Using Plausible Negative ExamplesCode0
Language Model Alignment with Elastic ResetCode0
Evaluating Natural Language Understanding Services for Conversational Question Answering SystemsCode0
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False PresuppositionsCode0
Subword Semantic Hashing for Intent Classification on Small DatasetsCode0
ConvoCache: Smart Re-Use of Chatbot ResponsesCode0
A Statistical Framework for Ranking LLM-Based ChatbotsCode0
A Bi-Encoder LSTM Model For Learning Unstructured DialogsCode0
Auto-Arena: Automating LLM Evaluations with Agent Peer Battles and Committee DiscussionsCode0
A New Perspective on ADHD Research: Knowledge Graph Construction with LLMs and Network Based InsightsCode0
Can AI Relate: Testing Large Language Model Response for Mental Health SupportCode0
Layer-Level Self-Exposure and Patch: Affirmative Token Mitigation for Jailbreak Attack DefenseCode0
Bio-Eng-LMM AI Assist chatbot: A Comprehensive Tool for Research and EducationCode0
Learning from Dialogue after Deployment: Feed Yourself, Chatbot!Code0
ConveRT for FAQ AnsweringCode0
Learning From Free-Text Human Feedback -- Collect New Datasets Or Extend Existing Ones?Code0
Evaluating Large Language Models with Human Feedback: Establishing a Swedish BenchmarkCode0
Learning Improvised Chatbots from Adversarial Modifications of Natural Language FeedbackCode0
An Ontology-Based Dialogue Management System for Banking and Finance Dialogue SystemsCode0
Using Adaptive Empathetic Responses for Teaching EnglishCode0
ErAConD : Error Annotated Conversational Dialog Dataset for Grammatical Error CorrectionCode0
Learning to love diligent trolls: Accounting for rater effects in the dialogue safety taskCode0
Show:102550
← PrevPage 38 of 39Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified