SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 351375 of 971 papers

TitleStatusHype
Impact of Decoding Methods on Human Alignment of Conversational LLMs0
Interactive Learning in Computer Science Education Supported by a Discord ChatbotCode0
Enhancing Model Performance: Another Approach to Vision-Language Instruction Tuning0
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement0
WildHallucinations: Evaluating Long-form Factuality in LLMs with Real-World Entity Queries0
MoRSE: Bridging the Gap in Cybersecurity Expertise with Retrieval Augmented Generation0
Impacts of Anthropomorphizing Large Language Models in Learning Environments0
Chatbot-Based Ontology Interaction Using Large Language Models and Domain-Specific Standards0
Unipa-GPT: Large Language Models for university-oriented QA in ItalianCode0
Improving Engagement and Efficacy of mHealth Micro-Interventions for Stress Coping: an In-The-Wild Study0
Trust No Bot: Discovering Personal Disclosures in Human-LLM Conversations in the WildCode0
zIA: a GenAI-powered local auntie assists tourists in Italy0
Arena Learning: Build Data Flywheel for LLMs Post-training via Simulated Chatbot Arena0
A Chatbot for Asylum-Seeking Migrants in EuropeCode0
SoupLM: Model Integration in Large Language and Multi-Modal Models0
Analyzing Large language models chatbots: An experimental approach using a probability test0
Empirical Study of Symmetrical Reasoning in Conversational Chatbots0
Towards Optimizing and Evaluating a Retrieval Augmented QA Chatbot using LLMs with Human in the Loop0
Stephanie: Step-by-Step Dialogues for Mimicking Human Interactions in Social Conversations0
Zero-shot Persuasive Chatbots with LLM-Generated Strategies and Information Retrieval0
On the Benchmarking of LLMs for Open-Domain Dialogue Evaluation0
Improving Conversational Abilities of Quantized Large Language Models via Direct Preference Alignment0
Model-Enhanced LLM-Driven VUI Testing of VPA Apps0
Lightweight Large Language Model for Medication Enquiry: Med-Pal0
Self-Cognition in Large Language Models: An Exploratory Study0
Show:102550
← PrevPage 15 of 39Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified