SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 951971 of 971 papers

TitleStatusHype
Code-Mixer Ya Nahi: Novel Approaches to Measuring Multilingual LLMs' Code-Mixing Capabilities0
Collaborative Evaluation of Deepfake Text with Deliberation-Enhancing Dialogue Systems0
Towards Efficient Educational Chatbots: Benchmarking RAG Frameworks0
Combining knowledge graphs and LLMs for hazardous chemical information management and reuse0
Combining Textual Content and Structure to Improve Dialog Similarity0
Comparative Analysis of GPT-4 and Human Graders in Evaluating Praise Given to Students in Synthetic Dialogues0
Comparing informativeness of an NLG chatbot vs graphical app in diet-information domain0
Comparing Software Developers with ChatGPT: An Empirical Investigation0
Comparing the Utility, Preference, and Performance of Course Material Search Functionality and Retrieval-Augmented Generation Large Language Model (RAG-LLM) AI Chatbots in Information-Seeking Tasks0
Comprehensive Audio Query Handling System with Integrated Expert Models and Contextual Understanding0
Comprehensive Lipidomic Automation Workflow using Large Language Models0
A Unified Pre-training Framework for Conversational AI0
Computer says 'no': Exploring systemic bias in ChatGPT using an audit approach0
Token Trails: Navigating Contextual Depths in Conversational AI with ChatLLM0
Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions0
WildVis: Open Source Visualizer for Million-Scale Chat Logs in the Wild0
Topic-Based Question Generation0
CONSCENDI: A Contrastive and Scenario-Guided Distillation Approach to Guardrail Models for Virtual Assistants0
ConSCompF: Consistency-focused Similarity Comparison Framework for Generative Large Language Models0
ConstitutionMaker: Interactively Critiquing Large Language Models by Converting Feedback into Principles0
Constructing Domain-Specific Evaluation Sets for LLM-as-a-judge0
Show:102550
← PrevPage 20 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified