SOTAVerified

Chatbot

Chatbot or conversational AI is a language model designed and implemented to have conversations with humans.

Source: Open Data Chatbot

Image source

Papers

Showing 150 of 971 papers

TitleStatusHype
TuneShield: Mitigating Toxicity in Conversational AI while Fine-tuning on Untrusted Data0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
Exploring the Effects of Chatbot Anthropomorphism and Human Empathy on Human Prosocial Behavior Toward Chatbots0
Mapping Caregiver Needs to AI Chatbot Design: Strengths and Gaps in Mental Health Support for Alzheimer's and Dementia Caregivers0
ProfiLLM: An LLM-Based Framework for Implicit Profiling of Chatbot Users0
The Safety Reminder: A Soft Prompt to Reactivate Delayed Safety Awareness in Vision-Language Models0
Transforming Chatbot Text: A Sequence-to-Sequence Approach0
Building Trustworthy AI by Addressing its 16+2 Desiderata with Goal-Directed Commonsense Reasoning0
Understanding Human-AI Trust in Education0
Evaluating AI-Powered Learning Assistants in Engineering Higher Education: Student Engagement, Ethical Challenges, and Policy Implications0
"We need to avail ourselves of GenAI to enhance knowledge distribution": Empowering Older Adults through GenAI Literacy0
Urania: Differentially Private Insights into AI Use0
Privacy and Security Threat for OpenAI GPTs0
A Practical Approach for Building Production-Grade Conversational Agents with Workflow Graphs0
From Chat Logs to Collective Insights: Aggregative Question Answering0
Position: Uncertainty Quantification Needs Reassessment for Large-language Model Agents0
Design and testing of an agent chatbot supporting decision making with public transport data0
MEDAL: A Framework for Benchmarking LLMs as Multilingual Open-Domain Chatbots and Dialogue EvaluatorsCode0
Is Your LLM Overcharging You? Tokenization, Transparency, and IncentivesCode0
Project Riley: Multimodal Multi-Agent LLM Collaboration with Emotional Reasoning and Voting0
The Impact of a Chatbot's Ephemerality-Framing on Self-Disclosure Perceptions0
A Fully Generative Motivational Interviewing Counsellor Chatbot for Moving Smokers Towards the Decision to QuitCode0
SWE-Dev: Evaluating and Training Autonomous Feature-Driven Software DevelopmentCode2
X-MAS: Towards Building Multi-Agent Systems with Heterogeneous LLMsCode0
Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought0
AI vs. Human Judgment of Content Moderation: LLM-as-a-Judge and Ethics-Based Response Refusals0
What is Stigma Attributed to? A Theory-Grounded, Expert-Annotated Interview Corpus for Demystifying Mental-Health StigmaCode1
Decentralized Arena: Towards Democratic and Scalable Automatic Evaluation of Language ModelsCode1
Enhancing User-Oriented Proactivity in Open-Domain Dialogues with Critic Guidance0
Let's have a chat with the EU AI Act0
GenAI Security: Outsmarting the Bots with a Proactive Testing Framework0
WaLLM -- Insights from an LLM-Powered Chatbot deployment via WhatsApp0
An empathic GPT-based chatbot to talk about mental disorders with Spanish teenagers0
Large Language Models are often politically extreme, usually ideologically inconsistent, and persuasive even in informational contexts0
Steerable Chatbots: Personalizing LLMs with Preference-Based Activation Steering0
A Proposal for Evaluating the Operational Risk for ChatBots based on Large Language Models0
LlamaFirewall: An open source guardrail system for building secure AI agents0
Social Biases in Knowledge Representations of Wikidata separates Global North from Global SouthCode0
LLaMA-Omni2: LLM-based Real-time Spoken Chatbot with Autoregressive Streaming Speech SynthesisCode3
Emotions in the Loop: A Survey of Affective Computing for Emotional Support0
Enhancing ML Model Interpretability: Leveraging Fine-Tuned Large Language Models for Better Understanding of AI0
The Leaderboard Illusion0
Chatbot Arena Meets Nuggets: Towards Explanations and Diagnostics in the Evaluation of LLM Responses0
AI Chatbots for Mental Health: Values and Harms from Lived Experiences of Depression0
Scaling Laws For Scalable Oversight0
Cancer-Myth: Evaluating AI Chatbot on Patient Questions with False PresuppositionsCode0
CHARM: Calibrating Reward Models With Chatbot Arena ScoresCode1
Confirmation Bias in Generative AI Chatbots: Mechanisms, Risks, Mitigation Strategies, and Future Research Directions0
Learning from Elders: Making an LLM-powered Chatbot for Retirement Communities more Accessible through User-centered Design0
Data Requirement Goal Modeling for Machine Learning Systems0
Show:102550
← PrevPage 1 of 20Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Yi 34B ChatAverage win rate27.2Unverified