| When Persuasion Overrides Truth in Multi-Agent LLM Debates: Introducing a Confidence-Weighted Persuasion Override Rate (CW-POR) | Apr 1, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| BEATS: Bias Evaluation and Assessment Test Suite for Large Language Models | Mar 31, 2025 | EthicsFairness | —Unverified | 0 |
| A Multi-Agent Framework with Automated Decision Rule Optimization for Cross-Domain Misinformation Detection | Mar 30, 2025 | Misinformation | —Unverified | 0 |
| Identifying Multi-modal Knowledge Neurons in Pretrained Transformers via Two-stage Filtering | Mar 29, 2025 | Caption Generationknowledge editing | —Unverified | 0 |
| A Framework for Cryptographic Verifiability of End-to-End AI Pipelines | Mar 28, 2025 | Misinformation | —Unverified | 0 |
| Susceptibility of Large Language Models to User-Driven Factors in Medical Queries | Mar 26, 2025 | DiagnosticMedQA | —Unverified | 0 |
| Detection of Somali-written Fake News and Toxic Messages on the Social Media Using Transformer-based Language Models | Mar 23, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Safe RLHF-V: Safe Reinforcement Learning from Human Feedback in Multimodal Large Language Models | Mar 22, 2025 | MisinformationSafe Reinforcement Learning | —Unverified | 0 |
| Deceptive Humor: A Synthetic Multilingual Benchmark Dataset for Bridging Fabricated Claims with Humorous Content | Mar 20, 2025 | Humor DetectionMisinformation | —Unverified | 0 |
| ChatGPT or A Silent Everywhere Helper: A Survey of Large Language Models | Mar 19, 2025 | Misinformation | —Unverified | 0 |
| Entity-aware Cross-lingual Claim Detection for Automated Fact-checking | Mar 19, 2025 | Entity LinkingFact Checking | CodeCode Available | 0 |
| Iffy-Or-Not: Extending the Web to Support the Critical Evaluation of Fallacious Texts | Mar 18, 2025 | Misinformation | —Unverified | 0 |
| Reliable and Efficient Amortized Model-based Evaluation | Mar 17, 2025 | DiagnosticMathematical Reasoning | —Unverified | 0 |
| Team NYCU at Defactify4: Robust Detection and Source Identification of AI-Generated Images Using CNN and CLIP-Based Models | Mar 13, 2025 | Misinformation | CodeCode Available | 0 |
| VaxGuard: A Multi-Generator, Multi-Type, and Multi-Role Dataset for Detecting LLM-Generated Vaccine Misinformation | Mar 12, 2025 | MisinformationText Generation | —Unverified | 0 |
| Battling Misinformation: An Empirical Study on Adversarial Factuality in Open-Source Large Language Models | Mar 12, 2025 | Misinformation | —Unverified | 0 |
| How to Protect Yourself from 5G Radiation? Investigating LLM Responses to Implicit Misinformation | Mar 12, 2025 | counterfactualMisconceptions | CodeCode Available | 0 |
| Certainly Bot Or Not? Trustworthy Social Bot Detection via Robust Multi-Modal Neural Processes | Mar 11, 2025 | Misinformation | —Unverified | 0 |
| A Graph-based Verification Framework for Fact-Checking | Mar 10, 2025 | Fact Checkinggraph construction | —Unverified | 0 |
| TH-Bench: Evaluating Evading Attacks via Humanizing AI Text on Machine-Generated Text Detectors | Mar 10, 2025 | Misinformation | —Unverified | 0 |
| Simulating Influence Dynamics with LLM Agents | Mar 10, 2025 | Misinformation | —Unverified | 0 |
| Fine-Grained Bias Detection in LLM: Enhancing detection mechanisms for nuanced biases | Mar 8, 2025 | Bias Detectioncounterfactual | —Unverified | 0 |
| Evaluating open-source Large Language Models for automated fact-checking | Mar 7, 2025 | Fact CheckingMisinformation | —Unverified | 0 |
| Maximum Hallucination Standards for Domain-Specific Large Language Models | Mar 7, 2025 | AttributeHallucination | —Unverified | 0 |
| SafeArena: Evaluating the Safety of Autonomous Web Agents | Mar 6, 2025 | MisinformationSafety Alignment | —Unverified | 0 |