| Loki: An Open-Source Tool for Fact Verification | Oct 2, 2024 | Claim VerificationFact Checking | CodeCode Available | 5 |
| Fake It Until You Break It: On the Adversarial Robustness of AI-generated Image Detectors | Oct 2, 2024 | Adversarial RobustnessMisinformation | CodeCode Available | 0 |
| ThreatGram 101 - Extreme Telegram Replies Data with Threat Levels | Sep 30, 2024 | Abusive LanguageHate Speech Detection | CodeCode Available | 0 |
| Wait, but Tylenol is Acetaminophen... Investigating and Improving Language Models' Ability to Resist Requests for Misinformation | Sep 30, 2024 | Logical ReasoningMisinformation | —Unverified | 0 |
| Multimodal Misinformation Detection by Learning from Synthetic Data with Multimodal LLMs | Sep 29, 2024 | Fact CheckingMisinformation | —Unverified | 0 |
| Explainable Artifacts for Synthetic Western Blot Source Attribution | Sep 27, 2024 | ArticlesMisinformation | CodeCode Available | 0 |
| Enhancing Guardrails for Safe and Secure Healthcare AI | Sep 25, 2024 | HallucinationMisinformation | —Unverified | 0 |
| FMDLlama: Financial Misinformation Detection based on Large Language Models | Sep 24, 2024 | Explanation GenerationInstruction Following | CodeCode Available | 0 |
| CHBench: A Chinese Dataset for Evaluating Health in Large Language Models | Sep 24, 2024 | Misinformation | CodeCode Available | 0 |
| XTRUST: On the Multilingual Trustworthiness of Large Language Models | Sep 24, 2024 | EthicsFairness | CodeCode Available | 1 |