| Modeling the Human Visual System: Comparative Insights from Response-Optimized and Task-Optimized Vision Models, Language Models, and different Readout Mechanisms | Oct 17, 2024 | cross-modal alignmentLarge Language Model | —Unverified | 0 |
| LLM Agent Honeypot: Monitoring AI Hacking Agents in the Wild | Oct 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting AI-Generated Texts in Cross-Domains | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Comparing the Utility, Preference, and Performance of Course Material Search Functionality and Retrieval-Augmented Generation Large Language Model (RAG-LLM) AI Chatbots in Information-Seeking Tasks | Oct 17, 2024 | ChatbotLanguage Modeling | —Unverified | 0 |
| MIRAGE-Bench: Automatic Multilingual Benchmark Arena for Retrieval-Augmented Generation Systems | Oct 17, 2024 | Answer GenerationLanguage Modeling | CodeCode Available | 1 |
| MedINST: Meta Dataset of Biomedical Instructions | Oct 17, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models | Oct 17, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| Collaborative AI in Sentiment Analysis: System Architecture, Data Prediction and Deployment Strategies | Oct 17, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Help Me Identify: Is an LLM+VQA System All We Need to Identify Visual Concepts? | Oct 17, 2024 | AllLanguage Modeling | CodeCode Available | 0 |
| Trust but Verify: Programmatic VLM Evaluation in the Wild | Oct 17, 2024 | BenchmarkingLanguage Modelling | —Unverified | 0 |