| Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning | Nov 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating the Effect of Retrieval Augmentation on Social Biases | Feb 24, 2025 | Large Language ModelQuestion Answering | —Unverified | 0 |
| Evaluating the Efficacy of LLM-Based Reasoning for Multiobjective HPC Job Scheduling | May 29, 2025 | Computational EfficiencyFairness | —Unverified | 0 |
| Evaluating The Performance of Using Large Language Models to Automate Summarization of CT Simulation Orders in Radiation Oncology | Jan 27, 2025 | Large Language Model | —Unverified | 0 |
| Measuring the Quality of Answers in Political Q&As with Large Language Models | Apr 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluating Voice Command Pipelines for Drone Control: From STT and LLM to Direct Classification and Siamese Networks | Jul 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluation of AI Chatbots for Patient-Specific EHR Questions | Jun 5, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers | Jun 7, 2023 | Document ClassificationLanguage Modeling | —Unverified | 0 |
| Evaluation of large language model performance on the Biomedical Language Understanding and Reasoning Benchmark | May 17, 2024 | Document ClassificationLanguage Modeling | —Unverified | 0 |