| Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning | Nov 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Steering Techniques using Human Similarity Judgments | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Reproducibility Study of Graph-Based Legal Case Retrieval | Apr 11, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 | 0 |
| Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models | Oct 17, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Nuanced Bias in Large Language Model Free Response Answers | Jul 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions | Jul 7, 2025 | Large Language ModelRAG | —Unverified | 0 | 0 |
| CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems | Mar 4, 2024 | AllLanguage Modelling | —Unverified | 0 | 0 |
| Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice | Dec 24, 2024 | Decision MakingFairness | —Unverified | 0 | 0 |
| Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation | May 16, 2025 | General KnowledgeLarge Language Model | —Unverified | 0 | 0 |
| Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | Mar 28, 2025 | Large Language Model | —Unverified | 0 | 0 |
| Evaluating LLaMA 3.2 for Software Vulnerability Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning | Feb 26, 2024 | Data Augmentationdocument understanding | —Unverified | 0 | 0 |
| Evaluating Large Language Model Creativity from a Literary Perspective | Nov 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Nov 8, 2024 | Fact CheckingLanguage Modeling | —Unverified | 0 | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 | 0 |
| Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness | Apr 7, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 | 0 |
| Are Human Conversations Special? A Large Language Model Perspective | Mar 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs | Mar 22, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 | 0 |
| Integrating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks | Aug 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System | Mar 8, 2024 | AttributeFairness | —Unverified | 0 | 0 |
| Evaluating Consistencies in LLM responses through a Semantic Clustering of Question Answering | Oct 20, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| CephGPT-4: An Interactive Multimodal Cephalometric Measurement and Diagnostic System with Visual Large Language Model | Jul 1, 2023 | DiagnosticLanguage Modeling | —Unverified | 0 | 0 |
| ACE: All-round Creator and Editor Following Instructions via Diffusion Transformer | Sep 30, 2024 | AllLarge Language Model | —Unverified | 0 | 0 |
| Evaluating ChatGPT text-mining of clinical records for obesity monitoring | Aug 3, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Chatbots to Promote Users' Trust -- Practices and Open Problems | Sep 9, 2023 | ChatbotLanguage Modeling | —Unverified | 0 | 0 |
| Evaluating Apple Intelligence's Writing Tools for Privacy Against Large Language Model-Based Inference Attacks: Insights from Early Datasets | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CDEMapper: Enhancing NIH Common Data Element Normalization using Large Language Models | Nov 30, 2024 | Large Language Model | —Unverified | 0 | 0 |
| EvalLM: Interactive Evaluation of Large Language Model Prompts on User-Defined Criteria | Sep 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CCoE: A Compact LLM with Collaboration of Experts | Jul 16, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| A Red Teaming Roadmap Towards System-Level Safety | May 30, 2025 | Large Language ModelRed Teaming | —Unverified | 0 | 0 |
| EuroLLM-9B: Technical Report | Jun 4, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ETimeline: An Extensive Timeline Generation Dataset based on Large Language Model | Feb 11, 2025 | ArticlesLanguage Modeling | —Unverified | 0 | 0 |
| EtC: Temporal Boundary Expand then Clarify for Weakly Supervised Video Grounding with Multimodal Large Language Model | Dec 5, 2023 | Boundary DetectionLanguage Modeling | —Unverified | 0 | 0 |
| CBT-LLM: A Chinese Large Language Model for Cognitive Behavioral Therapy-based Mental Health Question Answering | Mar 24, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Estimating Contribution Quality in Online Deliberations Using a Large Language Model | Aug 21, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Arch-LLM: Taming LLMs for Neural Architecture Generation via Unsupervised Discrete Representation Learning | Mar 28, 2025 | Large Language ModelNeural Architecture Search | —Unverified | 0 | 0 |
| Agents for self-driving laboratories applied to quantum computing | Dec 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Accuracy of a Large Language Model in Distinguishing Anti- And Pro-vaccination Messages on Social Media: The Case of Human Papillomavirus Vaccination | Apr 10, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Explain What You Mean: Intent Augmented Knowledge Graph Recommender Built With An LLM | May 16, 2025 | Knowledge GraphsLarge Language Model | —Unverified | 0 | 0 |
| Infusing Environmental Captions for Long-Form Video Language Grounding | Aug 5, 2024 | FormLanguage Modeling | —Unverified | 0 | 0 |
| E-Sparse: Boosting the Large Language Model Inference through Entropy-based N:M Sparsity | Oct 24, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ESLM: Risk-Averse Selective Language Modeling for Efficient Pretraining | May 26, 2025 | Knowledge DistillationLanguage Modeling | —Unverified | 0 | 0 |
| Escaping Collapse: The Strength of Weak Data for Large Language Model Training | Feb 13, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| ERABAL: Enhancing Role-Playing Agents through Boundary-Aware Learning | Sep 23, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Data Augmentations for Improved (Large) Language Model Generalization | Oct 19, 2023 | Attributecounterfactual | —Unverified | 0 | 0 |