| Evaluating the Effectiveness of Retrieval-Augmented Large Language Models in Scientific Document Reasoning | Nov 7, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Text Creativity across Diverse Domains: A Dataset and Large Language Model Evaluator | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Steering Techniques using Human Similarity Judgments | May 25, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| A Reproducibility Study of Graph-Based Legal Case Retrieval | Apr 11, 2025 | Information RetrievalLarge Language Model | —Unverified | 0 | 0 |
| Evaluating Self-Generated Documents for Enhancing Retrieval-Augmented Generation with Large Language Models | Oct 17, 2024 | Language ModellingLarge Language Model | —Unverified | 0 | 0 |
| Chaining text-to-image and large language model: A novel approach for generating personalized e-commerce banners | Feb 28, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Nuanced Bias in Large Language Model Free Response Answers | Jul 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 | 0 |
| Evaluating Memory in LLM Agents via Incremental Multi-Turn Interactions | Jul 7, 2025 | Large Language ModelRAG | —Unverified | 0 | 0 |
| CFunModel: A "Funny" Language Model Capable of Chinese Humor Generation and Processing | Mar 26, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Are More LLM Calls All You Need? Towards Scaling Laws of Compound Inference Systems | Mar 4, 2024 | AllLanguage Modelling | —Unverified | 0 | 0 |
| Agents on the Bench: Large Language Model Based Multi Agent Framework for Trustworthy Digital Justice | Dec 24, 2024 | Decision MakingFairness | —Unverified | 0 | 0 |
| Who You Are Matters: Bridging Topics and Social Roles via LLM-Enhanced Logical Recommendation | May 16, 2025 | General KnowledgeLarge Language Model | —Unverified | 0 | 0 |
| Evaluating LLM-based Agents for Multi-Turn Conversations: A Survey | Mar 28, 2025 | Large Language Model | —Unverified | 0 | 0 |
| Evaluating LLaMA 3.2 for Software Vulnerability Detection | Mar 10, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Read and Think: An Efficient Step-wise Multimodal Language Model for Document Understanding and Reasoning | Feb 26, 2024 | Data Augmentationdocument understanding | —Unverified | 0 | 0 |
| Evaluating Large Language Model Creativity from a Literary Perspective | Nov 30, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Are Large Language Model-based Evaluators the Solution to Scaling Up Multilingual Evaluation? | Sep 14, 2023 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating Large Language Model Capability in Vietnamese Fact-Checking Data Generation | Nov 8, 2024 | Fact CheckingLanguage Modeling | —Unverified | 0 | 0 |
| Evaluating Large Language Model Capabilities in Assessing Spatial Econometrics Research | Jun 4, 2025 | counterfactualEconometrics | —Unverified | 0 | 0 |
| Evaluating Knowledge Graph Based Retrieval Augmented Generation Methods under Knowledge Incompleteness | Apr 7, 2025 | Knowledge GraphsLanguage Modeling | —Unverified | 0 | 0 |
| CFBenchmark-MM: Chinese Financial Assistant Benchmark for Multimodal Large Language Model | Jun 16, 2025 | Decision MakingFinancial Analysis | —Unverified | 0 | 0 |
| Are Human Conversations Special? A Large Language Model Perspective | Mar 8, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| Evaluating GPT-4 with Vision on Detection of Radiological Findings on Chest Radiographs | Mar 22, 2024 | DiagnosticLanguage Modeling | —Unverified | 0 | 0 |
| Integrating Diverse Knowledge Sources for Online One-shot Learning of Novel Tasks | Aug 19, 2022 | Language ModelingLanguage Modelling | —Unverified | 0 | 0 |
| CFaiRLLM: Consumer Fairness Evaluation in Large-Language Model Recommender System | Mar 8, 2024 | AttributeFairness | —Unverified | 0 | 0 |