| Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings | Mar 19, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender and content bias in Large Language Models: a case study on Google Gemini 2.0 Flash Experimental | Mar 18, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Engineering Scientific Assistants using Interactive Structured Induction of Programs | Mar 18, 2025 | Large Language Model | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |