| Entropy-based Exploration Conduction for Multi-step Reasoning | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 | Mar 20, 2025 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs | Mar 20, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| Cultural Alignment in Large Language Models Using Soft Prompt Tuning | Mar 20, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| Using Language Models to Decipher the Motivation Behind Human Behaviors | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Mar 20, 2025 | Large Language ModelText Generation | —Unverified | 0 |
| Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Probing the topology of the space of tokens with structured prompts | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation | Mar 19, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings | Mar 19, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| Robust Transmission of Punctured Text with Large Language Model-based Recovery | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models | Mar 19, 2025 | Bayesian OptimizationCode Generation | —Unverified | 0 |
| GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation | Mar 19, 2025 | Large Language ModelMotion Generation | —Unverified | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection | Mar 19, 2025 | Contrastive LearningDeepFake Detection | —Unverified | 0 |
| LEGION: Learning to Ground and Explain for Synthetic Image Detection | Mar 19, 2025 | Artifact DetectionImage Manipulation | —Unverified | 0 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Engineering Scientific Assistants using Interactive Structured Induction of Programs | Mar 18, 2025 | Large Language Model | —Unverified | 0 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Gricean Norms as a Basis for Effective Collaboration | Mar 18, 2025 | Large Language ModelNavigate | CodeCode Available | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |