| Does the Prompt-based Large Language Model Recognize Students' Demographics and Introduce Bias in Essay Scoring? | Apr 30, 2025 | Automated Essay ScoringFairness | —Unverified | 0 |
| DeepSeek-Prover-V2: Advancing Formal Mathematical Reasoning via Reinforcement Learning for Subgoal Decomposition | Apr 30, 2025 | Automated Theorem ProvingLarge Language Model | CodeCode Available | 5 |
| MF-LLM: Simulating Population Decision Dynamics via a Mean-Field Large Language Model Framework | Apr 30, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 1 |
| Consistency-aware Fake Videos Detection on Short Video Platforms | Apr 30, 2025 | Large Language ModelMultimodal Large Language Model | CodeCode Available | 0 |
| Iterative Tool Usage Exploration for Multimodal Agents via Step-wise Preference Tuning | Apr 30, 2025 | Large Language Model | —Unverified | 0 |
| LLM Enhancer: Merged Approach using Vector Embedding for Reducing Large Language Model Hallucinations with External Knowledge | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Detecting Manipulated Contents Using Knowledge-Grounded Inference | Apr 29, 2025 | Claim VerificationFact Checking | CodeCode Available | 0 |
| CoCo-Bench: A Comprehensive Code Benchmark For Multi-task Large Language Model Evaluation | Apr 29, 2025 | Code GenerationLanguage Model Evaluation | —Unverified | 0 |
| Computational Reasoning of Large Language Models | Apr 29, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| BrAIcht, a theatrical agent that speaks like Bertolt Brecht's characters | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| WenyanGPT: A Large Language Model for Classical Chinese Tasks | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| A Framework to Assess the Persuasion Risks Large Language Model Chatbots Pose to Democratic Societies | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LLM-Enabled EV Charging Stations Recommendation | Apr 29, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cognitive maps are generative programs | Apr 29, 2025 | Computational EfficiencyLarge Language Model | —Unverified | 0 |
| GVPO: Group Variance Policy Optimization for Large Language Model Post-Training | Apr 28, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AutoJudge: Judge Decoding Without Manual Annotation | Apr 28, 2025 | GSM8KLarge Language Model | —Unverified | 0 |
| Fitness Landscape of Large Language Model-Assisted Automated Algorithm Search | Apr 28, 2025 | Combinatorial OptimizationLanguage Modeling | —Unverified | 0 |
| PhenoAssistant: A Conversational Multi-Agent AI System for Automated Plant Phenotyping | Apr 28, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 1 |
| semi-PD: Towards Efficient LLM Serving via Phase-Wise Disaggregated Computation and Unified Storage | Apr 28, 2025 | GPULarge Language Model | —Unverified | 0 |
| Intelligent4DSE: Optimizing High-Level Synthesis Design Space Exploration with Graph Neural Networks and Large Language Models | Apr 28, 2025 | Evolutionary AlgorithmsGraph Neural Network | —Unverified | 0 |
| An Automated Reinforcement Learning Reward Design Framework with Large Language Model for Cooperative Platoon Coordination | Apr 28, 2025 | Code GenerationHallucination | —Unverified | 0 |
| Leveraging LLM to Strengthen ML-Based Cross-Site Scripting Detection | Apr 28, 2025 | Large Language Model | —Unverified | 0 |
| CodeBC: A More Secure Large Language Model for Smart Contract Code Generation in Blockchain | Apr 28, 2025 | Code GenerationLanguage Modeling | CodeCode Available | 0 |
| SPC: Evolving Self-Play Critic via Adversarial Games for LLM Reasoning | Apr 27, 2025 | Large Language ModelMathematical Reasoning | —Unverified | 0 |
| GenTorrent: Scaling Large Language Model Serving with An Overley Network | Apr 27, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |