| Modifying Large Language Model Post-Training for Diverse Creative Writing | Mar 21, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation | Mar 21, 2025 | Click-Through Rate PredictionContrastive Learning | —Unverified | 0 |
| Language Models May Verbatim Complete Text They Were Not Explicitly Trained On | Mar 21, 2025 | Large Language Model | —Unverified | 0 |
| Improving Quantization with Post-Training Model Expansion | Mar 21, 2025 | Large Language Modelmodel | —Unverified | 0 |
| Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent | Mar 21, 2025 | Large Language ModelPrivacy Preserving | —Unverified | 0 |
| Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks | Mar 21, 2025 | ArticlesBinary Classification | —Unverified | 0 |
| Variance Control via Weight Rescaling in LLM Pre-training | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| Entropy-based Exploration Conduction for Multi-step Reasoning | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination | Mar 20, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 1 |
| Using Language Models to Decipher the Motivation Behind Human Behaviors | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Cultural Alignment in Large Language Models Using Soft Prompt Tuning | Mar 20, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Mar 20, 2025 | Large Language ModelText Generation | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| ChatGPT and U(X): A Rapid Review on Measuring the User Experience | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 | Mar 20, 2025 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs | Mar 20, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| GenM^3: Generative Pretrained Multi-path Motion Model for Text Conditional Human Motion Generation | Mar 19, 2025 | Large Language ModelMotion Generation | —Unverified | 0 |
| Unlocking the Capabilities of Vision-Language Models for Generalizable and Explainable Deepfake Detection | Mar 19, 2025 | Contrastive LearningDeepFake Detection | —Unverified | 0 |
| Robust Transmission of Punctured Text with Large Language Model-based Recovery | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Probing the topology of the space of tokens with structured prompts | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| LEGION: Learning to Ground and Explain for Synthetic Image Detection | Mar 19, 2025 | Artifact DetectionImage Manipulation | —Unverified | 0 |
| Aligning Crowd-sourced Human Feedback for Reinforcement Learning on Code Generation by Large Language Models | Mar 19, 2025 | Bayesian OptimizationCode Generation | —Unverified | 0 |
| UPME: An Unsupervised Peer Review Framework for Multimodal Large Language Model Evaluation | Mar 19, 2025 | Language Model EvaluationLanguage Modeling | —Unverified | 0 |
| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Does Context Matter? ContextualJudgeBench for Evaluating LLM-based Judges in Contextual Settings | Mar 19, 2025 | Instruction FollowingLarge Language Model | CodeCode Available | 0 |
| Leveraging MoE-based Large Language Model for Zero-Shot Multi-Task Semantic Communication | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Gender and content bias in Large Language Models: a case study on Google Gemini 2.0 Flash Experimental | Mar 18, 2025 | FairnessLarge Language Model | —Unverified | 0 |
| Enabling Inclusive Systematic Reviews: Incorporating Preprint Articles with Large Language Model-Driven Evaluations | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| Towards a Barrier-free GeoQA Portal: Natural Language Interaction with Geospatial Data Using Multi-Agent LLMs and Semantic Search | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments | Mar 18, 2025 | Decision MakingLanguage Modeling | —Unverified | 0 |
| Good/Evil Reputation Judgment of Celebrities by LLMs via Retrieval Augmented Generation | Mar 18, 2025 | ArticlesLanguage Modeling | —Unverified | 0 |
| SpaceVLLM: Endowing Multimodal Large Language Model with Spatio-Temporal Video Grounding Capability | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Engineering Scientific Assistants using Interactive Structured Induction of Programs | Mar 18, 2025 | Large Language Model | —Unverified | 0 |
| Gricean Norms as a Basis for Effective Collaboration | Mar 18, 2025 | Large Language ModelNavigate | CodeCode Available | 0 |
| The Empty Chair: Using LLMs to Raise Missing Perspectives in Policy Deliberations | Mar 18, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| KVShare: An LLM Service System with Efficient and Effective Multi-Tenant KV Cache Reuse | Mar 17, 2025 | DiversityLanguage Modeling | —Unverified | 0 |
| HiDe-LLaVA: Hierarchical Decoupling for Continual Instruction Tuning of Multimodal Large Language Model | Mar 17, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| AccelGen: Heterogeneous SLO-Guaranteed High-Throughput LLM Inference Serving for Diverse Applications | Mar 17, 2025 | ChunkingGPU | —Unverified | 0 |
| Pensez: Less Data, Better Reasoning -- Rethinking French LLM | Mar 17, 2025 | Large Language ModelMath | —Unverified | 0 |
| Mitigating KV Cache Competition to Enhance User Experience in LLM Inference | Mar 17, 2025 | Large Language Model | —Unverified | 0 |
| Analytic Subspace Routing: How Recursive Least Squares Works in Continual Learning of Large Language Model | Mar 17, 2025 | Continual LearningLanguage Modeling | —Unverified | 0 |
| Knowledge-Aware Iterative Retrieval for Multi-Agent Systems | Mar 17, 2025 | Evidence SelectionLarge Language Model | —Unverified | 0 |
| PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing | Mar 17, 2025 | DenoisingLanguage Modeling | —Unverified | 0 |
| Lifelong Reinforcement Learning with Similarity-Driven Weighting by Large Models | Mar 17, 2025 | Large Language Modelreinforcement-learning | —Unverified | 0 |