| Modifying Large Language Model Post-Training for Diverse Creative Writing | Mar 21, 2025 | DiversityLanguage Modeling | CodeCode Available | 2 |
| Autonomous Radiotherapy Treatment Planning Using DOLA: A Privacy-Preserving, LLM-Based Optimization Agent | Mar 21, 2025 | Large Language ModelPrivacy Preserving | —Unverified | 0 |
| Improving Quantization with Post-Training Model Expansion | Mar 21, 2025 | Large Language Modelmodel | —Unverified | 0 |
| Assessing Consistency and Reproducibility in the Outputs of Large Language Models: Evidence Across Diverse Finance and Accounting Tasks | Mar 21, 2025 | ArticlesBinary Classification | —Unverified | 0 |
| Federated Cross-Domain Click-Through Rate Prediction With Large Language Model Augmentation | Mar 21, 2025 | Click-Through Rate PredictionContrastive Learning | —Unverified | 0 |
| Language Models May Verbatim Complete Text They Were Not Explicitly Trained On | Mar 21, 2025 | Large Language Model | —Unverified | 0 |
| Variance Control via Weight Rescaling in LLM Pre-training | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| CVE-Bench: A Benchmark for AI Agents' Ability to Exploit Real-World Web Application Vulnerabilities | Mar 21, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 2 |
| How Robust Are Router-LLMs? Analysis of the Fragility of LLM Routing Capabilities | Mar 20, 2025 | General KnowledgeLanguage Modeling | CodeCode Available | 0 |
| Code Evolution Graphs: Understanding Large Language Model Driven Design of Algorithms | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Distributed LLMs and Multimodal Large Language Models: A Survey on Advances, Challenges, and Future Directions | Mar 20, 2025 | 2D Object DetectionDistributed Computing | CodeCode Available | 1 |
| Entropy-based Exploration Conduction for Multi-step Reasoning | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination | Mar 20, 2025 | BenchmarkingLarge Language Model | CodeCode Available | 1 |
| Cultural Alignment in Large Language Models Using Soft Prompt Tuning | Mar 20, 2025 | In-Context LearningLarge Language Model | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| LLM Braces: Straightening Out LLM Predictions with Relevant Sub-Updates | Mar 20, 2025 | Large Language ModelText Generation | —Unverified | 0 |
| Video-VoT-R1: An efficient video inference model integrating image packing and AoE architecture | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Using Language Models to Decipher the Motivation Behind Human Behaviors | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| ChatGPT and U(X): A Rapid Review on Measuring the User Experience | Mar 20, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning | Mar 20, 2025 | Decision MakingLanguage Modeling | CodeCode Available | 4 |
| Bridging Technology and Humanities: Evaluating the Impact of Large Language Models on Social Sciences Research with DeepSeek-R1 | Mar 20, 2025 | Large Language ModelLogical Reasoning | —Unverified | 0 |
| Unify and Triumph: Polyglot, Diverse, and Self-Consistent Generation of Unit Tests with LLMs | Mar 20, 2025 | DiversityLarge Language Model | —Unverified | 0 |
| SWEET-RL: Training Multi-Turn LLM Agents on Collaborative Reasoning Tasks | Mar 19, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 3 |
| Probing the topology of the space of tokens with structured prompts | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Robust Transmission of Punctured Text with Large Language Model-based Recovery | Mar 19, 2025 | Language ModelingLanguage Modelling | —Unverified | 0 |