| Enabling High-Sparsity Foundational Llama Models with Efficient Pretraining and Deployment | May 6, 2024 | Arithmetic ReasoningCode Generation | —Unverified | 0 |
| Evaluating Gender Bias in Large Language Models via Chain-of-Thought Prompting | Jan 28, 2024 | Arithmetic ReasoningFact Checking | —Unverified | 0 |
| Exploring Group and Symmetry Principles in Large Language Models | Feb 9, 2024 | Arithmetic ReasoningNegation | —Unverified | 0 |
| Fact-Consistency Evaluation of Text-to-SQL Generation for Business Intelligence Using Exaone 3.5 | Apr 30, 2025 | Arithmetic ReasoningText to SQL | —Unverified | 0 |
| Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together | Jul 15, 2024 | Arithmetic ReasoningLanguage Modeling | —Unverified | 0 |
| FinLMM-R1: Enhancing Financial Reasoning in LMM through Scalable Data and Reward Design | Jun 16, 2025 | Answer GenerationArithmetic Reasoning | —Unverified | 0 |
| GaLore+: Boosting Low-Rank Adaptation for LLMs with Cross-Head Projection | Dec 15, 2024 | Arithmetic ReasoningText Generation | —Unverified | 0 |
| On-Policy Distillation of Language Models: Learning from Self-Generated Mistakes | Jun 23, 2023 | Arithmetic ReasoningKnowledge Distillation | —Unverified | 0 |
| Hint Marginalization for Improved Reasoning in Large Language Models | Dec 17, 2024 | Arithmetic Reasoning | —Unverified | 0 |
| Inference-Time Computations for LLM Reasoning and Planning: A Benchmark and Insights | Feb 18, 2025 | Arithmetic ReasoningCommon Sense Reasoning | —Unverified | 0 |