| Deep Knowledge Tracing for Personalized Adaptive Learning at Historically Black Colleges and Universities | Oct 2, 2024 | Knowledge TracingMath | —Unverified | 0 |
| Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks | Oct 2, 2024 | MathNavigate | —Unverified | 0 |
| PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation | Oct 2, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo | Oct 2, 2024 | Math | —Unverified | 0 |
| Mind Scramble: Unveiling Large Language Model Psychology Via Typoglycemia | Oct 2, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| Not All LLM Reasoners Are Created Equal | Oct 2, 2024 | AllCode Generation | —Unverified | 0 |
| Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge Tracing | Oct 2, 2024 | Contrastive LearningKnowledge Tracing | CodeCode Available | 0 |
| Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models | Oct 2, 2024 | Cross-Lingual TransferMath | —Unverified | 0 |
| Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-Problems | Sep 30, 2024 | GSM8KMath | CodeCode Available | 0 |
| The Perfect Blend: Redefining RLHF with Mixture of Judges | Sep 30, 2024 | Instruction FollowingMath | —Unverified | 0 |
| Instance-adaptive Zero-shot Chain-of-Thought Prompting | Sep 30, 2024 | GSM8KMath | —Unverified | 0 |
| INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models | Sep 28, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| Revisiting the Superficial Alignment Hypothesis | Sep 27, 2024 | Instruction FollowingMath | —Unverified | 0 |
| On the Inductive Bias of Stacking Towards Improving Reasoning | Sep 27, 2024 | Inductive BiasMath | —Unverified | 0 |
| Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy | Sep 26, 2024 | Knowledge TracingMath | —Unverified | 0 |
| LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ | Sep 25, 2024 | ChatbotGSM8K | —Unverified | 0 |
| Democratizing Signal Processing and Machine Learning: Math Learning Equity for Elementary and Middle School Students | Sep 25, 2024 | Math | —Unverified | 0 |
| Models Can and Should Embrace the Communicative Nature of Human-Generated Math | Sep 25, 2024 | Math | —Unverified | 0 |
| PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning | Sep 25, 2024 | GSM8KMath | —Unverified | 0 |
| PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL | Sep 21, 2024 | MathText to SQL | CodeCode Available | 0 |
| Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-Tuning | Sep 20, 2024 | Language ModelingLanguage Modelling | CodeCode Available | 0 |
| ControlMath: Controllable Data Generation Promotes Math Generalist Models | Sep 20, 2024 | Data AugmentationDiversity | —Unverified | 0 |
| InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning | Sep 19, 2024 | MathMathematical Reasoning | —Unverified | 0 |
| GRIN: GRadient-INformed MoE | Sep 18, 2024 | HellaSwagHumanEval | —Unverified | 0 |
| Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement | Sep 18, 2024 | GSM8KMath | —Unverified | 0 |
| Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning | Sep 17, 2024 | Few-Shot LearningIn-Context Learning | CodeCode Available | 0 |
| NVLM: Open Frontier-Class Multimodal LLMs | Sep 17, 2024 | MathMultimodal Reasoning | —Unverified | 0 |
| GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students | Sep 16, 2024 | Math | —Unverified | 0 |
| Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia | Sep 13, 2024 | MathMultiple-choice | —Unverified | 0 |
| CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks | Sep 13, 2024 | ARCCode Generation | —Unverified | 0 |
| Knowledge Tagging with Large Language Model based Multi-Agent System | Sep 12, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Alignment with Preference Optimization Is All You Need for LLM Safety | Sep 12, 2024 | AllMath | —Unverified | 0 |
| Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models | Sep 11, 2024 | Language ModellingLarge Language Model | —Unverified | 0 |
| A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio | Sep 10, 2024 | Emotional IntelligenceMath | —Unverified | 0 |
| Mathematical Formalized Problem Solving and Theorem Proving in Different Fields in Lean 4 | Sep 9, 2024 | Abstract AlgebraAutomated Theorem Proving | CodeCode Available | 0 |
| Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs | Sep 4, 2024 | Mathparameter-efficient fine-tuning | —Unverified | 0 |
| Wavelet GPT: Wavelet Inspired Large Language Models | Sep 4, 2024 | DecoderMath | —Unverified | 0 |
| Building Math Agents with Multi-Turn Iterative Preference Learning | Sep 4, 2024 | GSM8KMath | —Unverified | 0 |
| Prompt Baking | Sep 4, 2024 | ARCGSM8K | —Unverified | 0 |
| More is More: Addition Bias in Large Language Models | Sep 4, 2024 | MathText Summarization | CodeCode Available | 0 |
| S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners | Sep 3, 2024 | GSM8KMath | —Unverified | 0 |
| Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic | Aug 29, 2024 | GSM8KLanguage Modeling | —Unverified | 0 |
| Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems | Aug 29, 2024 | Math | —Unverified | 0 |
| Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity | Aug 29, 2024 | Code GenerationDiversity | —Unverified | 0 |
| SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models | Aug 28, 2024 | Data AugmentationGSM8K | —Unverified | 0 |
| Generative Verifiers: Reward Modeling as Next-Token Prediction | Aug 27, 2024 | MathPrediction | —Unverified | 0 |
| Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class | Aug 26, 2024 | Math | —Unverified | 0 |
| Multi-tool Integration Application for Math Reasoning Using Large Language Model | Aug 22, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Mathematical Information Retrieval: Search and Question Answering | Aug 21, 2024 | Information RetrievalMath | —Unverified | 0 |