SOTAVerified

Math

Papers

Showing 9511000 of 1596 papers

TitleStatusHype
Deep Knowledge Tracing for Personalized Adaptive Learning at Historically Black Colleges and Universities0
Can We Further Elicit Reasoning in LLMs? Critic-Guided Planning with Retrieval-Augmentation for Solving Challenging Tasks0
PersonaMath: Enhancing Math Reasoning through Persona-Driven Data Augmentation0
Step-by-Step Reasoning for Math Problems via Twisted Sequential Monte Carlo0
Mind Scramble: Unveiling Large Language Model Psychology Via TypoglycemiaCode0
Not All LLM Reasoners Are Created Equal0
Automated Knowledge Concept Annotation and Question Representation Learning for Knowledge TracingCode0
Layer Swapping for Zero-Shot Cross-Lingual Transfer in Large Language Models0
Scheherazade: Evaluating Chain-of-Thought Math Reasoning in LLMs with Chain-of-ProblemsCode0
The Perfect Blend: Redefining RLHF with Mixture of Judges0
Instance-adaptive Zero-shot Chain-of-Thought Prompting0
INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models0
Revisiting the Superficial Alignment Hypothesis0
On the Inductive Bias of Stacking Towards Improving Reasoning0
Learning to Love Edge Cases in Formative Math Assessment: Using the AMMORE Dataset and Chain-of-Thought Prompting to Improve Grading Accuracy0
LLaMa-SciQ: An Educational Chatbot for Answering Science MCQ0
Democratizing Signal Processing and Machine Learning: Math Learning Equity for Elementary and Middle School Students0
Models Can and Should Embrace the Communicative Nature of Human-Generated Math0
PMSS: Pretrained Matrices Skeleton Selection for LLM Fine-tuning0
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQLCode0
Beyond Accuracy Optimization: Computer Vision Losses for Large Language Model Fine-TuningCode0
ControlMath: Controllable Data Generation Promotes Math Generalist Models0
InfiMM-WebMath-40B: Advancing Multimodal Pre-Training for Enhanced Mathematical Reasoning0
GRIN: GRadient-INformed MoE0
Qwen2.5-Math Technical Report: Toward Mathematical Expert Model via Self-Improvement0
Reasoning Graph Enhanced Exemplars Retrieval for In-Context LearningCode0
NVLM: Open Frontier-Class Multimodal LLMs0
GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students0
Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks0
Knowledge Tagging with Large Language Model based Multi-Agent System0
Alignment with Preference Optimization Is All You Need for LLM Safety0
Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio0
Mathematical Formalized Problem Solving and Theorem Proving in Different Fields in Lean 4Code0
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs0
Wavelet GPT: Wavelet Inspired Large Language Models0
Building Math Agents with Multi-Turn Iterative Preference Learning0
Prompt Baking0
More is More: Addition Bias in Large Language ModelsCode0
S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners0
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems0
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic0
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity0
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models0
Generative Verifiers: Reward Modeling as Next-Token Prediction0
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class0
Multi-tool Integration Application for Math Reasoning Using Large Language Model0
Mathematical Information Retrieval: Search and Question Answering0
Show:102550
← PrevPage 20 of 32Next →

No leaderboard results yet.