SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 976–1000 of 1596 papers

Title	Date	Tasks	Status	Hype
Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning	Sep 17, 2024	Few-Shot LearningIn-Context Learning	CodeCode Available	0
NVLM: Open Frontier-Class Multimodal LLMs	Sep 17, 2024	MathMultimodal Reasoning	—Unverified	0
GPT takes the SAT: Tracing changes in Test Difficulty and Math Performance of Students	Sep 16, 2024	Math	—Unverified	0
Cracking the Code: Multi-domain LLM Evaluation on Real-World Professional Exams in Indonesia	Sep 13, 2024	MathMultiple-choice	—Unverified	0
CPL: Critical Plan Step Learning Boosts LLM Generalization in Reasoning Tasks	Sep 13, 2024	ARCCode Generation	—Unverified	0
Knowledge Tagging with Large Language Model based Multi-Agent System	Sep 12, 2024	Language ModelingLanguage Modelling	—Unverified	0
Alignment with Preference Optimization Is All You Need for LLM Safety	Sep 12, 2024	AllMath	—Unverified	0
Leveraging Unstructured Text Data for Federated Instruction Tuning of Large Language Models	Sep 11, 2024	Language ModellingLarge Language Model	—Unverified	0
A Practice of Post-Training on Llama-3 70B with Optimal Selection of Additional Language Mixture Ratio	Sep 10, 2024	Emotional IntelligenceMath	—Unverified	0
Mathematical Formalized Problem Solving and Theorem Proving in Different Fields in Lean 4	Sep 9, 2024	Abstract AlgebraAutomated Theorem Proving	CodeCode Available	0
Deconfounded Causality-aware Parameter-Efficient Fine-Tuning for Problem-Solving Improvement of LLMs	Sep 4, 2024	Mathparameter-efficient fine-tuning	—Unverified	0
Wavelet GPT: Wavelet Inspired Large Language Models	Sep 4, 2024	DecoderMath	—Unverified	0
Building Math Agents with Multi-Turn Iterative Preference Learning	Sep 4, 2024	GSM8KMath	—Unverified	0
Prompt Baking	Sep 4, 2024	ARCGSM8K	—Unverified	0
More is More: Addition Bias in Large Language Models	Sep 4, 2024	MathText Summarization	CodeCode Available	0
S^3c-Math: Spontaneous Step-level Self-correction Makes Large Language Models Better Mathematical Reasoners	Sep 3, 2024	GSM8KMath	—Unverified	0
Logic Contrastive Reasoning with Lightweight Large Language Model for Math Word Problems	Aug 29, 2024	GSM8KLanguage Modeling	—Unverified	0
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic	Aug 29, 2024	GSM8KLanguage Modeling	—Unverified	0
Physics of Language Models: Part 2.2, How to Learn From Mistakes on Grade-School Math Problems	Aug 29, 2024	Math	—Unverified	0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity	Aug 29, 2024	Code GenerationDiversity	—Unverified	0
SIaM: Self-Improving Code-Assisted Mathematical Reasoning of Large Language Models	Aug 28, 2024	Data AugmentationGSM8K	—Unverified	0
Generative Verifiers: Reward Modeling as Next-Token Prediction	Aug 27, 2024	MathPrediction	—Unverified	0
Students' Perceived Roles, Opportunities, and Challenges of a Generative AI-powered Teachable Agent: A Case of Middle School Math Class	Aug 26, 2024	Math	—Unverified	0
Multi-tool Integration Application for Math Reasoning Using Large Language Model	Aug 22, 2024	Language ModelingLanguage Modelling	—Unverified	0
Mathematical Information Retrieval: Search and Question Answering	Aug 21, 2024	Information RetrievalMath	—Unverified	0

Show:10 25 50

← PrevPage 40 of 64Next →

No leaderboard results yet.