SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–925 of 1596 papers

Title	Date	Tasks	Status	Hype
Instance-adaptive Zero-shot Chain-of-Thought Prompting	Sep 30, 2024	GSM8KMath	—Unverified	0
Instruction-Following Pruning for Large Language Models	Jan 3, 2025	Instruction FollowingMath	—Unverified	0
Integer Networks for Data Compression with Latent-Variable Models	May 1, 2019	Data CompressionMath	—Unverified	0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving	Feb 12, 2025	Mathmultimodal interaction	—Unverified	0
Interleaved Reasoning for Large Language Models via Reinforcement Learning	May 26, 2025	Logical ReasoningMath	—Unverified	0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models	Dec 11, 2023	DiversityMath	—Unverified	0
Interpretable Factorization for Neural Network ECG Models	Jun 26, 2020	Math	—Unverified	0
Interpretable Math Word Problem Solution Generation Via Step-by-step Planning	Jun 1, 2023	GSM8KLanguage Modeling	—Unverified	0
Intriguing Properties of Large Language and Vision Models	Oct 7, 2024	cross-modal alignmentLarge Language Model	—Unverified	0
Introducing the Mathematics Meme Repository	Oct 19, 2021	Math	—Unverified	0
Introduction to Coresets: Accurate Coresets	Oct 19, 2019	Math	—Unverified	0
Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving	Apr 1, 2025	Math	—Unverified	0
Investigating Math Word Problems using Pretrained Multilingual Language Models	Jan 16, 2022	Machine TranslationMath	—Unverified	0
Investigating Symbolic Capabilities of Large Language Models	May 21, 2024	MathNavigate	—Unverified	0
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination	Jun 10, 2023	MathMathematical Reasoning	—Unverified	0
Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting	Sep 30, 2023	Math	—Unverified	0
Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation	Feb 18, 2025	DiversityMath	—Unverified	0
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations	Apr 1, 2024	BenchmarkingMath	—Unverified	0
Solving Functional Optimization with Deep Networks and Variational Principles	Oct 8, 2024	Math	—Unverified	0
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs	Jan 21, 2025	GSM8KIn-Context Learning	—Unverified	0
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist	Jul 11, 2024	GSM8KMath	—Unverified	0
Iterative Reasoning Preference Optimization	Apr 30, 2024	ARCGSM8K	—Unverified	0
Yi-Lightning Technical Report	Dec 2, 2024	ChatbotLarge Language Model	—Unverified	0
Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models	Jun 16, 2025	Mathreinforcement-learning	—Unverified	0
JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation	Oct 22, 2024	Math	—Unverified	0

Show:10 25 50

← PrevPage 37 of 64Next →

No leaderboard results yet.