SOTAVerified

Math

Papers

Showing 15811590 of 1596 papers

TitleStatusHype
MIRB: Mathematical Information Retrieval BenchmarkCode0
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-MakingCode0
Distinguishing affixoid formations from compoundsCode0
Mixed-R1: Unified Reward Perspective For Reasoning Capability in Multimodal Large Language ModelsCode0
Enhancing the Transformer with Explicit Relational Encoding for Math Problem SolvingCode0
AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and GuardrailsCode0
MMATH: A Multilingual Benchmark for Mathematical ReasoningCode0
Learning a Continue-Thinking Token for Enhanced Test-Time ScalingCode0
Algebra Error Classification with Large Language ModelsCode0
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMsCode0
Show:102550
← PrevPage 159 of 160Next →

No leaderboard results yet.