SOTAVerified

Math

Papers

Showing 626650 of 1596 papers

TitleStatusHype
Mind Scramble: Unveiling Large Language Model Psychology Via TypoglycemiaCode0
MIRB: Mathematical Information Retrieval BenchmarkCode0
Meta-Reasoning Improves Tool Use in Large Language ModelsCode0
How Should We Enhance the Safety of Large Reasoning Models: An Empirical StudyCode0
How Is LLM Reasoning Distracted by Irrelevant Context? An Analysis Using a Controlled BenchmarkCode0
metboost: Exploratory regression analysis with hierarchically clustered dataCode0
How Do Humans Write Code? Large Models Do It the Same Way TooCode0
ConciseRL: Conciseness-Guided Reinforcement Learning for Efficient Reasoning ModelsCode0
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-MakingCode0
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language ModelsCode0
MAWPS: A Math Word Problem RepositoryCode0
Heteroclinic cycling and extinction in May-Leonard models with demographic stochasticityCode0
ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak SupervisionCode0
Math Word Problem Solving by Generating Linguistic Variants of Problem StatementsCode0
Algebra Error Classification with Large Language ModelsCode0
Helpful assistant or fruitful facilitator? Investigating how personas affect language model behaviorCode0
ASyMOB: Algebraic Symbolic Mathematical Operations BenchmarkCode0
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical ReasoningCode0
Computationally Identifying Funneling and Focusing Questions in Classroom DiscourseCode0
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical BenchmarkCode0
Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal ModelsCode0
Compositional Processing Emerges in Neural Networks Solving Math ProblemsCode0
MathScale: Scaling Instruction Tuning for Mathematical ReasoningCode0
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate ClassCode0
Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition ExtractionCode0
Show:102550
← PrevPage 26 of 64Next →

No leaderboard results yet.