SOTAVerified

Math

Papers

Showing 851900 of 1596 papers

TitleStatusHype
MindStar: Enhancing Math Reasoning in Pre-trained LLMs at Inference Time0
Mining Commonsense and Domain Knowledge from Math Word Problems0
Mining Math Conjectures from LLMs: A Pruning Approach0
Assignment Flows for Data Labeling on Graphs: Convergence and Stability0
Assessing the impact of social activity permissiveness on the COVID-19 infection curve of several countries0
Mixture of Parrots: Experts improve memorization more than reasoning0
ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain0
MME-CoT: Benchmarking Chain-of-Thought in Large Multimodal Models for Reasoning Quality, Robustness, and Efficiency0
Assessing the Impact of Prompting Methods on ChatGPT's Mathematical Capabilities0
MMTM: Multi-Tasking Multi-Decoder Transformer for Math Word Problems0
MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis0
Modeling and Analyzing Scorer Preferences in Short-Answer Math Questions0
Where to find Grokking in LLM Pretraining? Monitor Memorization-to-Generalization without Test0
Modeling Student Response Times: Towards Efficient One-on-one Tutoring Dialogues0
Modelling silicosis: dynamics of a model with piecewise constant rate coefficients0
Models Can and Should Embrace the Communicative Nature of Human-Generated Math0
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models0
MoL for LLMs: Dual-Loss Optimization to Enhance Domain Expertise While Preserving General Capabilities0
Assessing and Verifying Task Utility in LLM-Powered Applications0
More Thinking, Less Seeing? Assessing Amplified Hallucination in Multimodal Reasoning Models0
MSA at BEA 2025 Shared Task: Disagreement-Aware Instruction Tuning for Multi-Dimensional Evaluation of LLMs as Math Tutors0
Multi-lingual Functional Evaluation for Large Language Models0
Multimodal Assessment of Classroom Discourse Quality: A Text-Centered Attention-Based Multi-Task Learning Approach0
A Chinese Math Word Problem Solving System Based on Linguistic Theory and Non-statistical Approach0
Multi-Stage Pre-Training for Math-Understanding: ^2(AL)BERT0
Multi-Step Alignment as Markov Games: An Optimistic Online Gradient Descent Approach with Convergence Guarantees0
Multi-step Problem Solving Through a Verifier: An Empirical Analysis on Model-induced Process Supervision0
Multi-tool Integration Application for Math Reasoning Using Large Language Model0
Ask-Before-Detection: Identifying and Mitigating Conformity Bias in LLM-Powered Error Detector for Math Word Problem Solutions0
Tree-structured Decoding for Solving Math Word Problems0
A Rule-Based Computational Model of Cognitive Arithmetic0
MV-MATH: Evaluating Multimodal Math Reasoning in Multi-Visual Contexts0
Which Data Attributes Stimulate Math and Code Reasoning? An Investigation via Influence Functions0
MWP-BERT: Numeracy-Augmented Pre-training for Math Word Problem Solving0
MWPRanker: An Expression Similarity Based Math Word Problem Retriever0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science0
TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games0
NaturalReasoning: Reasoning in the Wild with 2.8M Challenging Questions0
Natural- to formal-language generation using Tensor Product Representations0
Navigating the Labyrinth: Evaluating and Enhancing LLMs' Ability to Reason About Search Problems0
AceReason-Nemotron: Advancing Math and Code Reasoning through Reinforcement Learning0
NEOLAF, an LLM-powered neural-symbolic cognitive architecture0
"Turing Tests" For An AI Scientist0
Network psychometrics and cognitive network science open new ways for detecting, understanding and tackling the complexity of math anxiety: A review0
Neural Math Word Problem Solver with Reinforcement Learning0
Neural Symbolic Reader: Scalable Integration of Distributed and Symbolic Representations for Reading Comprehension0
Neuro-Symbolic Data Generation for Math Reasoning0
NLU for Game-based Learning in Real: Initial Evaluations0
No Free Lunch: Rethinking Internal Feedback for LLM Reasoning0
Arithmetic Reasoning with LLM: Prolog Generation & Permutation0
Show:102550
← PrevPage 18 of 32Next →

No leaderboard results yet.