SOTAVerified

Math

Papers

Showing 801850 of 1596 papers

TitleStatusHype
Towards Thinking-Optimal Scaling of Test-Time Compute for LLM Reasoning0
MATHion: Solving Math Word Problems with Logically Consistent Problems0
Towards Tractable Mathematical Reasoning: Challenges, Strategies, and Opportunities for Solving Math Word Problems0
A Theme-Rewriting Approach for Generating Algebra Word Problems0
Math Multiple Choice Question Generation via Human-Large Language Model Collaboration0
Towards Trustworthy AutoGrading of Short, Multi-lingual, Multi-type Answers0
Atari games and Intel processors0
Math Operation Embeddings for Open-ended Solution Analysis and Feedback0
MATH-Perturb: Benchmarking LLMs' Math Reasoning Abilities against Hard Perturbations0
MathPhys-Guided Coarse-to-Fine Anomaly Synthesis with SQE-Driven Bi-Level Optimization for Anomaly Detection0
Trace-of-Thought Prompting: Investigating Prompt-Based Knowledge Distillation Through Question Decomposition0
math-PVS: A Large Language Model Framework to Map Scientific Publications to PVS Theories0
MathQA: Towards Interpretable Math Word Problem Solving with Operation-Based Formalisms0
Math Search for the Masses: Multimodal Search Interfaces and Appearance-Based Retrieval0
MathVC: An LLM-Simulated Multi-Character Virtual Classroom for Mathematics Education0
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?0
A Tag-based English Math Word Problem Solver with Understanding, Reasoning and Explanation0
When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities0
When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs0
Math Word Problem Generation with Mathematical Consistency and Problem Context Constraints0
Matryoshka Model Learning for Improved Elastic Student Models0
Asymptotic expression for the fixation probability of a mutant in star graphs0
Maximizing Confidence Alone Improves Reasoning0
MDIT: A Model-free Data Interpolation Method for Diverse Instruction Tuning0
Training Large Language Models to Reason via EM Policy Gradient0
Measurement to Meaning: A Validity-Centered Framework for AI Evaluation0
Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning0
Measuring and Improving BERT's Mathematical Abilities by Predicting the Order of Reasoning.0
When Not to Answer: Evaluating Prompts on GPT Models for Effective Abstention in Unanswerable Math Word Problems0
Measuring Large Language Models Capacity to Annotate Journalistic Sourcing0
Asymptotic behavior of mean fixation times in the Moran process with frequency-independent fitnesses0
Mechanochemical models for calcium waves in embryonic epithelia0
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning0
Med-RLVR: Emerging Medical Reasoning from a 3B base model via reinforcement Learning0
A Survey on Multimodal Large Language Models0
A Survey on Feedback-based Multi-step Reasoning for Large Language Models on Mathematics0
Translating a Math Word Problem to a Expression Tree0
Mental Stress Detection: Development and Evaluation of a Wearable In-Ear Plethysmography0
Metacognitive Capabilities of LLMs: An Exploration in Mathematical Problem Solving0
A Survey of Slow Thinking-based Reasoning LLMs using Reinforced Learning and Inference-time Scaling Law0
A Survey of Question Answering for Math and Science Problem0
INC-Math: Integrating Natural Language and Code for Enhanced Mathematical Reasoning in Large Language Models0
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges0
A Study on Leveraging Search and Self-Feedback for Agent Reasoning0
Metric-agnostic Ranking Optimization0
MIaS: Math-Aware Retrieval in Digital Mathematical Libraries0
MiLoRA: Efficient Mixture of Low-Rank Adaptation for Large Language Models Fine-tuning0
A Study of PHOC Spatial Region Configurations for Math Formula Retrieval0
MIND: Math Informed syNthetic Dialogues for Pretraining LLMs0
Mind meets machine: Unravelling GPT-4's cognitive psychology0
Show:102550
← PrevPage 17 of 32Next →

No leaderboard results yet.