SOTAVerified

Math

Papers

Showing 926950 of 1596 papers

TitleStatusHype
Evaluating and Optimizing Educational Content with Large Language Model JudgmentsCode0
Experimenting with Generative AI: Does ChatGPT Really Increase Everyone's Productivity?0
The Claude 3 Model Family: Opus, Sonnet, Haiku0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training0
Brilla AI: AI Contestant for the National Science and Maths QuizCode1
Masked Thought: Simply Masking Partial Reasoning Steps Can Improve Mathematical Reasoning Learning of Language ModelsCode1
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
Improving the Validity of Automatically Generated Feedback via Reinforcement LearningCode1
ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data0
Functional Benchmarks for Robust Evaluation of Reasoning Performance, and the Reasoning GapCode2
PRSA: Prompt Stealing Attacks against Real-World Prompt Services0
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem SolversCode2
StarCoder 2 and The Stack v2: The Next GenerationCode7
Data Interpreter: An LLM Agent For Data Science0
Adversarial Math Word Problem GenerationCode0
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical ReasoningCode0
Case-Based or Rule-Based: How Do Transformers Do the Math?Code1
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs0
Stepwise Self-Consistent Mathematical Reasoning with Large Language ModelsCode1
How Do Humans Write Code? Large Models Do It the Same Way TooCode0
MATHWELL: Generating Educational Math Word Problems Using Teacher AnnotationsCode1
Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought ProcessesCode0
ConceptMath: A Bilingual Concept-wise Benchmark for Measuring Mathematical Reasoning of Large Language ModelsCode1
Measuring Multimodal Mathematical Reasoning with MATH-Vision DatasetCode2
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models0
Show:102550
← PrevPage 38 of 64Next →

No leaderboard results yet.