SOTAVerified

Math

Papers

Showing 13761400 of 1596 papers

TitleStatusHype
Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question ClassificationCode0
PT-MoE: An Efficient Finetuning Framework for Integrating Mixture-of-Experts into Prompt TuningCode0
An algorithm to represent inbreeding treesCode0
What Makes Math Word Problems Challenging for LLMs?Code0
Leveraging Training Data in Few-Shot Prompting for Numerical ReasoningCode0
AgentSwift: Efficient LLM Agent Design via Value-guided Hierarchical SearchCode0
Leveraging Web-Crawled Data for High-Quality Fine-TuningCode0
StepMathAgent: A Step-Wise Agent for Evaluating Mathematical Processes through Tree-of-ErrorCode0
Self-Demos: Eliciting Out-of-Demonstration Generalizability in Large Language ModelsCode0
Library Learning Doesn't: The Curious Case of the Single-Use "Library"Code0
AutoMSC: Automatic Assignment of Mathematics Subject Classification LabelsCode0
From Euler to AI: Unifying Formulas for Mathematical ConstantsCode0
A safety realignment framework via subspace-oriented model fusion for large language modelsCode0
TreeRPO: Tree Relative Policy OptimizationCode0
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
Linguistic Generalizability of Test-Time Scaling in Mathematical ReasoningCode0
DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice QuestionsCode0
Automatic Short Math Answer Grading via In-context Meta-learningCode0
The Matrix Calculus You Need For Deep LearningCode0
An extrapolated and provably convergent algorithm for nonlinear matrix decomposition with the ReLU functionCode0
Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model TutorsCode0
Llama SLayer 8B: Shallow Layers Hold the Key to Knowledge InjectionCode0
Taxonomy of Mathematical PlagiarismCode0
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM EvaluationCode0
GATE: Graph-based Adaptive Tool Evolution Across Diverse TasksCode0
Show:102550
← PrevPage 56 of 64Next →

No leaderboard results yet.