SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1126–1150 of 1596 papers

Title	Date	Tasks	Status	Hype
MathScale: Scaling Instruction Tuning for Mathematical Reasoning	Mar 5, 2024	GSM8KMath	CodeCode Available	0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning	Mar 4, 2024	GSM8KMath	—Unverified	0
The Claude 3 Model Family: Opus, Sonnet, Haiku	Mar 4, 2024	1 Image, 2*2 StitchingArithmetic Reasoning	—Unverified	0
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training	Mar 4, 2024	MathPhrase Grounding	—Unverified	0
Experimenting with Generative AI: Does ChatGPT Really Increase Everyone's Productivity?	Mar 4, 2024	EconometricsMath	—Unverified	0
ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data	Mar 1, 2024	Math	—Unverified	0
PRSA: Prompt Stealing Attacks against Real-World Prompt Services	Feb 29, 2024	Math	—Unverified	0
Data Interpreter: An LLM Agent For Data Science	Feb 28, 2024	Code GenerationLanguage Modelling	—Unverified	0
Adversarial Math Word Problem Generation	Feb 27, 2024	Math	CodeCode Available	0
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning	Feb 27, 2024	8kLanguage Modeling	CodeCode Available	0
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs	Feb 26, 2024	GSM8KMath	—Unverified	0
How Do Humans Write Code? Large Models Do It the Same Way Too	Feb 24, 2024	Code GenerationMath	CodeCode Available	0
Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes	Feb 23, 2024	MathMathematical Reasoning	CodeCode Available	0
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models	Feb 20, 2024	Common Sense ReasoningContrastive Learning	—Unverified	0
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks	Feb 18, 2024	Math	—Unverified	0
Orca-Math: Unlocking the potential of SLMs in Grade School Math	Feb 16, 2024	Arithmetic ReasoningGSM8K	—Unverified	0
Mathematical Opportunities in Digital Twins (MATH-DT)	Feb 15, 2024	Math	—Unverified	0
Language Models with Conformal Factuality Guarantees	Feb 15, 2024	Conformal PredictionLanguage Modeling	—Unverified	0
AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails	Feb 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	0
Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications	Feb 14, 2024	Math	—Unverified	0
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements	Feb 13, 2024	GSM8KMath	—Unverified	0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages	Feb 12, 2024	Automated Theorem ProvingBenchmarking	—Unverified	0
Understanding the Progression of Educational Topics via Semantic Matching	Feb 10, 2024	Math	—Unverified	0
V-STaR: Training Verifiers for Self-Taught Reasoners	Feb 9, 2024	Code GenerationMath	—Unverified	0
In-Context Principle Learning from Mistakes	Feb 8, 2024	GSM8KIn-Context Learning	CodeCode Available	0

Show:10 25 50

← PrevPage 46 of 64Next →

No leaderboard results yet.