SOTAVerified|Agents Browse Leaderboard About

Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1125 of 1596 papers

Title	Date	Tasks	Status	Hype
Scaling up ridge regression for brain encoding in a massive individual fMRI dataset	Mar 28, 2024	CPUMath	CodeCode Available	0
Large Language Models Are Struggle to Cope with Unreasonability in Math Problems	Mar 28, 2024	Math	—Unverified	0
ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain	Mar 28, 2024	Math	—Unverified	0
Few-Shot Recalibration of Language Models	Mar 27, 2024	MathMMLU	—Unverified	0
The Invalsi Benchmarks: measuring Linguistic and Mathematical understanding of Large Language Models in Italian	Mar 27, 2024	Language ModellingMath	—Unverified	0
Automate Knowledge Concept Tagging on Math Questions with LLMs	Mar 26, 2024	Few-Shot LearningMath	—Unverified	0
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning	Mar 25, 2024	Code GenerationIn-Context Learning	—Unverified	0
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision	Mar 21, 2024	Math	—Unverified	0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified	0
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?	Mar 21, 2024	MathMathematical Reasoning	—Unverified	0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?	Mar 20, 2024	Abstractive Text SummarizationContinual Pretraining	—Unverified	0
Instructing Large Language Models to Identify and Ignore Irrelevant Conditions	Mar 19, 2024	MathMathematical Reasoning	CodeCode Available	0
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem	Mar 17, 2024	DiversityEvolutionary Algorithms	—Unverified	0
What Makes Math Word Problems Challenging for LLMs?	Mar 17, 2024	Math	CodeCode Available	0
Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning	Mar 14, 2024	Deep Reinforcement LearningGraph Attention	CodeCode Available	0
Sabiá-2: A New Generation of Portuguese Large Language Models	Mar 14, 2024	Math	—Unverified	0
Hydrodynamics of Markets:Hidden Links Between Physics and Finance	Mar 14, 2024	Math	—Unverified	0
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks	Mar 14, 2024	MathSkill Generalization	—Unverified	0
Self-Consistency Boosts Calibration for Math Reasoning	Mar 14, 2024	GSM8KMath	—Unverified	0
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models	Mar 13, 2024	Math	—Unverified	0
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models	Mar 12, 2024	MathMathematical Problem-Solving	CodeCode Available	0
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models	Mar 12, 2024	MathMathematical Reasoning	—Unverified	0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified	0
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem	Mar 6, 2024	BenchmarkingHallucination	CodeCode Available	0
Evaluating and Optimizing Educational Content with Large Language Model Judgments	Mar 5, 2024	Language ModelingLanguage Modelling	CodeCode Available	0

Show:10 25 50

← PrevPage 45 of 64Next →

No leaderboard results yet.