SOTAVerified
|
Agents
Browse
Leaderboard
About
Tasks
›
Math
Math
Papers
Recently Added
Most Hyped
Most Active
Needs Verification
Most Verified
Showing 1371–1380 of 1596 papers
Title
Date
Tasks
Status
Hype
Score
Feature Selection Based on Confidence Machine
Oct 20, 2014
feature selection
Math
—
Unverified
0
0
The Impact of Item-Writing Flaws on Difficulty and Discrimination in Item Response Theory
Mar 13, 2025
Math
Multiple-choice
—
Unverified
0
0
Few-Shot Recalibration of Language Models
Mar 27, 2024
Math
MMLU
—
Unverified
0
0
FG-PRM: Fine-grained Hallucination Detection and Mitigation in Language Model Mathematical Reasoning
Oct 8, 2024
GSM8K
Hallucination
—
Unverified
0
0
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models
Mar 12, 2024
Math
Mathematical Reasoning
—
Unverified
0
0
The Invalsi Benchmarks: measuring Linguistic and Mathematical understanding of Large Language Models in Italian
Mar 27, 2024
Language Modelling
Math
—
Unverified
0
0
Weakest Link in the Chain: Security Vulnerabilities in Advanced Reasoning Models
Jun 16, 2025
Math
—
Unverified
0
0
First-Step Advantage: Importance of Starting Right in Multi-Step Math Reasoning
Nov 14, 2023
GSM8K
Math
—
Unverified
0
0
Fixation probabilities for the Moran process in evolutionary games with two strategies: graph shapes and large population asymptotics
Apr 30, 2018
Math
—
Unverified
0
0
Fixation probabilities for the Moran process with three or more strategies: general and coupling results
Nov 23, 2018
Math
—
Unverified
0
0
Show:
10
25
50
← Prev
Page 138 of 160
Next →
No leaderboard results yet.