SOTAVerified

Math

Papers

Showing 12511300 of 1596 papers

TitleStatusHype
Unlocking Temporal Question Answering for Large Language Models with Tailor-Made Reasoning LogicCode0
Let GPT be a Math Tutor: Teaching Math Word Problem Solvers with Customized Exercise Generation0
Cognitive network science reveals bias in GPT-3, ChatGPT, and GPT-4 mirroring math anxiety in high-school students0
TEIMMA: The First Content Reuse Annotator for Text, Images, and MathCode0
Can ChatGPT Defend its Belief in Truth? Evaluating LLM Reasoning via Debate0
Hint of Thought prompting: an explainable and zero-shot approach to reasoning tasks with LLMs0
A quantitative study of NLP approaches to question difficulty estimationCode0
Learning Non-linguistic Skills without Sacrificing Linguistic ProficiencyCode0
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
Parameterized Approximation for Robust Clustering in Discrete Geometric Spaces0
Algebra Error Classification with Large Language ModelsCode0
AI, write an essay for me: A large-scale comparison of human-written versus ChatGPT-generated essays0
Who's the Best Detective? LLMs vs. MLs in Detecting Incoherent Fourth Grade Math Answers0
Enhancing Textbooks with Visuals from the Web for Improved LearningCode0
What Makes a Good Dataset for Symbol Description Reading?0
Metric-agnostic Ranking Optimization0
Gamifying Math Education using Object Detection0
Reinforcement Learning Tutor Better Supported Lower Performers in a Math Task0
Exploring the Impact of Instruction Data Scaling on Large Language Models: An Empirical Study on Real-World Use Cases0
Reliable and Efficient Evaluation of Adversarial Robustness for Deep Hashing-Based Retrieval0
Mind meets machine: Unravelling GPT-4's cognitive psychology0
OntoMath^PRO 2.0 Ontology: Updates of the Formal Model0
Self-reinforced polynomial approximation methods for concentrated probability densities0
On the existence of minimizers in shallow residual ReLU neural network optimization landscapes0
An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP)Code0
Tighter 'uniform bounds for Black-Scholes implied volatility' and the applications to root-finding0
On the Difficulty of Characterizing Network Formation with Endogenous Behavior0
Techniques to Improve Neural Math Word Problem SolversCode0
Faithful Chain-of-Thought ReasoningCode0
The Backpropagation algorithm for a math student0
Tracing and Manipulating Intermediate Values in Neural Math Problem SolversCode0
Class Prototypes Based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational Videos0
Deterministic and Nondeterministic Particle Motion with Interaction MechanismsCode0
Asymptotic behavior of mean fixation times in the Moran process with frequency-independent fitnesses0
Constrained monotone mean-variance problem with random coefficients0
MatCha: Enhancing Visual Language Pretraining with Math Reasoning and Chart Derendering0
The Long-Term Effects of Teachers' Gender Stereotypes0
Decomposing a Recurrent Neural Network into Modules for Enabling Reusability and Replacement0
Skellam Mixture Mechanism: a Novel Approach to Federated Learning with Differential PrivacyCode0
Analogical Math Word Problems Solving with Enhanced Problem-Solution AssociationCode0
Generalizing Math Word Problem Solvers via Solution DiversificationCode0
Nonlinear and Machine Learning Analyses on High-Density EEG data of Math Experts and Novices0
Explicit Knowledge Transfer for Weakly-Supervised Code Generation0
Textual Enhanced Contrastive Learning for Solving Math Word ProblemsCode0
Robot Kinematics: Motion, Kinematics and Dynamics0
Solving math word problems with process- and outcome-based feedback0
DyRRen: A Dynamic Retriever-Reranker-Generator Model for Numerical Reasoning over Tabular and Textual DataCode0
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics0
Self-consistent Reasoning For Solving Math Word Problems0
Structure-Unified M-Tree Coding Solver for MathWord ProblemCode0
Show:102550
← PrevPage 26 of 32Next →

No leaderboard results yet.