SOTAVerified

Math

Papers

Showing 601650 of 1596 papers

TitleStatusHype
Big Math and the One-Brain Barrier A Position Paper and Architecture Proposal0
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models0
Dualformer: Controllable Fast and Slow Thinking by Learning with Randomized Reasoning Traces0
Accurate closed-form solution of the SIR epidemic model0
SelfBudgeter: Adaptive Token Allocation for Efficient LLM Reasoning0
LaRS: Latent Reasoning Skills for Chain-of-Thought Reasoning0
Biased Programmers? Or Biased Data? A Field Experiment in Operationalizing AI Ethics0
DrawEduMath: Evaluating Vision Language Models with Expert-Annotated Students' Hand-Drawn Math Images0
Do Thinking Tokens Help or Trap? Towards More Efficient Large Reasoning Model0
An Improved Coarse-to-Fine Method for Solving Generation Tasks0
A General Retrieval-Augmented Generation Framework for Multimodal Case-Based Reasoning Applications0
Large Language Models Can Self-Correct with Key Condition Verification0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Done Is Better than Perfect: Unlocking Efficient Reasoning by Structured Multi-Turn Decomposition0
Dolphin: A Spoken Language Proficiency Assessment System for Elementary Education0
Beyond Sentential Semantic Parsing: Tackling the Math SAT with a Cascade of Tree Transducers0
Do Large Language Models Truly Grasp Mathematics? An Empirical Exploration From Cognitive Psychology0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models0
Does Representation Intervention Really Identify Desired Concepts and Elicit Alignment?0
Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?0
Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets0
Does Reasoning Introduce Bias? A Study of Social Bias Evaluation and Mitigation in LLM Reasoning0
Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models0
Beyond Captioning: Task-Specific Prompting for Improved VLM Performance in Mathematical Reasoning0
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs0
Large Language Models as Analogical Reasoners0
Large Language Models Might Not Care What You Are Saying: Prompt Format Beats Descriptions0
A Neural Network Implementation for Free Energy Principle0
dMath: Distributed Linear Algebra for DL0
dMath: A Scalable Linear Algebra and Math Library for Heterogeneous GP-GPU Architectures0
Language Models with Conformal Factuality Guarantees0
Divide-and-Conquer Meets Consensus: Unleashing the Power of Functions in Code Generation0
Better Process Supervision with Bi-directional Rewarding Signals0
DiversiGATE: A Comprehensive Framework for Reliable Large Language Models0
Benchmarking Reasoning Robustness in Large Language Models0
Distributed Skellam Mechanism: a Novel Approach to Federated Learning with Differential Privacy0
Advancing Process Verification for Large Language Models via Tree-Based Preference Learning0
DISK: Domain-constrained Instance Sketch for Math Word Problem Generation0
DISC: DISC: Dynamic Decomposition Improves LLM Inference Scaling0
Benchmarking and Improving Generator-Validator Consistency of Language Models0
Direct Reasoning Optimization: LLMs Can Reward And Refine Their Own Reasoning for Open-Ended Tasks0
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks0
An Efficient Merge Search Matheuristic for Maximising the Net Present Value of Project Schedules0
DINGO: Constrained Inference for Diffusion LLMs0
Dimension Reduction via Colour Refinement0
BeamLoRA: Beam-Constraint Low-Rank Adaptation0
Dimensionality reduction: theoretical perspective on practical measures0
Digenes: genetic algorithms to discover conjectures about directed and undirected graphs0
Basic concepts, definitions, and methods in D number theory0
Odd period cycles and ergodic properties in price dynamics for an exchange economy0
Show:102550
← PrevPage 13 of 32Next →

No leaderboard results yet.