SOTAVerified

Math

Papers

Showing 13011350 of 1596 papers

TitleStatusHype
Effects of context, complexity, and clustering on evaluation for math formula retrieval0
Text2Math: End-to-end Parsing Text into Math Expressions0
Can Stories Help LLMs Reason? Curating Information Space Through Narrative0
WARM: A Weakly (+Semi) Supervised Math Word Problem Solver0
Efficient Tool Use with Chain-of-Abstraction Reasoning0
The Backpropagation algorithm for a math student0
Embedded Phase Shifting: Robust Phase Shifting With Embedded Signals0
Embedding Self-Correction as an Inherent Ability in Large Language Models for Enhanced Mathematical Reasoning0
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students0
Emergent inabilities? Inverse scaling over the course of pretraining0
Empirical entropy, minimax regret and minimax risk0
Empowering Bengali Education with AI: Solving Bengali Math Word Problems through Transformer Models0
Enabling Massive Deep Neural Networks with the GraphBLAS0
End-to-End Bangla AI for Solving Math Olympiad Problem Benchmark: Leveraging Large Language Model Using Integrated Approach0
End-to-End Evaluation of a Spoken Dialogue System for Learning Basic Mathematics0
Can LLMs understand Math? -- Exploring the Pitfalls in Mathematical Reasoning0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
The Claude 3 Model Family: Opus, Sonnet, Haiku0
Enhancing LLM Intelligence with ARM-RAG: Auxiliary Rationale Memory for Retrieval Augmented Generation0
Enhancing Mathematical Reasoning in LLMs with Background Operators0
Enhancing Math Learning in an LMS Using AI-Driven Question Recommendations0
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach0
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search0
Enigmata: Scaling Logical Reasoning in Large Language Models with Synthetic Verifiable Puzzles0
Ensembling Large Language Models with Process Reward-Guided Tree Search for Better Complex Reasoning0
Entropic Distribution Matching in Supervised Fine-tuning of LLMs: Less Overfitting and Better Diversity0
Entropy Adaptive Decoding: Dynamic Model Switching for Efficient Inference0
The Complexity of Math Problems -- Linguistic, or Computational?0
Entropy-Guided Watermarking for LLMs: A Test-Time Framework for Robust and Traceable Text Generation0
Entropy Martingale Optimal Transport and Nonlinear Pricing-Hedging Duality0
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation0
Error Classification of Large Language Models on Math Word Problems: A Dynamically Adaptive Framework0
The Effect of Teacher Gender on Student Achievement in Primary School0
Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation0
The Entropic Measure Transform0
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams0
Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics0
The Function Transformation Omics - Funomics0
Evaluating Robustness of Reward Models for Mathematical Reasoning0
Evaluating the Design Features of an Intelligent Tutoring System for Advanced Mathematics Learning0
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages0
Can I understand what I create? Self-Knowledge Evaluation of Large Language Models0
Evolving LLMs' Self-Refinement Capability via Iterative Preference Optimization0
The Gap of Semantic Parsing: A Survey on Automatic Math Word Problem Solvers0
Examining the Behavior of LLM Architectures Within the Framework of Standardized National Exams in Brazil0
Examining the Robustness of Large Language Models across Language Complexity0
Expanding Search Space with Diverse Prompting Agents: An Efficient Sampling Approach for LLM Mathematical Reasoning0
Wavelet GPT: Wavelet Inspired Large Language Models0
Experimental Demonstration of an Optical Neural PDE Solver via On-Chip PINN Training0
Experimenting with Generative AI: Does ChatGPT Really Increase Everyone's Productivity?0
Show:102550
← PrevPage 27 of 32Next →

No leaderboard results yet.