SOTAVerified

Math

Papers

Showing 651700 of 1596 papers

TitleStatusHype
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMsCode0
Modeling Intra-Relation in Math Word Problems with Different Functional Multi-Head AttentionsCode0
Scaling up ridge regression for brain encoding in a massive individual fMRI datasetCode0
Mind Scramble: Unveiling Large Language Model Psychology Via TypoglycemiaCode0
metboost: Exploratory regression analysis with hierarchically clustered dataCode0
Heteroclinic cycling and extinction in May-Leonard models with demographic stochasticityCode0
ComSearch: Equation Searching with Combinatorial Strategy for Solving Math Word Problems with Weak SupervisionCode0
Algebra Error Classification with Large Language ModelsCode0
Meta-Reasoning Improves Tool Use in Large Language ModelsCode0
Helpful assistant or fruitful facilitator? Investigating how personas affect language model behaviorCode0
ASyMOB: Algebraic Symbolic Mathematical Operations BenchmarkCode0
Computationally Identifying Funneling and Focusing Questions in Classroom DiscourseCode0
Hard Negative Contrastive Learning for Fine-Grained Geometric Understanding in Large Multimodal ModelsCode0
Compositional Processing Emerges in Neural Networks Solving Math ProblemsCode0
MIRB: Mathematical Information Retrieval BenchmarkCode0
HARDMath2: A Benchmark for Applied Mathematics Built by Students as Part of a Graduate ClassCode0
Complex Mathematical Symbol Definition Structures: A Dataset and Model for Coordination Resolution in Definition ExtractionCode0
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?Code0
MAWPS: A Math Word Problem RepositoryCode0
In-Context Principle Learning from MistakesCode0
mCoT: Multilingual Instruction Tuning for Reasoning Consistency in Language ModelsCode0
Math Word Problem Solving by Generating Linguistic Variants of Problem StatementsCode0
Guided Speculative Inference for Efficient Test-Time Alignment of LLMsCode0
GThinker: Towards General Multimodal Reasoning via Cue-Guided RethinkingCode0
Activation Steering for Chain-of-Thought CompressionCode0
Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework SupportCode0
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical ReasoningCode0
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations GenerationCode0
MathScape: Evaluating MLLMs in multimodal Math Scenarios through a Hierarchical BenchmarkCode0
Misplaced Trust: Measuring the Interference of Machine Learning in Human Decision-MakingCode0
Exploring the Reliability of Large Language Models as Customized Evaluators for Diverse NLP TasksCode0
CoinMath: Harnessing the Power of Coding Instruction for Math LLMsCode0
Mathematical Reasoning in Large Language Models: Assessing Logical and Arithmetic Errors across Wide Numerical RangesCode0
A large language model-assisted education tool to provide feedback on open-ended responsesCode0
Mathematical Reasoning for Unmanned Aerial Vehicles: A RAG-Based Approach for Complex Arithmetic ReasoningCode0
Mathematics Content Understanding for Cyberlearning via Formula Evolution MapCode0
Give me a hint: Can LLMs take a hint to solve math problems?Code0
CodeT5+: Open Code Large Language Models for Code Understanding and GenerationCode0
PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQLCode0
GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image GenerationCode0
Coarse-grained Stochastic Model of Myosin-Driven Vesicles into Dendritic SpinesCode0
MARGE: Improving Math Reasoning for LLMs with Guided ExplorationCode0
MAMUT: A Novel Framework for Modifying Mathematical Formulas for the Generation of Specialized Datasets for Language Model TrainingCode0
Mapping to Declarative Knowledge for Word Problem SolvingCode0
Lost in Cultural Translation: Do LLMs Struggle with Math Across Cultural Contexts?Code0
A Context-Enhanced Framework for Sequential Graph ReasoningCode0
Generalizing Math Word Problem Solvers via Solution DiversificationCode0
MathScale: Scaling Instruction Tuning for Mathematical ReasoningCode0
Adversarial Math Word Problem GenerationCode0
LLM Performance for Code Generation on Noisy TasksCode0
Show:102550
← PrevPage 14 of 32Next →

No leaderboard results yet.