SOTAVerified

Math

Papers

Showing 701750 of 1596 papers

TitleStatusHype
A multi-core periphery perspective: Ranking via relative centrality0
Data Augmentation with In-Context Learning and Comparative Evaluation in Math Word Problem Solving0
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations0
AutoMathKG: The automated mathematical knowledge graph based on LLM and vector database0
LoRASuite: Efficient LoRA Adaptation Across Large Language Model Upgrades0
Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting0
Automate Knowledge Concept Tagging on Math Questions with LLMs0
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination0
Investigating Symbolic Capabilities of Large Language Models0
Cross-Lingual Consistency: A Novel Inference Framework for Advancing Reasoning in Large Language Models0
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks0
Investigating Math Word Problems using Pretrained Multilingual Language Models0
Solving Functional Optimization with Deep Networks and Variational Principles0
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs0
Automated Systems For Diagnosis of Dysgraphia in Children: A Survey and Novel Framework0
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist0
Iterative Reasoning Preference Optimization0
Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving0
Critique-Guided Distillation: Improving Supervised Fine-tuning via Better Distillation0
Accelerating Chain-of-Thought Reasoning: When Goal-Gradient Importance Meets Dynamic Skipping0
Critique Ability of Large Language Models0
Introduction to Coresets: Accurate Coresets0
Kappa Learning: A New Method for Measuring Similarity Between Educational Items Using Performance Data0
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning0
Introducing the Mathematics Meme Repository0
Critic-CoT: Boosting the reasoning abilities of large language model via Chain-of-thoughts Critic0
Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains0
Knowledge Tagging System on Math Questions via LLMs with Flexible Demonstration Retriever0
Knowledge Tagging with Large Language Model based Multi-Agent System0
Kokoyi: Executable LaTeX for End-to-end Deep Learning0
L2CEval: Evaluating Language-to-Code Generation Capabilities of Large Language Models0
Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer0
Intriguing Properties of Large Language and Vision Models0
Interpretable Math Word Problem Solution Generation Via Step-by-step Planning0
Interpretable Factorization for Neural Network ECG Models0
AdaptMI: Adaptive Skill-based In-context Math Instruction for Small Language Models0
Accelerated Sampling from Masked Diffusion Models via Entropy Bounded Unmasking0
Long Is More Important Than Difficult for Training Reasoning Models0
Interleaved Reasoning for Large Language Models via Reinforcement Learning0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
CRANE: Reasoning with constrained LLM generation0
Large Language Models Are Struggle to Cope with Unreasonability in Math Problems0
Large Language Models as Analogical Reasoners0
Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses0
Large Language Models Can Self-Correct with Key Condition Verification0
Large Language Models for Mathematical Reasoning: Progresses and Challenges0
Local Prompt Optimization0
Large Language Models' Understanding of Math: Source Criticism and Extrapolation0
Cramer-Rao bound and absolute sensitivity in chemical reaction networks0
Integer Networks for Data Compression with Latent-Variable Models0
Show:102550
← PrevPage 15 of 32Next →

No leaderboard results yet.