SOTAVerified

Math

Papers

Showing 901925 of 1596 papers

TitleStatusHype
Instance-adaptive Zero-shot Chain-of-Thought Prompting0
Instruction-Following Pruning for Large Language Models0
Integer Networks for Data Compression with Latent-Variable Models0
Interactive Sketchpad: A Multimodal Tutoring System for Collaborative, Visual Problem-Solving0
Interleaved Reasoning for Large Language Models via Reinforcement Learning0
Beyond Human Data: Scaling Self-Training for Problem-Solving with Language Models0
Interpretable Factorization for Neural Network ECG Models0
Interpretable Math Word Problem Solution Generation Via Step-by-step Planning0
Intriguing Properties of Large Language and Vision Models0
Introducing the Mathematics Meme Repository0
Introduction to Coresets: Accurate Coresets0
Investigating Large Language Models in Diagnosing Students' Cognitive Skills in Math Problem-solving0
Investigating Math Word Problems using Pretrained Multilingual Language Models0
Investigating Symbolic Capabilities of Large Language Models0
Investigating the Effectiveness of ChatGPT in Mathematical Reasoning and Problem Solving: Evidence from the Vietnamese National High School Graduation Examination0
Investigating the Efficacy of Large Language Models in Reflective Assessment Methods through Chain of Thoughts Prompting0
Thinking Outside the (Gray) Box: A Context-Based Score for Assessing Value and Originality in Neural Text Generation0
IsoBench: Benchmarking Multimodal Foundation Models on Isomorphic Representations0
Solving Functional Optimization with Deep Networks and Variational Principles0
Is your LLM trapped in a Mental Set? Investigative study on how mental sets affect the reasoning capabilities of LLMs0
Is Your Model Really A Good Math Reasoner? Evaluating Mathematical Reasoning with Checklist0
Iterative Reasoning Preference Optimization0
Yi-Lightning Technical Report0
Adaptive Guidance Accelerates Reinforcement Learning of Reasoning Models0
JMMMU: A Japanese Massive Multi-discipline Multimodal Understanding Benchmark for Culture-aware Evaluation0
Show:102550
← PrevPage 37 of 64Next →

No leaderboard results yet.