Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 851–900 of 1596 papers

Title	Date	Tasks	Status
Strictly monotone mean-variance preferences with applications to portfolio selection	Dec 18, 2024	ManagementMath	—Unverified
Inference-Aware Fine-Tuning for Best-of-N Sampling in Large Language Models	Dec 18, 2024	HumanEvalImitation Learning	—Unverified
LinguaLIFT: An Effective Two-stage Instruction Tuning Framework for Low-Resource Language Tasks	Dec 17, 2024	Math	—Unverified
CoinMath: Harnessing the Power of Coding Instruction for Math LLMs	Dec 16, 2024	DescriptiveMath	CodeCode Available
A Survey of Mathematical Reasoning in the Era of Multimodal Large Language Model: Benchmark, Method & Challenges	Dec 16, 2024	Language ModelingLanguage Modelling	—Unverified
Combining Large Language Models with Tutoring System Intelligence: A Case Study in Caregiver Homework Support	Dec 16, 2024	Large Language ModelMath	CodeCode Available
Dipper: Diversity in Prompts for Producing Large Language Model Ensembles in Reasoning tasks	Dec 12, 2024	DiversityGPU	—Unverified
Geo-LLaVA: A Large Multi-Modal Model for Solving Geometry Math Problems with Meta In-Context Learning	Dec 12, 2024	Geometry Problem SolvingIn-Context Learning	—Unverified
Learning to Solve Domain-Specific Calculation Problems with Knowledge-Intensive Programs Generator	Dec 12, 2024	Math	—Unverified
A Context-Enhanced Framework for Sequential Graph Reasoning	Dec 12, 2024	Math	CodeCode Available
A Graph-Based Synthetic Data Pipeline for Scaling High-Quality Reasoning Instructions	Dec 12, 2024	GSM8KKnowledge Graphs	—Unverified
Greek2MathTex: A Greek Speech-to-Text Framework for LaTeX Equations Generation	Dec 11, 2024	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available
MNIST-Fraction: Enhancing Math Education with AI-Driven Fraction Detection and Analysis	Dec 11, 2024	Math	—Unverified
LLM-as-an-Interviewer: Beyond Static Testing Through Dynamic LLM Evaluation	Dec 10, 2024	Math	CodeCode Available
When Dimensionality Reduction Meets Graph (Drawing) Theory: Introducing a Common Framework, Challenges and Opportunities	Dec 9, 2024	Dimensionality ReductionMath	—Unverified
Mining Math Conjectures from LLMs: A Pruning Approach	Dec 9, 2024	Math	—Unverified
Chimera: Improving Generalist Model with Domain-Specific Experts	Dec 8, 2024	Mathmodel	—Unverified
Neuro-Symbolic Data Generation for Math Reasoning	Dec 6, 2024	DiversityMath	—Unverified
Hard Math -- Easy UVM: Pragmatic solutions for verifying hardware algorithms using UVM	Dec 6, 2024	Math	—Unverified
Automated LaTeX Code Generation from Handwritten Math Expressions Using Vision Transformer	Dec 5, 2024	Code GenerationDecoder	—Unverified
Enhancing Mathematical Reasoning in LLMs with Background Operators	Dec 5, 2024	Data AugmentationMath	—Unverified
RedStone: Curating General, Code, Math, and QA Data for Large Language Models	Dec 4, 2024	Domain AdaptationMath	—Unverified
Unsupervised learning-based calibration scheme for Rough Bergomi model	Dec 3, 2024	Math	CodeCode Available
MALT: Improving Reasoning with Multi-Agent LLM Training	Dec 2, 2024	Common Sense ReasoningGSM8K	—Unverified
Yi-Lightning Technical Report	Dec 2, 2024	ChatbotLarge Language Model	—Unverified
Reverse Thinking Makes LLMs Stronger Reasoners	Nov 29, 2024	Data AugmentationKnowledge Distillation	—Unverified
Mars-PO: Multi-Agent Reasoning System Preference Optimization	Nov 28, 2024	MathMathematical Reasoning	—Unverified
A Lean Dataset for International Math Olympiad: Small Steps towards Writing Math Proofs for Hard Problems	Nov 28, 2024	LEMMAMath	—Unverified
Embracing AI in Education: Understanding the Surge in Large Language Model Use by Secondary Students	Nov 27, 2024	Language ModelingLanguage Modelling	—Unverified
Beyond Examples: High-level Automated Reasoning Paradigm in In-Context Learning via MCTS	Nov 27, 2024	In-Context LearningMath	CodeCode Available
Learning by Analogy: Enhancing Few-Shot Prompting for Math Word Problem Solving with Computational Graph-Based Retrieval	Nov 25, 2024	MathMath Word Problem Solving	—Unverified
Unraveling Arithmetic in Large Language Models: The Role of Algebraic Structures	Nov 25, 2024	GSM8KMath	—Unverified
Velocitune: A Velocity-based Dynamic Domain Reweighting Method for Continual Pre-training	Nov 21, 2024	Math	—Unverified
MM-Eval: A Hierarchical Benchmark for Modern Mongolian Evaluation in LLMs	Nov 14, 2024	General KnowledgeMath	CodeCode Available
RESOLVE: Relational Reasoning with Symbolic and Object-Level Features Using Vector Symbolic Processing	Nov 13, 2024	DecoderMath	CodeCode Available
OpenAI-o1 AB Testing: Does the o1 model really do good reasoning in math problem solving?	Nov 9, 2024	Logical ReasoningMath	—Unverified
VISTA: Visual Integrated System for Tailored Automation in Math Problem Generation Using LLM	Nov 8, 2024	Math	—Unverified
Meta-Reasoning Improves Tool Use in Large Language Models	Nov 7, 2024	Math	CodeCode Available
Evaluating GPT-4 at Grading Handwritten Solutions in Math Exams	Nov 7, 2024	Math	—Unverified
Self-Consistency Preference Optimization	Nov 6, 2024	GSM8KMath	—Unverified
Automatic Generation of Question Hints for Mathematics Problems using Large Language Models in Educational Technology	Nov 5, 2024	MathMisconceptions	—Unverified
Leveraging Label Semantics and Meta-Label Refinement for Multi-Label Question Classification	Nov 4, 2024	MathReranking	CodeCode Available
Dictionary Insertion Prompting for Multilingual Reasoning on Multilingual Large Language Models	Nov 2, 2024	GSM8KMath	—Unverified
STEM-POM: Evaluating Language Models Math-Symbol Reasoning in Document Parsing	Nov 1, 2024	2kIn-Context Learning	—Unverified
DynaMath: A Dynamic Visual Benchmark for Evaluating Mathematical Reasoning Robustness of Vision Language Models	Oct 29, 2024	MathMathematical Reasoning	—Unverified
Automated Feedback in Math Education: A Comparative Analysis of LLMs for Open-Ended Responses	Oct 29, 2024	MathZero-Shot Learning	—Unverified
Improving Math Problem Solving in Large Language Models Through Categorization and Strategy Tailoring	Oct 29, 2024	Math	—Unverified
EoRA: Training-free Compensation for Compressed LLM with Eigenspace Low-Rank Approximation	Oct 28, 2024	ARCMath	—Unverified
Guiding Through Complexity: What Makes Good Supervision for Hard Reasoning Tasks?	Oct 27, 2024	Data AugmentationMath	CodeCode Available
Library Learning Doesn't: The Curious Case of the Single-Use "Library"	Oct 26, 2024	MathMathematical Reasoning	CodeCode Available

Show:10 25 50

← PrevPage 18 of 32Next →

No leaderboard results yet.