Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 901–925 of 1596 papers

Title	Date	Tasks	Status	Hype
Automate Knowledge Concept Tagging on Math Questions with LLMs	Mar 26, 2024	Few-Shot LearningMath	—Unverified	0
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning	Mar 25, 2024	Code GenerationIn-Context Learning	—Unverified	0
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?	Mar 21, 2024	MathMathematical Reasoning	—Unverified	0
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified	0
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision	Mar 21, 2024	Math	—Unverified	0
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?	Mar 20, 2024	Abstractive Text SummarizationContinual Pretraining	—Unverified	0
Evolutionary Optimization of Model Merging Recipes	Mar 19, 2024	Evolutionary AlgorithmsMath	CodeCode Available	5
Memory-Efficient and Secure DNN Inference on TrustZone-enabled Consumer IoT Devices	Mar 19, 2024	Math	CodeCode Available	1
Instructing Large Language Models to Identify and Ignore Irrelevant Conditions	Mar 19, 2024	MathMathematical Reasoning	CodeCode Available	0
What Makes Math Word Problems Challenging for LLMs?	Mar 17, 2024	Math	CodeCode Available	0
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem	Mar 17, 2024	DiversityEvolutionary Algorithms	—Unverified	0
Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning	Mar 14, 2024	Deep Reinforcement LearningGraph Attention	CodeCode Available	0
Hydrodynamics of Markets:Hidden Links Between Physics and Finance	Mar 14, 2024	Math	—Unverified	0
Self-Consistency Boosts Calibration for Math Reasoning	Mar 14, 2024	GSM8KMath	—Unverified	0
Sabiá-2: A New Generation of Portuguese Large Language Models	Mar 14, 2024	Math	—Unverified	0
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision	Mar 14, 2024	MathReinforcement Learning (RL)	CodeCode Available	2
The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models?	Mar 14, 2024	Hallucinationimage-classification	CodeCode Available	1
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks	Mar 14, 2024	MathSkill Generalization	—Unverified	0
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models	Mar 13, 2024	Math	—Unverified	0
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models	Mar 12, 2024	MathMathematical Reasoning	—Unverified	0
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models	Mar 12, 2024	MathMathematical Problem-Solving	CodeCode Available	0
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified	0
Common 7B Language Models Already Possess Strong Math Capabilities	Mar 7, 2024	GSM8KMath	CodeCode Available	5
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem	Mar 6, 2024	BenchmarkingHallucination	CodeCode Available	0
MathScale: Scaling Instruction Tuning for Mathematical Reasoning	Mar 5, 2024	GSM8KMath	CodeCode Available	0

Show:10 25 50

← PrevPage 37 of 64Next →

No leaderboard results yet.