Math

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1101–1150 of 1596 papers

Title	Date	Tasks	Status
Scaling up ridge regression for brain encoding in a massive individual fMRI dataset	Mar 28, 2024	CPUMath	CodeCode Available
Large Language Models Are Struggle to Cope with Unreasonability in Math Problems	Mar 28, 2024	Math	—Unverified
ML2SC: Deploying Machine Learning Models as Smart Contracts on the Blockchain	Mar 28, 2024	Math	—Unverified
Few-Shot Recalibration of Language Models	Mar 27, 2024	MathMMLU	—Unverified
The Invalsi Benchmarks: measuring Linguistic and Mathematical understanding of Large Language Models in Italian	Mar 27, 2024	Language ModellingMath	—Unverified
Automate Knowledge Concept Tagging on Math Questions with LLMs	Mar 26, 2024	Few-Shot LearningMath	—Unverified
To Err is Machine: Vulnerability Detection Challenges LLM Reasoning	Mar 25, 2024	Code GenerationIn-Context Learning	—Unverified
From Large to Tiny: Distilling and Refining Mathematical Expertise for Math Word Problems with Weakly Supervision	Mar 21, 2024	Math	—Unverified
A Chain-of-Thought Prompting Approach with LLMs for Evaluating Students' Formative Assessment Responses in Science	Mar 21, 2024	Active LearningMath	—Unverified
MathVerse: Does Your Multi-modal LLM Truly See the Diagrams in Visual Math Problems?	Mar 21, 2024	MathMathematical Reasoning	—Unverified
PARAMANU-AYN: Pretrain from scratch or Continual Pretraining of LLMs for Legal Domain Adaptation?	Mar 20, 2024	Abstractive Text SummarizationContinual Pretraining	—Unverified
Instructing Large Language Models to Identify and Ignore Irrelevant Conditions	Mar 19, 2024	MathMathematical Reasoning	CodeCode Available
An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem	Mar 17, 2024	DiversityEvolutionary Algorithms	—Unverified
What Makes Math Word Problems Challenging for LLMs?	Mar 17, 2024	Math	CodeCode Available
Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning	Mar 14, 2024	Deep Reinforcement LearningGraph Attention	CodeCode Available
Sabiá-2: A New Generation of Portuguese Large Language Models	Mar 14, 2024	Math	—Unverified
Hydrodynamics of Markets:Hidden Links Between Physics and Finance	Mar 14, 2024	Math	—Unverified
Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks	Mar 14, 2024	MathSkill Generalization	—Unverified
Self-Consistency Boosts Calibration for Math Reasoning	Mar 14, 2024	GSM8KMath	—Unverified
Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models	Mar 13, 2024	Math	—Unverified
SmallToLarge (S2L): Scalable Data Selection for Fine-tuning Large Language Models by Summarizing Training Trajectories of Small Models	Mar 12, 2024	MathMathematical Problem-Solving	CodeCode Available
FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models	Mar 12, 2024	MathMathematical Reasoning	—Unverified
Branch-Train-MiX: Mixing Expert LLMs into a Mixture-of-Experts LLM	Mar 12, 2024	Arithmetic ReasoningCode Generation	—Unverified
Benchmarking Hallucination in Large Language Models based on Unanswerable Math Word Problem	Mar 6, 2024	BenchmarkingHallucination	CodeCode Available
Evaluating and Optimizing Educational Content with Large Language Model Judgments	Mar 5, 2024	Language ModelingLanguage Modelling	CodeCode Available
MathScale: Scaling Instruction Tuning for Mathematical Reasoning	Mar 5, 2024	GSM8KMath	CodeCode Available
Key-Point-Driven Data Synthesis with its Enhancement on Mathematical Reasoning	Mar 4, 2024	GSM8KMath	—Unverified
The Claude 3 Model Family: Opus, Sonnet, Haiku	Mar 4, 2024	1 Image, 2*2 StitchingArithmetic Reasoning	—Unverified
Contrastive Region Guidance: Improving Grounding in Vision-Language Models without Training	Mar 4, 2024	MathPhrase Grounding	—Unverified
Experimenting with Generative AI: Does ChatGPT Really Increase Everyone's Productivity?	Mar 4, 2024	EconometricsMath	—Unverified
ClickTree: A Tree-based Method for Predicting Math Students' Performance Based on Clickstream Data	Mar 1, 2024	Math	—Unverified
PRSA: Prompt Stealing Attacks against Real-World Prompt Services	Feb 29, 2024	Math	—Unverified
Data Interpreter: An LLM Agent For Data Science	Feb 28, 2024	Code GenerationLanguage Modelling	—Unverified
Adversarial Math Word Problem Generation	Feb 27, 2024	Math	CodeCode Available
MATHSENSEI: A Tool-Augmented Large Language Model for Mathematical Reasoning	Feb 27, 2024	8kLanguage Modeling	CodeCode Available
MathGenie: Generating Synthetic Data with Question Back-translation for Enhancing Mathematical Reasoning of LLMs	Feb 26, 2024	GSM8KMath	—Unverified
How Do Humans Write Code? Large Models Do It the Same Way Too	Feb 24, 2024	Code GenerationMath	CodeCode Available
Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes	Feb 23, 2024	MathMathematical Reasoning	CodeCode Available
MoELoRA: Contrastive Learning Guided Mixture of Experts on Parameter-Efficient Fine-Tuning for Large Language Models	Feb 20, 2024	Common Sense ReasoningContrastive Learning	—Unverified
LoRA-Flow: Dynamic LoRA Fusion for Large Language Models in Generative Tasks	Feb 18, 2024	Math	—Unverified
Orca-Math: Unlocking the potential of SLMs in Grade School Math	Feb 16, 2024	Arithmetic ReasoningGSM8K	—Unverified
Mathematical Opportunities in Digital Twins (MATH-DT)	Feb 15, 2024	Math	—Unverified
Language Models with Conformal Factuality Guarantees	Feb 15, 2024	Conformal PredictionLanguage Modeling	—Unverified
AutoTutor meets Large Language Models: A Language Model Tutor with Rich Pedagogy and Guardrails	Feb 14, 2024	Language ModelingLanguage Modelling	CodeCode Available
Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications	Feb 14, 2024	Math	—Unverified
GLoRe: When, Where, and How to Improve LLM Reasoning via Global and Local Refinements	Feb 13, 2024	GSM8KMath	—Unverified
EvoGPT-f: An Evolutionary GPT Framework for Benchmarking Formal Math Languages	Feb 12, 2024	Automated Theorem ProvingBenchmarking	—Unverified
Understanding the Progression of Educational Topics via Semantic Matching	Feb 10, 2024	Math	—Unverified
V-STaR: Training Verifiers for Self-Taught Reasoners	Feb 9, 2024	Code GenerationMath	—Unverified
In-Context Principle Learning from Mistakes	Feb 8, 2024	GSM8KIn-Context Learning	CodeCode Available

Show:10 25 50

← PrevPage 23 of 32Next →

No leaderboard results yet.