| An upper bound of the mutation probability in the genetic algorithm for general 0-1 knapsack problem | Mar 17, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| Incorporating Graph Attention Mechanism into Geometric Problem Solving Based on Deep Reinforcement Learning | Mar 14, 2024 | Deep Reinforcement LearningGraph Attention | CodeCode Available | 0 |
| Hydrodynamics of Markets:Hidden Links Between Physics and Finance | Mar 14, 2024 | Math | —Unverified | 0 |
| Self-Consistency Boosts Calibration for Math Reasoning | Mar 14, 2024 | GSM8KMath | —Unverified | 0 |
| Sabiá-2: A New Generation of Portuguese Large Language Models | Mar 14, 2024 | Math | —Unverified | 0 |
| Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision | Mar 14, 2024 | MathReinforcement Learning (RL) | CodeCode Available | 2 |
| The First to Know: How Token Distributions Reveal Hidden Knowledge in Large Vision-Language Models? | Mar 14, 2024 | Hallucinationimage-classification | CodeCode Available | 1 |
| Laying the Foundation First? Investigating the Generalization from Atomic Skills to Complex Reasoning Tasks | Mar 14, 2024 | MathSkill Generalization | —Unverified | 0 |
| Mastering Text, Code and Math Simultaneously via Fusing Highly Specialized Language Models | Mar 13, 2024 | Math | —Unverified | 0 |
| FineMath: A Fine-Grained Mathematical Evaluation Benchmark for Chinese Large Language Models | Mar 12, 2024 | MathMathematical Reasoning | —Unverified | 0 |