| NUMCoT: Numerals and Units of Measurement in Chain-of-Thought Reasoning using Large Language Models | Jun 5, 2024 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| NeuralNexus at BEA 2025 Shared Task: Retrieval-Augmented Prompting for Mistake Identification in AI Tutors | Jun 12, 2025 | Language ModelingLanguage Modelling | CodeCode Available | 0 | 5 |
| Brain-Inspired Two-Stage Approach: Enhancing Mathematical Reasoning by Imitating Human Thought Processes | Feb 23, 2024 | MathMathematical Reasoning | CodeCode Available | 0 | 5 |
| Not All Votes Count! Programs as Verifiers Improve Self-Consistency of Language Models for Math Reasoning | Oct 16, 2024 | AllGSM8K | CodeCode Available | 0 | 5 |
| Position: AI Evaluation Should Learn from How We Test Humans | Jun 18, 2023 | Mathematical ReasoningPosition | CodeCode Available | 0 | 5 |
| Multilingual Mathematical Reasoning: Advancing Open-Source LLMs in Hindi and English | Dec 24, 2024 | Mathematical Reasoning | CodeCode Available | 0 | 5 |
| MultiLingPoT: Enhancing Mathematical Reasoning with Multilingual Program Fine-tuning | Dec 17, 2024 | Mathematical Reasoning | CodeCode Available | 0 | 5 |
| OmniRouter: Budget and Performance Controllable Multi-LLM Routing | Feb 27, 2025 | AI AgentMathematical Reasoning | CodeCode Available | 0 | 5 |
| Blank Collapse: Compressing CTC emission for the faster decoding | Oct 31, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | CodeCode Available | 0 | 5 |
| Multi-Agent Sampling: Scaling Inference Compute for Data Synthesis with Tree Search-Based Agentic Collaboration | Dec 22, 2024 | Decision MakingMachine Translation | CodeCode Available | 0 | 5 |