SOTAVerified

Task 2

Papers

Showing 6170 of 572 papers

TitleStatusHype
LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMsCode0
TUMS: Enhancing Tool-use Abilities of LLMs with Multi-structure Handlers0
Team ACK at SemEval-2025 Task 2: Beyond Word-for-Word Machine Translation for English-Korean Pairs0
Feature Fusion Revisited: Multimodal CTR Prediction for MMCTR ChallengeCode0
BadMoE: Backdooring Mixture-of-Experts LLMs via Optimizing Routing Triggers and Infecting Dormant Experts0
Data Augmentation Using Neural Acoustic Fields With Retrieval-Augmented Pre-training0
HausaNLP at SemEval-2025 Task 2: Entity-Aware Fine-tuning vs. Prompt Engineering in Entity-Aware Machine Translation0
Towards Universal Learning-based Model for Cardiac Image Reconstruction: Summary of the CMRxRecon2024 ChallengeCode0
Bridging vision language model (VLM) evaluation gaps with a framework for scalable and cost-effective benchmark generation0
Fine-Tuning Open-Source Large Language Models to Improve Their Performance on Radiation Oncology Tasks: A Feasibility Study to Investigate Their Potential Clinical Applications in Radiation Oncology0
Show:102550
← PrevPage 7 of 58Next →

No leaderboard results yet.