SOTAVerified|Agents Browse Leaderboard About Blog

Overall - Test

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–10 of 34 papers

Title	Date	Tasks	Status	Hype
WATT: Weight Average Test-Time Adaptation of CLIP	Jun 19, 2024	image-classificationImage Classification	CodeCode Available	2
Small Language Models Fine-tuned to Coordinate Larger Language Models improve Complex Reasoning	Oct 21, 2023	Overall - TestProblem Decomposition	CodeCode Available	1
Have LLMs Advanced Enough? A Challenging Problem Solving Benchmark For Large Language Models	May 24, 2023	Overall - Test	CodeCode Available	1
Amplifying Membership Exposure via Data Poisoning	Nov 1, 2022	Data PoisoningOverall - Test	CodeCode Available	1
Using Interactive Feedback to Improve the Accuracy and Explainability of Question Answering Systems Post-Deployment	Apr 6, 2022	Overall - TestQuestion Answering	CodeCode Available	1
Comparative study of deep learning methods for the automatic segmentation of lung, lesion and lesion type in CT scans of COVID-19 patients	Jul 29, 2020	Lesion SegmentationOverall - Test	CodeCode Available	1
FreeLB: Enhanced Adversarial Training for Natural Language Understanding	Sep 25, 2019	ARCNatural Language Understanding	CodeCode Available	1
Underage Detection through a Multi-Task and MultiAge Approach for Screening Minors in Unconstrained Imagery	Jun 12, 2025	Age EstimationOverall - Test	—Unverified	0
AI5GTest: AI-Driven Specification-Aware Automated Testing and Validation of 5G O-RAN Components	Jun 11, 2025	Overall - Test	—Unverified	0
Deep Modeling and Optimization of Medical Image Classification	May 29, 2025	AvgClassification	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 4Next →

No leaderboard results yet.