SOTAVerified

Benchmarking

Papers

Showing 771780 of 5548 papers

TitleStatusHype
DomainLab: A modular Python package for domain generalization in deep learningCode1
RoDLA: Benchmarking the Robustness of Document Layout Analysis ModelsCode1
Practical End-to-End Optical Music Recognition for Pianoform MusicCode1
MELTing point: Mobile Evaluation of Language TransformersCode1
ERASE: Benchmarking Feature Selection Methods for Deep Recommender SystemsCode1
NovelQA: Benchmarking Question Answering on Documents Exceeding 200K TokensCode1
Align and Distill: Unifying and Improving Domain Adaptive Object DetectionCode1
Histo-Genomic Knowledge Distillation For Cancer Prognosis From Histopathology Whole Slide ImagesCode1
An Improved Metric and Benchmark for Assessing the Performance of Virtual Screening ModelsCode1
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
Show:102550
← PrevPage 78 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified