SOTAVerified

Benchmarking

Papers

Showing 36913700 of 5548 papers

TitleStatusHype
Motion-Grounded Video Reasoning: Understanding and Perceiving Motion at Pixel Level0
A Dataset for Benchmarking Image-Based Localization0
Movie Description0
MoviePuzzle: Visual Narrative Reasoning through Multimodal Order Learning0
Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking0
MozzaVID: Mozzarella Volumetric Image Dataset0
MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning0
MRAnnotator: multi-Anatomy and many-Sequence MRI segmentation of 44 structures0
MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization0
Towards responsible AI for education: Hybrid human-AI to confront the Elephant in the room0
Show:102550
← PrevPage 370 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified