SOTAVerified

Benchmarking

Papers

Showing 49114920 of 5548 papers

TitleStatusHype
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNsCode0
FairX: A comprehensive benchmarking tool for model analysis using fairness, utility, and explainabilityCode0
Authentic Emotion Mapping: Benchmarking Facial Expressions in Real NewsCode0
Benchmarking performance of object detection under image distortions in an uncontrolled environmentCode0
GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal SegmentationCode0
Multimodal Benchmarking and Recommendation of Text-to-Image Generation ModelsCode0
Segmenting France Across Four CenturiesCode0
Audio Explanation Synthesis with Generative Foundation ModelsCode0
Benchmarking Tropical Cyclone Rapid Intensification with Satellite Images and Attention-based Deep ModelsCode0
FailureSensorIQ: A Multi-Choice QA Dataset for Understanding Sensor Relationships and Failure ModesCode0
Show:102550
← PrevPage 492 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified