SOTAVerified

Benchmarking

Papers

Showing 17811790 of 5548 papers

TitleStatusHype
Adaptive Visual Scene Understanding: Incremental Scene Graph GenerationCode0
Integrating Expert Knowledge into Logical Programs via LLMsCode0
Building a Large Scale Dataset for Image Emotion Recognition: The Fine Print and The BenchmarkCode0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Bugs in the Data: How ImageNet Misrepresents BiodiversityCode0
CleanPatrick: A Benchmark for Image Data CleaningCode0
BubGAN: Bubble Generative Adversarial Networks for Synthesizing Realistic Bubbly Flow ImagesCode0
bsnsing: A decision tree induction method based on recursive optimal boolean rule compositionCode0
BSBench: will your LLM find the largest prime number?Code0
Show:102550
← PrevPage 179 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified