SOTAVerified

Benchmarking

Papers

Showing 11711180 of 5548 papers

TitleStatusHype
Benchmarking saliency methods for chest X-ray interpretationCode1
GLGENN: A Novel Parameter-Light Equivariant Neural Networks Architecture Based on Clifford Geometric AlgebrasCode1
FinDABench: Benchmarking Financial Data Analysis Ability of Large Language ModelsCode1
BiCo-Net: Regress Globally, Match Locally for Robust 6D Pose EstimationCode1
A GPU-accelerated Large-scale Simulator for Transportation System Optimization BenchmarkingCode1
German's Next Language ModelCode1
Benchmarking Multi-modal Semantic Segmentation under Sensor Failures: Missing and Noisy Modality RobustnessCode1
Binary Code Summarization: Benchmarking ChatGPT/GPT-4 and Other Large Language ModelsCode1
Benchmarking Multimodal Variational Autoencoders: CdSprites+ Dataset and ToolkitCode1
Benchmarking Segmentation Models with Mask-Preserved Attribute EditingCode1
Show:102550
← PrevPage 118 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified