SOTAVerified

Benchmarking

Papers

Showing 24112420 of 5548 papers

TitleStatusHype
Class Imbalance in Object Detection: An Experimental Diagnosis and Study of Mitigation StrategiesCode0
Amharic LLaMA and LLaVA: Multimodal LLMs for Low Resource LanguagesCode1
A Holistic Framework Towards Vision-based Traffic Signal Control with Microscopic Simulation0
Leveraging Foundation Models for Content-Based Medical Image Retrieval in RadiologyCode1
Addressing Shortcomings in Fair Graph Learning Datasets: Towards a New BenchmarkCode1
Multi-GPU-Enabled Hybrid Quantum-Classical Workflow in Quantum-HPC Middleware: Applications in Quantum SimulationsCode0
Synth4bench: a framework for generating synthetic genomics data for the evaluation of tumor-only somatic variant calling algorithmsCode0
Benchmarking Micro-action Recognition: Dataset, Methods, and ApplicationsCode1
Benchmarking Large Language Models for Molecule Prediction TasksCode0
Tapilot-Crossing: Benchmarking and Evolving LLMs Towards Interactive Data Analysis AgentsCode1
Show:102550
← PrevPage 242 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified