SOTAVerified

Benchmarking

Papers

Showing 26012625 of 5548 papers

TitleStatusHype
Benchmarking Robustness of Contrastive Learning Models for Medical Image-Report Retrieval0
FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding0
FineText: Text Classification via Attention-based Language Model Fine-tuning0
Fast Training of Deep Networks with One-Class CNNs0
Fine-tuning LLaMA 2 interference: a comparative study of language implementations for optimal efficiency0
FinGPT: Instruction Tuning Benchmark for Open-Source Large Language Models in Financial Datasets0
AI-ready Snow Radar Echogram Dataset (SRED) for climate change monitoring0
A Comprehensive Benchmarking Platform for Deep Generative Models in Molecular Design0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
FIORD: A Fisheye Indoor-Outdoor Dataset with LIDAR Ground Truth for 3D Scene Reconstruction and Benchmarking0
Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines0
FastEnsemble: Benchmarking and Accelerating Ensemble-based Uncertainty Estimation for Image-to-Image Translation0
Fast Empirical Scenarios0
FixCLR: Negative-Class Contrastive Learning for Semi-Supervised Domain Generalization0
Benchmarking Quantum Convolutional Neural Networks for Signal Classification in Simulated Gamma-Ray Burst Detection0
A Survey on Model Compression for Large Language Models0
FastDraft: How to Train Your Draft0
FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning0
FlowBench: Revisiting and Benchmarking Workflow-Guided Planning for LLM-based Agents0
Benchmarking Rotary Position Embeddings for Automatic Speech Recognition0
FlowerTune: A Cross-Domain Benchmark for Federated Fine-Tuning of Large Language Models0
FlowMind: Automatic Workflow Generation with LLMs0
AI-Powered Cow Detection in Complex Farm Environments0
Benchmarking quantized LLaMa-based models on the Brazilian Secondary School Exam0
Fast, approximate kinetics of RNA folding0
Show:102550
← PrevPage 105 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified