SOTAVerified

Benchmarking

Papers

Showing 36013625 of 5548 papers

TitleStatusHype
A Survey on Preserving Fairness Guarantees in Changing Environments0
Self-Aligning Depth-regularized Radiance Fields for Asynchronous RGB-D Sequences0
A Benchmark for Out of Distribution Detection in Point Cloud 3D Semantic Segmentation0
A Benchmarking Dataset with 2440 Organic Molecules for Volume Distribution at Steady StateCode0
EvEntS ReaLM: Event Reasoning of Entity States via Language Models0
Hyperparameter optimization in deep multi-target predictionCode1
Tell Your Story: Task-Oriented Dialogs for Interactive Content Creation0
Okapi: Generalising Better by Making Statistical Matches MatchCode0
Common Pets in 3D: Dynamic New-View Synthesis of Real-Life Deformable Categories0
Improved Target-specific Stance Detection on Social Media Platforms by Delving into Conversation ThreadsCode0
The Legal Argument Reasoning Task in Civil ProcedureCode0
EventEA: Benchmarking Entity Alignment for Event-centric Knowledge GraphsCode1
An approach for benchmarking the numerical solutions of stochastic compartmental models0
Benchmarking Quality-Diversity Algorithms on Neuroevolution for Reinforcement Learning0
Quantum Similarity Testing with Convolutional Neural Networks0
Title2Event: Benchmarking Open Event Extraction with a Large-scale Chinese Title Dataset0
Signing Outside the Studio: Benchmarking Background Robustness for Continuous Sign Language RecognitionCode0
SOLAR: A Highly Optimized Data Loading Framework for Distributed Training of CNN-based Scientific Surrogates0
Classical ensemble of Quantum-classical ML algorithms for Phishing detection in Ethereum transaction networksCode0
Benchmarking Adversarial Patch Against Aerial DetectionCode1
Benchmarking performance of object detection under image distortions in an uncontrolled environmentCode0
Benchmarking Language Models for Code Syntax UnderstandingCode1
What's Different between Visual Question Answering for Machine "Understanding" Versus for Accessibility?Code0
pmuBAGE: The Benchmarking Assortment of Generated PMU Data for Power System EventsCode0
CrisisLTLSum: A Benchmark for Local Crisis Event Timeline Extraction and SummarizationCode0
Show:102550
← PrevPage 145 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified