SOTAVerified

Benchmarking

Papers

Showing 40764100 of 5548 papers

TitleStatusHype
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective OptimizationCode0
A Comparative study of Hyper-Parameter Optimization Tools0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks0
Feasibility of BERT Embeddings For Domain-Specific Knowledge Mining0
Context-guided Triple Matching for Multiple Choice Question Answering0
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding0
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-190
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
A Baseline Statistical Method For Robust User-Assisted Multiple SegmentationCode0
Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling0
Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software PlatformCode0
DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation0
Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networksCode1
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical StudyCode1
Leveraging Trust for Joint Multi-Objective and Multi-Fidelity OptimizationCode1
MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning0
Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish LanguageCode0
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving EnvironmentCode0
CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding0
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking ResultsCode0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent0
Autonomous Reinforcement Learning: Formalism and BenchmarkingCode1
Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift0
Show:102550
← PrevPage 164 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified