SOTAVerified

Benchmarking

Papers

Showing 40514100 of 5548 papers

TitleStatusHype
Benchmarking Deep Models for Salient Object DetectionCode1
Evaluation Methods and Measures for Causal Learning Algorithms0
RerrFact: Reduced Evidence Retrieval Representations for Scientific Claim VerificationCode0
Structured Prediction Problem ArchiveCode0
Quality Assessment of Low Light Restored Images: A Subjective Study and an Unsupervised Model0
Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization0
A quantitative method for benchmarking fair income distribution0
Black-box Bayesian inference for economic agent-based models0
When Do Flat Minima Optimizers Work?Code1
AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation0
Benchmarking Resource Usage for Efficient Distributed Deep Learning0
Benchmarking Conventional Vision Models on Neuromorphic Fall Detection and Action Recognition Dataset0
Benchmarking Robustness of 3D Point Cloud Recognition Against Common CorruptionsCode2
Benchmarking learned non-Cartesian k-space trajectories and reconstruction networks0
A Multi-rater Comparative Study of Automatic Target Localization Methods for Epilepsy Deep Brain Stimulation Procedures0
MeltpoolNet: Melt pool Characteristic Prediction in Metal Additive Manufacturing Using Machine Learning0
Jointly Learning Knowledge Embedding and Neighborhood Consensus with Relational Knowledge Distillation for Entity Alignment0
DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery -- A Focus on Affinity Prediction Problems with Noise AnnotationsCode0
Visual Object Tracking on Multi-modal RGB-D Videos: A Review0
Out of Distribution Detection on ImageNet-OCode0
Towards Private Learning on Decentralized Graphs with Local Differential Privacy0
AiTLAS: Artificial Intelligence Toolbox for Earth ObservationCode2
Individual Treatment Effect Estimation Through Controlled Neural Network Training in Two Stages0
A Simple Evolutionary Algorithm for Multi-modal Multi-objective Optimization0
High-Level Synthesis Performance Prediction using GNNs: Benchmarking, Modeling, and Advancing0
Benchmarking Subset Selection from Large Candidate Solution Sets in Evolutionary Multi-objective OptimizationCode0
A Comparative study of Hyper-Parameter Optimization Tools0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing Tasks0
Feasibility of BERT Embeddings For Domain-Specific Knowledge Mining0
Context-guided Triple Matching for Multiple Choice Question Answering0
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding0
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-190
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
A Baseline Statistical Method For Robust User-Assisted Multiple SegmentationCode0
Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling0
Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software PlatformCode0
DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation0
Are we really making much progress? Revisiting, benchmarking, and refining heterogeneous graph neural networksCode1
Benchmarking Chinese Text Recognition: Datasets, Baselines, and an Empirical StudyCode1
Leveraging Trust for Joint Multi-Objective and Multi-Fidelity OptimizationCode1
MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning0
Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish LanguageCode0
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving EnvironmentCode0
CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding0
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking ResultsCode0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent0
Autonomous Reinforcement Learning: Formalism and BenchmarkingCode1
Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift0
Show:102550
← PrevPage 82 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified