SOTAVerified

Benchmarking

Papers

Showing 37013750 of 5548 papers

TitleStatusHype
Application of Machine Learning for Online Reputation Systems0
FORLORN: A Framework for Comparing Offline Methods and Reinforcement Learning for Optimization of RAN ParametersCode0
Improving plant disease classification by adaptive minimal ensembling0
Benchmarking Multimodal Variational Autoencoders: CdSprites+ Dataset and ToolkitCode1
RF Fingerprinting Needs Attention: Multi-task Approach for Real-World WiFi and Bluetooth0
Low Complexity Hybrid Beamforming for mmWave Full-Duplex Integrated Access and BackhaulCode0
Structural Bias for Aspect Sentiment Triplet ExtractionCode1
nnOOD: A Framework for Benchmarking Self-supervised Anomaly Localisation MethodsCode1
Complexity of Representations in Deep Learning0
An evaluation framework for comparing causal inference models0
AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels0
Hardware-aware mobile building block evaluation for computer vision0
Benchmarking Human Face Similarity Using Identical Twins0
TEP-GNN: Accurate Execution Time Prediction of Functional Tests using Graph Neural Networks0
Towards Benchmarking Explainable Artificial Intelligence Methods0
Bugs in the Data: How ImageNet Misrepresents BiodiversityCode0
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic0
MechProNet: Machine Learning Prediction of Mechanical Properties in Metal Additive Manufacturing0
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms0
A biologically-inspired multi-modal evaluation of molecular generative machine learning0
Wildfire Forecasting with Satellite Images and Deep Generative Model0
Benchmarking Compositionality with Formal LanguagesCode1
MultiPL-E: A Scalable and Extensible Approach to Benchmarking Neural Code GenerationCode2
The Low Emission Oil&Gas Open (LEOGO) Reference Platform of an Off-Grid Energy System for Renewable Integration Studies0
Unsupervised machine learning approach for building composite indicators with fuzzy metrics0
Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control0
Benchmarking Joint Face Spoofing and Forgery Detection with Visual and Physiological Cues0
Exact lattice-based stochastic cell culture simulation algorithms incorporating spontaneous and contact-dependent reactions0
fMRI-S4: learning short- and long-range dynamic fMRI dependencies using 1D Convolutions and State Space ModelsCode0
QSAM-Net: Rain streak removal by quaternion neural network with self-attention module0
SOMPT22: A Surveillance Oriented Multi-Pedestrian Tracking Dataset0
AstroVision: Towards Autonomous Feature Detection and Description for Missions to Small Bodies Using Deep LearningCode0
Benchmarking zero-shot and few-shot approaches for tokenization, tagging, and dependency parsing of Tagalog text0
Binary Classification with Positive Labeling Sources0
On the role of benchmarking data sets and simulations in method comparison studies0
CIPCaD-Bench: Continuous Industrial Process datasets for benchmarking Causal Discovery methodsCode1
A Multifaceted Benchmarking of Synthetic Electronic Health Record Generation ModelsCode1
ferret: a Framework for Benchmarking Explainers on TransformersCode0
A Case for Dataset Specific Profiling0
Accelerated and interpretable oblique random survival forestsCode1
On the Evaluation of User Privacy in Deep Neural Networks using Timing Side Channel0
Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression0
Vector-Based Data Improves Left-Right Eye-Tracking Classifier Performance After a Covariate Distributional ShiftCode0
PASTA: A Dataset for Modeling Participant States in Narratives0
Content-Aware Differential Privacy with Conditional Invertible Neural NetworksCode0
Benchmarking Azerbaijani Neural Machine Translation0
Towards Large-Scale Small Object Detection: Survey and Benchmarks0
Toward Transparent AI: A Survey on Interpreting the Inner Structures of Deep Neural Networks0
Tracking Every Thing in the WildCode1
ArtFID: Quantitative Evaluation of Neural Style TransferCode1
Show:102550
← PrevPage 75 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified