SOTAVerified

Benchmarking

Papers

Showing 47514800 of 5548 papers

TitleStatusHype
EASTER: Efficient and Scalable Text Recognizer0
AIPerf: Automated machine learning as an AI-HPC benchmarkCode1
From Attack to Protection: Leveraging Watermarking Attack Network for Advanced Add-on Watermarking0
Continuous Optimization Benchmarks by SimulationCode0
An AI based talent acquisition and benchmarking for job0
Short-term origin-destination demand prediction in urban rail transit systems: A channel-wise attentive split-convolutional neural network method0
Scission: Performance-driven and Context-aware Cloud-Edge Distribution of Deep Neural NetworksCode0
A critical analysis of metrics used for measuring progress in artificial intelligence0
Cross-Model Image Annotation Platform with Active Learning0
Real-World Blur Dataset for Learning and Benchmarking Deblurring Algorithms0
Beyond Monocular Deraining: Stereo Image Deraining via Semantic Understanding0
Robust Benchmarking for Machine Learning of Clinical Entity ExtractionCode0
Benchmarking and Comparing Multi-exposure Image Fusion Algorithms0
Deep Hedging of Long-Term Financial Derivatives0
dMelodies: A Music Dataset for Disentanglement LearningCode1
Realistic Video Summarization through VISIOCITY: A New Benchmark and Evaluation Framework0
Benchmarking Meta-heuristic Optimization0
From Sound Representation to Model Robustness0
Benchmarking Multivariate Time Series Classification Algorithms0
Image-Based Benchmarking and Visualization for Large-Scale Global Optimization0
A Survey on Performance Metrics for Object-Detection AlgorithmsCode3
Explainable Rumor Detection using Inter and Intra-feature Attention Networks0
DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection0
Few-Shot Defect Segmentation Leveraging Abundant Normal Training Samples Through Normal Background Regularization and Crop-and-Paste Operation0
ImageNet performance correlates with pose estimation robustness and generalization on out-of-domain data0
WordCraft: An Environment for Benchmarking Commonsense AgentsCode1
Domain2Vec: Domain Embedding for Unsupervised Domain AdaptationCode0
Towards an Automated SOAP Note: Classifying Utterances from Medical Conversations0
CoNES: Convex Natural Evolutionary Strategies0
Are We There Yet? Evaluating State-of-the-Art Neural Network based Geoparsers Using EUPEG as a Benchmarking PlatformCode1
Emoji Prediction: Extensions and BenchmarkingCode1
Towards causal benchmarking of bias in face analysis algorithmsCode0
CheXphoto: 10,000+ Photos and Transformations of Chest X-rays for Benchmarking Deep Learning RobustnessCode1
Affine Non-negative Collaborative Representation Based Pattern ClassificationCode0
GAMA: a General Automated Machine learning AssistantCode1
VisImages: A Fine-Grained Expert-Annotated Visualization Dataset0
Enhancing spatial and textual analysis with EUPEG: an extensible and unified platform for evaluating geoparsersCode1
URSABench: Comprehensive Benchmarking of Approximate Bayesian Inference Methods for Deep Neural NetworksCode1
Quaternion Capsule NetworksCode0
RobFR: Benchmarking Adversarial Robustness on Face RecognitionCode1
IOHanalyzer: Detailed Performance Analyses for Iterative Optimization HeuristicsCode1
Benchmarking in Optimization: Best Practice and Open Issues0
Re-thinking Co-Salient Object DetectionCode1
Wiki-CS: A Wikipedia-Based Benchmark for Graph Neural NetworksCode1
Complex Human Action Recognition in Live Videos Using Hybrid FR-DL Method0
Does imputation matter? Benchmark for predictive models0
Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey0
Building benchmarking frameworks for supporting replicability and reproducibility: spatial and textual analysis as an example0
Quo Vadis, Skeleton Action Recognition ?Code1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via MetagradientCode1
Show:102550
← PrevPage 96 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified