SOTAVerified

Benchmarking

Papers

Showing 39013950 of 5548 papers

TitleStatusHype
Improving Items and Contexts Understanding with Descriptive Graph for Conversational Recommendation0
Benchmarking the Physical-world Adversarial Robustness of Vehicle Detection0
Certifiable Black-Box Attacks with Randomized Adversarial Examples: Breaking Defenses with Provable ConfidenceCode0
ESPnet-ST-v2: Multipurpose Spoken Language Translation Toolkit0
On Evaluation of Bangla Word Analogies0
ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
Benchmarking the Robustness of Quantized Models0
Probing Conceptual Understanding of Large Visual-Language ModelsCode0
Benchmarking Robustness to Text-Guided CorruptionsCode0
IHCV: Discovery of Hidden Time-Dependent Control Variables in Non-Linear Dynamical SystemsCode0
DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images0
The Saudi Privacy Policy DatasetCode0
LogoNet: a fine-grained network for instance-level logo sketch retrievalCode0
OpenContrails: Benchmarking Contrail Detection on GOES-16 ABI0
A Latent Fingerprint in the Wild Database0
LaCViT: A Label-aware Contrastive Fine-tuning Framework for Vision TransformersCode0
Benchmarking FedAvg and FedCurv for Image Classification Tasks0
Prediction of cancer driver genes and mutations: the potential of integrative computational frameworks0
Why is the winner the best?0
From Private to Public: Benchmarking GANs in the Context of Private Time Series Classification0
Open the box of digital neuromorphic processor: Towards effective algorithm-hardware co-design0
GeoNet: Benchmarking Unsupervised Adaptation across Geographies0
Hyperparameter optimization, quantum-assisted model performance prediction, and benchmarking of AI-based High Energy Physics workloads using HPC0
Exploring Continual Learning of Diffusion Models0
Balancing policy constraint and ensemble size in uncertainty-based offline reinforcement learningCode0
Vulnerability of Face Morphing Attacks: A Case Study on Lookalike and Identical Twins0
Benchmarking the Impact of Noise on Deep Learning-based Classification of Atrial Fibrillation in 12-Lead ECG0
Benchmarking the Reliability of Post-training Quantization: a Particular Focus on Worst-case Performance0
Adaptive Experimentation at Scale: A Computational Framework for Flexible Batches0
Automated deep learning segmentation of high-resolution 7 T postmortem MRI for quantitative analysis of structure-pathology correlations in neurodegenerative diseasesCode0
Benchmarking Robustness of 3D Object Detection to Common Corruptions in Autonomous DrivingCode0
A Multi-Task Deep Learning Approach for Sensor-based Human Activity Recognition and Segmentation0
NoisyHate: Mining Online Human-Written Perturbations for Realistic Robustness Benchmarking of Content Moderation Models0
DeAR: Debiasing Vision-Language Models with Additive Residuals0
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset0
Joint Multi-Scale Tone Mapping and Denoising for HDR Image EnhancementCode0
From MNIST to ImageNet and Back: Benchmarking Continual Curriculum LearningCode0
DACOS-A Manually Annotated Dataset of Code Smells0
BaDLAD: A Large Multi-Domain Bengali Document Layout Analysis DatasetCode0
Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary DropoutsCode0
Multimodal Multi-User Surface Recognition with the Kernel Two-Sample TestCode0
Using Affine Combinations of BBOB Problems for Performance Assessment0
Towards Self-adaptive Mutation in Evolutionary Multi-Objective Algorithms0
Continuous Function Structured in Multilayer Perceptron for Global Optimization0
Leveraging Pre-trained AudioLDM for Sound Generation: A Benchmark Study0
Continuous-Time Gaussian Process Motion-Compensation for Event-vision Pattern Tracking with Distance Fields0
Benchmarking White Blood Cell Classification Under Domain ShiftCode0
Data-Efficient Training of CNNs and Transformers with Coresets: A Stability PerspectiveCode0
Benchmarking framework for machine learning classification from fNIRS dataCode0
Show:102550
← PrevPage 79 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified