SOTAVerified

Benchmarking

Papers

Showing 46514700 of 5548 papers

TitleStatusHype
Cryo-RALib -- a modular library for accelerating alignment in cryo-EMCode0
Perturbation-based exploration methods in deep reinforcement learning0
Benchmarking LF-MMI, CTC and RNN-T Criteria for Streaming ASR0
Characterizing Transactional Databases for Frequent Itemset Mining0
Long Range Arena: A Benchmark for Efficient TransformersCode1
Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?Code0
A Comprehensive Comparison of Multi-Dimensional Image Denoising MethodsCode0
Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation DifficultyCode0
InferBench: Understanding Deep Learning Inference Serving with an Automatic Benchmarking System0
EEGS: A Transparent Model of Emotions0
The Forchheim Image Database for Camera Identification in the Wild0
Rearrangement: A Challenge for Embodied AI0
Face Morphing Attack Generation & Detection: A Comprehensive Survey0
IndoLEM and IndoBERT: A Benchmark Dataset and Pre-trained Language Model for Indonesian NLP0
Collective Knowledge: organizing research projects as a database of reusable components and portable workflows with common APIsCode1
Alibaba’s Submission for the WMT 2020 APE Shared Task: Improving Automatic Post-Editing with Pre-trained Conditional Cross-Lingual BERT0
Cross-lingual sentiment classification in low-resource Bengali languageCode0
On the Reliability and Validity of Detecting Approval of Political Actors in Tweets0
Benchmarking Meaning Representations in Neural Semantic ParsingCode1
Neural Network Design: Learning from Neural Architecture SearchCode0
Is Transfer Learning Necessary for Protein Landscape Prediction?0
A Critical Assessment of State-of-the-Art in Entity AlignmentCode1
Improving seasonal forecast using probabilistic deep learning0
SHARP 2020: The 1st Shape Recovery from Partial Textured 3D Scans Challenge Results0
Benchmarking Deep Learning Interpretability in Time Series PredictionsCode1
Probing Acoustic Representations for Phonetic PropertiesCode0
Kvasir-Instrument: Diagnostic and therapeutic tool segmentation dataset in gastrointestinal endoscopyCode1
KINNEWS and KIRNEWS: Benchmarking Cross-Lingual Text Classification for Kinyarwanda and KirundiCode1
CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs0
Learnability and Complexity of Quantum SamplesCode0
Exploiting News Article Structure for Automatic Corpus Generation of Entailment DatasetsCode1
Self-Alignment Pretraining for Biomedical Entity RepresentationsCode1
German's Next Language ModelCode1
On Benchmarking Iris Recognition within a Head-mounted Display for AR/VR Application0
A Flatter Loss for Bias Mitigation in Cross-dataset Facial Age Estimation0
Promoting High Diversity Ensemble Learning with EnsembleBenchCode1
How much progress have we made in neural network training? A New Evaluation Protocol for Benchmarking Optimizers0
Bayesian Neural Networks with Soft EvidenceCode0
RobustBench: a standardized adversarial robustness benchmarkCode1
RADIATE: A Radar Dataset for Automotive Perception in Bad WeatherCode1
A Seq2Seq approach to Symbolic RegressionCode0
ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection0
ALdataset: a benchmark for pool-based active learning0
Applicability and Challenges of Deep Reinforcement Learning for Satellite Frequency Plan Design0
Teaspoon: A comprehensive python package for topological signal processing0
Downsampling and geometric feature methods for EEG classification tasks with CNNs0
TOTOPO: Classifying univariate and multivariate time series with Topological Data Analysis0
Light Field Salient Object Detection: A Review and BenchmarkCode1
Addressing the Real-world Class Imbalance Problem in Dermatology0
Black-Box Optimization Revisited: Improving Algorithm Selection Wizards through Massive Benchmarking0
Show:102550
← PrevPage 94 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified