SOTAVerified

Benchmarking

Papers

Showing 44014450 of 5548 papers

TitleStatusHype
Beyond Emotion: A Multi-Modal Dataset for Human Desire Understanding0
A Survey on Masked Facial Detection Methods and Datasets for Fighting Against COVID-190
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
A Baseline Statistical Method For Robust User-Assisted Multiple SegmentationCode0
Aerial Scene Parsing: From Tile-level Scene Classification to Pixel-wise Semantic Labeling0
Standard Vs Uniform Binary Search and Their Variants in Learned Static Indexing: The Case of the Searching on Sorted Data Benchmarking Software PlatformCode0
DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation0
MPCLeague: Robust MPC Platform for Privacy-Preserving Machine Learning0
Benchmarking Pedestrian Odometry: The Brown Pedestrian Odometry Dataset (BPOD)0
TFW2V: An Enhanced Document Similarity Method for the Morphologically Rich Finnish LanguageCode0
InstaIndoor and Multi-modal Deep Learning for Indoor Scene RecognitionCode0
Evaluating the Robustness of Deep Reinforcement Learning for Autonomous Policies in a Multi-agent Urban Driving EnvironmentCode0
CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding0
QU-BraTS: MICCAI BraTS 2020 Challenge on Quantifying Uncertainty in Brain Tumor Segmentation - Analysis of Ranking Scores and Benchmarking ResultsCode0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent0
Logically at Factify 2022: Multimodal Fact Verification0
A Modular Workflow for Performance Benchmarking of Neuronal Network SimulationsCode0
On the Use of Quality Diversity Algorithms for The Traveling Thief Problem0
Benchmarking Safe Deep Reinforcement Learning in Aquatic Navigation0
Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift0
On the Value of ML Models0
GUNNEL: Guided Mixup Augmentation and Multi-View Fusion for Aquatic Animal SegmentationCode0
7th AI Driving Olympics: 1st Place Report for Panoptic Tracking0
GreenPCO: An Unsupervised Lightweight Point Cloud Odometry Method0
Reduced, Reused and Recycled: The Life of a Dataset in Machine Learning Research0
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines0
Synthetic weather radar using hybrid quantum-classical machine learning0
An implementation of the "Guess who?" game using CLIPCode0
Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking0
HRNET: AI on Edge for mask detection and social distancingCode0
TinyML Platforms Benchmarking0
An in-depth experimental study of sensor usage and visual reasoning of robots navigating in real environments0
OOD-CV: A Benchmark for Robustness to Out-of-Distribution Shifts of Individual Nuisances in Natural Images0
3D Compositional Zero-shot Learning with DeCompositional Consensus0
EffCNet: An Efficient CondenseNet for Image Classification on NXP BlueBox0
Benchmarking Shadow Removal for Facial Landmark Detection and Beyond0
Learning to Transfer for Traffic Forecasting via Multi-task LearningCode0
Using Color To Identify Insider ThreatsCode0
A War Beyond Deepfake: Benchmarking Facial Counterfeits and Countermeasures0
A Modular Framework for Centrality and Clustering in Complex Networks0
RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR0
Filter Methods for Feature Selection in Supervised Machine Learning Applications -- Review and Benchmark0
Novel Real-Time EMT-TS Modeling Architecture for Feeder Blackstart Simulations0
CLMB: deep contrastive learning for robust metagenomic binningCode0
Benchmarking Quality-Dependent and Cost-Sensitive Score-Level Multimodal Biometric Fusion Algorithms0
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding0
MSAMSum: Towards Benchmarking Multi-lingual Dialogue Summarization0
Fantastic Questions and Where to Find Them: FairytaleQA--An Authentic Dataset for Narrative Comprehension0
Mukayese: Turkish NLP Strikes Back0
Multiclass Optimal Classification Trees with SVM-splits0
Show:102550
← PrevPage 89 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified