SOTAVerified

Benchmarking

Papers

Showing 37513800 of 5548 papers

TitleStatusHype
Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision0
Benchmarking Edge Computing Devices for Grape Bunches and Trunks Detection using Accelerated Object Detection Single Shot MultiBox Deep Learning Models0
Benchmarking Edge AI Platforms for High-Performance ML Inference0
Quantum Similarity Testing with Convolutional Neural Networks0
Benchmarking Domain Randomisation for Visual Sim-to-Real Transfer0
Multimodal or Text? Retrieval or BERT? Benchmarking Classifiers for the Shared Task on Hateful Memes0
Multi-Modal Three-Stream Network for Action Recognition0
MultiON: Benchmarking Semantic Map Memory using Multi-Object Navigation0
Towards Spoken Mathematical Reasoning: Benchmarking Speech-based Models over Multi-faceted Math Problems0
LadderMIL: Multiple Instance Learning with Coarse-to-Fine Self-Distillation0
Towards Stable 3D Object Detection0
Benchmarking Domain Generalization on EEG-based Emotion Recognition0
MultiRobustBench: Benchmarking Robustness Against Multiple Attacks0
MultiSocial: Multilingual Benchmark of Machine-Generated Text Detection of Social-Media Texts0
AT-Drone: Benchmarking Adaptive Teaming in Multi-Drone Pursuit0
MultiSpider: Towards Benchmarking Multilingual Text-to-SQL Semantic Parsing0
Benchmarking Diverse-Modal Entity Linking with Generative Models0
Benchmarking Discrete Optimization Heuristics with IOHprofiler0
Non-linear Multitask Learning with Deep Gaussian Processes0
Benchmarking Differential Evolution on a Quantum Simulator0
Adaptive Gradient Methods with Local Guarantees0
Benchmarking Denoising Algorithms with Real Photographs0
Multivariate Stochastic Dominance via Optimal Transport and Applications to Models Benchmarking0
Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?0
Multi-view deep learning based molecule design and structural optimization accelerates the SARS-CoV-2 inhibitor discovery0
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Benchmarking Defeasible Reasoning with Large Language Models -- Initial Experiments and Future Directions0
Benchmarking Deep Trackers on Aerial Videos0
MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry0
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks0
N^2: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion0
NABU - Multilingual Graph-based Neural RDF Verbalizer0
Towards Toxic Positivity Detection0
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series0
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices0
Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach0
NAS-Bench-Zero: A Large Scale Dataset for Understanding Zero-Shot Neural Architecture Search0
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation0
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes0
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs0
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition0
Natural Disasters Detection in Social Media and Satellite imagery: a survey0
Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images0
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages0
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning0
Nature-Inspired Optimization Algorithms: Challenges and Open Problems0
NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS0
Show:102550
← PrevPage 76 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified