SOTAVerified

Benchmarking

Papers

Showing 37763800 of 5548 papers

TitleStatusHype
MUPAX: Multidimensional Problem Agnostic eXplainable AI0
Benchmarking Defeasible Reasoning with Large Language Models -- Initial Experiments and Future Directions0
Benchmarking Deep Trackers on Aerial Videos0
MVS^2: Deep Unsupervised Multi-view Stereo with Multi-View Symmetry0
My Boli: Code-mixed Marathi-English Corpora, Pretrained Language Models and Evaluation Benchmarks0
N^2: A Unified Python Package and Test Bench for Nearest Neighbor-Based Matrix Completion0
NABU - Multilingual Graph-based Neural RDF Verbalizer0
Towards Toxic Positivity Detection0
Benchmarking Deep Sequential Models on Volatility Predictions for Financial Time Series0
Benchmarking Deep Reinforcement Learning Algorithms for Vision-based Robotics0
Benchmarking Deep Learning Models for Object Detection on Edge Computing Devices0
Benchmarking deep learning models for bearing fault diagnosis using the CWRU dataset: A multi-label approach0
NAS-Bench-Zero: A Large Scale Dataset for Understanding Zero-Shot Neural Architecture Search0
Benchmarking Deep Learning Frameworks for Automated Diagnosis of Ocular Toxoplasmosis: A Comprehensive Approach to Classification and Segmentation0
NA-SODINN: a deep learning algorithm for exoplanet image detection based on residual noise regimes0
NativQA: Multilingual Culturally-Aligned Natural Query for LLMs0
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition0
Natural Disasters Detection in Social Media and Satellite imagery: a survey0
Benchmarking Deep Learning-Based Methods for Irradiance Nowcasting with Sky Images0
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages0
NATURAL PLAN: Benchmarking LLMs on Natural Language Planning0
Nature-Inspired Optimization Algorithms: Challenges and Open Problems0
NavBench: A Unified Robotics Benchmark for Reinforcement Learning-Based Autonomous Navigation0
What Motivates You? Benchmarking Automatic Detection of Basic Needs from Short Posts0
Benchmarking Deep Learning Architectures for Urban Vegetation Point Cloud Semantic Segmentation from MLS0
Show:102550
← PrevPage 152 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified