SOTAVerified

Benchmarking

Papers

Showing 18011825 of 5548 papers

TitleStatusHype
An Empirical Study of Benchmarking Chinese Aspect Sentiment Quad Prediction0
Comparative Benchmarking of Causal Discovery Techniques0
User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance0
Comparative Design Space Exploration of Dense and Semi-Dense SLAM0
Comparative evaluation of instrument segmentation and tracking methods in minimally invasive surgery0
ChatGPT vs State-of-the-Art Models: A Benchmarking Study in Keyphrase Generation Task0
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics0
Comparing Computing Platforms for Deep Learning on a Humanoid Robot0
Benchmarking Answer Verification Methods for Question Answering-Based Summarization Evaluation Metrics0
Comparing Hyper-optimized Machine Learning Models for Predicting Efficiency Degradation in Organic Solar Cells0
ChatGPT Alternative Solutions: Large Language Models Survey0
Comparison and Benchmarking of AI Models and Frameworks on Mobile Devices0
Comparison of feature extraction and dimensionality reduction methods for single channel extracellular spike sorting0
Comparison of tree-based ensemble algorithms for merging satellite and earth-observed precipitation data at the daily time scale0
An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets0
CompBench: Benchmarking Complex Instruction-guided Image Editing0
Chart-to-Experience: Benchmarking Multimodal LLMs for Predicting Experiential Impact of Charts0
CHaRNet: Conditioned Heatmap Regression for Robust Dental Landmark Localization0
Characterizing Transactional Databases for Frequent Itemset Mining0
Benchmarking and Validation of Sub-mW 30GHz VG-LNAs in 22nm FDSOI CMOS for 5G/6G Phased-Array Receivers0
Complexity of Representations in Deep Learning0
Comprehensive Benchmark Datasets for Amharic Scene Text Detection and Recognition0
Characterizing the adversarial vulnerability of speech self-supervised learning0
Characterizing Missing Information in Deep Networks Using Backpropagated Gradients0
An Empirical Investigation into Benchmarking Model Multiplicity for Trustworthy Machine Learning: A Case Study on Image Classification0
Show:102550
← PrevPage 73 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified