SOTAVerified

Benchmarking

Papers

Showing 38763900 of 5548 papers

TitleStatusHype
SAIBench: Benchmarking AI for Science0
Saliency Benchmarking Made Easy: Separating Models, Maps and Metrics0
Salient Object Detection: A Benchmark0
SAMA: Towards Multi-Turn Referential Grounded Video Chat with Large Language Models0
SAM-based instance segmentation models for the automation of structural damage detection0
Sarcasm in Sight and Sound: Benchmarking and Expansion to Improve Multimodal Sarcasm Detection0
SASSE: Scalable and Adaptable 6-DOF Pose Estimation0
SATBench: Benchmarking LLMs' Logical Reasoning via Automated Puzzle Generation from SAT Formulas0
SAWNet: A Spatially Aware Deep Neural Network for 3D Point Cloud Processing0
Scaffold Splits Overestimate Virtual Screening Performance0
Scalable and Customizable Benchmark Problems for Many-Objective Optimization0
Scalable and Hybrid Ensemble-Based Causality Discovery0
Scalable, Distributed AI Frameworks: Leveraging Cloud Computing for Enhanced Deep Learning Performance and Efficiency0
Scalable Psychological Momentum Forecasting in Esports0
Automated Coding of Communications in Collaborative Problem-solving Tasks Using ChatGPT0
ScanNeRF: a Scalable Benchmark for Neural Radiance Fields0
SCBench: A Sports Commentary Benchmark for Video LLMs0
Scenarios and Approaches for Situated Natural Language Explanations0
ScholarSearch: Benchmarking Scholar Searching Ability of LLMs0
SciDoc2Diagrammer-MAF: Towards Generation of Scientific Diagrams from Documents guided by Multi-Aspect Feedback Refinement0
Science Across Languages: Assessing LLM Multilingual Translation of Scientific Papers0
Scientific Machine Learning Benchmarks0
SciHorizon: Benchmarking AI-for-Science Readiness from Scientific Data to Large Language Models0
scMamba: A Scalable Foundation Model for Single-Cell Multi-Omics Integration Beyond Highly Variable Feature Selection0
Score-Based Generative Models for Molecule Generation0
Show:102550
← PrevPage 156 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified