SOTAVerified

Benchmarking

Papers

Showing 40014025 of 5548 papers

TitleStatusHype
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields0
Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations0
SpiralMLP: A Lightweight Vision MLP Architecture0
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs0
Sports Intelligence: Assessing the Sports Understanding Capabilities of Language Models through Question Answering from Text to Video0
SPot: A tool for identifying operating segments in financial tables0
Spotting tell-tale visual artifacts in face swapping videos: strengths and pitfalls of CNN detectors0
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads0
Unifying Large Language Model and Deep Reinforcement Learning for Human-in-Loop Interactive Socially-aware Navigation0
SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset0
Stability Constrained OPF in Microgrids: A Chance Constrained Optimization Framework with Non-Gaussian Uncertainty0
Stabilized Self-training with Negative Sampling on Few-labeled Graph Data0
Stable Virtual Camera: Generative View Synthesis with Diffusion Models0
Staining normalization in histopathology: Method benchmarking using multicenter dataset0
Standardisation of Convex Ultrasound Data Through Geometric Analysis and Augmentation0
Standardised workflow for mass spectrometry-based single-cell proteomics data processing and analysis using the scp package0
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing0
State and Memory is All You Need for Robust and Reliable AI Agents0
State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects0
State-of-the-Art in Human Scanpath Prediction0
Statistical Multicriteria Benchmarking via the GSD-Front0
Statistical Scenario Modelling and Lookalike Distributions for Multi-Variate AI Risk0
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic0
Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation0
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models0
Show:102550
← PrevPage 161 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified