SOTAVerified

Benchmarking

Papers

Showing 40014050 of 5548 papers

TitleStatusHype
SPIn-NeRF: Multiview Segmentation and Perceptual Inpainting with Neural Radiance Fields0
Spintronics for image recognition: performance benchmarking via ultrafast data-driven simulations0
SpiralMLP: A Lightweight Vision MLP Architecture0
SpokenNativQA: Multilingual Everyday Spoken Queries for LLMs0
Sports Intelligence: Assessing the Sports Understanding Capabilities of Language Models through Question Answering from Text to Video0
SPot: A tool for identifying operating segments in financial tables0
Spotting tell-tale visual artifacts in face swapping videos: strengths and pitfalls of CNN detectors0
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads0
Unifying Large Language Model and Deep Reinforcement Learning for Human-in-Loop Interactive Socially-aware Navigation0
SS3DM: Benchmarking Street-View Surface Reconstruction with a Synthetic 3D Mesh Dataset0
Stability Constrained OPF in Microgrids: A Chance Constrained Optimization Framework with Non-Gaussian Uncertainty0
Stabilized Self-training with Negative Sampling on Few-labeled Graph Data0
Stable Virtual Camera: Generative View Synthesis with Diffusion Models0
Staining normalization in histopathology: Method benchmarking using multicenter dataset0
Standardisation of Convex Ultrasound Data Through Geometric Analysis and Augmentation0
Standardised workflow for mass spectrometry-based single-cell proteomics data processing and analysis using the scp package0
CrisisBench: Benchmarking Crisis-related Social Media Datasets for Humanitarian Information Processing0
State and Memory is All You Need for Robust and Reliable AI Agents0
State-of-the-art AI-based Learning Approaches for Deepfake Generation and Detection, Analyzing Opportunities, Threading through Pros, Cons, and Future Prospects0
State-of-the-Art in Human Scanpath Prediction0
Statistical Multicriteria Benchmarking via the GSD-Front0
Statistical Scenario Modelling and Lookalike Distributions for Multi-Variate AI Risk0
StEduCov: An Explored and Benchmarked Dataset on Stance Detection in Tweets towards Online Education during COVID-19 Pandemic0
Steerable Pyramid Weighted Loss: Multi-Scale Adaptive Weighting for Semantic Segmentation0
STEER-ME: Assessing the Microeconomic Reasoning of Large Language Models0
Stochastic Spiking Neural Networks with First-to-Spike Coding0
Stratify: Unifying Multi-Step Forecasting Strategies0
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs0
StructTest: Benchmarking LLMs' Reasoning through Compositional Structured Outputs0
Structural Property Prediction0
Structure-Based Experimental Datasets for Benchmarking Protein Simulation Force Fields0
StylusAI: Stylistic Adaptation for Robust German Handwritten Text Generation0
Sub-8-bit quantization for on-device speech recognition: a regularization-free approach0
Subjective Quality Assessment of Compressed Tone-Mapped High Dynamic Range Videos0
Subspace Learning Machine (SLM): Methodology and Performance0
SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes0
Sum Rate Maximization for Pinching Antennas Assisted RSMA System With Multiple Waveguides0
Sum Secrecy Rate Maximization for Full Duplex ISAC Systems0
Super-Resolution via Deep Learning0
Support Vector Machines and generalisation in HEP0
Surface Reconstruction from Point Clouds: A Survey and a Benchmark0
SurgBench: A Unified Large-Scale Benchmark for Surgical Video Analysis0
Surprise Potential as a Measure of Interactivity in Driving Scenarios0
Survey of HPC in US Research Institutions0
Sustainable LLM Inference for Edge AI: Evaluating Quantized LLMs for Energy Efficiency, Output Accuracy, and Inference Latency0
SUTD-PRCM Dataset and Neural Architecture Search Approach for Complex Metasurface Design0
SVGenius: Benchmarking LLMs in SVG Understanding, Editing and Generation0
SVLTA: Benchmarking Vision-Language Temporal Alignment via Synthetic Video Situation0
SWIFT: Super-fast and Robust Privacy-Preserving Machine Learning0
SydneyScapes: Image Segmentation for Australian Environments0
Show:102550
← PrevPage 81 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified