SOTAVerified

Benchmarking

Papers

Showing 39013950 of 5548 papers

TitleStatusHype
SCPO: Safe Reinforcement Learning with Safety Critic Policy Optimization0
SDFR: Synthetic Data for Face Recognition Competition0
Uncertainty in GNN Learning Evaluations: The Importance of a Consistent Benchmark for Community Detection0
SE Arena: An Interactive Platform for Evaluating Foundation Models in Software Engineering0
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
SeaTurtleID2022: A long-span dataset for reliable sea turtle re-identification0
SecBench: A Comprehensive Multi-Dimensional Benchmarking Dataset for LLMs in Cybersecurity0
SecRepoBench: Benchmarking LLMs for Secure Code Generation in Real-World Repositories0
Secure Neuroimaging Analysis using Federated Learning with Homomorphic Encryption0
Securing the Skies: A Comprehensive Survey on Anti-UAV Methods, Benchmarking, and Future Directions0
Seeing in the Dark: Benchmarking Egocentric 3D Vision with the Oxford Day-and-Night Dataset0
Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction0
Segmenting Maxillofacial Structures in CBCT Volumes0
Segment Together: A Versatile Paradigm for Semi-Supervised Medical Image Segmentation0
SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios0
Selecting Differential Splicing Methods: Practical Considerations0
Selective Shot Learning for Code Explanation0
Self-supervised Benchmark Lottery on ImageNet: Do Marginal Improvements Translate to Improvements on Similar Datasets?0
Self-Supervised Speech Representation Learning: A Review0
Semantic Segmentation using Vision Transformers: A survey0
SemanticST: Spatially Informed Semantic Graph Learning for Clustering, Integration, and Scalable Analysis of Spatial Transcriptomics0
Semi and Weakly Supervised Semantic Segmentation Using Generative Adversarial Network0
Semi-implicit Continuous Newton Method for Power Flow Analysis0
Semi-supervised learning via Feedforward-Designed Convolutional Neural Networks0
Semi-supervised Learning with Graphs: Covariance Based Superpixels For Hyperspectral Image Classification0
Semi Supervised Semantic Segmentation Using Generative Adversarial Network0
SEN12-WATER: A New Dataset for Hydrological Applications and its Benchmarking0
Sensor Data for Human Activity Recognition: Feature Representation and Benchmarking0
Sentence Smith: Formally Controllable Text Transformation and its Application to Evaluation of Text Embedding Models0
SentSpace: Large-Scale Benchmarking and Evaluation of Text using Cognitively Motivated Lexical, Syntactic, and Semantic Features0
Sequence-Level Leakage Risk of Training Data in Large Language Models0
SEvoBench : A C++ Framework For Evolutionary Single-Objective Optimization Benchmarking0
SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects0
ShabbyPages: A Reproducible Document Denoising and Binarization Dataset0
SHARP 2020: The 1st Shape Recovery from Partial Textured 3D Scans Challenge Results0
Sheared Backpropagation for Fine-tuning Foundation Models0
ShiftedBronzes: Benchmarking and Analysis of Domain Fine-Grained Classification in Open-World Settings0
Short-term origin-destination demand prediction in urban rail transit systems: A channel-wise attentive split-convolutional neural network method0
SHOWMe: Benchmarking Object-agnostic Hand-Object 3D Reconstruction0
Show Some Love to Your n-grams: A Bit of Progress and Stronger n-gram Language Modeling Baselines0
SHS: Scorpion Hunting Strategy Swarm Algorithm0
Shuffle Vision Transformer: Lightweight, Fast and Efficient Recognition of Driver Facial Expression0
Benchmarking Stroke Forecasting with Stroke-Level Badminton Dataset0
SIAM: Chiplet-based Scalable In-Memory Acceleration with Mesh for Deep Neural Networks0
SIM2E: Benchmarking the Group Equivariant Capability of Correspondence Matching Algorithms0
SimBank: from Simulation to Solution in Prescriptive Process Monitoring0
SIMCOPILOT: Evaluating Large Language Models for Copilot-Style Code Generation0
Similarity-Quantized Relative Difference Learning for Improved Molecular Activity Prediction0
Simple Feedfoward Neural Networks are Almost All You Need for Time Series Forecasting0
Simulation-Based Sensitivity Analysis in Optimal Treatment Regimes and Causal Decomposition with Individualized Interventions0
Show:102550
← PrevPage 79 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified