SOTAVerified

Benchmarking

Papers

Showing 50515075 of 5548 papers

TitleStatusHype
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
SimbaML: Connecting Mechanistic Models and Machine Learning with Augmented DataCode0
NSINA: A News Corpus for SinhalaCode0
Improving Sequential Recommendation Models with an Enhanced Loss FunctionCode0
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional NetworksCode0
Editing Factual Knowledge and Explanatory Ability of Medical Large Language ModelsCode0
SimBench: A Rule-Based Multi-Turn Interaction Benchmark for Evaluating an LLM's Ability to Generate Digital TwinsCode0
A Seq2Seq approach to Symbolic RegressionCode0
A Collection of Quality Diversity Optimization Problems Derived from Hyperparameter Optimization of Machine Learning ModelsCode0
Simitate: A Hybrid Imitation Learning BenchmarkCode0
Echo State Networks with Self-Normalizing Activations on the Hyper-SphereCode0
ECBD: Evidence-Centered Benchmark Design for NLPCode0
A Continuous Optimisation Benchmark Suite from Neural Network RegressionCode0
An Evaluation of Machine Learning Approaches for Early Diagnosis of Autism Spectrum DisorderCode0
Dyport: Dynamic Importance-based Hypothesis Generation Benchmarking TechniqueCode0
DynCIM: Dynamic Curriculum for Imbalanced Multimodal LearningCode0
DynamoRep: Trajectory-Based Population Dynamics for Classification of Black-box Optimization ProblemsCode0
Simple GNNs with Low Rank Non-parametric AggregatorsCode0
Effective Stabilized Self-Training on Few-Labeled Graph DataCode0
Simulated Contextual Bandits for Personalization Tasks from Recommendation DatasetsCode0
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock MarketCode0
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge GraphsCode0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Referenced Thermodynamic Integration for Bayesian Model Selection: Application to COVID-19 Model SelectionCode0
Simulation-based Benchmarking for Causal Structure Learning in Gene Perturbation ExperimentsCode0
Show:102550
← PrevPage 203 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified