SOTAVerified

Benchmarking

Papers

Showing 49264950 of 5548 papers

TitleStatusHype
Exploiting Out-of-Domain Parallel Data through Multilingual Transfer Learning for Low-Resource Neural Machine TranslationCode0
Zero-shot generation of synthetic neurosurgical data with large language modelsCode0
Benchmarking Pathology Foundation Models: Adaptation Strategies and ScenariosCode0
Three Revisits to Node-Level Graph Anomaly Detection: Outliers, Message Passing and Hyperbolic Neural NetworksCode0
Multiple Instance Learning: A Survey of Problem Characteristics and ApplicationsCode0
Self-Adjusting Weighted Expected Improvement for Bayesian OptimizationCode0
Multiple Light Source Dataset for Colour ResearchCode0
Experimental Analysis of Large-scale Learnable Vector Storage CompressionCode0
Benchmarking Parameter Control Methods in Differential Evolution for Mixed-Integer Black-Box OptimizationCode0
ThrowBench: Benchmarking LLMs by Predicting Runtime ExceptionsCode0
Benchmarking Domain Adaptation for Chemical Processes on the Tennessee Eastman ProcessCode0
AttackSeqBench: Benchmarking Large Language Models' Understanding of Sequential Patterns in Cyber AttacksCode0
Expecting The Unexpected: Towards Broad Out-Of-Distribution DetectionCode0
exHarmony: Authorship and Citations for Benchmarking the Reviewer Assignment ProblemCode0
Benchmarking optimality of time series classification methods in distinguishing diffusionsCode0
ExEBench: Benchmarking Foundation Models on Extreme Earth EventsCode0
MULTITAT: Benchmarking Multilingual Table-and-Text Question AnsweringCode0
Evolving Evolutionary Algorithms with PatternsCode0
Semantic Hilbert Space for Text Representation LearningCode0
A Continuous Information Gain Measure to Find the Most Discriminatory Problems for AI BenchmarkingCode0
Timage -- A Robust Time Series Classification PipelineCode0
AttackNet: Enhancing Biometric Security via Tailored Convolutional Neural Network Architectures for Liveness DetectionCode0
EvoLearner: Learning Description Logics with Evolutionary AlgorithmsCode0
Evidential Deep Learning for Uncertainty Quantification and Out-of-Distribution Detection in Jet Identification using Deep Neural NetworksCode0
Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test DataCode0
Show:102550
← PrevPage 198 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified