SOTAVerified

Benchmarking

Papers

Showing 28812890 of 5548 papers

TitleStatusHype
Soft-Hard Attention U-Net Model and Benchmark Dataset for Multiscale Image Shadow Removal0
Online Model-based Anomaly Detection in Multivariate Time Series: Taxonomy, Survey, Research Challenges and Future Directions0
Benchmarking In-the-wild Multimodal Disease Recognition and A Versatile Baseline0
From LLMs to LLM-based Agents for Software Engineering: A Survey of Current, Challenges and Future0
LMEMs for post-hoc analysis of HPO BenchmarkingCode0
MaterioMiner -- An ontology-based text mining dataset for extraction of process-structure-property entities0
SPINEX-TimeSeries: Similarity-based Predictions with Explainable Neighbors Exploration for Time Series and Forecasting Problems0
User-in-the-loop Evaluation of Multimodal LLMs for Activity Assistance0
Deep Reinforcement Learning for Dynamic Order Picking in Warehouse Operations0
Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test DataCode0
Show:102550
← PrevPage 289 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified