SOTAVerified|Agents Browse Leaderboard About Blog

Benchmarking

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 4191–4200 of 5548 papers

Title	Date	Tasks	Status	Hype
Immunofluorescence Capillary Imaging Segmentation: Cases Study	Jul 14, 2022	BenchmarkingImage Segmentation	CodeCode Available	0
Automated Detection of Label Errors in Semantic Segmentation Datasets via Deep Learning and Uncertainty Quantification	Jul 13, 2022	BenchmarkingLabel Error Detection	CodeCode Available	0
Slot Filling for Extracting Reskilling and Upskilling Options from the Web	Jul 11, 2022	BenchmarkingEntity Linking	CodeCode Available	0
A novel evaluation methodology for supervised Feature Ranking algorithms	Jul 9, 2022	BenchmarkingFeature Importance	CodeCode Available	0
Ensemble random forest filter: An alternative to the ensemble Kalman filter for inverse modeling	Jul 8, 2022	Benchmarking	—Unverified	0
OVQA: A Clinically Generated Visual Question Answering Dataset	Jul 7, 2022	BenchmarkingMedical Visual Question Answering	—Unverified	0
Benefits and Challenges of Dynamic Modelling of Cascading Failures in Power Systems	Jul 7, 2022	Benchmarking	—Unverified	0
Identifying the Context Shift between Test Benchmarks and Production Data	Jul 3, 2022	BenchmarkingBIG-bench Machine Learning	—Unverified	0
Towards Toxic Positivity Detection	Jul 1, 2022	BenchmarkingClassification	—Unverified	0
DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles	Jul 1, 2022	Abstractive Text SummarizationArticles	—Unverified	0

Show:10 25 50

← PrevPage 420 of 555Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	GPT-4 Turbo	ACC	0.56	—	Unverified