SOTAVerified

Benchmarking

Papers

Showing 45614570 of 5548 papers

TitleStatusHype
BLESS: Benchmarking Large Language Models on Sentence SimplificationCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
BanglaNLP at BLP-2023 Task 2: Benchmarking different Transformer Models for Sentiment Analysis of Bangla Social Media PostsCode0
LLM Performance for Code Generation on Noisy TasksCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
A Dataset for Web-Scale Knowledge Base PopulationCode0
The Devil is in the Prompts: De-Identification Traces Enhance Memorization Risks in Synthetic Chest X-Ray GenerationCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Show:102550
← PrevPage 457 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified