SOTAVerified

Benchmarking

Papers

Showing 18511875 of 5548 papers

TitleStatusHype
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
BONES: a Benchmark fOr Neural Estimation of Shapley valuesCode0
BOLD: Dataset and Metrics for Measuring Biases in Open-Ended Language GenerationCode0
Benchmarking Instance-Centric Counterfactual Algorithms for XAI: From White Box to Black BoxCode0
Individual Fairness Guarantees for Neural NetworksCode0
BN-AuthProf: Benchmarking Machine Learning for Bangla Author Profiling on Social Media TextsCode0
InDL: A New Dataset and Benchmark for In-Diagram Logic Interpretation based on Visual IllusionCode0
Improving the Perturbation-Based Explanation of Deepfake Detectors Through the Use of Adversarially-Generated SamplesCode0
Benchmark data and method for real-time people counting in cluttered scenes using depth sensorsCode0
BLESS: Benchmarking Large Language Models on Sentence SimplificationCode0
Anomaly Detection in Large-Scale Cloud Systems: An Industry Case and DatasetCode0
Multiple Instance Learning: A Survey of Problem Characteristics and ApplicationsCode0
A Benchmarking Dataset with 2440 Organic Molecules for Volume Distribution at Steady StateCode0
IndiBias: A Benchmark Dataset to Measure Social Biases in Language Models for Indian ContextCode0
KArSL: Arabic Sign Language DatabaseCode0
LexSumm and LexT5: Benchmarking and Modeling Legal Summarization Tasks in EnglishCode0
OpenML Benchmarking SuitesCode0
Advancing and Benchmarking Personalized Tool Invocation for LLMsCode0
Importance of Disjoint Sampling in Conventional and Transformer Models for Hyperspectral Image ClassificationCode0
ImpliRet: Benchmarking the Implicit Fact Retrieval ChallengeCode0
Improved Multilingual Language Model Pretraining for Social Media Text via Translation Pair PredictionCode0
Immunofluorescence Capillary Imaging Segmentation: Cases StudyCode0
Impact of ImageNet Model Selection on Domain AdaptationCode0
BioVFM-21M: Benchmarking and Scaling Self-Supervised Vision Foundation Models for Biomedical Image AnalysisCode0
BioSentVec: creating sentence embeddings for biomedical textsCode0
Show:102550
← PrevPage 75 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified