SOTAVerified

Benchmarking

Papers

Showing 44764500 of 5548 papers

TitleStatusHype
Dynabench: Rethinking Benchmarking in NLP0
Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python0
Robust Semantic Interpretability: Revisiting Concept Activation VectorsCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
What Will it Take to Fix Benchmarking in Natural Language Understanding?0
The Multi-speaker Multi-style Voice Cloning Challenge 20210
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy ReasoningCode0
An Empirical Evaluation of Cost-based Federated SPARQL Query Processing EnginesCode0
Benchmarking Transformer-based Language Models for Arabic Sentiment and Sarcasm Detection0
Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared TaskCode0
Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada0
Benchmarking a transformer-FREE model for ad-hoc retrievalCode0
Remote Sensing Image Classification with the SEN12MS DatasetCode1
Generalized Conflict-directed Search for Optimal Ordering Problems0
Simultaneous Navigation and Construction Benchmarking EnvironmentsCode1
Benchmarks for Deep Off-Policy EvaluationCode1
Unsupervised Learning of 3D Object Categories from Videos in the Wild0
3D AffordanceNet: A Benchmark for Visual Object Affordance UnderstandingCode1
Benchmarking Representation Learning for Natural World Image CollectionsCode0
RAN-GNNs: breaking the capacity limits of graph neural networks0
Deep Image Compositing0
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic EventsCode1
Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks0
Marine Snow Removal Benchmarking DatasetCode1
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design0
Show:102550
← PrevPage 180 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified