SOTAVerified

Benchmarking

Papers

Showing 44514500 of 5548 papers

TitleStatusHype
A Benchmarking on Cloud based Speech-To-Text Services for French Speech and Background Noise Effect0
PathBench: A Benchmarking Platform for Classical and Learned Path Planning Algorithms0
Event Camera Simulator Design for Modeling Attention-based Inference Architectures0
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal ProcessingCode1
A Complementarity Analysis of the COCO Benchmark Problems and Artificially Generated Problems0
2.5D Visual Relationship DetectionCode1
OPTION: OPTImization Algorithm Benchmarking ONtology0
Towards Trustworthy Deception Detection: Benchmarking Model Robustness across Domains, Modalities, and Languages0
Knodle: Modular Weakly Supervised Learning with PyTorchCode1
Measuring what Really Matters: Optimizing Neural Networks for TinyMLCode0
Model-predictive control and reinforcement learning in multi-energy system case studies0
Benchmarking the Benchmark -- Analysis of Synthetic NIDS Datasets0
FedNLP: Benchmarking Federated Learning Methods for Natural Language Processing TasksCode0
The Impact of ASR on the Automatic Analysis of Linguistic Complexity and Sophistication in Spontaneous L2 Speech0
BEIR: A Heterogenous Benchmark for Zero-shot Evaluation of Information Retrieval ModelsCode2
Towards Standardising Reinforcement Learning Approaches for Production Scheduling ProblemsCode1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series DataCode1
Jointly Modeling and Clustering Tensors in High Dimensions0
On the Assessment of Benchmark Suites for Algorithm Comparison0
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning InterpretabilityCode1
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm OptimizationCode1
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style TransferCode1
A Probabilistic Framework for Lexicon-based Keyword Spotting in Handwritten Text Images0
Benchmarking Scene Text Recognition in Devanagari, Telugu and Malayalam0
BERT-based Chinese Text Classification for Emergency Domain with a Novel Loss Function0
Dynabench: Rethinking Benchmarking in NLP0
Efficient and Accurate In-Database Machine Learning with SQL Code Generation in Python0
Robust Semantic Interpretability: Revisiting Concept Activation VectorsCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
What Will it Take to Fix Benchmarking in Natural Language Understanding?0
The Multi-speaker Multi-style Voice Cloning Challenge 20210
Improving Pretrained Models for Zero-shot Multi-label Text Classification through Reinforced Label Hierarchy ReasoningCode0
An Empirical Evaluation of Cost-based Federated SPARQL Query Processing EnginesCode0
Benchmarking Transformer-based Language Models for Arabic Sentiment and Sarcasm Detection0
Benchmarking Pre-trained Language Models for Multilingual NER: TraSpaS at the BSNLP2021 Shared TaskCode0
Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada0
Benchmarking a transformer-FREE model for ad-hoc retrievalCode0
Remote Sensing Image Classification with the SEN12MS DatasetCode1
Generalized Conflict-directed Search for Optimal Ordering Problems0
Simultaneous Navigation and Construction Benchmarking EnvironmentsCode1
Benchmarks for Deep Off-Policy EvaluationCode1
Unsupervised Learning of 3D Object Categories from Videos in the Wild0
3D AffordanceNet: A Benchmark for Visual Object Affordance UnderstandingCode1
Benchmarking Representation Learning for Natural World Image CollectionsCode0
RAN-GNNs: breaking the capacity limits of graph neural networks0
Deep Image Compositing0
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic EventsCode1
Exploiting Adam-like Optimization Algorithms to Improve the Performance of Convolutional Neural Networks0
Marine Snow Removal Benchmarking DatasetCode1
Enabling Design Methodologies and Future Trends for Edge AI: Specialization and Co-design0
Show:102550
← PrevPage 90 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified