SOTAVerified

Benchmarking

Papers

Showing 19762000 of 5548 papers

TitleStatusHype
Data-driven surrogate modelling and benchmarking for process equipment0
Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound0
Benchmarking Federated Machine Unlearning methods for Tabular Data0
ChakmaNMT: A Low-resource Machine Translation On Chakma Language0
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning0
Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese0
End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings0
C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System0
CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking0
Benchmarking and Improving Generator-Validator Consistency of Language Models0
Certifying almost all quantum states with few single-qubit measurements0
A Platform for Event Extraction in Hindi0
DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition0
DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation0
Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines0
An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks0
DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection0
CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs0
DeAR: Debiasing Vision-Language Models with Additive Residuals0
CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark0
DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis0
An efficiency analysis of Spanish airports0
Decentralized Federated Learning on the Edge over Wireless Mesh Networks0
1-D Convlutional Neural Networks for the Analysis of Pupil Size Variations in Scotopic Conditions0
Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption0
Show:102550
← PrevPage 80 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified