SOTAVerified

Benchmarking

Papers

Showing 16511675 of 5548 papers

TitleStatusHype
Benchmarking VLMs' Reasoning About Persuasive Atypical Images0
Benchmarking Large Language Model Uncertainty for Prompt OptimizationCode0
Benchmarking LLMs in Political Content Text-Annotation: Proof-of-Concept with Toxicity and Incivility Data0
Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering0
LLM-Powered Grapheme-to-Phoneme Conversion: Benchmark and Case Study0
Text-To-Speech Synthesis In The Wild0
ODAQ: Open Dataset of Audio Quality - Benchmark on GitHubCode1
Introducing CausalBench: A Flexible Benchmark Framework for Causal Analysis and Machine Learning0
Linear energy storage and flexibility model with ramp rate, ramping, deadline and capacity constraintsCode0
Enhancing Q&A Text Retrieval with Ranking Models: Benchmarking, fine-tuning and deploying Rerankers for RAG0
Online vs Offline: A Comparative Study of First-Party and Third-Party Evaluations of Social Chatbots0
Efficient Sparse Coding with the Adaptive Locally Competitive Algorithm for Speech Classification0
The JPEG Pleno Learning-based Point Cloud Coding Standard: Serving Man and Machine0
Improve Machine Learning carbon footprint using Nvidia GPU and Mixed Precision training for classification models -- Part ICode0
The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal0
Benchmarking and Validation of Sub-mW 30GHz VG-LNAs in 22nm FDSOI CMOS for 5G/6G Phased-Array Receivers0
Understanding Foundation Models: Are We Back in 1924?0
Unsupervised Novelty Detection Methods Benchmarking with Wavelet DecompositionCode0
Benchmarking 2D Egocentric Hand Pose Datasets0
VoiceWukong: Benchmarking Deepfake Voice Detection0
Mahalanobis k-NN: A Statistical Lens for Robust Point-Cloud RegistrationsCode0
Benchmarking Sub-Genre Classification For Mainstage Dance Music0
MIP-GAF: A MLLM-annotated Benchmark for Most Important Person Localization and Group Context UnderstandingCode0
Ransomware Detection Using Machine Learning in the Linux Kernel0
Selecting Differential Splicing Methods: Practical Considerations0
Show:102550
← PrevPage 67 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified