SOTAVerified

Benchmarking

Papers

Showing 41814190 of 5548 papers

TitleStatusHype
Towards Benchmarking and Assessing the Safety and Robustness of Autonomous Driving on Safety-critical Scenarios0
Towards Benchmarking and Evaluating Deepfake Detection0
Towards Benchmarking Explainable Artificial Intelligence Methods0
Towards Benchmarking Scene Background Initialization0
Towards Benchmarking the Utility of Explanations for Model Debugging0
Towards Class-agnostic Tracking Using Feature Decorrelation in Point Clouds0
Towards Effective Disambiguation for Machine Translation with Large Language Models0
Towards Efficient Multi-LLM Inference: Characterization and Analysis of LLM Routing and Hierarchical Techniques0
Towards Explainability and Fairness in Swiss Judgement Prediction: Benchmarking on a Multilingual Dataset0
Towards Explainable Network Intrusion Detection using Large Language Models0
Show:102550
← PrevPage 419 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified