SOTAVerified

Benchmarking

Papers

Showing 25512600 of 5548 papers

TitleStatusHype
Handwritten Text Recognition: A Survey0
Benchmarking Resource Usage for Efficient Distributed Deep Learning0
FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization0
HaN-Seg: The head and neck organ-at-risk CT and MR segmentation dataset0
A Survey on Vision Autoregressive Model0
Federated Deconfounding and Debiasing Learning for Out-of-Distribution Generalization0
A Survey on Temporal Sentence Grounding in Videos0
FedAD-Bench: A Unified Benchmark for Federated Unsupervised Anomaly Detection in Tabular Data0
FeDa4Fair: Client-Level Federated Datasets for Fairness Evaluation0
Benchmarking Reinforcement Learning Methods for Dexterous Robotic Manipulation with a Three-Fingered Gripper0
4Seasons: Benchmarking Visual SLAM and Long-Term Localization for Autonomous Driving in Challenging Conditions0
Hardware and Software Optimizations for Accelerating Deep Neural Networks: Survey of Current Trends, Challenges, and the Road Ahead0
HA-VLN: A Benchmark for Human-Aware Navigation in Discrete-Continuous Environments with Dynamic Multi-Human Interactions, Real-World Validation, and an Open Leaderboard0
Imitation Learning Datasets: A Toolkit For Creating Datasets, Training Agents and Benchmarking0
FederatedScope-LLM: A Comprehensive Package for Fine-tuning Large Language Models in Federated Learning0
FedEval: A Holistic Evaluation Framework for Federated Learning0
FeB4RAG: Evaluating Federated Search in the Context of Retrieval Augmented Generation0
Feature selection in linear SVMs via a hard cardinality constraint: a scalable SDP decomposition approach0
A Survey on Semi-Supervised Learning for Delayed Partially Labelled Data Streams0
Feature Selection and Classification of Hyperspectral Images With Support Vector Machines0
Featuremetric benchmarking: Quantum computer benchmarks based on circuit features0
Airport Capacity and Performance in Europe -- A study of transport economics, service quality and sustainability0
A Survey on Preserving Fairness Guarantees in Changing Environments0
Benchmarking Reasoning Robustness in Large Language Models0
Feature Encodings for Gradient Boosting with Automunge0
gSuite: A Flexible and Framework Independent Benchmark Suite for Graph Neural Network Inference on GPUs0
Feature-based Evolutionary Diversity Optimization of Discriminating Instances for Chance-constrained Optimization Problems0
Benchmarking real-time monitoring strategies for ethanol production from lignocellulosic biomass0
FERA 2017 - Addressing Head Pose in the Third Facial Expression Recognition and Analysis Challenge0
FER-C: Benchmarking Out-of-Distribution Soft Calibration for Facial Expression Recognition0
GTP-4o: Modality-prompted Heterogeneous Graph Learning for Omni-modal Biomedical Representation0
Feasibility of BERT Embeddings For Domain-Specific Knowledge Mining0
Benchmarking real-time algorithms for in-phase auditory stimulation of low amplitude slow waves with wearable EEG devices during sleep0
F-Bench: Rethinking Human Preference Evaluation Metrics for Benchmarking Face Generation, Customization, and Restoration0
FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language Understanding0
Few-Shot Defect Segmentation Leveraging Abundant Normal Training Samples Through Normal Background Regularization and Crop-and-Paste Operation0
Benchmarking Randomized Optimization Algorithms on Binary, Permutation, and Combinatorial Problem Landscapes0
FAVOR-Bench: A Comprehensive Benchmark for Fine-Grained Video Motion Understanding0
Benchmarking Robustness in Neural Radiance Fields0
Fiber Bundle Morphisms as a Framework for Modeling Many-to-Many Maps0
Fast Training of Deep Networks with One-Class CNNs0
AI-ready Snow Radar Echogram Dataset (SRED) for climate change monitoring0
A Comprehensive Benchmarking Platform for Deep Generative Models in Molecular Design0
Guidelines for Fine-grained Sentence-level Arabic Readability Annotation0
Fast Labeling and Transcription with the Speechalyzer Toolkit0
Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines0
FastEnsemble: Benchmarking and Accelerating Ensemble-based Uncertainty Estimation for Image-to-Image Translation0
Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada0
Fast Empirical Scenarios0
Benchmarking Quantum Convolutional Neural Networks for Signal Classification in Simulated Gamma-Ray Burst Detection0
Show:102550
← PrevPage 52 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified