SOTAVerified

Benchmarking

Papers

Showing 34513500 of 5548 papers

TitleStatusHype
A Novel Hybrid Ordinal Learning Model with Health Care Application0
SPEAL: Skeletal Prior Embedded Attention Learning for Cross-Source Point Cloud Registration0
EventAid: Benchmarking Event-aided Image/Video Enhancement Algorithms with Real-captured Hybrid Dataset0
Efficiently Quantifying Individual Agent Importance in Cooperative MARL0
Meta-survey on outlier and anomaly detectionCode0
Benchmarking Deep Learning Classifiers for SAR Automatic Target Recognition0
Benchmarking Pretrained Vision Embeddings for Near- and Duplicate Detection in Medical Images0
Watchog: A Light-weight Contrastive Learning based Framework for Column Annotation0
Implementing hosting capacity analysis in distribution networks: Practical considerations, advancements and future directions0
Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection0
Benchmarking of Query Strategies: Towards Future Deep Active LearningCode0
Graph-based Prediction and Planning Policy Network (GP3Net) for scalable self-driving in dynamic environments using Deep Reinforcement Learning0
Forecasting Lithium-Ion Battery Longevity with Limited Data Availability: Benchmarking Different Machine Learning Algorithms0
An Experimental Study: Assessing the Combined Framework of WavLM and BEST-RQ for Text-to-Speech Synthesis0
Perspectives on the State and Future of Deep Learning -- 20230
Multiview Aerial Visual Recognition (MAVREC): Can Multi-view Improve Aerial Visual Perception?0
KhabarChin: Automatic Detection of Important News in the Persian LanguageCode0
Dyport: Dynamic Importance-based Hypothesis Generation Benchmarking TechniqueCode0
Benchmarking Continual Learning from Cognitive Perspectives0
SPOC: Imitating Shortest Paths in Simulation Enables Effective Navigation and Manipulation in the Real World0
Semi-implicit Continuous Newton Method for Power Flow Analysis0
Liquid State Genetic Programming0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
BenchMARL: Benchmarking Multi-Agent Reinforcement Learning0
An Empirical Study of Automated Mislabel Detection in Real World Vision Datasets0
Evetac: An Event-based Optical Tactile Sensor for Robotic Manipulation0
Identifying patterns and recommendations of and for sustainable open data initiatives: a benchmarking-driven analysis of open government data initiatives among European countries0
Analyzing the Impact of Fake News on the Anticipated Outcome of the 2024 Election Ahead of Time0
Benchmarking Multi-Domain Active Learning on Image Classification0
Event-based Continuous Color Video Decompression from Single Frames0
Benchmarking and Enhancing Disentanglement in Concept-Residual Models0
Seg2Reg: Differentiable 2D Segmentation to 1D Regression Rendering for 360 Room Layout Reconstruction0
LucidDreaming: Controllable Object-Centric 3D Generation0
Z_2 Z_2 Equivariant Quantum Neural Networks: Benchmarking against Classical Neural NetworksCode0
A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval0
ROBBIE: Robust Bias Evaluation of Large Generative Language Models0
Mixed-Precision Quantization for Federated Learning on Resource-Constrained Heterogeneous Devices0
SAIBench: A Structural Interpretation of AI for Science Through Benchmarks0
TransOpt: Transformer-based Representation Learning for Optimization Problem Classification0
Enhancing Post-Hoc Explanation Benchmark Reliability for Image Classification0
PAWS-VMK: A Unified Approach To Semi-Supervised Learning And Out-of-Distribution Detection0
UniIR: Training and Benchmarking Universal Multimodal Information Retrievers0
Riemannian Self-Attention Mechanism for SPD Networks0
Syn3DWound: A Synthetic Dataset for 3D Wound Bed Analysis0
Comprehensive Benchmarking of Entropy and Margin Based Scoring Metrics for Data Selection0
FakeWatch ElectionShield: A Benchmarking Framework to Detect Fake News for Credible US Elections0
Experimental Analysis of Large-scale Learnable Vector Storage CompressionCode0
Lightly Weighted Automatic Audio Parameter Extraction for the Quality Assessment of Consensus Auditory-Perceptual Evaluation of Voice0
Benchmarking Large Language Model Volatility0
ASI: Accuracy-Stability Index for Evaluating Deep Learning Models0
Show:102550
← PrevPage 70 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified