SOTAVerified

Benchmarking

Papers

Showing 27512775 of 5548 papers

TitleStatusHype
Context-guided Triple Matching for Multiple Choice Question Answering0
Contextual Metric Meta-Evaluation by Measuring Local Metric Accuracy0
Exploring the Practicality of Generative Retrieval on Dynamic Corpora0
Continuous Function Structured in Multilayer Perceptron for Global Optimization0
Continuous-Time Gaussian Process Motion-Compensation for Event-vision Pattern Tracking with Distance Fields0
Continuous U-Net: Faster, Greater and Noiseless0
Contrastive Learning-Based Spectral Knowledge Distillation for Multi-Modality and Missing Modality Scenarios in Semantic Segmentation0
Contribution à l'Optimisation d'un Comportement Collectif pour un Groupe de Robots Autonomes0
Contributions of the Petabyte Scale Sequence Search Codeathon toward efforts to scale sequence-based searches on SRA0
ConvBench: A Comprehensive Benchmark for 2D Convolution Primitive Evaluation0
ConvCodeWorld: Benchmarking Conversational Code Generation in Reproducible Feedback Environments0
Convolutional and Deep Learning based techniques for Time Series Ordinal Classification0
COPA: Comparing the Incomparable to Explore the Pareto Front0
CORE: A Knowledge Graph Entity Type Prediction Method via Complex Space Regression and Embedding0
CORE: Benchmarking LLMs Code Reasoning Capabilities through Static Analysis Tasks0
Cornac: A Comparative Framework for Multimodal Recommender Systems0
COSET: A Benchmark for Evaluating Neural Program Embeddings0
CoSy: Evaluating Textual Explanations of Neurons0
Countering Backdoor Attacks in Image Recognition: A Survey and Evaluation of Mitigation Strategies0
COUNTS: Benchmarking Object Detectors and Multimodal Large Language Models under Distribution Shifts0
Coupling volume-excluding compartment-based models of diffusion at different scales: Voronoi and pseudo-compartment approaches0
Covariance Matrix Adaptation Evolution Strategy Assisted by Principal Component Analysis0
Creating a Data Collection for Evaluating Rich Speech Retrieval0
CRF-based Single-stage Acoustic Modeling with CTC Topology0
CroCoDL: Cross-device Collaborative Dataset for Localization0
Show:102550
← PrevPage 111 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified