SOTAVerified

Benchmarking

Papers

Showing 47264750 of 5548 papers

TitleStatusHype
GRATIS: GeneRAting TIme Series with diverse and controllable characteristicsCode0
Understanding the World's Museums through Vision-Language ReasoningCode0
RUPBench: Benchmarking Reasoning Under Perturbations for Robustness Evaluation in Large Language ModelsCode0
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes ProsthesisCode0
Benchmarking the Fairness of Image Upsampling MethodsCode0
Graph-theoretical approach to robust 3D normal extraction of LiDAR dataCode0
A Modular Workflow for Performance Benchmarking of Neuronal Network SimulationsCode0
Messing Up 3D Virtual Environments: Transferable Adversarial 3D ObjectsCode0
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral PerspectiveCode0
Meta-Black-Box-Optimization through Offline Q-function LearningCode0
Learning Conjoint Attentions for Graph Neural NetsCode0
Graph Convolutional Networks Meet with High Dimensionality ReductionCode0
Benchmarking the Attribution Quality of Vision ModelsCode0
MetaFaith: Faithful Natural Language Uncertainty Expression in LLMsCode0
GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and BenchmarkingCode0
MetaGreen: Meta-Learning Inspired Transformer Selection for Green Semantic CommunicationCode0
S3Simulator: A benchmarking Side Scan Sonar Simulator dataset for Underwater Image AnalysisCode0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
GOAL: Towards Benchmarking Few-Shot Sports Game SummarizationCode0
SACoD: Sensor Algorithm Co-Design Towards Efficient CNN-powered Intelligent PhlatCamCode0
GNNMerge: Merging of GNN Models Without Accessing Training DataCode0
Meta-survey on outlier and anomaly detectionCode0
The Legal Argument Reasoning Task in Civil ProcedureCode0
A Modular Benchmarking Infrastructure for High-Performance and Reproducible Deep LearningCode0
Safe Multi-Agent Navigation guided by Goal-Conditioned Safe Reinforcement LearningCode0
Show:102550
← PrevPage 190 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified