SOTAVerified

Benchmarking

Papers

Showing 23512375 of 5548 papers

TitleStatusHype
DynCIM: Dynamic Curriculum for Imbalanced Multimodal LearningCode0
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and GazeboCode0
Heterogeneous Datasets for Federated Survival Analysis SimulationCode0
DynamoRep: Trajectory-Based Population Dynamics for Classification of Black-box Optimization ProblemsCode0
Effective Stabilized Self-Training on Few-Labeled Graph DataCode0
Graph Neural Networks Are More Than Filters: Revisiting and Benchmarking from A Spectral PerspectiveCode0
Graph-theoretical approach to robust 3D normal extraction of LiDAR dataCode0
Graph Convolutional Networks Meet with High Dimensionality ReductionCode0
A Deep Reinforcement Learning Framework for Dynamic Portfolio Optimization: Evidence from China's Stock MarketCode0
Learning Conjoint Attentions for Graph Neural NetsCode0
GOAL: Towards Benchmarking Few-Shot Sports Game SummarizationCode0
Good at captioning, bad at counting: Benchmarking GPT-4V on Earth observation dataCode0
GPT4Graph: Can Large Language Models Understand Graph Structured Data ? An Empirical Evaluation and BenchmarkingCode0
Benchmarking LLM-based Relevance Judgment MethodsCode0
Global Prediction of COVID-19 Variant Emergence Using Dynamics-Informed Graph Neural NetworksCode0
GNNMerge: Merging of GNN Models Without Accessing Training DataCode0
Grasp Pre-shape Selection by Synthetic Training: Eye-in-hand Shared Control on the Hannes ProsthesisCode0
DyKgChat: Benchmarking Dialogue Generation Grounding on Dynamic Knowledge GraphsCode0
Benchmarking Linguistic Diversity of Large Language ModelsCode0
Geological Inference from Textual Data using Word EmbeddingsCode0
GiantHunter: Accurate detection of giant virus in metagenomic data using reinforcement-learning and Monte Carlo tree searchCode0
DVFL-Net: A Lightweight Distilled Video Focal Modulation Network for Spatio-Temporal Action RecognitionCode0
Ducho meets Elliot: Large-scale Benchmarks for Multimodal RecommendationCode0
Are Synthetic Corruptions A Reliable Proxy For Real-World Corruptions?Code0
Flexible Generation of Preference Data for Recommendation AnalysisCode0
Show:102550
← PrevPage 95 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified