SOTAVerified

Benchmarking

Papers

Showing 31013150 of 5548 papers

TitleStatusHype
Benchmarking fixed-length Fingerprint Representations across different Embedding Sizes and Sensor Types0
Machine Learning for Ranking f-wave Extraction Methods in Single-Lead ECGs0
Efficient and Accurate Optimal Transport with Mirror Descent and Conjugate GradientsCode0
EasyTPP: Towards Open Benchmarking Temporal Point ProcessesCode2
Revisiting Implicit Models: Sparsity Trade-offs Capability in Weight-tied Model for Vision Tasks0
GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease DetectionCode1
Joint Batching and Scheduling for High-Throughput Multiuser Edge AI with Asynchronous Task Arrivals0
Benchmarking the Effectiveness of Classification Algorithms and SVM Kernels for Dry Beans0
Benchmarking Explanatory Models for Inertia Forecasting using Public Data of the Nordic Area0
Challenge Results Are Not Reproducible0
A Dynamic Points Removal Benchmark in Point Cloud MapsCode2
IntelliGraphs: Datasets for Benchmarking Knowledge Graph GenerationCode1
Robotic Manipulation Datasets for Offline Compositional Reinforcement LearningCode1
Pathway: a fast and flexible unified stream data processing framework for analytical and Machine Learning applications0
A Comprehensive Overview of Large Language ModelsCode1
Deep Generative Models for Physiological Signals: A Systematic Literature Review0
AnuraSet: A dataset for benchmarking Neotropical anuran calls identification in passive acoustic monitoringCode1
Temporal Graphs Anomaly Emergence Detection: Benchmarking For Social Media Interactions0
Benchmarking Algorithms for Federated Domain GeneralizationCode1
Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation0
A Call to Reflect on Evaluation Practices for Age Estimation: Comparative Analysis of the State-of-the-Art and a Unified BenchmarkCode1
Assessing the efficacy of large language models in generating accurate teacher responses0
Fairness-Aware Graph Neural Networks: A Survey0
Fast Empirical Scenarios0
Benchmarking Test-Time Adaptation against Distribution Shifts in Image ClassificationCode1
Structural Property Prediction0
Performance Modeling of Data Storage Systems using Generative ModelsCode0
Unsupervised Spectral Demosaicing with Lightweight Spectral Attention Networks0
ClimateLearn: Benchmarking Machine Learning for Weather and Climate ModelingCode2
OpenSiteRec: An Open Dataset for Site Recommendation0
A Synthetic Benchmarking Pipeline to Compare Camera Calibration Algorithms0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
SysNoise: Exploring and Benchmarking Training-Deployment System Inconsistency0
InstructEval: Systematic Evaluation of Instruction Selection Methods0
Learning Environment Models with Continuous Stochastic Dynamics0
Benchmarking Large Language Model Capabilities for Conditional Generation0
Principles and Guidelines for Evaluating Social Robot Navigation Algorithms0
Generative AI for Programming Education: Benchmarking ChatGPT, GPT-4, and Human Tutors0
Uncovering the Limits of Machine Learning for Automatic Vulnerability DetectionCode1
Benchmarking Zero-Shot Recognition with Vision-Language Models: Challenges on Granularity and Specificity0
Effective Transfer of Pretrained Large Visual Model for Fabric Defect Segmentation via Specifc Knowledge Injection0
Emotion Analysis of Tweets Banning Education in Afghanistan0
Paradigm Shift in Sustainability Disclosure Analysis: Empowering Stakeholders with CHATREPORT, a Language Model-Based Tool0
Pulse Shape-Aided Multipath Delay Estimation for Fine-Grained WiFi Sensing0
Benchmarking Stroke Forecasting with Stroke-Level Badminton Dataset0
Enhancing Navigation Benchmarking and Perception Data Generation for Row-based Crops in Simulation0
SCENEREPLICA: Benchmarking Real-World Robot Manipulation by Creating Replicable ScenesCode1
InterCode: Standardizing and Benchmarking Interactive Coding with Execution FeedbackCode2
Improving Reference-based Distinctive Image Captioning with Contrastive Rewards0
Hybrid Precoder and Combiner Designs for Decentralized Parameter Estimation in mmWave MIMO Wireless Sensor Networks0
Show:102550
← PrevPage 63 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified