SOTAVerified

Benchmarking

Papers

Showing 36513675 of 5548 papers

TitleStatusHype
Perona: Robust Infrastructure Fingerprinting for Resource-Efficient Big Data Analytics0
PerSEval: Assessing Personalization in Text Summarizers0
Personalised Feedback Framework for Online Education Programmes Using Generative AI0
Personalized Multimodal Large Language Models: A Survey0
Personalized On-Device E-health Analytics with Decentralized Block Coordinate Descent0
Person Re-Identification by Unsupervised Video Matching0
Person Re-Identification in Identity Regression Space0
Person Re-identification in the Wild0
Person Search by Multi-Scale Matching0
Person Search by Multi-Scale Matching0
Perspective on recent developments and challenges in regulatory and systems genomics0
Perspectives on the State and Future of Deep Learning -- 20230
Perturbation-based exploration methods in deep reinforcement learning0
PGLearn -- An Open-Source Learning Toolkit for Optimal Power Flow0
PGLib-CO2: A Power Grid Library for Computing and Optimizing Carbon Emissions0
PhD Thesis on Code Modulated Interferometric Imaging System using Phased Arrays0
Phi-3 Safety Post-Training: Aligning Language Models with a "Break-Fix" Cycle0
PhilHumans: Benchmarking Machine Learning for Personal Health0
PhysBench: Benchmarking and Enhancing Vision-Language Models for Physical World Understanding0
PhySense: Principle-Based Physics Reasoning Benchmarking for Large Language Models0
Physics-Learning AI Datamodel (PLAID) datasets: a collection of physics simulations for machine learning0
PhytoSynth: Leveraging Multi-modal Generative Models for Crop Disease Data Generation with Novel Benchmarking and Prompt Engineering Approach0
PieTrack: An MOT solution based on synthetic data training and self-supervised domain adaptation0
PISTOL: Dataset Compilation Pipeline for Structural Unlearning of LLMs0
Pitfalls of topology-aware image segmentation0
Show:102550
← PrevPage 147 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified