SOTAVerified

Benchmarking

Papers

Showing 34013425 of 5548 papers

TitleStatusHype
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding0
A Reinforcement Learning Environment for Directed Quantum Circuit Synthesis0
Lifelogging As An Extreme Form of Personal Information Management -- What Lessons To Learn0
Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking0
Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics0
Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset0
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models0
MST: Adaptive Multi-Scale Tokens Guided Interactive SegmentationCode0
SoK: Systematization and Benchmarking of Deepfake Detectors in a Unified Framework0
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning0
Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking0
NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds0
Global Prediction of COVID-19 Variant Emergence Using Dynamics-Informed Graph Neural NetworksCode0
Using Multi-Temporal Sentinel-1 and Sentinel-2 data for water bodies mapping0
Benchmarking PathCLIP for Pathology Image Analysis0
Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNNCode0
Nodule detection and generation on chest X-rays: NODE21 Challenge0
AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets0
Sheared Backpropagation for Fine-tuning Foundation Models0
Temporal Validity Change Prediction0
AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One0
FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures0
Hyperbolic Anomaly Detection0
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos0
FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning0
Show:102550
← PrevPage 137 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified