SOTAVerified

Benchmarking

Papers

Showing 28212830 of 5548 papers

TitleStatusHype
1st Place Winner of the 2024 Pixel-level Video Understanding in the Wild (CVPR'24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation0
Better Bill GPT: Comparing Large Language Models against Legal Invoice Reviewers0
BestServe: Serving Strategies with Optimal Goodput in Collocation and Disaggregation Architectures0
The CLC-UKET Dataset: Benchmarking Case Outcome Prediction for the UK Employment Tribunal0
Best Practices in Pool-based Active Learning for Image Classification0
Abasy Atlas v2.2: The most comprehensive and up-to-date inventory of meta-curated, historical, bacterial regulatory networks, their completeness and system-level characterization0
The Convergent Ethics of AI? Analyzing Moral Foundation Priorities in Large Language Models with a Multi-Framework Approach0
BERT-GT: Cross-sentence n-ary relation extraction with BERT and Graph Transformer0
Greening AI-enabled Systems with Software Engineering: A Research Agenda for Environmentally Sustainable AI Practices0
Grid Search Hyperparameter Benchmarking of BERT, ALBERT, and LongFormer on DuoRC0
Show:102550
← PrevPage 283 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified