SOTAVerified

Benchmarking

Papers

Showing 26112620 of 5548 papers

TitleStatusHype
BongLLaMA: LLaMA for Bangla Language0
AutoBench-V: Can Large Vision-Language Models Benchmark Themselves?Code0
Exploring Capabilities of Time Series Foundation Models in Building Analytics0
Project MPG: towards a generalized performance benchmark for LLM capabilities0
Hierarchical Knowledge Graph Construction from Images for Scalable E-Commerce0
Sequential Large Language Model-Based Hyper-parameter OptimizationCode0
Multi-input Multi-output Loewner Framework for Vibration-based Damage Detection on a Trainer Jet0
AutoMIR: Effective Zero-Shot Medical Information Retrieval without Relevance LabelsCode0
SFTrack: A Robust Scale and Motion Adaptive Algorithm for Tracking Small and Fast Moving Objects0
MMDocBench: Benchmarking Large Vision-Language Models for Fine-Grained Visual Document Understanding0
Show:102550
← PrevPage 262 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified