SOTAVerified

Benchmarking

Papers

Showing 27112720 of 5548 papers

TitleStatusHype
IoT-LLM: Enhancing Real-World IoT Task Reasoning with Large Language Models0
MANTRA: The Manifold Triangulations AssemblageCode0
Repurposing Foundation Model for Generalizable Medical Time Series Classification0
Large Language Model for Multi-Domain Translation: Benchmarking and Domain CoT Fine-tuning0
Deep learning for action spotting in association football videos0
ConServe: Harvesting GPUs for Low-Latency and High-Throughput Large Language Model Serving0
CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations0
The Labyrinth of Links: Navigating the Associative Maze of Multi-modal LLMs0
Emo3D: Metric and Benchmarking Dataset for 3D Facial Expression Generation from Emotion Description0
A Real Benchmark Swell Noise Dataset for Performing Seismic Data Denoising via Deep Learning0
Show:102550
← PrevPage 272 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified