SOTAVerified

Benchmarking

Papers

Showing 23012310 of 5548 papers

TitleStatusHype
RoofDiffusion: Constructing Roofs from Severely Corrupted Point Data via DiffusionCode1
Towards Sim-to-Real Industrial Parts Classification with Synthetic DatasetCode1
Practical Guidelines for Cell Segmentation Models Under Optical Aberrations in Microscopy0
OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer EnvironmentsCode7
Exploring the Decentraland Economy: Multifaceted Parcel Attributes, Key Insights, and Benchmarking0
DyKnow: Dynamically Verifying Time-Sensitive Factual Knowledge in LLMsCode0
Certifying almost all quantum states with few single-qubit measurements0
Implicit Multi-Spectral Transformer: An Lightweight and Effective Visible to Infrared Image Translation ModelCode1
GoodDrag: Towards Good Practices for Drag Editing with Diffusion Models0
Accel-NASBench: Sustainable Benchmarking for Accelerator-Aware NASCode0
Show:102550
← PrevPage 231 of 555Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified