SOTAVerified

Benchmarking

Papers

Showing 13011325 of 5548 papers

TitleStatusHype
Benchmarking TinyML Systems: Challenges and DirectionCode1
Curious Hierarchical Actor-Critic Reinforcement LearningCode1
Arctique: An artificial histopathological dataset unifying realism and controllability for uncertainty quantificationCode1
Image Matching across Wide Baselines: From Paper to PracticeCode1
Benchmarking Robustness of Text-Image Composed RetrievalCode1
Benchmarking Robustness to Adversarial Image ObfuscationsCode1
Benchmarking Vision-Language Models on Optical Character Recognition in Dynamic Video EnvironmentsCode1
Multi-Mask Aggregators for Graph Neural NetworksCode1
A framework for benchmarking class-out-of-distribution detection and its application to ImageNetCode1
Multimodal Fusion via Teacher-Student Network for Indoor Action RecognitionCode1
BEND: Benchmarking DNA Language Models on biologically meaningful tasksCode1
ImageNet-D: Benchmarking Neural Network Robustness on Diffusion Synthetic ObjectCode1
IndoNLU: Benchmark and Resources for Evaluating Indonesian Natural Language UnderstandingCode1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series DataCode1
Benchmarking the Robustness of Spatial-Temporal Models Against CorruptionsCode1
IDToolkit: A Toolkit for Benchmarking and Developing Inverse Design Algorithms in NanophotonicsCode1
Benchmarking the Robustness of Deep Neural Networks to Common Corruptions in Digital PathologyCode1
Benchmarking Segmentation Models with Mask-Preserved Attribute EditingCode1
A Comprehensive Study on Large-Scale Graph Training: Benchmarking and RethinkingCode1
Benchmarking Self-Supervised Learning on Diverse Pathology DatasetsCode1
Data Splits and Metrics for Method Benchmarking on Surgical Action Triplet DatasetsCode1
Mutual-Information Based Few-Shot ClassificationCode1
Dataset and Benchmark: Novel Sensors for Autonomous Vehicle PerceptionCode1
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object DetectionCode1
Benchmarking Implicit Neural Representation and Geometric Rendering in Real-Time RGB-D SLAMCode1
Show:102550
← PrevPage 53 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified