SOTAVerified

Benchmarking

Papers

Showing 13261350 of 5548 papers

TitleStatusHype
D2S: Document-to-Slide Generation Via Query-Based Text SummarizationCode1
Open Radar Initiative: Large Scale Dataset for Benchmarking of micro-Doppler Recognition AlgorithmsCode1
dEchorate: a Calibrated Room Impulse Response Database for Echo-aware Signal ProcessingCode1
2.5D Visual Relationship DetectionCode1
Knodle: Modular Weakly Supervised Learning with PyTorchCode1
Data Generating Process to Evaluate Causal Discovery Techniques for Time Series DataCode1
Towards Standardising Reinforcement Learning Approaches for Production Scheduling ProblemsCode1
Is Multi-Hop Reasoning Really Explainable? Towards Benchmarking Reasoning InterpretabilityCode1
Safety-enhanced UAV Path Planning with Spherical Vector-based Particle Swarm OptimizationCode1
StylePTB: A Compositional Benchmark for Fine-grained Controllable Text Style TransferCode1
Robust Semantic Interpretability: Revisiting Concept Activation VectorsCode1
CBench: Towards Better Evaluation of Question Answering Over Knowledge GraphsCode1
Remote Sensing Image Classification with the SEN12MS DatasetCode1
Simultaneous Navigation and Construction Benchmarking EnvironmentsCode1
Benchmarks for Deep Off-Policy EvaluationCode1
3D AffordanceNet: A Benchmark for Visual Object Affordance UnderstandingCode1
SUTD-TrafficQA: A Question Answering Benchmark and an Efficient Network for Video Reasoning over Traffic EventsCode1
Marine Snow Removal Benchmarking DatasetCode1
Learning to Optimize: A Primer and A BenchmarkCode1
Neural Multi-Hop Reasoning With Logical Rules on Biomedical Knowledge GraphsCode1
SHARP: Environment and Person Independent Activity Recognition with Commodity IEEE 802.11 Access PointsCode1
A Large-Scale Dataset for Benchmarking Elevator Button Segmentation and Character RecognitionCode1
The Effect of Domain and Diacritics in Yorùbá-English Neural Machine TranslationCode1
Recent Advances on Neural Network Pruning at InitializationCode1
A Computed Tomography Vertebral Segmentation Dataset with Anatomical Variations and Multi-Vendor Scanner DataCode1
Show:102550
← PrevPage 54 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified