SOTAVerified

Benchmarking

Papers

Showing 25512575 of 5548 papers

TitleStatusHype
Recognizing Object Affordances to Support Scene Reasoning for Manipulation TasksCode0
From MNIST to ImageNet and Back: Benchmarking Continual Curriculum LearningCode0
Motley: Benchmarking Heterogeneity and Personalization in Federated LearningCode0
Feature interpretability in BCIs: exploring the role of network lateralizationCode0
From Knowledge to Reasoning: Evaluating LLMs for Ionic Liquids Research in Chemical and Biological EngineeringCode0
From Past to Present: A Survey of Malicious URL Detection Techniques, Datasets and Code RepositoriesCode0
Benchmarking Reinforcement Learning Algorithms on Real-World RobotsCode0
Detecting critical treatment effect bias in small subgroupsCode0
FRAMES-VQA: Benchmarking Fine-Tuning Robustness across Multi-Modal Shifts in Visual Question AnsweringCode0
Benchmarking Image Perturbations for Testing Automated Driving Assistance SystemsCode0
FR-MRInet: A Deep Convolutional Encoder-Decoder for Brain Tumor Segmentation with Relu-RGB and Sliding-windowCode0
Affine Non-negative Collaborative Representation Based Pattern ClassificationCode0
DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual DesignCode0
Design and implementation of intelligent packet filtering in IoT microcontroller-based devicesCode0
Accurate Peak Detection in Multimodal Optimization via Approximated Landscape LearningCode0
From Bytes to Borsch: Fine-Tuning Gemma and Mistral for the Ukrainian Language RepresentationCode0
A quantum-classical reinforcement learning model to play Atari gamesCode0
Dermatological Diagnosis Explainability Benchmark for Convolutional Neural NetworksCode0
Benchmarking Human and Automated Prompting in the Segment Anything ModelCode0
Depth Functions for Partial Orders with a Descriptive Analysis of Machine Learning AlgorithmsCode0
Benchmarking histopathology foundation models in a multi-center dataset for skin cancer subtypingCode0
From Modern CNNs to Vision Transformers: Assessing the Performance, Robustness, and Classification Strategies of Deep Learning Models in HistopathologyCode0
Forecasting Across Time Series Databases using Recurrent Neural Networks on Groups of Similar Series: A Clustering ApproachCode0
Benchmarking HillVallEA for the GECCO 2019 Competition on Multimodal OptimizationCode0
Forecasting Future International Events: A Reliable Dataset for Text-Based Event ModelingCode0
Show:102550
← PrevPage 103 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified