SOTAVerified

Benchmarking

Papers

Showing 34013450 of 5548 papers

TitleStatusHype
TACO: Benchmarking Generalizable Bimanual Tool-ACtion-Object Understanding0
A Reinforcement Learning Environment for Directed Quantum Circuit Synthesis0
Lifelogging As An Extreme Form of Personal Information Management -- What Lessons To Learn0
Knowledge Sharing in Manufacturing using Large Language Models: User Evaluation and Model Benchmarking0
Latency-aware Road Anomaly Segmentation in Videos: A Photorealistic Dataset and New Metrics0
Benchmark Analysis of Various Pre-trained Deep Learning Models on ASSIRA Cats and Dogs Dataset0
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models0
MST: Adaptive Multi-Scale Tokens Guided Interactive SegmentationCode0
SoK: Systematization and Benchmarking of Deepfake Detectors in a Unified Framework0
Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning0
Attention versus Contrastive Learning of Tabular Data -- A Data-centric Benchmarking0
NovelGym: A Flexible Ecosystem for Hybrid Planning and Learning Agents Designed for Open Worlds0
Global Prediction of COVID-19 Variant Emergence Using Dynamics-Informed Graph Neural NetworksCode0
Using Multi-Temporal Sentinel-1 and Sentinel-2 data for water bodies mapping0
Benchmarking PathCLIP for Pathology Image Analysis0
Enhancing 3D-Air Signature by Pen Tip Tail Trajectory Awareness: Dataset and Featuring by Novel Spatio-temporal CNNCode0
Nodule detection and generation on chest X-rays: NODE21 Challenge0
AstroLLaMA-Chat: Scaling AstroLLaMA with Conversational and Diverse Datasets0
Sheared Backpropagation for Fine-tuning Foundation Models0
Temporal Validity Change Prediction0
AM-RADIO: Agglomerative Vision Foundation Model Reduce All Domains Into One0
FISBe: A Real-World Benchmark Dataset for Instance Segmentation of Long-Range Thin Filamentous Structures0
Hyperbolic Anomaly Detection0
Benchmarking Audio Visual Segmentation for Long-Untrimmed Videos0
FLHetBench: Benchmarking Device and State Heterogeneity in Federated Learning0
Benchmarking Hebbian learning rules for associative memory0
Pushing Boundaries: Exploring Zero Shot Object Classification with Large Multimodal Models0
TSPP: A Unified Benchmarking Tool for Time-series ForecastingCode0
Knowledge Enhanced Conditional Imputation for Healthcare Time-seriesCode0
FALCON: Feature-Label Constrained Graph Net Collapse for Memory Efficient GNNsCode0
Combining SNNs with Filtering for Efficient Neural Decoding in Implantable Brain-Machine Interfaces0
RDF-star2Vec: RDF-star Graph Embeddings for Data MiningCode0
Data needs and challenges for quantum dot devices automation0
Benchmarking Evolutionary Community Detection Algorithms in Dynamic Networks0
ARBiBench: Benchmarking Adversarial Robustness of Binarized Neural Networks0
Incorporating Human Flexibility through Reward Preferences in Human-AI Teaming0
Benchmarking and Analyzing In-context Learning, Fine-tuning and Supervised Learning for Biomedical Knowledge Curation: a focused study on chemical entities of biological interest0
Scaling Compute Is Not All You Need for Adversarial RobustnessCode0
Comparing Machine Learning Algorithms by Union-Free Generic DepthCode0
Review and experimental benchmarking of machine learning algorithms for efficient optimization of cold atom experiments0
Perception Test 2023: A Summary of the First Challenge And Outcome0
Neural feels with neural fields: Visuo-tactile perception for in-hand manipulation0
AN ELIXIR FOR BLOCKCHAIN SCALABILITY WITH CHANNEL BASED CLUSTERED SHARDING0
MA-BBOB: A Problem Generator for Black-Box Optimization Using Affine Combinations and Shifts0
QDA^2: A principled approach to automatically annotating charge stability diagrams0
Bio-Image Informatics Index BIII: A unique database of image analysis tools and workflows for and by the bioimaging community0
Code Ownership in Open-Source AI Software SecurityCode0
FER-C: Benchmarking Out-of-Distribution Soft Calibration for Facial Expression Recognition0
Enabling Accelerators for Graph Computing0
ChemTime: Rapid and Early Classification for Multivariate Time Series Classification of Chemical Sensors0
Show:102550
← PrevPage 69 of 111Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified