SOTAVerified

Benchmarking

Papers

Showing 22262250 of 5548 papers

TitleStatusHype
How Far Are We from Optimal Reasoning Efficiency?Code0
HopaDIFF: Holistic-Partial Aware Fourier Conditioned Diffusion for Referring Human Action Segmentation in Multi-Person ScenariosCode0
HOEG: A New Approach for Object-Centric Predictive Process MonitoringCode0
3D fluorescence microscopy data synthesis for segmentation and benchmarkingCode0
How to Manage Tiny Machine Learning at Scale: An Industrial PerspectiveCode0
Hi Guys or Hi Folks? Benchmarking Gender-Neutral Machine Translation with the GeNTE CorpusCode0
High-Quality, ROS Compatible Video Encoding and Decoding for High-Definition DatasetsCode0
BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed GraphsCode0
High-Dynamic-Range Imaging for Cloud SegmentationCode0
Hierarchical Neural Networks for Sequential Sentence Classification in Medical Scientific AbstractsCode0
HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability predictionCode0
ASR Benchmarking: Need for a More Representative Conversational DatasetCode0
Benchmarking Neural Machine Translation for Southern African LanguagesCode0
Benchmarking neural embeddings for link prediction in knowledge graphs under semantic and structural changesCode0
Heterogeneous Datasets for Federated Survival Analysis SimulationCode0
Harnessing Orthogonality to Train Low-Rank Neural NetworksCode0
Harmonization Benchmarking Tool for Neuroimaging DatasetsCode0
Aspect-based Sentiment Classification with Aspect-specific Graph Convolutional NetworksCode0
Hardware Aware Neural Network Architectures using FbNetCode0
HATE-ITA: New Baselines for Hate Speech Detection in ItalianCode0
HammerBench: Fine-Grained Function-Calling Evaluation in Real Mobile Device ScenariosCode0
Dynamic Neighborhood Construction for Structured Large Discrete Action SpacesCode0
gym-gazebo2, a toolkit for reinforcement learning using ROS 2 and GazeboCode0
Hard-Label Cryptanalytic Extraction of Neural Network ModelsCode0
Hi-EF: Benchmarking Emotion Forecasting in Human-interactionCode0
Show:102550
← PrevPage 90 of 222Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1GPT-4 TurboACC0.56Unverified