SOTAVerified

Binary Classification

Papers

Showing 150 of 2574 papers

TitleStatusHype
Better than classical? The subtle art of benchmarking quantum machine learning modelsCode7
PyTorch Frame: A Modular Framework for Multi-Modal Tabular LearningCode4
Speech Segmentation Optimization using Segmented Bilingual Speech Corpus for End-to-end Speech TranslationCode4
Meta-Chunking: Learning Text Segmentation and Semantic Completion via Logical PerceptionCode3
GuardT2I: Defending Text-to-Image Models from Adversarial PromptsCode3
Common Sense Reasoning for Deepfake DetectionCode3
GlyphNet: Homoglyph domains dataset and detection using attention-based Convolutional Neural NetworksCode3
UCF: Uncovering Common Features for Generalizable Deepfake DetectionCode3
Benchmarking Multimodal AutoML for Tabular Data with Text FieldsCode3
Is deep learning necessary for simple classification tasks?Code3
The Hateful Memes Challenge: Detecting Hate Speech in Multimodal MemesCode3
Rethinking Vision-Language Model in Face Forensics: Multi-Modal Interpretable Forged Face DetectorCode2
DeTeCtive: Detecting AI-generated Text via Multi-Level Contrastive LearningCode2
LTNtorch: PyTorch Implementation of Logic Tensor NetworksCode2
AXIAL: Attention-based eXplainability for Interpretable Alzheimer's Localized Diagnosis using 2D CNNs on 3D MRI brain scansCode2
Split-and-Fit: Learning B-Reps via Structure-Aware Voronoi PartitioningCode2
SpecDETR: A Transformer-based Hyperspectral Point Object Detection NetworkCode2
FakeBench: Probing Explainable Fake Image Detection via Large Multimodal ModelsCode2
Understanding the Ranking Loss for Recommendation with Sparse User FeedbackCode2
A Survey on LLM-Generated Text Detection: Necessity, Methods, and Future DirectionsCode2
Detecting and Grounding Multi-Modal Media Manipulation and BeyondCode2
Detect Everything with Few ExamplesCode2
VadCLIP: Adapting Vision-Language Models for Weakly Supervised Video Anomaly DetectionCode2
DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature MatchingCode2
Maintaining Plasticity in Deep Continual LearningCode2
Detecting and Grounding Multi-Modal Media ManipulationCode2
DeepDTA: Deep Drug-Target Binding Affinity PredictionCode2
EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial StatementsCode1
BusterX: MLLM-Powered AI-Generated Video Forgery Detection and ExplanationCode1
VenusX: Unlocking Fine-Grained Functional Understanding of ProteinsCode1
Cal or No Cal? -- Real-Time Miscalibration Detection of LiDAR and Camera SensorsCode1
Architecture for Trajectory-Based Fishing Ship Classification with AIS DataCode1
Learning to Filter Outlier Edges in Global SfMCode1
Progressive Boundary Guided Anomaly Synthesis for Industrial Anomaly DetectionCode1
Interactive Classification Metrics: A graphical application to build robust intuition for classification model evaluationCode1
Lie-Equivariant Quantum Graph Neural NetworksCode1
How EEG preprocessing shapes decoding performanceCode1
Feature Selection Gates with Gradient Routing for Endoscopic Image ComputingCode1
TabKANet: Tabular Data Modeling with Kolmogorov-Arnold Network and TransformerCode1
PMLBmini: A Tabular Classification Benchmark Suite for Data-Scarce ApplicationsCode1
LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text DetectionCode1
Hard-Attention Gates with Gradient Routing for Endoscopic Image ComputingCode1
OxonFair: A Flexible Toolkit for Algorithmic FairnessCode1
Pairwise Difference Learning for ClassificationCode1
LLM Critics Help Catch Bugs in Mathematics: Towards a Better Mathematical Verifier with Natural Language FeedbackCode1
Probing the Decision Boundaries of In-context Learning in Large Language ModelsCode1
LogiCode: an LLM-Driven Framework for Logical Anomaly DetectionCode1
Modeling PROTAC Degradation Activity with Machine LearningCode1
DistilDIRE: A Small, Fast, Cheap and Lightweight Diffusion Synthesized Deepfake DetectionCode1
SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text DetectionCode1
Show:102550
← PrevPage 1 of 52Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Trompt + OpenAI embeddingAUROC0.98Unverified
2LightGBM + OpenAI embeddingAUROC0.97Unverified
3FTTransformer + RoBERTa fintuneAUROC0.96Unverified
4LightGBM + RoBERTa embeddingAUROC0.95Unverified
5FTTransformer + RoBERTa embeddingAUROC0.94Unverified
6ResNet + RoBERTa embeddingAUROC0.93Unverified
7ResNet + OpenAI embeddingAUROC0.92Unverified
8FTTransformer + OpenAI embeddingAUROC0.91Unverified
#ModelMetricClaimedVerifiedStatus
1Trompt + OpenAI embeddingAUROC0.81Unverified
2Multimodal-Net All-TextAUROC0.8Unverified
3ResNet + RoBERTa finetuneAUROC0.79Unverified
4LightGBM + RoBERTa embeddingAUROC0.77Unverified
#ModelMetricClaimedVerifiedStatus
1Attention based CNNF1 score0.93Unverified
#ModelMetricClaimedVerifiedStatus
1XGBoostF1-Score98.79Unverified