SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1385113900 of 474278 papers

TitleStatusHype
Response Quality Assessment for Retrieval-Augmented Generation via Conditional Conformal FactualityCode0
Boosting Generative Adversarial Transferability with Self-supervised Vision Transformer FeaturesCode0
DBConformer: Dual-Branch Convolutional Transformer for EEG DecodingCode2
Curve-Aware Gaussian Splatting for 3D Parametric Curve ReconstructionCode2
CovDocker: Benchmarking Covalent Drug Design with Tasks, Datasets, and SolutionsCode1
How Good Are Synthetic Requirements ? Evaluating LLM-Generated Datasets for AI4RECode0
Learning to Skip the Middle Layers of TransformersCode1
Patch2Loc: Learning to Localize Patches for Unsupervised Brain Lesion DetectionCode0
Recursive KalmanNet: Analyse des capacités de généralisation d'un réseau de neurones récurrent guidé par un filtre de KalmanCode1
FOCUS: Internal MLLM Representations for Efficient Fine-Grained Visual Question Answering0
IRanker: Towards Ranking Foundation ModelCode1
AUTOMATIC ROOM LIGHT CONTROLLER MANAGEMENT SYSTEM.0
Visual-Semantic Knowledge Conflicts in Operating Rooms: Synthetic Data Curation for Surgical Risk Perception in Multimodal Large Language ModelsCode0
Omniwise: Predicting GPU Kernels Performance with LLMs0
Multi-lingual Functional Evaluation for Large Language Models0
Multiple Streams of Relation Extraction: Enriching and Recalling in Transformers0
Divide, Specialize, and Route: A New Approach to Efficient Ensemble Learning0
VOICE CONTROL ROBOT USING ARDUINO MANAGEMENT SYSTEM PROJECT.0
Towards Probabilistic Question Answering Over Tabular Data0
Stochastic Parameter DecompositionCode2
On Context-Content Uncertainty Principle0
Uncovering Hidden Violent Tendencies in LLMs: A Demographic Analysis via Behavioral Vignettes0
Decide less, communicate more: On the construct validity of end-to-end fact-checking in medicineCode0
On the Necessity of Output Distribution Reweighting for Effective Class Unlearning0
SEZ-HARN: Self-Explainable Zero-shot Human Activity Recognition NetworkCode0
E-ABIN: an Explainable module for Anomaly detection in BIological NetworksCode0
Demystifying Distributed Training of Graph Neural Networks for Link PredictionCode0
Learning-Based Resource Management in Integrated Sensing and Communication Systems0
FR-CapsNet: Enhancing Low-Resolution Image Classification via Frequency Routed CapsulesCode0
Integrating Pharmacokinetics and Pharmacodynamics Modeling with Quantum Regression for Predicting Herbal Compound Toxicity0
Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management0
Predicting Readiness to Engage in Psychotherapy of People with Chronic Pain Based on their Pain-Related Narratives Saar0
Diffusion Tree Sampling: Scalable inference-time alignment of diffusion models0
AI-Driven MRI-based Brain Tumour Segmentation Benchmarking0
Enhancing Ambiguous Dynamic Facial Expression Recognition with Soft Label-based Data Augmentation0
THIRDEYE: Cue-Aware Monocular Depth Estimation via Brain-Inspired Multi-Stage Fusion0
MultiHuman-Testbench: Benchmarking Image Generation for Multiple Humans0
MAGPIE: A dataset for Multi-AGent contextual PrIvacy Evaluation0
inMOTIFin: a lightweight end-to-end simulation software for regulatory sequencesCode0
Revisiting CHAMPAGNE: Sparse Bayesian Learning as Reweighted Sparse Coding0
Brains and language models converge on a shared conceptual space across different languagesCode0
Differential Transformer-driven 6G Physical Layer for Collaborative Perception Enhancement0
FixCLR: Negative-Class Contrastive Learning for Semi-Supervised Domain Generalization0
Dynamic Context-Aware Prompt Recommendation for Domain-Specific AI Applications0
Joint Quantization and Pruning Neural Networks Approach: A Case Study on FSO Receivers0
scMamba: A Scalable Foundation Model for Single-Cell Multi-Omics Integration Beyond Highly Variable Feature Selection0
Evaluating PDE discovery methods for multiscale modeling of biological signals0
Empirical estimator of diversification quotient0
Towards Two-Stage Counterfactual Learning to Rank0
StereoDiff: Stereo-Diffusion Synergy for Video Depth Estimation0
Show:102550
← PrevPage 278 of 9486Next →