SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 63766400 of 474278 papers

TitleStatusHype
Gene-DML: Dual-Pathway Multi-Level Discrimination for Gene Expression Prediction from Histopathology ImagesCode0
Democratic or Authoritarian? Probing a New Dimension of Political Biases in Large Language ModelsCode0
Unleashing the Intrinsic Visual Representation Capability of Multimodal Large Language ModelsCode0
Importance-aware Topic Modeling for Discovering Public Transit Risk from Noisy Social MediaCode0
FronTalk: Benchmarking Front-End Development as Conversational Code Generation with Multi-Modal FeedbackCode0
SegAssess: Panoramic quality mapping for robust and transferable unsupervised segmentation assessmentCode0
Evolving Deep Learning OptimizersCode0
The MICCAI Federated Tumor Segmentation (FeTS) Challenge 2024: Efficient and Robust Aggregation Methods for Federated LearningCode0
LYNX: Learning Dynamic Exits for Confidence-Controlled Reasoning0
CookAnything: A Framework for Flexible and Consistent Multi-Step Recipe Image Generation0
COOPER: A Unified Model for Cooperative Perception and Reasoning in Spatial Intelligence0
Semantics Lead the Way: Harmonizing Semantic and Texture Modeling with Asynchronous Latent Diffusion0
Taxonomy-Adaptive Moderation Model with Robust Guardrails for Large Language Models0
Smart Timing for Mining: A Deep Learning Framework for Bitcoin Hardware ROI PredictionCode0
LDLT L-Lipschitz Network: Generalized Deep End-To-End Lipschitz Network Construction0
EditThinker: Unlocking Iterative Reasoning for Any Image Editor0
EgoEdit: Dataset, Real-Time Streaming Model, and Benchmark for Egocentric Video Editing0
Empathy by Design: Aligning Large Language Models for Healthcare DialogueCode0
Exploring Ordinal Bias in Action Recognition for Instructional Videos0
Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs0
Variational Quantum Rainbow Deep Q-Network for Optimizing Resource Allocation ProblemCode0
Curvature-Regularized Variational Autoencoder for 3D Scene Reconstruction from Sparse DepthCode0
CausalKANs: interpretable treatment effect estimation with Kolmogorov-Arnold networksCode0
Guided Query Refinement: Multimodal Hybrid Retrieval with Test-Time OptimizationCode0
OpenMMReasoner: Pushing the Frontiers for Multimodal Reasoning with an Open and General RecipeCode0
Show:102550
← PrevPage 256 of 18972Next →