SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 66516700 of 661570 papers

TitleStatusHype
SAIF: A Stability-Aware Inference Framework for Medical Image Segmentation with Segment Anything Model0
Exploring label correlations using decision templates for ensemble of classifier chains0
NumColor: Precise Numeric Color Control in Text-to-Image Generation0
Semantic Aware Feature Extraction for Enhanced 3D Reconstruction0
Task-Oriented Wireless Transmission of 3D Point Clouds: Geometric Versus Semantic Robustness0
State Algebra for Probabilistic Logic0
An Empirical Investigation of Pre-Trained Deep Learning Model Reuse in the Scientific Process0
Volumetric Radar Echo Motion Estimation Using Physics-Informed Deep Learning: A Case Study Over Slovakia0
Opportunistic Cardiac Health Assessment: Estimating Phenotypes from Localizer MRI through Multi-Modal Representations0
The Equivalence Theorem: First-Class Relationships for Structurally Complete Database Systems0
Orla: A Library for Serving LLM-Based Multi-Agent Systems0
Robust Sequential Tracking via Bounded Information Geometry and Non-Parametric Field Actions0
Egocentric World Model for Photorealistic Hand-Object Interaction Synthesis0
Locatability-Guided Adaptive Reasoning for Image Geo-Localization with Vision-Language Models0
Widespread Gender and Pronoun Bias in Moral Judgments Across LLMs0
SemRep: Generative Code Representation Learning with Code Transformations0
Causal Attribution via Activation Patching0
Privacy Preserving Topic-wise Sentiment Analysis of the Iran Israel USA Conflict Using Federated Transformer Models0
State-space models through the lens of ensemble control0
Out of Sight, Out of Mind? Evaluating State Evolution in Video World Models0
RobotArena : Scalable Robot Benchmarking via Real-to-Sim Translation0
Steve-Evolving: Open-World Embodied Self-Evolution via Fine-Grained Diagnosis and Dual-Track Knowledge Distillation0
Standard Acquisition Is Sufficient for Asynchronous Bayesian Optimization0
A Decision-Theoretic Formalisation of Steganography With Applications to LLM Monitoring0
Developing the PsyCogMetrics AI Lab to Evaluate Large Language Models and Advance Cognitive Science -- A Three-Cycle Action Design Science Study0
EPIC-EuroParl-UdS: Information-Theoretic Perspectives on Translation and Interpreting0
ActionPlan: Future-Aware Streaming Motion Synthesis via Frame-Level Action Planning0
Self-Flow-Matching assisted Full Waveform Inversion0
Scalable Classification of Course Information Sheets Using Large Language Models: A Reusable Institutional Method for Academic Quality Assurance0
EnterpriseOps-Gym: Environments and Evaluations for Stateful Agentic Planning and Tool Use in Enterprise Settings2
Executable Archaeology: Reanimating the Logic Theorist from its IPL-V Source0
A survey of diversity quantification in natural language processing: The why, what, where and how0
Neural-Quantum-States Impurity Solver for Quantum Embedding Problems0
LLM Unlearning with LLM Beliefs0
SpaceControl: Introducing Test-Time Spatial Control to 3D Generative Modeling0
EgoGrasp: World-Space Hand-Object Interaction Estimation from Egocentric Videos0
Failure Detection in Chemical Processes Using Symbolic Machine Learning: A Case Study on Ethylene Oxidation0
MIBench: Evaluating LMMs on Multimodal Interaction0
Technical Case Study of Privacy-Enhancing Technologies (PETs) for Public Health0
Diffusion-based Generative Machine Learning Model for Predicting Crack Propagation in Aluminum Nitride at the Atomic Scale0
DiT-IC: Aligned Diffusion Transformer for Efficient Image Compression0
Ref-DGS: Reflective Dual Gaussian Splatting0
Interpretable Semantic Gradients in SSD: A PCA Sweep Approach and a Case Study on AI Discourse0
CAST: Cross-Attentive Spatio-Temporal feature fusion for deepfake detection0
Channel Selected Stratified Nested Cross Validation for Clinically Relevant EEG Based Parkinsons Disease Detection0
FoV-Net: Rotation-Invariant CAD B-rep Learning via Field-of-View Ray Casting0
Dynamic Mixture-of-Experts for Visual Autoregressive Model0
Evolution and compression in LLMs: On the emergence of human-aligned categorization0
LLMs Can Infer Political Alignment from Online Conversations0
Are General-Purpose Vision Models All We Need for 2D Medical Image Segmentation? A Cross-Dataset Empirical Study0
Show:102550
← PrevPage 134 of 13232Next →