SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1260112650 of 474278 papers

TitleStatusHype
MedGemma Technical Report0
When Small Guides Large: Cross-Model Co-Learning for Test-Time AdaptationCode0
DeltaSHAP: Explaining Prediction Evolutions in Online Patient Monitoring with Shapley ValuesCode0
Harnessing Text-to-Image Diffusion Models for Point Cloud Self-Supervised LearningCode0
CoVAE: Consistency Training of Variational AutoencodersCode0
Mind the Gap: Preserving and Compensating for the Modality Gap in CLIP-Based Continual LearningCode0
RAMA: Retrieval-Augmented Multi-Agent Framework for Misinformation Detection in Multimodal Fact-CheckingCode0
Optimizing Basis Function Selection in Constructive Wavelet Neural Networks and Its ApplicationsCode0
Cross Knowledge Distillation between Artificial and Spiking Neural NetworksCode0
Geometric Generative Modeling with Noise-Conditioned Graph NetworksCode0
ESPFormer: Doubly-Stochastic Attention with Expected Sliced Transport PlansCode0
DLBAcalib: Robust Extrinsic Calibration for Non-Overlapping LiDARs Based on Dual LBACode0
Ambiguity-Aware and High-Order Relation Learning for Multi-Grained Image-Text MatchingCode0
AlphaVAE: Unified End-to-End RGBA Image Reconstruction and Generation with Alpha-Aware Representation LearningCode0
DS@GT at Touché: Large Language Models for Retrieval-Augmented DebateCode0
DTECT: Dynamic Topic Explorer & Context TrackerCode0
ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow MatchingCode4
CompassJudger-2: Towards Generalist Judge Model via Verifiable RewardsCode2
Meta-autoencoders: An approach to discovery and representation of relationships between dynamically evolving classes0
Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distribution Shift0
Adversarial Activation Patching: A Framework for Detecting and Mitigating Emergent Deception in Safety-Aligned Transformers0
LLM-Stackelberg Games: Conjectural Reasoning Equilibria and Their Applications to Spearphishing0
SnapMoGen: Human Motion Generation from Expressive Texts0
Continual Reinforcement Learning by Planning with Online World Models0
RoHOI: Robustness Benchmark for Human-Object Interaction DetectionCode0
ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark EvaluationCode0
I^2-World: Intra-Inter Tokenization for Efficient Dynamic 4D Scene ForecastingCode2
PoseLLM: Enhancing Language-Guided Human Pose Estimation with MLP AlignmentCode0
Deep Reinforcement Learning with Gradient Eligibility TracesCode1
Generative Latent Kernel Modeling for Blind Motion DeblurringCode0
Robust Spatiotemporal Epidemic Modeling with Integrated Adaptive Outlier DetectionCode0
PanoDiff-SR: Synthesizing Dental Panoramic Radiographs using Diffusion and Super-resolutionCode0
BayesTTA: Continual-Temporal Test-Time Adaptation for Vision-Language Models via Gaussian Discriminant AnalysisCode0
Visual Semantic Description Generation with MLLMs for Image-Text MatchingCode0
PRISM: Reducing Spurious Implicit Biases in Vision-Language Models with LLM-Guided Embedding ProjectionCode0
Spectral Manifold Harmonization for Graph Imbalanced RegressionCode0
Multimodal Cardiovascular Risk Profiling Using Self-Supervised Learning of PolysomnographyCode0
Leanabell-Prover-V2: Verifier-integrated Reasoning for Formal Theorem Proving via Reinforcement LearningCode0
Fair-FLIP: Fair Deepfake Detection with Fairness-Oriented Final Layer Input PrioritisingCode0
Vision-Language-Vision Auto-Encoder: Scalable Knowledge Distillation from Diffusion Models0
Single-Step Latent Diffusion for Underwater Image Restoration0
Cycle Context Verification for In-Context Medical Image SegmentationCode0
One-Pass to Reason: Token Duplication and Block-Sparse Mask for Efficient Fine-Tuning on Multi-Turn ReasoningCode0
Predicting Air Pollution in Cork, Ireland Using Machine LearningCode0
Transfer Learning and Mixup for Fine-Grained Few-Shot Fungi ClassificationCode0
LLaPa: A Vision-Language Model Framework for Counterfactual-Aware Procedural PlanningCode0
OpenCodeReasoning-II: A Simple Test Time Scaling Approach via Self-Critique0
Multilingual Multimodal Software Developer for Code Generation0
Droid: A Resource Suite for AI-Generated Code Detection0
Conformation-Aware Structure Prediction of Antigen-Recognizing Immune ProteinsCode1
Show:102550
← PrevPage 253 of 9486Next →