SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1485114900 of 474278 papers

TitleStatusHype
InftyThink+: Effective and Efficient Infinite-Horizon Reasoning via Reinforcement Learning1
Large Multimodal Models as General In-Context Classifiers1
CodeOCR: On the Effectiveness of Vision Language Models in Code Understanding1
Optimal Scaling Needs Optimal Norm1
Spider-Sense: Intrinsic Risk Sensing for Efficient Agent Defense with Hierarchical Adaptive Screening1
Embed-RL: Reinforcement Learning for Reasoning-Driven Multimodal Embeddings1
SeeThrough3D: Occlusion Aware 3D Control in Text-to-Image Generation1
TADA! Tuning Audio Diffusion Models through Activation Steering1
Vlaser: Vision-Language-Action Model with Synergistic Embodied Reasoning1
HSImul3R: Physics-in-the-Loop Reconstruction of Simulation-Ready Human-Scene Interactions1
DSGym: A Holistic Framework for Evaluating and Training Data Science Agents1
SQuTR: A Robustness Benchmark for Spoken Query to Text Retrieval under Acoustic Noise1
MedCLIPSeg: Probabilistic Vision-Language Adaptation for Data-Efficient and Generalizable Medical Image Segmentation1
DREAM: Where Visual Understanding Meets Text-to-Image Generation1
CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance1
See Less, See Right: Bi-directional Perceptual Shaping For Multimodal Reasoning1
PluRel: Synthetic Data unlocks Scaling Laws for Relational Foundation Models1
Learning to Configure Agentic AI Systems1
Benchmarking Vision-Language Models for French PDF-to-Markdown Conversion1
Planning in 8 Tokens: A Compact Discrete Tokenizer for Latent World Model1
ProRL Agent: Rollout-as-a-Service for RL Training of Multi-Turn LLM Agents1
CASA: Cross-Attention over Self-Attention for Efficient Vision-Language Fusion1
BandPO: Bridging Trust Regions and Ratio Clipping via Probability-Aware Bounds for LLM Reinforcement Learning1
HEARTS: Benchmarking LLM Reasoning on Health Time Series1
CodeV: Code with Images for Faithful Visual Reasoning via Tool-Aware Policy Optimization1
AdaptVision: Efficient Vision-Language Models via Adaptive Visual Acquisition1
U6G XL-MIMO Radiomap Prediction: Multi-Config Dataset and Beam Map Approach1
Epistemic Diversity and Knowledge Collapse in Large Language Models1
OCRVerse: Towards Holistic OCR in End-to-End Vision-Language Models1
NOVA: Sparse Control, Dense Synthesis for Pair-Free Video Editing1
How2Everything: Mining the Web for How-To Procedures to Evaluate and Improve LLMs1
MergeMix: A Unified Augmentation Paradigm for Visual and Multi-Modal Understanding1
-Reasoner: LLM Reasoning via Test-Time Gradient Descent in Latent Space1
Sparking Scientific Creativity via LLM-Driven Interdisciplinary Inspiration1
Pix2Shape: Towards Unsupervised Learning of 3D Scenes from Images using a View-based RepresentationCode1
PeeledHuman: Robust Shape Representation for Textured 3D Human Body ReconstructionCode1
Learning Graph Regularisation for Guided Super-ResolutionCode1
SpeechNet: A Universal Modularized Model for Speech Processing TasksCode1
CNN-Based Image Reconstruction Method for Ultrafast Ultrasound ImagingCode1
Peeking inside the Black Box: Interpreting Deep Learning Models for Exoplanet Atmospheric RetrievalsCode1
Uncrowded Hypervolume-based Multi-objective Optimization with Gene-pool Optimal MixingCode1
A Discourse-Aware Attention Model for Abstractive Summarization of Long DocumentsCode1
Can Q-Learning with Graph Networks Learn a Generalizable Branching Heuristic for a SAT Solver?Code1
Fairwashing Explanations with Off-Manifold DetergentCode1
Emo-DNA: Emotion Decoupling and Alignment Learning for Cross-Corpus Speech Emotion RecognitionCode1
Anisotropic 3D Multi-Stream CNN for Accurate Prostate Segmentation from Multi-Planar MRICode1
Imposing Relation Structure in Language-Model Embeddings Using Contrastive LearningCode1
Multi-Task Learning for Dense Prediction Tasks: A SurveyCode1
Smooth activations and reproducibility in deep networksCode1
Homomorphism Autoencoder -- Learning Group Structured Representations from Observed TransitionsCode1
Show:102550
← PrevPage 298 of 9486Next →