SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1675116800 of 474278 papers

TitleStatusHype
Socratic-MCTS: Test-Time Visual Reasoning by Asking the Right Questions0
Diffuse and Disperse: Image Generation with Representation Regularization0
An Adaptive Method Stabilizing Activations for Enhanced GeneralizationCode0
The interplay of robustness and generalization in quantum machine learningCode0
A Survey of Link Prediction in N-ary Knowledge GraphsCode0
Modular Recurrence in Contextual MDPs for Universal Morphology Control0
Approaching Dialogue State Tracking via Aligning Speech Encoders and LLMs0
MD-ViSCo: A Unified Model for Multi-Directional Vital Sign Waveform ConversionCode0
Data Augmentation For Small Object using Fast AutoAugment0
Protriever: End-to-End Differentiable Protein Homology Search for Fitness Prediction0
SakugaFlow: A Stagewise Illustration Framework Emulating the Human Drawing Process and Providing Interactive Tutoring for Novice Drawing Skills0
UD-KSL Treebank v1.3: A semi-automated framework for aligning XPOS-extracted units with UPOS tags0
NysAct: A Scalable Preconditioned Gradient Descent using Nystrom ApproximationCode0
SDTagNet: Leveraging Text-Annotated Navigation Maps for Online HD Map ConstructionCode1
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
TableDreamer: Progressive and Weakness-guided Data Synthesis from Scratch for Table Instruction TuningCode0
Employing self-supervised learning models for cross-linguistic child speech maturity classificationCode0
Enhancing Reasoning Capabilities of Small Language Models with Blueprints and Prompt Template Search0
Adapting Vision-Language Foundation Model for Next Generation Medical Ultrasound Image AnalysisCode1
Towards Secure and Private Language Models for Nuclear Power Plants0
Variational Autoencoder-Based Approach to Latent Feature Analysis on Efficient Representation of Power Load Monitoring Data0
Brevity is the soul of sustainability: Characterizing LLM response lengthsCode0
Systematic and Efficient Construction of Quadratic Unconstrained Binary Optimization Forms for High-order and Dense Interactions0
Summarization for Generative Relation Extraction in the Microbiome Domain0
Spatial Transcriptomics Expression Prediction from Histopathology Based on Cross-Modal Mask Reconstruction and Contrastive Learning0
LeanTutor: A Formally-Verified AI Tutor for Mathematical Proofs0
ORFS-agent: Tool-Using Agents for Chip Design Optimization0
Single-Node Trigger Backdoor Attacks in Graph-Based Recommendation Systems0
SHIELD: Multi-task Multi-distribution Vehicle Routing Solver with Sparsity and Hierarchy0
A Survey on Large Language Models for Mathematical Reasoning0
Hybrid Reasoning for Perception, Explanation, and Autonomous Action in Manufacturing0
How Much To Guide: Revisiting Adaptive Guidance in Classifier-Free Guidance Text-to-Vision Diffusion Models0
Reinforcement Learning Teachers of Test Time Scaling0
How to Provably Improve Return Conditioned Supervised Learning?0
Robust Evolutionary Multi-Objective Network Architecture Search for Reinforcement Learning (EMNAS-RL)0
MasHost Builds It All: Autonomous Multi-Agent System Directed by Reinforcement Learning0
Flow Matching Meets PDEs: A Unified Framework for Physics-Constrained Generation0
Exploration by Random Reward Perturbation0
MEMETRON: Metaheuristic Mechanisms for Test-time Response Optimization of Large Language Models0
Comparing human and LLM proofreading in L2 writing: Impact on lexical and syntactic features0
Boosting Gradient Leakage Attacks: Data Reconstruction in Realistic FL Settings0
DeepForm: Reasoning Large Language Model for Communication System Formulation0
TS-PIELM: Time-Stepping Physics-Informed Extreme Learning Machine Facilitates Soil Consolidation Analyses0
PerfTracker: Online Performance Troubleshooting for Large-scale Model Training in Production0
Locating Tennis Ball Impact on the Racket in Real Time Using an Event Camera0
Convergence of Spectral Principal Paths: How Deep Networks Distill Linear Representations from Noisy Inputs0
A Probability-guided Sampler for Neural Implicit Surface Rendering0
Orientation Matters: Making 3D Generative Models Orientation-Aligned0
ATAS: Any-to-Any Self-Distillation for Enhanced Open-Vocabulary Dense Prediction0
TraGraph-GS: Trajectory Graph-based Gaussian Splatting for Arbitrary Large-Scale Scene Rendering0
Show:102550
← PrevPage 336 of 9486Next →