SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1895119000 of 474278 papers

TitleStatusHype
Writing-Zero: Bridge the Gap Between Non-verifiable Tasks and Verifiable Rewards0
Empirical Validation of the Independent Chip Model0
Multi-Analyte, Swab-based Automated Wound Monitor with AI0
Artificial Empathy: AI based Mental Health0
PersianMedQA: Language-Centric Evaluation of LLMs in the Persian Medical Domain0
A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things0
RoboMoRe: LLM-based Robot Co-design via Joint Optimization of Morphology and Reward0
Ctrl-Crash: Controllable Diffusion for Realistic Car Crashes0
Understanding while Exploring: Semantics-driven Active Mapping0
Visual Embodied Brain: Let Multimodal Large Language Models See, Think, and Control in Spaces0
Interactive Imitation Learning for Dexterous Robotic Manipulation: Challenges and Perspectives -- A Survey0
Hi-Dyna Graph: Hierarchical Dynamic Scene Graph for Robotic Autonomy in Human-Centric Environments0
Supporting architecture evaluation for ATAM scenarios with LLMs0
Applying Large Language Models to Issue Classification: Revisiting with Extended Data and New Models0
A Causation-Based Framework for Pricing and Cost Allocation of Energy, Reserves, and Transmission in Modern Power Systems0
MRDust: Wireless Implant Data Uplink & Localization via Magnetic Resonance Image Modulation0
Tensor Network for Anomaly Detection in the Latent Space of Proton Collision Events at the LHCCode0
Input-Power-to-State Stability of Time-Varying Systems0
MOFGPT: Generative Design of Metal-Organic Frameworks using Language ModelsCode0
Sorrel: A simple and flexible framework for multi-agent reinforcement learningCode1
Vedavani: A Benchmark Corpus for ASR on Vedic Sanskrit PoetryCode0
Generator Based Inference (GBI)Code0
Pushing the Limits of Beam Search Decoding for Transducer-based ASR models0
Applying Vision Transformers on Spectral Analysis of Astronomical ObjectsCode0
Beyond Atomic Geometry Representations in Materials Science: A Human-in-the-Loop Multimodal FrameworkCode0
Hierarchical Level-Wise News Article Clustering via Multilingual Matryoshka Embeddings0
SoundSculpt: Direction and Semantics Driven Ambisonic Target Sound Extraction0
Structure-Aware Fill-in-the-Middle Pretraining for CodeCode0
Optimal Weighted Convolution for Classification and DenosingCode2
Segmenting France Across Four CenturiesCode0
GARLIC: GAussian Representation LearnIng for spaCe partitioning0
Tackling View-Dependent Semantics in 3D Language Gaussian SplattingCode2
un^2CLIP: Improving CLIP's Visual Detail Capturing Ability via Inverting unCLIPCode1
Model-Guided Network with Cluster-Based Operators for Spatio-Spectral Super-ResolutionCode0
Category-Level 6D Object Pose Estimation in Agricultural Settings Using a Lattice-Deformation Framework and Diffusion-Augmented Synthetic Data0
UniGeo: Taming Video Diffusion for Unified Consistent Geometry Estimation0
3D Gaussian Splat VulnerabilitiesCode1
Pretraining Deformable Image Registration Networks with Random ImagesCode0
Consistent line clustering using geometric hypergraphs0
6D Pose Estimation on Point Cloud Data through Prior Knowledge Integration: A Case Study in Autonomous Disassembly0
ComposeAnything: Composite Object Priors for Text-to-Image Generation0
Threading Keyframe with Narratives: MLLMs as Strong Long Video Comprehenders0
50 Years of Automated Face Recognition0
Harnessing Foundation Models for Robust and Generalizable 6-DOF Bronchoscopy Localization0
Out of Sight, Not Out of Context? Egocentric Spatial Reasoning in VLMs Across Disjoint Frames0
LLM-powered Query Expansion for Enhancing Boundary Prediction in Language-driven Action Localization0
Progressive Class-level Distillation0
Leadership Assessment in Pediatric Intensive Care Unit Team Training0
InteractAnything: Zero-shot Human Object Interaction Synthesis via LLM Feedback and Object Affordance Parsing0
VUDG: A Dataset for Video Understanding Domain Generalization0
Show:102550
← PrevPage 380 of 9486Next →