SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 34513500 of 659983 papers

TitleStatusHype
Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards0
Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction0
Deep Adaptive Model-Based Design of Experiments0
Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism0
Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery0
3D tomography of exchange phase in a Si/SiGe quantum dot device0
POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs0
The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models0
Volumetrically Consistent Implicit Atlas Learning via Neural Diffeomorphic Flow for Placenta MRI0
A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog0
Safe Distributionally Robust Feature Selection under Covariate Shift0
Diffusion Models for Joint Audio-Video Generation0
Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP0
ViT-AdaLA: Adapting Vision Transformers with Linear Attention0
Adaptive regularization parameter selection for high-dimensional inverse problems: A Bayesian approach with Tucker low-rank constraints0
Structured prototype regularization for synthetic-to-real driving scene parsing0
Attribution Upsampling should Redistribute, Not Interpolate0
SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia0
ClaimFlow: Tracing the Evolution of Scientific Claims in NLP0
Interact3D: Compositional 3D Generation of Interactive Objects0
Parallel In-context Learning for Large Vision Language Models0
NanoGS: Training-Free Gaussian Splat Simplification0
Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization0
ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning0
Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning0
Functorial Neural Architectures from Higher Inductive Types0
The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data0
Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting0
DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives0
Communication-Aware Multi-Agent Reinforcement Learning for Decentralized Cooperative UAV Deployment0
GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation0
DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay0
Segmentation-before-Staining Improves Structural Fidelity in Virtual IHC-to-Multiplex IF Translation0
SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation0
SignNav: Leveraging Signage for Semantic Visual Navigation in Large-Scale Indoor Environments0
360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method0
KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety0
Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift0
Are Large Language Models Truly Smarter Than Humans?0
A Scoping Review of AI-Driven Digital Interventions in Mental Health Care: Mapping Applications Across Screening, Support, Monitoring, Prevention, and Clinical Education0
Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning0
Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation0
SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation0
Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors0
Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting0
RASLF: Representation-Aware State Space Model for Light Field Super-Resolution0
More Rounds, More Noise: Why Multi-Turn Review Fails to Improve Cross-Context Verification0
Visual Prompt Discovery via Semantic Exploration0
When Thinking Hurts: Mitigating Visual Forgetting in Video Reasoning via Frame Repetition0
Is Semi-Automatic Transcription Useful in Corpus Creation? Preliminary Considerations on the KIParla Corpus0
Show:102550
← PrevPage 70 of 13200Next →