SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 50015050 of 661570 papers

TitleStatusHype
Masked BRep Autoencoder via Hierarchical Graph Transformer0
Analyzing Error Sources in Global Feature Effect Estimation0
Physics-Informed Neural Systems for the Simulation of EUV Electromagnetic Wave Diffraction from a Lithography Mask0
Tracking the Discriminative Axis: Dual Prototypes for Test-Time OOD Detection Under Covariate Shift0
SAGE: Multi-Agent Self-Evolution for LLM Reasoning0
Noisy Data is Destructive to Reinforcement Learning with Verifiable Rewards0
Structure-Aware Multimodal LLM Framework for Trustworthy Near-Field Beam Prediction0
Deep Adaptive Model-Based Design of Experiments0
Dual Consensus: Escaping from Spurious Majority in Unsupervised RLVR via Two-Stage Vote Mechanism0
Speak, Segment, Track, Navigate: An Interactive System for Video-Guided Skull-Base Surgery0
3D tomography of exchange phase in a Si/SiGe quantum dot device0
POaaS: Minimal-Edit Prompt Optimization as a Service to Lift Accuracy and Cut Hallucinations on On-Device sLLMs0
The Era of End-to-End Autonomy: Transitioning from Rule-Based Driving to Large Driving Models0
Volumetrically Consistent Implicit Atlas Learning via Neural Diffeomorphic Flow for Placenta MRI0
A Context Alignment Pre-processor for Enhancing the Coherence of Human-LLM Dialog0
Safe Distributionally Robust Feature Selection under Covariate Shift0
Diffusion Models for Joint Audio-Video Generation0
Reevaluating the Intra-Modal Misalignment Hypothesis in CLIP0
ViT-AdaLA: Adapting Vision Transformers with Linear Attention0
Adaptive regularization parameter selection for high-dimensional inverse problems: A Bayesian approach with Tucker low-rank constraints0
Structured prototype regularization for synthetic-to-real driving scene parsing0
Attribution Upsampling should Redistribute, Not Interpolate0
SEAHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Southeast Asia0
ClaimFlow: Tracing the Evolution of Scientific Claims in NLP0
Interact3D: Compositional 3D Generation of Interactive Objects0
Parallel In-context Learning for Large Vision Language Models0
NanoGS: Training-Free Gaussian Splat Simplification0
Frequency Matters: Fast Model-Agnostic Data Curation for Pruning and Quantization0
ASDA: Automated Skill Distillation and Adaptation for Financial Reasoning0
Out-of-Distribution Object Detection in Street Scenes via Synthetic Outlier Exposure and Transfer Learning0
Functorial Neural Architectures from Higher Inductive Types0
The Finetuner's Fallacy: When to Pretrain with Your Finetuning Data0
Boosting Quantitive and Spatial Awareness for Zero-Shot Object Counting0
DualPrim: Compact 3D Reconstruction with Positive and Negative Primitives0
Communication-Aware Multi-Agent Reinforcement Learning for Decentralized Cooperative UAV Deployment0
GATS: Gaussian Aware Temporal Scaling Transformer for Invariant 4D Spatio-Temporal Point Cloud Representation0
DyJR: Preserving Diversity in Reinforcement Learning with Verifiable Rewards via Dynamic Jensen-Shannon Replay0
Segmentation-before-Staining Improves Structural Fidelity in Virtual IHC-to-Multiplex IF Translation0
SQL-ASTRA: Alleviating Sparse Feedback in Agentic SQL via Column-Set Matching and Trajectory Aggregation0
SignNav: Leveraging Signage for Semantic Visual Navigation in Large-Scale Indoor Environments0
360° Image Perception with MLLMs: A Comprehensive Benchmark and a Training-Free Method0
KidsNanny: A Two-Stage Multimodal Content Moderation Pipeline Integrating Visual Classification, Object Detection, OCR, and Contextual Reasoning for Child Safety0
Sample-Efficient Adaptation of Drug-Response Models to Patient Tumors under Strong Biological Domain Shift0
Are Large Language Models Truly Smarter Than Humans?0
A Scoping Review of AI-Driven Digital Interventions in Mental Health Care: Mapping Applications Across Screening, Support, Monitoring, Prevention, and Clinical Education0
Offline Exploration-Aware Fine-Tuning for Long-Chain Mathematical Reasoning0
Leveling3D: Leveling Up 3D Reconstruction with Feed-Forward 3D Gaussian Splatting and Geometry-Aware Generation0
SpecSteer: Synergizing Local Context and Global Reasoning for Efficient Personalized Generation0
Ground Reaction Inertial Poser: Physics-based Human Motion Capture from Sparse IMUs and Insole Pressure Sensors0
Exclusivity-Guided Mask Learning for Semi-Supervised Crowd Instance Segmentation and Counting0
Show:102550
← PrevPage 101 of 13232Next →