SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 90019050 of 661570 papers

TitleStatusHype
Toward Robust LLM-Based Judges: Taxonomic Bias Evaluation and Debiasing Optimization0
DC-W2S: Dual-Consensus Weak-to-Strong Training for Reliable Process Reward Modeling in Biological Reasoning0
Invisible Safety Threat: Malicious Finetuning for LLM via Steganography0
SAMoE-VLA: A Scene Adaptive Mixture-of-Experts Vision-Language-Action Model for Autonomous Driving0
SaiVLA-0: Cerebrum--Pons--Cerebellum Tripartite Architecture for Compute-Aware Vision-Language-Action0
Ramsa: A Large Sociolinguistically Rich Emirati Arabic Speech Corpus for ASR and TTS0
Foley-Flow: Coordinated Video-to-Audio Generation with Masked Audio-Visual Alignment and Dynamic Conditional Flows0
Gradually Excavating External Knowledge for Implicit Complex Question Answering0
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery5
TRIAGE: Type-Routed Interventions via Aleatoric-Epistemic Gated Estimation in Robotic Manipulation and Adaptive Perception -- Don't Treat All Uncertainty the Same0
Explainable Condition Monitoring via Probabilistic Anomaly Detection Applied to Helicopter Transmissions0
UniGround: Universal 3D Visual Grounding via Training-Free Scene Parsing0
Fast Low-light Enhancement and Deblurring for 3D Dark Scenes0
VesselFusion: Diffusion Models for Vessel Centerline Extraction from 3D CT Images0
Mitigating Homophily Disparity in Graph Anomaly Detection: A Scalable and Adaptive Approach0
DARC: Disagreement-Aware Alignment via Risk-Constrained Decoding0
MV-Fashion: Towards Enabling Virtual Try-On and Size Estimation with Multi-View Paired Data0
Edged USLAM: Edge-Aware Event-Based SLAM with Learning-Based Depth Priors0
Gender Bias in MT for a Genderless Language: New Benchmarks for Basque0
Outlier-robust Autocovariance Least Square Estimation via Iteratively Reweighted Least Square0
Unifying On- and Off-Policy Variance Reduction Methods0
An explainable hybrid deep learning-enabled intelligent fault detection and diagnosis approach for automotive software systems validation0
Evidence-Driven Reasoning for Industrial Maintenance Using Heterogeneous Data0
AutoAdapt: An Automated Domain Adaptation Framework for LLMs0
TildeOpen LLM: Leveraging Curriculum Learning to Achieve Equitable Language Representation0
SERQ: Saliency-Aware Low-Rank Error Reconstruction for LLM Quantization0
Sequential Service Region Design with Capacity-Constrained Investment and Spillover Effect0
Supporting Workflow Reproducibility by Linking Bioinformatics Tools across Papers and Executable Code0
Fusion-Poly: A Polyhedral Framework Based on Spatial-Temporal Fusion for 3D Multi-Object Tracking0
MM-TS: Multi-Modal Temperature and Margin Schedules for Contrastive Learning with Long-Tail Data0
Alignment-Aware and Reliability-Gated Multimodal Fusion for Unmanned Aerial Vehicle Detection Across Heterogeneous Thermal-Visual Sensors0
Multi-Objective Evolutionary Optimization of Chance-Constrained Multiple-Choice Knapsack Problems with Implicit Probability Distributions0
SplitAgent: A Privacy-Preserving Distributed Architecture for Enterprise-Cloud Agent Collaboration0
GarmentPainter: Efficient 3D Garment Texture Synthesis with Character-Guided Diffusion Model0
Optimising antibiotic switching via forecasting of patient physiology0
The Struggle Between Continuation and Refusal: A Mechanistic Analysis of the Continuation-Triggered Jailbreak in LLMs0
Prototype-Guided Concept Erasure in Diffusion Models0
Exploring Deep Learning and Ultra-Widefield Imaging for Diabetic Retinopathy and Macular Edema0
Fibration Policy Optimization0
Sensivity of LLMs' Explanations to the Training Randomness:Context, Class & Task Dependencies0
DynamicVGGT: Learning Dynamic Point Maps for 4D Scene Reconstruction in Autonomous Driving0
Event-based Motion & Appearance Fusion for 6D Object Pose Tracking0
FlowTouch: View-Invariant Visuo-Tactile Prediction0
WaDi: Weight Direction-aware Distillation for One-step Image Synthesis0
Airborne Magnetic Anomaly Navigation with Neural-Network-Augmented Online Calibration0
Towards a more efficient bias detection in financial language models0
SCL-GNN: Towards Generalizable Graph Neural Networks via Spurious Correlation Learning0
AdaCultureSafe: Adaptive Cultural Safety Grounded by Cultural Knowledge in Large Language Models0
TA-RNN-Medical-Hybrid: A Time-Aware and Interpretable Framework for Mortality Risk Prediction0
OSCAR: Occupancy-based Shape Completion via Acoustic Neural Implicit Representations0
Show:102550
← PrevPage 181 of 13232Next →