SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1135111400 of 177340 papers

TitleStatusHype
GREC: Generalized Referring Expression ComprehensionCode2
MixSup: Mixed-grained Supervision for Label-efficient LiDAR-based 3D Object DetectionCode2
PointSea: Point Cloud Completion via Self-structure AugmentationCode2
Emma-X: An Embodied Multimodal Action Model with Grounded Chain of Thought and Look-ahead Spatial ReasoningCode2
Semantic Human Mesh Reconstruction with TexturesCode2
Vision Matters: Simple Visual Perturbations Can Boost Multimodal Math ReasoningCode2
CoLaDa: A Collaborative Label Denoising Framework for Cross-lingual Named Entity RecognitionCode2
ALBench: A Framework for Evaluating Active Learning in Object DetectionCode2
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous EnvironmentsCode2
Bilateral Propagation Network for Depth CompletionCode2
The Gender-GAP Pipeline: A Gender-Aware Polyglot Pipeline for Gender Characterisation in 55 LanguagesCode2
Fino1: On the Transferability of Reasoning Enhanced LLMs to FinanceCode2
HUGS: Human Gaussian SplatsCode2
NusaCrowd: Open Source Initiative for Indonesian NLP ResourcesCode2
Structural Entropy Guided Agent for Detecting and Repairing Knowledge Deficiencies in LLMsCode2
Do Llamas Work in English? On the Latent Language of Multilingual TransformersCode2
Leveraging Rust types for modular specification and verificationCode2
DVLO: Deep Visual-LiDAR Odometry with Local-to-Global Feature Fusion and Bi-Directional Structure AlignmentCode2
Graph Language ModelsCode2
A Short Survey of Viewing Large Language Models in Legal AspectCode2
Stylized Face Sketch Extraction via Generative Prior with Limited DataCode2
Towards Efficient and Scale-Robust Ultra-High-Definition Image DemoireingCode2
RGBDS-SLAM: A RGB-D Semantic Dense SLAM Based on 3D Multi Level Pyramid Gaussian SplattingCode2
An Electrocardiogram Foundation Model Built on over 10 Million Recordings with External Evaluation across Multiple DomainsCode2
SPARS3R: Semantic Prior Alignment and Regularization for Sparse 3D ReconstructionCode2
AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic MovementsCode2
Deep Visual Geo-localization BenchmarkCode2
AtomThink: A Slow Thinking Framework for Multimodal Mathematical ReasoningCode2
VLKEB: A Large Vision-Language Model Knowledge Editing BenchmarkCode2
Probabilistic Language-Image Pre-TrainingCode2
R-AIF: Solving Sparse-Reward Robotic Tasks from Pixels with Active Inference and World ModelsCode2
Zero-Shot Scene Change DetectionCode2
Simultaneously Recovering Multi-Person Meshes and Multi-View Cameras with Human SemanticsCode2
Unraveling Molecular Structure: A Multimodal Spectroscopic Dataset for ChemistryCode2
Discrete Prior-based Temporal-coherent Content Prediction for Blind Face Video RestorationCode2
KBNet: Kernel Basis Network for Image RestorationCode2
Physical Plausibility-aware Trajectory Prediction via Locomotion EmbodimentCode2
Strong Baseline: Multi-UAV Tracking via YOLOv12 with BoT-SORT-ReIDCode2
L-PR: Exploiting LiDAR Fiducial Marker for Unordered Low Overlap Multiview Point Cloud RegistrationCode2
PCP-MAE: Learning to Predict Centers for Point Masked AutoencodersCode2
Protein Conformation Generation via Force-Guided SE(3) Diffusion ModelsCode2
LLMParser: An Exploratory Study on Using Large Language Models for Log ParsingCode2
CARLA2Real: a tool for reducing the sim2real gap in CARLA simulatorCode2
RC-MVSNet: Unsupervised Multi-View Stereo with Neural RenderingCode2
ADELIE: Aligning Large Language Models on Information ExtractionCode2
Web-Shepherd: Advancing PRMs for Reinforcing Web AgentsCode2
DeepPrivacy2: Towards Realistic Full-Body AnonymizationCode2
Pre-training Enhanced Spatial-temporal Graph Neural Network for Multivariate Time Series ForecastingCode2
EasyText: Controllable Diffusion Transformer for Multilingual Text RenderingCode2
Graph Condensation: A SurveyCode2
Show:102550
← PrevPage 228 of 3547Next →