SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 1350113550 of 474278 papers

TitleStatusHype
NavMorph: A Self-Evolving World Model for Vision-and-Language Navigation in Continuous EnvironmentsCode2
L0: Reinforcement Learning to Become General AgentsCode3
Improve Underwater Object Detection through YOLOv12 Architecture and Physics-informed AugmentationCode1
RAG-R1 : Incentivize the Search and Reasoning Capabilities of LLMs through Multi-query ParallelismCode5
Flexibility-Conditioned Protein Structure Design with Flow MatchingCode0
Accurate Parameter-Efficient Test-Time Adaptation for Time Series ForecastingCode0
Endo-4DGX: Robust Endoscopic Scene Reconstruction and Illumination Correction with Gaussian SplattingCode0
Learning Counterfactually Decoupled Attention for Open-World Model AttributionCode0
MoCa: Modality-aware Continual Pre-training Makes Better Bidirectional Multimodal Embeddings0
Token Activation Map to Visually Explain Multimodal LLMs0
IR3D-Bench: Evaluating Vision-Language Model Scene Understanding as Agentic Inverse Rendering0
Teaching a Language Model to Speak the Language of Tools0
Frequency-enhanced Multi-granularity Context Network for Efficient Vertebrae SegmentationCode0
Forget-MI: Machine Unlearning for Forgetting Multimodal Information in Healthcare SettingsCode0
External Data-Enhanced Meta-Representation for Adaptive Probabilistic Load ForecastingCode0
UrbanLLaVA: A Multi-modal Large Language Model for Urban Intelligence with Spatial Reasoning and UnderstandingCode0
High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level AnnotationCode0
Boosting LLM's Molecular Structure Elucidation with Knowledge Enhanced Tree Search ReasoningCode0
Dynamic Contrastive Learning for Hierarchical Retrieval: A Case Study of Distance-Aware Cross-View Geo-LocalizationCode0
Are Large Language Models Capable of Deep Relational Reasoning? Insights from DeepSeek-R1 and Benchmark ComparisonsCode0
RiverText: A Python Library for Training and Evaluating Incremental Word Embeddings from Text Data StreamsCode0
SIEDD: Shared-Implicit Encoder with Discrete DecodersCode0
Sub-MoE: Efficient Mixture-of-Expert LLMs Compression via Subspace Expert MergingCode0
TVG-SLAM: Robust Gaussian Splatting SLAM with Tri-view Geometric Constraints0
DDL: A Dataset for Interpretable Deepfake Detection and Localization in Real-World Scenarios0
DC-TTA: Divide-and-Conquer Framework for Test-Time Adaptation of Interactive Segmentation0
TOMI: Transforming and Organizing Music Ideas for Multi-Track Compositions with Full-Song StructureCode1
CycleVAR: Repurposing Autoregressive Model for Unsupervised One-Step Image TranslationCode1
Datasets for Fairness in Language Models: An In-Depth SurveyCode1
Double-Diffusion: Diffusion Conditioned Diffusion Probabilistic Model For Air Quality Prediction0
Where, What, Why: Towards Explainable Driver Attention PredictionCode1
SurgTPGS: Semantic 3D Surgical Scene Understanding with Text Promptable Gaussian SplattingCode1
Ovis-U1 Technical ReportCode3
Revisiting Z Transform Laplace Inversion: To Correct flaws in Signal and System Theory0
Dare to Plagiarize? Plagiarized Painting Recognition and Retrieval0
Context-Driven Knowledge Graph Completion with Semantic-Aware Relational Message Passing0
MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow EstimationCode2
ANN-Based Grid Impedance Estimation for Adaptive Gain Scheduling in VSG Under Dynamic Grid ConditionsCode0
Computer-Aided Multi-Stroke Character Simplification by Stroke RemovalCode0
Advanced Financial Reasoning at Scale: A Comprehensive Evaluation of Large Language Models on CFA Level III0
FinAI-BERT: A Transformer-Based Model for Sentence-Level Detection of AI Disclosures in Financial ReportsCode0
FedRef: Communication-Efficient Bayesian Fine Tuning with Reference ModelCode0
VolumetricSMPL: A Neural Volumetric Body Model for Efficient Interactions, Contacts, and CollisionsCode2
CRISP-SAM2: SAM2 with Cross-Modal Interaction and Semantic Prompting for Multi-Organ SegmentationCode1
RoboScape: Physics-informed Embodied World ModelCode0
MOTOR: Multimodal Optimal Transport via Grounded Retrieval in Medical Visual Question AnsweringCode0
Degradation-Modeled Multipath Diffusion for Tunable Metalens Photography0
STR-Match: Matching SpatioTemporal Relevance Score for Training-Free Video Editing0
MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning0
Confident Splatting: Confidence-Based Compression of 3D Gaussian Splatting via Learnable Beta DistributionsCode0
Show:102550
← PrevPage 271 of 9486Next →