SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 82018250 of 661570 papers

TitleStatusHype
Prompt-Free Diffusion: Taking "Text" out of Text-to-Image Diffusion ModelsCode2
ScoreFlow: Mastering LLM Agent Workflows via Score-based Preference OptimizationCode2
SalM2: An Extremely Lightweight Saliency Mamba Model for Real-Time Cognitive Awareness of Driver AttentionCode2
TritonBench: Benchmarking Large Language Model Capabilities for Generating Triton OperatorsCode2
A Survey of Safety on Large Vision-Language Models: Attacks, Defenses and EvaluationsCode2
Sanity Checking Causal Representation Learning on a Simple Real-World SystemCode2
Enhanced Contrastive Learning with Multi-view Longitudinal Data for Chest X-ray Report GenerationCode2
A Training-free LLM-based Approach to General Chinese Character Error CorrectionCode2
SemiSAM+: Rethinking Semi-Supervised Medical Image Segmentation in the Era of Foundation ModelsCode2
Neural Posterior Estimation for Cataloging Astronomical Images with Spatially Varying Backgrounds and Point Spread FunctionsCode2
AnalogGenie: A Generative Engine for Automatic Discovery of Analog Circuit TopologiesCode2
Patch-wise Structural Loss for Time Series ForecastingCode2
Find First, Track Next: Decoupling Identification and Propagation in Referring Video Object SegmentationCode2
MM-OR: A Large Multimodal Operating Room Dataset for Semantic Understanding of High-Intensity Surgical EnvironmentsCode2
PromptPex: Automatic Test Generation for Language Model PromptsCode2
MPA: MultiPath++ Based Architecture for Motion PredictionCode2
Real-time Spatial-temporal Traversability Assessment via Feature-based Sparse Gaussian ProcessCode2
DriveLMM-o1: A Step-by-Step Reasoning Dataset and Large Multimodal Model for Driving Scenario UnderstandingCode2
Bayesian Prompt Flow Learning for Zero-Shot Anomaly DetectionCode2
Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian SplattingCode2
Rapid patient-specific neural networks for intraoperative X-ray to volume registrationCode2
Generalized Few-shot 3D Point Cloud Segmentation with Vision-Language ModelCode2
Tokenize Image as a SetCode2
HahahaCode2
WaveFM: A High-Fidelity and Efficient Vocoder Based on Flow MatchingCode2
MaSS13K: A Matting-level Semantic Segmentation BenchmarkCode2
STEVE: A Step Verification Pipeline for Computer-use Agent TrainingCode2
COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian SplittingCode2
SkySenseGPT: A Fine-Grained Instruction Tuning Dataset and Model for Remote Sensing Vision-Language UnderstandingCode2
OntologyRAG: Better and Faster Biomedical Code Mapping with Retrieval-Augmented Generation (RAG) Leveraging Ontology Knowledge Graphs and Large Language ModelsCode2
SALT: A Flexible Semi-Automatic Labeling Tool for General LiDAR Point Clouds with Cross-Scene Adaptability and 4D ConsistencyCode2
TextCrafter: Accurately Rendering Multiple Texts in Complex Visual ScenesCode2
Force-Free Molecular Dynamics Through Autoregressive Equivariant NetworksCode2
Graph ODEs and Beyond: A Comprehensive Survey on Integrating Differential Equations with Graph Neural NetworksCode2
GPG: A Simple and Strong Reinforcement Learning Baseline for Model ReasoningCode2
CrackSQL: A Hybrid SQL Dialect Translation System Powered by Large Language ModelsCode2
SpaceR: Reinforcing MLLMs in Video Spatial ReasoningCode2
RWKVTTS: Yet another TTS based on RWKV-7Code2
Sleep-time Compute: Beyond Inference Scaling at Test-timeCode2
Seurat: From Moving Points to DepthCode2
RWKV-X: A Linear Complexity Hybrid Language ModelCode2
Representation Learning for Tabular Data: A Comprehensive SurveyCode2
Test-Time Domain Generalization via Universe Learning: A Multi-Graph Matching Approach for Medical Image SegmentationCode2
DTGBrepGen: A Novel B-rep Generative Model through Decoupling Topology and GeometryCode2
Learning to Detect Multi-class Anomalies with Just One Normal Image PromptCode2
Beyond 'Aha!': Toward Systematic Meta-Abilities Alignment in Large Reasoning ModelsCode2
Relational Graph TransformerCode2
AdaptThink: Reasoning Models Can Learn When to ThinkCode2
AD-AGENT: A Multi-agent Framework for End-to-end Anomaly DetectionCode2
FlightGPT: Towards Generalizable and Interpretable UAV Vision-and-Language Navigation with Vision-Language ModelsCode2
Show:102550
← PrevPage 165 of 13232Next →