SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 94769500 of 474278 papers

TitleStatusHype
UI-UG: A Unified MLLM for UI Understanding and GenerationCode0
A Multi-purpose Tracking Framework for Salmon Welfare Monitoring in Challenging EnvironmentsCode0
Multi-modal Liver Segmentation and Fibrosis Staging Using Real-world MRI ImagesCode0
Type-Less yet Type-Aware Inductive Link Prediction with Pretrained Language ModelsCode0
Feedback Forensics: A Toolkit to Measure AI PersonalityCode0
Automated and Scalable SEM Image Analysis of Perovskite Solar Cell Materials via a Deep Segmentation FrameworkCode0
DeepScientist: Advancing Frontier-Pushing Scientific Findings ProgressivelyCode0
Text-to-CT Generation via 3D Latent Diffusion Model with Contrastive Vision-Language PretrainingCode0
LoRAFusion: Efficient LoRA Fine-Tuning for LLMsCode0
Retrieval-Augmented Generation for Electrocardiogram-Language ModelsCode0
AVCD: Mitigating Hallucinations in Audio-Visual Large Language Models through Contrastive DecodingCode0
U-Mamba2-SSL for Semi-Supervised Tooth and Pulp Segmentation in CBCTCode0
EchoingECG: An Electrocardiogram Cross-Modal Model for Echocardiogram TasksCode0
CIMNAS: A Joint Framework for Compute-In-Memory-Aware Neural Architecture SearchCode0
DeepJSONEval: Benchmarking Complex Nested JSON Data Mining for Large Language ModelsCode0
GeoLink: Empowering Remote Sensing Foundation Model with OpenStreetMap DataCode0
SGS: Segmentation-Guided Scoring for Global Scene InconsistenciesCode0
MEDAKA: Construction of Biomedical Knowledge Graphs Using Large Language ModelsCode0
Towards Continual Expansion of Data Coverage: Automatic Text-guided Edge-case SynthesisCode0
Generalized Fine-Grained Category Discovery with Multi-Granularity Conceptual ExpertsCode0
Refine Drugs, Don't Complete Them: Uniform-Source Discrete Flows for Fragment-Based Drug DiscoveryCode0
Attention over Scene Graphs: Indoor Scene Representations Toward CSAI ClassificationCode0
AccidentBench: Benchmarking Multimodal Understanding and Reasoning in Vehicle Accidents and BeyondCode0
Stitch: Training-Free Position Control in Multimodal Diffusion TransformersCode0
FakeChain: Exposing Shallow Cues in Multi-Step Deepfake DetectionCode0
Show:102550
← PrevPage 380 of 18972Next →