SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 88768900 of 474278 papers

TitleStatusHype
RV-HATE: Reinforced Multi-Module Voting for Implicit Hate Speech DetectionCode0
Compositional Zero-Shot Learning: A SurveyCode0
Class Prototypes based Contrastive Learning for Classifying Multi-Label and Fine-Grained Educational VideosCode0
AwareCompiler: Agentic Context-Aware Compiler Optimization via a Synergistic Knowledge-Data Driven FrameworkCode0
Query-Specific GNN: A Comprehensive Graph Representation Learning Method for Retrieval Augmented GenerationCode0
Diffusion-DFL: Decision-focused Diffusion Models for Stochastic OptimizationCode0
Reproducibility: The New Frontier in AI GovernanceCode0
DiT360: High-Fidelity Panoramic Image Generation via Hybrid TrainingCode0
DTEA: Dynamic Topology Weaving and Instability-Driven Entropic Attenuation for Medical Image SegmentationCode0
Hierarchical Balance Packing: Towards Efficient Supervised Fine-tuning for Long-Context LLMCode0
SAVeD: Learning to Denoise Low-SNR Video for Improved Downstream PerformanceCode0
Learning Diffusion Models with Flexible Representation GuidanceCode0
Making Mathematical Reasoning AdaptiveCode0
Ontolearn-A Framework for Large-scale OWL Class Expression Learning in PythonCode0
Evaluating Line-level Localization Ability of Learning-based Code Vulnerability Detection ModelsCode0
Towards Real-Time Fake News Detection under Evidence ScarcityCode0
Investigating Large Language Models' Linguistic Abilities for Text PreprocessingCode0
Scaling Language-Centric Omnimodal Representation LearningCode0
CodePlot-CoT: Mathematical Visual Reasoning by Thinking with Code-Driven ImagesCode0
Gains: Fine-grained Federated Domain Adaptation in Open SetCode0
VNJPTranslate: A comprehensive pipeline for Vietnamese-Japanese translation0
oMeBench: Towards Robust Benchmarking of LLMs in Organic Mechanism Elucidation and Reasoning0
AVoCaDO: An Audiovisual Video Captioner Driven by Temporal Orchestration0
Rethinking LLM Evaluation: Can We Evaluate LLMs with 200x Less Data?0
VeritasFi: An Adaptable, Multi-tiered RAG Framework for Multi-modal Financial Question AnsweringCode0
Show:102550
← PrevPage 356 of 18972Next →