SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 18011825 of 177339 papers

TitleStatusHype
The WMDP Benchmark: Measuring and Reducing Malicious Use With UnlearningCode4
Aequitas Flow: Streamlining Fair ML ExperimentationCode4
Efficient Part-level 3D Object Generation via Dual Volume PackingCode4
Character Region Awareness for Text DetectionCode4
Video-LaVIT: Unified Video-Language Pre-training with Decoupled Visual-Motional TokenizationCode4
AnnoLLM: Making Large Language Models to Be Better Crowdsourced AnnotatorsCode4
Advancing Parsimonious Deep Learning Weather Prediction using the HEALPix MeshCode4
SurveyX: Academic Survey Automation via Large Language ModelsCode4
Cameras as Rays: Pose Estimation via Ray DiffusionCode4
Scaling Up Biomedical Vision-Language Models: Fine-Tuning, Instruction Tuning, and Multi-Modal LearningCode4
Internet of Agents: Weaving a Web of Heterogeneous Agents for Collaborative IntelligenceCode4
V?: Guided Visual Search as a Core Mechanism in Multimodal LLMsCode4
MiraData: A Large-Scale Video Dataset with Long Durations and Structured CaptionsCode4
Conformalized Physics-Informed Neural NetworksCode4
RETSim: Resilient and Efficient Text SimilarityCode4
RSAR: Restricted State Angle Resolver and Rotated SAR BenchmarkCode4
Data-Prep-Kit: getting your data ready for LLM application developmentCode4
NeMo-Aligner: Scalable Toolkit for Efficient Model AlignmentCode4
Retrieval-Augmented Generation for Large Language Models: A SurveyCode4
Learning the Beauty in Songs: Neural Singing Voice BeautifierCode4
HVI: A New color space for Low-light Image EnhancementCode4
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language ModelsCode4
Grokking: Generalization Beyond Overfitting on Small Algorithmic DatasetsCode4
Graspness Discovery in Clutters for Fast and Accurate Grasp DetectionCode4
OMG: Occlusion-friendly Personalized Multi-concept Generation in Diffusion ModelsCode4
Show:102550
← PrevPage 73 of 7094Next →