SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 17761800 of 661570 papers

TitleStatusHype
A Foundation Model for Zero-shot Logical Query ReasoningCode4
FLEX: FLEXible Federated Learning FrameworkCode4
Matching 2D Images in 3D: Metric Relative Pose from Metric CorrespondencesCode4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
Sailor: Open Language Models for South-East AsiaCode4
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
MiniGPT4-Video: Advancing Multimodal LLMs for Video Understanding with Interleaved Visual-Textual TokensCode4
ChangeMamba: Remote Sensing Change Detection With Spatiotemporal State Space ModelCode4
The largest EEG-based BCI reproducibility study for open science: the MOABB benchmarkCode4
SnAG: Scalable and Accurate Video GroundingCode4
Symbolic Prompt Program Search: A Structure-Aware Approach to Efficient Compile-Time Prompt OptimizationCode4
CameraCtrl: Enabling Camera Control for Text-to-Video GenerationCode4
A Survey on Large Language Model-Based Game AgentsCode4
End-to-End Autonomous Driving through V2X CooperationCode4
PyTorch Frame: A Modular Framework for Multi-Modal Tabular LearningCode4
Dataverse: Open-Source ETL (Extract, Transform, Load) Pipeline for Large Language ModelsCode4
Croissant: A Metadata Format for ML-Ready DatasetsCode4
Tiny Machine Learning: Progress and FuturesCode4
JailbreakBench: An Open Robustness Benchmark for Jailbreaking Large Language ModelsCode4
Long-form factuality in large language modelsCode4
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical TextCode4
DN-Splatter: Depth and Normal Priors for Gaussian Splatting and MeshingCode4
Deepfake Generation and Detection: A Benchmark and SurveyCode4
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild VideosCode4
Octree-GS: Towards Consistent Real-time Rendering with LOD-Structured 3D GaussiansCode4
Show:102550
← PrevPage 72 of 26463Next →