SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 15011525 of 177339 papers

TitleStatusHype
LESS: Selecting Influential Data for Targeted Instruction TuningCode4
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree SearchCode4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent BehaviorsCode4
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationCode4
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry RefinerCode4
UniTok: A Unified Tokenizer for Visual Generation and UnderstandingCode4
LangCell: Language-Cell Pre-training for Cell Identity UnderstandingCode4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow EstimationCode4
Kwai Keye-VL Technical ReportCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video GenerationCode4
DemoFusion: Democratising High-Resolution Image Generation With No $Code4
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
The All-Seeing Project V2: Towards General Relation Comprehension of the Open WorldCode4
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human InterplayCode4
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
High Fidelity Neural Audio CompressionCode4
MIGC++: Advanced Multi-Instance Generation Controller for Image SynthesisCode4
Qiskit Machine Learning: an open-source library for quantum machine learning tasks at scale on quantum hardware and classical simulatorsCode4
StudioGAN: A Taxonomy and Benchmark of GANs for Image SynthesisCode4
CoTracker: It is Better to Track TogetherCode4
Show:102550
← PrevPage 61 of 7094Next →