SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17511775 of 659983 papers

TitleStatusHype
SimPO: Simple Preference Optimization with a Reference-Free RewardCode4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical TrainingCode4
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video GenerationCode4
ParkingE2E: Camera-based End-to-end Parking Network, from Images to PlanningCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different ModalitiesCode4
LESS: Selecting Influential Data for Targeted Instruction TuningCode4
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree SearchCode4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent BehaviorsCode4
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationCode4
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry RefinerCode4
UniTok: A Unified Tokenizer for Visual Generation and UnderstandingCode4
LangCell: Language-Cell Pre-training for Cell Identity UnderstandingCode4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow EstimationCode4
Kwai Keye-VL Technical ReportCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video GenerationCode4
DemoFusion: Democratising High-Resolution Image Generation With No $Code4
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
The All-Seeing Project V2: Towards General Relation Comprehension of the Open WorldCode4
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human InterplayCode4
Show:102550
← PrevPage 71 of 26400Next →