SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 17511800 of 659983 papers

TitleStatusHype
SimPO: Simple Preference Optimization with a Reference-Free RewardCode4
FedML Parrot: A Scalable Federated Learning System via Heterogeneity-aware Scheduling on Sequential and Hierarchical TrainingCode4
Uni3C: Unifying Precisely 3D-Enhanced Camera and Human Motion Controls for Video GenerationCode4
ParkingE2E: Camera-based End-to-end Parking Network, from Images to PlanningCode4
A Survey of State of the Art Large Vision Language Models: Alignment, Benchmark, Evaluations and ChallengesCode4
TencentPretrain: A Scalable and Flexible Toolkit for Pre-training Models of Different ModalitiesCode4
LESS: Selecting Influential Data for Targeted Instruction TuningCode4
ReST-MCTS*: LLM Self-Training via Process Reward Guided Tree SearchCode4
No Time to Train: Empowering Non-Parametric Networks for Few-shot 3D Scene SegmentationCode4
AgentVerse: Facilitating Multi-Agent Collaboration and Exploring Emergent BehaviorsCode4
RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented GenerationCode4
CraftsMan3D: High-fidelity Mesh Generation with 3D Native Generation and Interactive Geometry RefinerCode4
UniTok: A Unified Tokenizer for Visual Generation and UnderstandingCode4
LangCell: Language-Cell Pre-training for Cell Identity UnderstandingCode4
RAPIDFlow: Recurrent Adaptable Pyramids with Iterative Decoding for Efficient Optical Flow EstimationCode4
Kwai Keye-VL Technical ReportCode4
Regularizing Hidden States Enables Learning Generalizable Reward Model for LLMsCode4
Towards One-shot Federated Learning: Advances, Challenges, and Future DirectionsCode4
s3: You Don't Need That Much Data to Train a Search Agent via RLCode4
lmgame-Bench: How Good are LLMs at Playing Games?Code4
OpenS2V-Nexus: A Detailed Benchmark and Million-Scale Dataset for Subject-to-Video GenerationCode4
DemoFusion: Democratising High-Resolution Image Generation With No $Code4
Look Once to Hear: Target Speech Hearing with Noisy ExamplesCode4
The All-Seeing Project V2: Towards General Relation Comprehension of the Open WorldCode4
APIGen-MT: Agentic Pipeline for Multi-Turn Data Generation via Simulated Agent-Human InterplayCode4
Eureka: Human-Level Reward Design via Coding Large Language ModelsCode4
High Fidelity Neural Audio CompressionCode4
MIGC++: Advanced Multi-Instance Generation Controller for Image SynthesisCode4
Qiskit Machine Learning: an open-source library for quantum machine learning tasks at scale on quantum hardware and classical simulatorsCode4
StudioGAN: A Taxonomy and Benchmark of GANs for Image SynthesisCode4
CoTracker: It is Better to Track TogetherCode4
PromptSource: An Integrated Development Environment and Repository for Natural Language PromptsCode4
Leveraging Speculative Sampling and KV-Cache Optimizations Together for Generative AI using OpenVINOCode4
Context-Aware Drift DetectionCode4
A Lightweight Instrument-Agnostic Model for Polyphonic Note Transcription and Multipitch EstimationCode4
Detectron2 Object Detection & Manipulating Images using CartoonizationCode4
GCoNet+: A Stronger Group Collaborative Co-Salient Object DetectorCode4
RLlib: Abstractions for Distributed Reinforcement LearningCode4
Vision + Language Applications: A SurveyCode4
A Survey on Large Language Model-Based Game AgentsCode4
Z-Code++: A Pre-trained Language Model Optimized for Abstractive SummarizationCode4
R^3LIVE++: A Robust, Real-time, Radiance reconstruction package with a tightly-coupled LiDAR-Inertial-Visual state EstimatorCode4
Recent Advances in RecBole: Extensions with more Practical ConsiderationsCode4
Tune-A-Video: One-Shot Tuning of Image Diffusion Models for Text-to-Video GenerationCode4
Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Vision-Language ModelsCode4
AudioLDM: Text-to-Audio Generation with Latent Diffusion ModelsCode4
LORE: Lagrangian-Optimized Robust Embeddings for Visual EncodersCode4
Transcoders Beat Sparse Autoencoders for InterpretabilityCode4
Generalized Neighborhood Attention: Multi-dimensional Sparse Attention at the Speed of LightCode4
Memory-aided Contrastive Consensus Learning for Co-salient Object DetectionCode4
Show:102550
← PrevPage 36 of 13200Next →