SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 676700 of 659983 papers

TitleStatusHype
Inpaint Anything: Segment Anything Meets Image InpaintingCode5
Extreme Compression of Large Language Models via Additive QuantizationCode5
Structural Generalization in Autonomous Cyber Incident Response with Message-Passing Neural Networks and Reinforcement LearningCode5
FlexLLM: A System for Co-Serving Large Language Model Inference and Parameter-Efficient FinetuningCode5
CPsyCoun: A Report-based Multi-turn Dialogue Reconstruction and Evaluation Framework for Chinese Psychological CounselingCode5
Unified Multimodal Understanding and Generation Models: Advances, Challenges, and OpportunitiesCode5
Towards Better Instruction Following Language Models for Chinese: Investigating the Impact of Training Data and EvaluationCode5
MarS: a Financial Market Simulation Engine Powered by Generative Foundation ModelCode5
Mulberry: Empowering MLLM with o1-like Reasoning and Reflection via Collective Monte Carlo Tree SearchCode5
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion ModelsCode5
Arbitrary-steps Image Super-resolution via Diffusion InversionCode5
SQUAT: Stateful Quantization-Aware Training in Recurrent Spiking Neural NetworksCode5
SymbolicAI: A framework for logic-based approaches combining generative models and solversCode5
That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip DesignCode5
GAPartManip: A Large-scale Part-centric Dataset for Material-Agnostic Articulated Object ManipulationCode5
Very Low Complexity Speech Synthesis Using Framewise Autoregressive GAN (FARGAN) with Pitch PredictionCode5
A quantum semantic framework for natural language processingCode5
Single-seed generation of Brownian paths and integrals for adaptive and high order SDE solversCode5
The Path To Autonomous Cyber DefenseCode5
CityGaussian: Real-time High-quality Large-Scale Scene Rendering with GaussiansCode5
pyvene: A Library for Understanding and Improving PyTorch Models via InterventionsCode5
Re-imagine the Negative Prompt Algorithm: Transform 2D Diffusion into 3D, alleviate Janus problem and BeyondCode5
Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and ValuesCode5
Magic Clothing: Controllable Garment-Driven Image SynthesisCode5
MedCare: Advancing Medical LLMs through Decoupling Clinical Alignment and Knowledge AggregationCode5
Show:102550
← PrevPage 28 of 26400Next →