SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 14011450 of 177339 papers

TitleStatusHype
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion ModelsCode4
Mutual Reasoning Makes Smaller LLMs Stronger Problem-SolversCode4
Data quality dimensions for fair AICode4
AnyText: Multilingual Visual Text Generation And EditingCode4
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned EncodersCode4
BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View RepresentationCode4
SEED-Data-Edit Technical Report: A Hybrid Dataset for Instructional Image EditingCode4
TDMPBC: Self-Imitative Reinforcement Learning for Humanoid Robot ControlCode4
CFG-Zero*: Improved Classifier-Free Guidance for Flow Matching ModelsCode4
Kubric: A scalable dataset generatorCode4
Medical Graph RAG: Towards Safe Medical Large Language Model via Graph Retrieval-Augmented GenerationCode4
R^2-Gaussian: Rectifying Radiative Gaussian Splatting for Tomographic ReconstructionCode4
AgentGym: Evolving Large Language Model-based Agents across Diverse EnvironmentsCode4
RecBole 2.0: Towards a More Up-to-Date Recommendation LibraryCode4
ReasonFlux: Hierarchical LLM Reasoning via Scaling Thought TemplatesCode4
IGEV++: Iterative Multi-range Geometry Encoding Volumes for Stereo MatchingCode4
Long Context Transfer from Language to VisionCode4
RealisDance: Equip controllable character animation with realistic handsCode4
NNsight and NDIF: Democratizing Access to Open-Weight Foundation Model InternalsCode4
Unified Multimodal Chain-of-Thought Reward Model through Reinforcement Fine-TuningCode4
TokenFormer: Rethinking Transformer Scaling with Tokenized Model ParametersCode4
A Closer Look at Deep Learning Methods on Tabular DatasetsCode4
Quiet-STaR: Language Models Can Teach Themselves to Think Before SpeakingCode4
Magicoder: Empowering Code Generation with OSS-InstructCode4
Zero-Shot Whole-Body Humanoid Control via Behavioral Foundation ModelsCode4
Latent Consistency Models: Synthesizing High-Resolution Images with Few-Step InferenceCode4
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQLCode4
VM-UNet: Vision Mamba UNet for Medical Image SegmentationCode4
FedCP: Separating Feature Information for Personalized Federated Learning via Conditional PolicyCode4
Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question AnsweringCode4
NExT-GPT: Any-to-Any Multimodal LLMCode4
Co-Evolving LLM Coder and Unit Tester via Reinforcement LearningCode4
Eliminating Domain Bias for Federated Learning in Representation SpaceCode4
MotionClone: Training-Free Motion Cloning for Controllable Video GenerationCode4
Recent Advances in Large Langauge Model Benchmarks against Data Contamination: From Static to Dynamic EvaluationCode4
GIM: Learning Generalizable Image Matcher From Internet VideosCode4
Pixel-level and Semantic-level Adjustable Super-resolution: A Dual-LoRA ApproachCode4
Pearl: A Production-ready Reinforcement Learning AgentCode4
Towards All-in-One Medical Image Re-IdentificationCode4
LocAgent: Graph-Guided LLM Agents for Code LocalizationCode4
GPFL: Simultaneously Learning Global and Personalized Feature Information for Personalized Federated LearningCode4
Data-centric Artificial Intelligence: A SurveyCode4
Seg-Zero: Reasoning-Chain Guided Segmentation via Cognitive ReinforcementCode4
KeyPoint Relative Position Encoding for Face RecognitionCode4
Diffusion Model-Based Image Editing: A SurveyCode4
Planning-oriented Autonomous DrivingCode4
Generation of Training Data from HD Maps in the Lanelet2 FrameworkCode4
NAFSSR: Stereo Image Super-Resolution Using NAFNetCode4
Visual Mamba: A Survey and New OutlooksCode4
Weighted-Reward Preference Optimization for Implicit Model FusionCode4
Show:102550
← PrevPage 29 of 3547Next →