SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 83768400 of 474278 papers

TitleStatusHype
Compositional Coordination for Multi-Robot Teams with Large Language Models0
Bag of Tricks for Subverting Reasoning-based Safety Guardrails0
From Denoising to Refining: A Corrective Framework for Vision-Language Diffusion Model0
DaMo: Data Mixing Optimizer in Fine-tuning Multimodal LLMs for Mobile Phone AgentsCode0
Text or Pixels? It Takes Half: On the Token Efficiency of Visual Text Inputs in Multimodal LLMs0
Re-evaluating Minimum Bayes Risk Decoding for Automatic Speech RecognitionCode0
BLiSS 1.0: Evaluating Bilingual Learner Competence in Second Language Small Language Models0
Latent Space Factorization in LoRACode0
QoQ-Med: Building Multimodal Clinical Foundation Models with Domain-Aware GRPO TrainingCode0
Do LLMs Really Forget? Evaluating Unlearning with Knowledge Correlation and Confidence AwarenessCode0
Chiron-o1: Igniting Multimodal Large Language Models towards Generalizable Medical Reasoning via Mentor-Intern Collaborative SearchCode0
Q-Palette: Fractional-Bit Quantizers Toward Optimal Bit Allocation for Efficient LLM DeploymentCode0
The Open Syndrome DefinitionCode0
Rethinking Hebbian Principle: Low-Dimensional Structural Projection for Unsupervised LearningCode0
3D-GSRD: 3D Molecular Graph Auto-Encoder with Selective Re-mask DecodingCode0
SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large SpreadsheetsCode0
AegisRF: Adversarial Perturbations Guided with Sensitivity for Protecting Intellectual Property of Neural Radiance FieldsCode0
Lookahead Routing for Large Language ModelsCode0
MedReason-R1: Learning to Reason for CT Diagnosis with Reinforcement Learning and Local ZoomCode0
LyriCAR: A Difficulty-Aware Curriculum Reinforcement Learning Framework For Controllable Lyric TranslationCode0
Latent Diffusion Models with Masked AutoEncodersCode0
SEMPO: Lightweight Foundation Models for Time Series ForecastingCode0
FnRGNN: Distribution-aware Fairness in Graph Neural NetworkCode0
Data-Adaptive Transformed Bilateral Tensor Low-Rank Representation for ClusteringCode0
Filter-Based Reconstruction of Images from EventsCode0
Show:102550
← PrevPage 336 of 18972Next →