SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 35263550 of 177340 papers

TitleStatusHype
Relational Multi-Task Learning: Modeling Relations between Data and TasksCode3
Selective Reflection-Tuning: Student-Selected Data Recycling for LLM Instruction-TuningCode3
Deciphering Oracle Bone Language with Diffusion ModelsCode3
MS-Diffusion: Multi-subject Zero-shot Image Personalization with Layout GuidanceCode3
Robust Watermarking Using Generative Priors Against Image Editing: From Benchmarking to AdvancesCode3
LLaVA-MoD: Making LLaVA Tiny via MoE Knowledge DistillationCode3
Deep Photo Style TransferCode3
Anatomically-Controllable Medical Image Generation with Segmentation-Guided Diffusion ModelsCode3
Generalized Decoding for Pixel, Image, and LanguageCode3
Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive AttacksCode3
SAM Fails to Segment Anything? -- SAM-Adapter: Adapting SAM in Underperformed Scenes: Camouflage, Shadow, Medical Image Segmentation, and MoreCode3
Learning to Speak Fluently in a Foreign Language: Multilingual Speech Synthesis and Cross-Language Voice CloningCode3
StyleGaussian: Instant 3D Style Transfer with Gaussian SplattingCode3
GaMeS: Mesh-Based Adapting and Modification of Gaussian SplattingCode3
REPLUG: Retrieval-Augmented Black-Box Language ModelsCode3
Query-Based Adversarial Prompt GenerationCode3
GRAG: Graph Retrieval-Augmented GenerationCode3
Conformer: Convolution-augmented Transformer for Speech RecognitionCode3
Producing and Leveraging Online Map Uncertainty in Trajectory PredictionCode3
Efficient Inference for Large Reasoning Models: A SurveyCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic PatternsCode3
RF-Diffusion: Radio Signal Generation via Time-Frequency DiffusionCode3
DiLightNet: Fine-grained Lighting Control for Diffusion-based Image GenerationCode3
EXP-Bench: Can AI Conduct AI Research Experiments?Code3
Show:102550
← PrevPage 142 of 7094Next →