SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 40514075 of 661570 papers

TitleStatusHype
Evolution of Heuristics: Towards Efficient Automatic Algorithm Design Using Large Language ModelCode3
Text2MDT: Extracting Medical Decision Trees from Medical TextsCode3
LEAP-VO: Long-term Effective Any Point Tracking for Visual OdometryCode3
CharacterEval: A Chinese Benchmark for Role-Playing Conversational Agent EvaluationCode3
EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG SignalsCode3
SEED-Bench: Benchmarking Multimodal Large Language ModelsCode3
Towards Modern Image Manipulation Localization: A Large-Scale Dataset and Novel MethodsCode3
LL3DA: Visual Interactive Instruction Tuning for Omni-3D Understanding Reasoning and PlanningCode3
Intelligent Grimm - Open-ended Visual Storytelling via Latent Diffusion ModelsCode3
Inversion-Free Image Editing with Language-Guided Diffusion ModelsCode3
Towards Automatic Power Battery Detection: New Challenge Benchmark Dataset and BaselineCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
Stronger Fewer & Superior: Harnessing Vision Foundation Models for Domain Generalized Semantic SegmentationCode3
Improving Text Embeddings with Large Language ModelsCode3
Fairness in Serving Large Language ModelsCode3
EMAGE: Towards Unified Holistic Co-Speech Gesture Generation via Expressive Masked Audio Gesture ModelingCode3
Large Language Models for Generative Information Extraction: A SurveyCode3
TinyGPT-V: Efficient Multimodal Large Language Model via Small BackbonesCode3
MobileVLM : A Fast, Strong and Open Vision Language Assistant for Mobile DevicesCode3
LangSplat: 3D Language Gaussian SplattingCode3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
emotion2vec: Self-Supervised Pre-Training for Speech Emotion RepresentationCode3
DriveLM: Driving with Graph Visual Question AnsweringCode3
Generative Multimodal Models are In-Context LearnersCode3
Show:102550
← PrevPage 163 of 26463Next →