SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 29012950 of 659983 papers

TitleStatusHype
R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement LearningCode3
MiLoRA: Harnessing Minor Singular Components for Parameter-Efficient LLM FinetuningCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
SOAP: Style-Omniscient Animatable PortraitsCode3
wgatools: an ultrafast toolkit for manipulating whole genome alignmentsCode3
Detecting Twenty-thousand Classes using Image-level SupervisionCode3
VidTok: A Versatile and Open-Source Video TokenizerCode3
Transformers Can Do Arithmetic with the Right EmbeddingsCode3
StableVITON: Learning Semantic Correspondence with Latent Diffusion Model for Virtual Try-OnCode3
A General Framework for Inference-time Scaling and Steering of Diffusion ModelsCode3
Meissonic: Revitalizing Masked Generative Transformers for Efficient High-Resolution Text-to-Image SynthesisCode3
GEM: Empowering MLLM for Grounded ECG Understanding with Time Series and ImagesCode3
Unified Source-Free Domain AdaptationCode3
A Python library for efficient computation of molecular fingerprintsCode3
Long-VITA: Scaling Large Multi-modal Models to 1 Million Tokens with Leading Short-Context AccurayCode3
SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAMCode3
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector QuantizationCode3
Language-Codec: Bridging Discrete Codec Representations and Speech Language ModelsCode3
ROLAND: Graph Learning Framework for Dynamic GraphsCode3
DiC: Rethinking Conv3x3 Designs in Diffusion ModelsCode3
LongMemEval: Benchmarking Chat Assistants on Long-Term Interactive MemoryCode3
BiLLM: Pushing the Limit of Post-Training Quantization for LLMsCode3
MotionGPT: Human Motion as a Foreign LanguageCode3
HELMET: How to Evaluate Long-Context Language Models Effectively and ThoroughlyCode3
AiOS: All-in-One-Stage Expressive Human Pose and Shape EstimationCode3
Efficient Agent Training for Computer UseCode3
Agent Workflow MemoryCode3
LaViDa: A Large Diffusion Language Model for Multimodal UnderstandingCode3
Aquila2 Technical ReportCode3
The Flan Collection: Designing Data and Methods for Effective Instruction TuningCode3
DUFOMap: Efficient Dynamic Awareness MappingCode3
Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-PlayCode3
UnMarker: A Universal Attack on Defensive Image WatermarkingCode3
AlphaEdit: Null-Space Constrained Knowledge Editing for Language ModelsCode3
PaliGemma 2: A Family of Versatile VLMs for TransferCode3
AppWorld: A Controllable World of Apps and People for Benchmarking Interactive Coding AgentsCode3
5%>100%: Breaking Performance Shackles of Full Fine-Tuning on Visual Recognition TasksCode3
Nexus-Gen: A Unified Model for Image Understanding, Generation, and EditingCode3
StableIdentity: Inserting Anybody into Anywhere at First SightCode3
WebGLM: Towards An Efficient Web-Enhanced Question Answering System with Human PreferencesCode3
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D GenerationCode3
From Sora What We Can See: A Survey of Text-to-Video GenerationCode3
Diffusion-TS: Interpretable Diffusion for General Time Series GenerationCode3
TapeAgents: a Holistic Framework for Agent Development and OptimizationCode3
MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K ParametersCode3
DataSentinel: A Game-Theoretic Detection of Prompt Injection AttacksCode3
Adversarial Cheap TalkCode3
Flash3D: Feed-Forward Generalisable 3D Scene Reconstruction from a Single ImageCode3
EscherNet: A Generative Model for Scalable View SynthesisCode3
3DIS-FLUX: simple and efficient multi-instance generation with DiT renderingCode3
Show:102550
← PrevPage 59 of 13200Next →