SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 33013350 of 177340 papers

TitleStatusHype
CrossOver: 3D Scene Cross-Modal AlignmentCode3
Harnessing Multiple Large Language Models: A Survey on LLM EnsembleCode3
BatteryLife: A Comprehensive Dataset and Benchmark for Battery Life PredictionCode3
GoalFlow: Goal-Driven Flow Matching for Multimodal Trajectories Generation in End-to-End Autonomous DrivingCode3
Reinforcement Learning Outperforms Supervised Fine-Tuning: A Case Study on Audio Question AnsweringCode3
Falcon: A Remote Sensing Vision-Language Foundation ModelCode3
A Survey on Latent ReasoningCode3
Vision-Speech Models: Teaching Speech Models to Converse about ImagesCode3
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal ConsistencyCode3
Vision-to-Music Generation: A SurveyCode3
A Survey of Efficient Reasoning for Large Reasoning Models: Language, Multimodality, and BeyondCode3
AI2Agent: An End-to-End Framework for Deploying AI Projects as Autonomous AgentsCode3
Perception-R1: Pioneering Perception Policy with Reinforcement LearningCode3
Learning to Reason under Off-Policy GuidanceCode3
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented GenerationCode3
DS-Agent: Automated Data Science by Empowering Large Language Models with Case-Based ReasoningCode3
Causal-learn: Causal Discovery in PythonCode3
Memory Layers at ScaleCode3
MoE-Infinity: Efficient MoE Inference on Personal Machines with Sparsity-Aware Expert CacheCode3
Addressing the Abstraction and Reasoning Corpus via Procedural Example GenerationCode3
A Unified Framework for Rank-based Evaluation Metrics for Link Prediction in Knowledge GraphsCode3
Emergent World Models and Latent Variable Estimation in Chess-Playing Language ModelsCode3
GiT: Towards Generalist Vision Transformer through Universal Language InterfaceCode3
Champion Solution for the WSDM2023 Toloka VQA ChallengeCode3
EMOPortraits: Emotion-enhanced Multimodal One-shot Head AvatarsCode3
On Noise Injection in Generative Adversarial NetworksCode3
When Large Language Models Meet Vector Databases: A SurveyCode3
PyText: A Seamless Path from NLP research to productionCode3
Non-Autoregressive Semantic Parsing for Compositional Task-Oriented DialogCode3
Breaking reCAPTCHAv2Code3
AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-TuningCode3
BackdoorLLM: A Comprehensive Benchmark for Backdoor Attacks and Defenses on Large Language ModelsCode3
Deep learning in motion deblurring: current status, benchmarks and future prospectsCode3
LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image SegmentationCode3
RT-1: Robotics Transformer for Real-World Control at ScaleCode3
AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMsCode3
SMPLer-X: Scaling Up Expressive Human Pose and Shape EstimationCode3
Elucidating the Design Space of Multimodal Protein Language ModelsCode3
Locate 3D: Real-World Object Localization via Self-Supervised Learning in 3DCode3
Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language AlignmentCode3
Golden Gemini is All You Need: Finding the Sweet Spots for Speaker VerificationCode3
CausalML: Python Package for Causal Machine LearningCode3
Stable-Makeup: When Real-World Makeup Transfer Meets Diffusion ModelCode3
Evolve Cost-aware Acquisition Functions Using Large Language ModelsCode3
SlowFast-LLaVA: A Strong Training-Free Baseline for Video Large Language ModelsCode3
MMed-RAG: Versatile Multimodal RAG System for Medical Vision Language ModelsCode3
Atomic Convolutional Networks for Predicting Protein-Ligand Binding AffinityCode3
Personalize Segment Anything Model with One ShotCode3
SimpleRecon: 3D Reconstruction Without 3D ConvolutionsCode3
Cyber-Attack Technique Classification Using Two-Stage Trained Large Language ModelsCode3
Show:102550
← PrevPage 67 of 3547Next →