SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 28012850 of 659983 papers

TitleStatusHype
The Breeze 2 Herd of Models: Traditional Chinese LLMs Based on Llama with Vision-Aware and Function-Calling CapabilitiesCode3
HAC++: Towards 100X Compression of 3D Gaussian SplattingCode3
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language ModelCode3
How Well Do Supervised 3D Models Transfer to Medical Imaging Tasks?Code3
CatV2TON: Taming Diffusion Transformers for Vision-Based Virtual Try-On with Temporal ConcatenationCode3
The OpenLAM ChallengesCode3
CoverM: Read alignment statistics for metagenomicsCode3
Infrared and Visible Image Fusion: From Data Compatibility to Task AdaptionCode3
Universal Actions for Enhanced Embodied Foundation ModelsCode3
A Survey on LLM Test-Time Compute via Search: Tasks, LLM Profiling, Search Algorithms, and Relevant FrameworksCode3
X-Dyna: Expressive Dynamic Human Image AnimationCode3
Foundations of Large Language ModelsCode3
OmniThink: Expanding Knowledge Boundaries in Machine Writing through ThinkingCode3
DEFOM-Stereo: Depth Foundation Model Based Stereo MatchingCode3
Karatsuba Matrix Multiplication and its Efficient Custom Hardware ImplementationsCode3
FramePainter: Endowing Interactive Image Editing with Video Diffusion PriorsCode3
Do generative video models understand physical principles?Code3
In-situ graph reasoning and knowledge expansion using Graph-PReFLexORCode3
Lifelong Learning of Large Language Model based Agents: A RoadmapCode3
A General Framework for Inference-time Scaling and Steering of Diffusion ModelsCode3
ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing systemCode3
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMsCode3
Valley2: Exploring Multimodal Models with Scalable Vision-Language DesignCode3
BRIGHT: A globally distributed multimodal building damage assessment dataset with very-high-resolution for all-weather disaster responseCode3
Relative Pose Estimation through Affine Corrections of Monocular Depth PriorsCode3
3DIS-FLUX: simple and efficient multi-instance generation with DiT renderingCode3
RadGPT: Constructing 3D Image-Text Tumor DatasetsCode3
GLiREL -- Generalist Model for Zero-Shot Relation ExtractionCode3
LangFair: A Python Package for Assessing Bias and Fairness in Large Language Model Use CasesCode3
Visual Large Language Models for Generalized and Specialized ApplicationsCode3
The Tabular Foundation Model TabPFN Outperforms Specialized Time Series Forecasting Models Based on Simple FeaturesCode3
Depth Any Camera: Zero-Shot Metric Depth Estimation from Any CameraCode3
UAVs Meet LLMs: Overviews and Perspectives Toward Agentic Low-Altitude MobilityCode3
ROLO-SLAM: Rotation-Optimized LiDAR-Only SLAM in Uneven Terrain with Ground VehicleCode3
Cold-Start Recommendation towards the Era of Large Language Models (LLMs): A Comprehensive Survey and RoadmapCode3
JoyGen: Audio-Driven 3D Depth-Aware Talking-Face Video EditingCode3
CryptoMamba: Leveraging State Space Models for Accurate Bitcoin Price PredictionCode3
MuQ: Self-Supervised Music Representation Learning with Mel Residual Vector QuantizationCode3
VISTA3D: A Unified Segmentation Foundation Model For 3D Medical ImagingCode3
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline SummarizationCode3
Dataset Distillation with Neural Characteristic Function: A Minmax PerspectiveCode3
VideoRefer Suite: Advancing Spatial-Temporal Object Understanding with Video LLMCode3
DiC: Rethinking Conv3x3 Designs in Diffusion ModelsCode3
STORM: Spatio-Temporal Reconstruction Model for Large-Scale Outdoor ScenesCode3
Efficiently Serving LLM Reasoning Programs with CertaindexCode3
SM3Det: A Unified Model for Multi-Modal Remote Sensing Object DetectionCode3
VisionReward: Fine-Grained Multi-Dimensional Human Preference Learning for Image and Video GenerationCode3
Towards Visual Grounding: A SurveyCode3
Calibre: Towards Fair and Accurate Personalized Federated Learning with Self-Supervised LearningCode3
DrivingWorld: Constructing World Model for Autonomous Driving via Video GPTCode3
Show:102550
← PrevPage 57 of 13200Next →