SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 42514300 of 177340 papers

TitleStatusHype
PhysTwin: Physics-Informed Reconstruction and Simulation of Deformable Objects from VideosCode3
Deep Learning and LLM-based Methods Applied to Stellar Lightcurve ClassificationCode3
Detecting hallucinations in large language models using semantic entropyCode3
The Curse of Multi-Modalities: Evaluating Hallucinations of Large Multimodal Models across Language, Visual, and AudioCode3
BoostTrack: boosting the similarity measure and detection confidence for improved multiple object trackingCode3
SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-ScalingCode3
PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360^Code3
VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-TuningCode3
Improving Dictionary Learning with Gated Sparse AutoencodersCode3
Open3D: A Modern Library for 3D Data ProcessingCode3
ATPrompt: Textual Prompt Learning with Embedded AttributesCode3
N-BEATS: Neural basis expansion analysis for interpretable time series forecastingCode3
Mip-Splatting: Alias-free 3D Gaussian SplattingCode3
Video Prediction Policy: A Generalist Robot Policy with Predictive Visual RepresentationsCode3
A Vision-Language Foundation Model to Enhance Efficiency of Chest X-ray InterpretationCode3
Scaling Rectified Flow Transformers for High-Resolution Image SynthesisCode3
MegaPairs: Massive Data Synthesis For Universal Multimodal RetrievalCode3
BasicVSR: The Search for Essential Components in Video Super-Resolution and BeyondCode3
Towards CausalGPT: A Multi-Agent Approach for Faithful Knowledge Reasoning via Promoting Causal Consistency in LLMsCode3
WHAM: Reconstructing World-grounded Humans with Accurate 3D MotionCode3
Block-NeRF: Scalable Large Scene Neural View SynthesisCode3
SemantiCodec: An Ultra Low Bitrate Semantic Audio Codec for General SoundCode3
Vision Transformers for Dense PredictionCode3
RepViT: Revisiting Mobile CNN From ViT PerspectiveCode3
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion ModelCode3
CRAG -- Comprehensive RAG BenchmarkCode3
Major TOM: Expandable Datasets for Earth ObservationCode3
Uni-QSAR: an Auto-ML Tool for Molecular Property PredictionCode3
Optimal Variable Speed Limit Control Strategy on Freeway Segments under Fog ConditionsCode3
Towards General-purpose Infrastructure for Protecting Scientific Data Under StudyCode3
L1: Controlling How Long A Reasoning Model Thinks With Reinforcement LearningCode3
Genie: Generative Interactive EnvironmentsCode3
Exploring Regional Clues in CLIP for Zero-Shot Semantic SegmentationCode3
Efficiently Serving LLM Reasoning Programs with CertaindexCode3
SPO: Sequential Monte Carlo Policy OptimisationCode3
AgentStudio: A Toolkit for Building General Virtual AgentsCode3
Is Value Learning Really the Main Bottleneck in Offline RL?Code3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and AccuracyCode3
Compact 3D Gaussian Splatting for Static and Dynamic Radiance FieldsCode3
MAGiC-SLAM: Multi-Agent Gaussian Globally Consistent SLAMCode3
Gemma Scope: Open Sparse Autoencoders Everywhere All At Once on Gemma 2Code3
DPLM-2: A Multimodal Diffusion Protein Language ModelCode3
Automated Formulaic Alpha Generation for Quantitative Investing using Evolutionary AlgorithmsCode3
The False Promise of Imitating Proprietary LLMsCode3
Visual Geometry Grounded Deep Structure From MotionCode3
A Foundation Model for the Earth SystemCode3
DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement LearningCode3
Human-level play in the game of Diplomacy by combining language models with strategic reasoningCode3
Improving Text Embeddings with Large Language ModelsCode3
Performance Analysis of Open Source Machine Learning Frameworks for Various Parameters in Single-Threaded and Multi-Threaded ModesCode3
Show:102550
← PrevPage 86 of 3547Next →