SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Showing 13011350 of 659983 papers

TitleStatusHype
Building a Culture of Reproducibility in Academic ResearchCode4
A deep learning framework for efficient pathology image analysisCode4
Story-Adapter: A Training-free Iterative Framework for Long Story VisualizationCode4
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse AttentionCode4
CRUXEval: A Benchmark for Code Reasoning, Understanding and ExecutionCode4
VideoEval-Pro: Robust and Realistic Long Video Understanding EvaluationCode4
CitationMap: A Python Tool to Identify and Visualize Your Google Scholar Citations Around the WorldCode4
Real-time volumetric rendering of dynamic humansCode4
Improving Parallel Program Performance with LLM Optimizers via Agent-System InterfacesCode4
DeepFakes and Beyond: A Survey of Face Manipulation and Fake DetectionCode4
Inductive Moment MatchingCode4
Polysemous codesCode4
SWE-bench: Can Language Models Resolve Real-World GitHub Issues?Code4
RUMI: Rummaging Using Mutual InformationCode4
ChatGPT Outperforms Crowd-Workers for Text-Annotation TasksCode4
A General Theoretical Paradigm to Understand Learning from Human PreferencesCode4
Self-Supervised Geometry-Guided Initialization for Robust Monocular Visual OdometryCode4
MUSE: Machine Unlearning Six-Way Evaluation for Language ModelsCode4
Stock Price Prediction via Discovering Multi-Frequency Trading PatternsCode4
The Model Openness Framework: Promoting Completeness and Openness for Reproducibility, Transparency, and Usability in Artificial IntelligenceCode4
Fast Transformer Decoding: One Write-Head is All You NeedCode4
OpenMathInstruct-2: Accelerating AI for Math with Massive Open-Source Instruction DataCode4
DisCo-DSO: Coupling Discrete and Continuous Optimization for Efficient Generative Design in Hybrid SpacesCode4
Ideas in Inference-time Scaling can Benefit Generative Pre-training AlgorithmsCode4
Tiny-PULP-Dronets: Squeezing Neural Networks for Faster and Lighter Inference on Multi-Tasking Autonomous Nano-DronesCode4
ReARTeR: Retrieval-Augmented Reasoning with Trustworthy Process RewardingCode4
PointVLA: Injecting the 3D World into Vision-Language-Action ModelsCode4
ViViD: Video Virtual Try-on using Diffusion ModelsCode4
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single ImageCode4
GeneFace: Generalized and High-Fidelity Audio-Driven 3D Talking Face SynthesisCode4
Navigation World ModelsCode4
Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language ModelsCode4
Diffusion-Based Planning for Autonomous Driving with Flexible GuidanceCode4
Efficient LoFTR: Semi-Dense Local Feature Matching with Sparse-Like SpeedCode4
VideoChat: Chat-Centric Video UnderstandingCode4
HaGRIDv2: 1M Images for Static and Dynamic Hand Gesture RecognitionCode4
Contextual Multilingual Spellchecker for User QueriesCode4
Panoptic Feature Pyramid NetworksCode4
Evolution Transformer: In-Context Evolutionary OptimizationCode4
Segment and Track AnythingCode4
SmoothGrad: removing noise by adding noiseCode4
A Comprehensive Survey on 3D Content GenerationCode4
Autoregressive Models in Vision: A SurveyCode4
SynCamMaster: Synchronizing Multi-Camera Video Generation from Diverse ViewpointsCode4
Ray: A Distributed Framework for Emerging AI ApplicationsCode4
RegNet: Self-Regulated Network for Image ClassificationCode4
MVSGaussian: Fast Generalizable Gaussian Splatting Reconstruction from Multi-View StereoCode4
CrossWOZ: A Large-Scale Chinese Cross-Domain Task-Oriented Dialogue DatasetCode4
On the Contribution of Per-ICD Attention Mechanisms to Classify Health Records in Languages with Fewer Resources than EnglishCode4
Dive into Deep LearningCode4
Show:102550
← PrevPage 27 of 13200Next →