SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 59515975 of 177340 papers

TitleStatusHype
Salesforce CausalAI Library: A Fast and Scalable Framework for Causal Analysis of Time Series and Tabular DataCode2
PosterLLaVa: Constructing a Unified Multi-modal Layout Generator with LLMCode2
GraphGPT: Graph Instruction Tuning for Large Language ModelsCode2
Making LLaMA SEE and Draw with SEED TokenizerCode2
Unlocking Feature Visualization for Deeper Networks with MAgnitude Constrained OptimizationCode2
Hokoff: Real Game Dataset from Honor of Kings and its Offline Reinforcement Learning BenchmarksCode2
RGBAvatar: Reduced Gaussian Blendshapes for Online Modeling of Head AvatarsCode2
ReliableSwap: Boosting General Face Swapping Via Reliable SupervisionCode2
Structure-Aware Transformer for Graph Representation LearningCode2
High-Order Control Barrier Functions: Insights and a Truncated Taylor-Based FormulationCode2
Contrastive Learning of Asset Embeddings from Financial Time SeriesCode2
Pix2NeRF: Unsupervised Conditional p-GAN for Single Image to Neural Radiance Fields TranslationCode2
CHiSafetyBench: A Chinese Hierarchical Safety Benchmark for Large Language ModelsCode2
Graph Neural Network-based surrogate model for granular flowsCode2
Chasing Low-Carbon Electricity for Practical and Sustainable DNN TrainingCode2
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuningCode2
GSplatLoc: Ultra-Precise Camera Localization via 3D Gaussian SplattingCode2
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive DiffusionCode2
SpeechTokenizer: Unified Speech Tokenizer for Speech Large Language ModelsCode2
Video Polyp Segmentation: A Deep Learning PerspectiveCode2
P2P: Part-to-Part Motion Cues Guide a Strong Tracking Framework for LiDAR Point CloudsCode2
PolygloToxicityPrompts: Multilingual Evaluation of Neural Toxic Degeneration in Large Language ModelsCode2
mdCATH: A Large-Scale MD Dataset for Data-Driven Computational BiophysicsCode2
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task TuningCode2
Melting Pot 2.0Code2
Show:102550
← PrevPage 239 of 7094Next →