SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 71017150 of 661570 papers

TitleStatusHype
Aligning to Thousands of Preferences via System Message GeneralizationCode2
Learning Manipulation by Predicting InteractionCode2
HiFuse: Hierarchical Multi-Scale Feature Fusion Network for Medical Image ClassificationCode2
Turning the Tables: Biased, Imbalanced, Dynamic Tabular Datasets for ML EvaluationCode2
Revealing Single Frame Bias for Video-and-Language LearningCode2
OctFormer: Octree-based Transformers for 3D Point CloudsCode2
MobileFaceSwap: A Lightweight Framework for Video Face SwappingCode2
Vision Language Models in Autonomous Driving: A Survey and OutlookCode2
Architectures of Topological Deep Learning: A Survey of Message-Passing Topological Neural NetworksCode2
NeuroClips: Towards High-fidelity and Smooth fMRI-to-Video ReconstructionCode2
Beyond Prompts: Dynamic Conversational Benchmarking of Large Language ModelsCode2
DeepCore: A Comprehensive Library for Coreset Selection in Deep LearningCode2
A Self-Attention Ansatz for Ab-initio Quantum ChemistryCode2
SmartEdit: Exploring Complex Instruction-based Image Editing with Multimodal Large Language ModelsCode2
Deep Hierarchical Semantic SegmentationCode2
A Comprehensive Survey on Graph Reduction: Sparsification, Coarsening, and CondensationCode2
EmoTalk: Speech-Driven Emotional Disentanglement for 3D Face AnimationCode2
SelfReg-UNet: Self-Regularized UNet for Medical Image SegmentationCode2
YuLan-OneSim: Towards the Next Generation of Social Simulator with Large Language ModelsCode2
Learning Causally Invariant Representations for Out-of-Distribution Generalization on GraphsCode2
Learning Multi-Agent Communication from Graph Modeling PerspectiveCode2
WorldPM: Scaling Human Preference ModelingCode2
Turning a CLIP Model into a Scene Text SpotterCode2
Beyond Matryoshka: Revisiting Sparse Coding for Adaptive RepresentationCode2
Learning Multiple Probabilistic Decisions from Latent World Model in Autonomous DrivingCode2
Adapting Pre-Trained Vision Models for Novel Instance Detection and SegmentationCode2
Benchmarking Representations for Speech, Music, and Acoustic EventsCode2
FontDiffuser: One-Shot Font Generation via Denoising Diffusion with Multi-Scale Content Aggregation and Style Contrastive LearningCode2
SelfPose3d: Self-Supervised Multi-Person Multi-View 3d Pose EstimationCode2
ZeroGUI: Automating Online GUI Learning at Zero Human CostCode2
Customizable Perturbation Synthesis for Robust SLAM BenchmarkingCode2
Deep Learning Recommendation Model for Personalization and Recommendation SystemsCode2
Learning Truncated Causal History Model for Video RestorationCode2
ASAM: Adaptive Sharpness-Aware Minimization for Scale-Invariant Learning of Deep Neural NetworksCode2
Enhancing Visual-Language Modality Alignment in Large Vision Language Models via Self-ImprovementCode2
Zoology: Measuring and Improving Recall in Efficient Language ModelsCode2
One for All: Towards Training One Graph Model for All Classification TasksCode2
Modern Methods in Associative MemoryCode2
State-specific protein-ligand complex structure prediction with a multi-scale deep generative modelCode2
Spacing Loss for Discovering Novel CategoriesCode2
The RoboDepth Challenge: Methods and Advancements Towards Robust Depth EstimationCode2
OmniVid: A Generative Framework for Universal Video UnderstandingCode2
Swarm-SLAM : Sparse Decentralized Collaborative Simultaneous Localization and Mapping Framework for Multi-Robot SystemsCode2
Token-Budget-Aware LLM ReasoningCode2
Reference-based Video Super-Resolution Using Multi-Camera Video TripletsCode2
Conformal Prediction for Deep Classifier via Label RankingCode2
Localizing Task Information for Improved Model Merging and CompressionCode2
Towards Bidirectional Human-AI Alignment: A Systematic Review for Clarifications, Framework, and Future DirectionsCode2
FASTopic: Pretrained Transformer is a Fast, Adaptive, Stable, and Transferable Topic ModelCode2
MTR-A: 1st Place Solution for 2022 Waymo Open Dataset Challenge -- Motion PredictionCode2
Show:102550
← PrevPage 143 of 13232Next →