SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 90269050 of 474278 papers

TitleStatusHype
Linearly-evolved Transformer for Pan-sharpeningCode2
Large Language Models for Next Point-of-Interest RecommendationCode2
decoupleQ: Towards 2-bit Post-Training Uniform Quantization via decoupling Parameters into Integer and Floating PointsCode2
Introducing v0.5 of the AI Safety Benchmark from MLCommonsCode2
Aligning language models with human preferencesCode2
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation ExtractionCode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
Token-level Direct Preference OptimizationCode2
Partial Large Kernel CNNs for Efficient Super-ResolutionCode2
SPIdepth: Strengthened Pose Information for Self-supervised Monocular Depth EstimationCode2
6Img-to-3D: Few-Image Large-Scale Outdoor Driving Scene ReconstructionCode2
Physics-informed active learning for accelerating quantum chemical simulationsCode2
Point-In-Context: Understanding Point Cloud via In-Context LearningCode2
Transformer tricks: Removing weights for skipless transformersCode2
MolCRAFT: Structure-Based Drug Design in Continuous Parameter SpaceCode2
ShadowRefiner: Towards Mask-free Shadow Removal via Fast Fourier TransformerCode2
Concept Induction: Analyzing Unstructured Text with High-Level Concepts Using LLooMCode2
Model-free quantification of completeness, uncertainties, and outliers in atomistic machine learning using information theoryCode2
Partial-to-Partial Shape Matching with Geometric ConsistencyCode2
LongEmbed: Extending Embedding Models for Long Context RetrievalCode2
Large Language Models meet Collaborative Filtering: An Efficient All-round LLM-based Recommender SystemCode2
Variational Bayesian Last LayersCode2
Bias and Unfairness in Information Retrieval Systems: New Challenges in the LLM EraCode2
VBR: A Vision Benchmark in RomeCode2
Behavior Alignment: A New Perspective of Evaluating LLM-based Conversational Recommender SystemsCode2
Show:102550
← PrevPage 362 of 18972Next →