SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 77517775 of 177340 papers

TitleStatusHype
Color Shift Estimation-and-Correction for Image EnhancementCode2
Matcher: Segment Anything with One Shot Using All-Purpose Feature MatchingCode2
Dirichlet Flow Matching with Applications to DNA Sequence DesignCode2
ViewFusion: Towards Multi-View Consistency via Interpolated DenoisingCode2
M3: 3D-Spatial MultiModal MemoryCode2
Sparse Instance Activation for Real-Time Instance SegmentationCode2
Transformers are Sample-Efficient World ModelsCode2
Towards Cross-Modality Modeling for Time Series Analytics: A Survey in the LLM EraCode2
A Judge-free LLM Open-ended Generation Benchmark Based on the Distributional HypothesisCode2
AdaptFormer: Adapting Vision Transformers for Scalable Visual RecognitionCode2
An Egocentric Vision-Language Model based Portable Real-time Smart AssistantCode2
Fourier Neural Operator for Parametric Partial Differential EquationsCode2
LlamaFactory: Unified Efficient Fine-Tuning of 100+ Language ModelsCode2
UniRL: Self-Improving Unified Multimodal Models via Supervised and Reinforcement LearningCode2
Model soups: averaging weights of multiple fine-tuned models improves accuracy without increasing inference timeCode2
GraphMAE: Self-Supervised Masked Graph AutoencodersCode2
PET-MAD, a universal interatomic potential for advanced materials modelingCode2
BabyAI: A Platform to Study the Sample Efficiency of Grounded Language LearningCode2
BTS: Building Timeseries Dataset: Empowering Large-Scale Building AnalyticsCode2
MOOSE-Chem: Large Language Models for Rediscovering Unseen Chemistry Scientific HypothesesCode2
Source-Free Domain Adaptation with Frozen Multimodal Foundation ModelCode2
CRA-PCN: Point Cloud Completion with Intra- and Inter-level Cross-Resolution TransformersCode2
TimeLMs: Diachronic Language Models from TwitterCode2
string2string: A Modern Python Library for String-to-String AlgorithmsCode2
Advancing the Evaluation of Traditional Chinese Language Models: Towards a Comprehensive Benchmark SuiteCode2
Show:102550
← PrevPage 311 of 7094Next →