SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 25512575 of 661570 papers

TitleStatusHype
Simplifying Deep Temporal Difference LearningCode3
GFM-RAG: Graph Foundation Model for Retrieval Augmented GenerationCode3
XAttention: Block Sparse Attention with Antidiagonal ScoringCode3
4M: Massively Multimodal Masked ModelingCode3
Unifying Flow, Stereo and Depth EstimationCode3
EgoLife: Towards Egocentric Life AssistantCode3
AlpacaFarm: A Simulation Framework for Methods that Learn from Human FeedbackCode3
Planning with Diffusion for Flexible Behavior SynthesisCode3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMsCode3
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsCode3
BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and AlignmentCode3
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Data Engineering for Scaling Language Models to 128K ContextCode3
A Multiscale Visualization of Attention in the Transformer ModelCode3
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Streaming Deep Reinforcement Learning Finally WorksCode3
CBraMod: A Criss-Cross Brain Foundation Model for EEG DecodingCode3
NeuMan: Neural Human Radiance Field from a Single VideoCode3
Agent KB: Leveraging Cross-Domain Experience for Agentic Problem SolvingCode3
One Transformer Fits All Distributions in Multi-Modal Diffusion at ScaleCode3
Spikformer V2: Join the High Accuracy Club on ImageNet with an SNN TicketCode3
Exploring Progress in Multivariate Time Series Forecasting: Comprehensive Benchmarking and Heterogeneity AnalysisCode3
Bridging Language and Items for Retrieval and RecommendationCode3
Show:102550
← PrevPage 103 of 26463Next →