SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 24262450 of 177340 papers

TitleStatusHype
From Explicit CoT to Implicit CoT: Learning to Internalize CoT Step by StepCode3
Diffusion Feedback Helps CLIP See BetterCode3
HtmlRAG: HTML is Better Than Plain Text for Modeling Retrieved Knowledge in RAG SystemsCode3
CAX: Cellular Automata Accelerated in JAXCode3
Can Long-Context Language Models Subsume Retrieval, RAG, SQL, and More?Code3
Anything-3D: Towards Single-view Anything Reconstruction in the WildCode3
Unlock Pose Diversity: Accurate and Efficient Implicit Keypoint-based Spatiotemporal Diffusion for Audio-driven Talking PortraitCode3
Simplifying Deep Temporal Difference LearningCode3
GFM-RAG: Graph Foundation Model for Retrieval Augmented GenerationCode3
XAttention: Block Sparse Attention with Antidiagonal ScoringCode3
4M: Massively Multimodal Masked ModelingCode3
Unifying Flow, Stereo and Depth EstimationCode3
EgoLife: Towards Egocentric Life AssistantCode3
AlpacaFarm: A Simulation Framework for Methods that Learn from Human FeedbackCode3
Planning with Diffusion for Flexible Behavior SynthesisCode3
TorchSparse++: Efficient Training and Inference Framework for Sparse Convolution on GPUsCode3
MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMsCode3
C-Eval: A Multi-Level Multi-Discipline Chinese Evaluation Suite for Foundation ModelsCode3
BasicVSR++: Improving Video Super-Resolution with Enhanced Propagation and AlignmentCode3
Text-guided Sparse Voxel Pruning for Efficient 3D Visual GroundingCode3
Data Engineering for Scaling Language Models to 128K ContextCode3
A Multiscale Visualization of Attention in the Transformer ModelCode3
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
RCBEVDet: Radar-camera Fusion in Bird's Eye View for 3D Object DetectionCode3
Streaming Deep Reinforcement Learning Finally WorksCode3
Show:102550
← PrevPage 98 of 7094Next →