SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 32263250 of 177340 papers

TitleStatusHype
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal ModelsCode3
Conceptual Framework for Autonomous Cognitive EntitiesCode3
NoMaD: Goal Masked Diffusion Policies for Navigation and ExplorationCode3
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape OptimizationCode3
Sequential Modeling Enables Scalable Learning for Large Vision ModelsCode3
UniGS: Unified Representation for Image Generation and SegmentationCode3
Physical Symbolic OptimizationCode3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
Universal Time-Series Representation Learning: A SurveyCode3
Small LLMs Are Weak Tool Learners: A Multi-LLM AgentCode3
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-AlignmentCode3
Marabou 2.0: A Versatile Formal Analyzer of Neural NetworksCode3
V-IRL: Grounding Virtual Intelligence in Real LifeCode3
Sequoia: Scalable, Robust, and Hardware-aware Speculative DecodingCode3
Visual Style Prompting with Swapping Self-AttentionCode3
Where Visual Speech Meets Language: VSP-LLM Framework for Efficient and Context-Aware Visual Speech ProcessingCode3
Web-Bench: A LLM Code Benchmark Based on Web Standards and FrameworksCode3
3DGStream: On-the-Fly Training of 3D Gaussians for Efficient Streaming of Photo-Realistic Free-Viewpoint VideosCode3
PreRoutGNN for Timing Prediction with Order Preserving Partition: Global Circuit Pre-training, Local Delay Learning and Attentional Cell ModelingCode3
DNGaussian: Optimizing Sparse-View 3D Gaussian Radiance Fields with Global-Local Depth NormalizationCode3
BAD-Gaussians: Bundle Adjusted Deblur Gaussian SplattingCode3
HAC: Hash-grid Assisted Context for 3D Gaussian Splatting CompressionCode3
Benchmarks and Challenges in Pose Estimation for Egocentric Hand Interactions with ObjectsCode3
Diffusion-4K: Ultra-High-Resolution Image Synthesis with Latent Diffusion ModelsCode3
Sparse Feature Circuits: Discovering and Editing Interpretable Causal Graphs in Language ModelsCode3
Show:102550
← PrevPage 130 of 7094Next →