SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Showing 74517475 of 474278 papers

TitleStatusHype
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian SplattingCode2
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language ModelsCode2
GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry ModelingCode2
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model InitializationCode2
AutoVerus: Automated Proof Generation for Rust CodeCode2
Training Language Models to Self-Correct via Reinforcement LearningCode2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model ReasoningCode2
All-in-one foundational models learning across quantum chemical levelsCode2
Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D MasksCode2
A Controlled Study on Long Context Extension and Generalization in LLMsCode2
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based ReasoningCode2
Recent Advances in OOD Detection: Problems and ApproachesCode2
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking FrameworkCode2
Large Language Models are Strong Audio-Visual Speech Recognition LearnersCode2
Vista3D: Unravel the 3D Darkside of a Single ImageCode2
PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference MambaCode2
Multi-Domain Data Aggregation for Axon and Myelin Segmentation in Histology ImagesCode2
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D ReconstructionCode2
A mmWave Software-Defined Array Platform for Wireless Experimentation at 24-29.5 GHzCode2
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language ModelsCode2
Multi-Document Grounded Multi-Turn Synthetic Dialog GenerationCode2
Advances in APPFL: A Comprehensive and Extensible Federated Learning FrameworkCode2
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion GenerationCode2
Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion ModelsCode2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to RefuseCode2
Show:102550
← PrevPage 299 of 18972Next →