SOTAVerified

The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Showing 54765500 of 177340 papers

TitleStatusHype
Diffusion Models and Representation Learning: A SurveyCode2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image PriorsCode2
XMainframe: A Large Language Model for Mainframe ModernizationCode2
Learning Generative Interactive Environments By Trained Agent ExplorationCode2
Learning Efficient and Effective Trajectories for Differential Equation-based Image RestorationCode2
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language ModelsCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image ClassificationCode2
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught ReasonersCode2
Sparse Autoencoders Learn Monosemantic Features in Vision-Language ModelsCode2
ImageDream: Image-Prompt Multi-view Diffusion for 3D GenerationCode2
MMLongBench-Doc: Benchmarking Long-context Document Understanding with VisualizationsCode2
Saving 77% of the Parameters in Large Language Models Technical ReportCode2
Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point cloudsCode2
RARE: Retrieval-Augmented Reasoning ModelingCode2
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator DesignCode2
Data Science Education in Undergraduate Physics: Lessons Learned from a Community of PracticeCode2
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language UnderstandingCode2
Adaptive Multi-Agent Reasoning via Automated Workflow GenerationCode2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAXCode2
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual ScenariosCode2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban EnvironmentsCode2
Interactive and Explainable Region-guided Radiology Report GenerationCode2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential RecommendationCode2
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence ActCode2
Show:102550
← PrevPage 220 of 7094Next →