The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2876–2900 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
The Hidden Dimensions of LLM Alignment: A Multi-Dimensional Safety Analysis	Feb 13, 2025	Safety Alignment	CodeCode Available	3	5
Unfolding the Headline: Iterative Self-Questioning for News Retrieval and Timeline Summarization	Jan 1, 2025	News RetrievalRetrieval	CodeCode Available	3	5
Deformable 3D Gaussians for High-Fidelity Monocular Dynamic Scene Reconstruction	Sep 22, 2023	Dynamic ReconstructionNeural Rendering	CodeCode Available	3	5
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library	Oct 11, 2022	Multi-agent Reinforcement Learningreinforcement-learning	CodeCode Available	3	5
UNetFormer: A Unified Vision Transformer Model and Pre-Training Framework for 3D Medical Image Segmentation	Apr 1, 2022	Brain Tumor SegmentationImage Segmentation	CodeCode Available	3	5
GraphNeuralNetworks.jl: Deep Learning on Graphs with Julia	Dec 9, 2024	Deep LearningGPU	CodeCode Available	3	5
ArCHer: Training Language Model Agents via Hierarchical Multi-Turn RL	Feb 29, 2024	Language ModelingLanguage Modelling	CodeCode Available	3	5
A Simple Framework for Open-Vocabulary Segmentation and Detection	Mar 14, 2023	Instance SegmentationPanoptic Segmentation	CodeCode Available	3	5
LinFusion: 1 GPU, 1 Minute, 16K Image	Sep 3, 2024	16kCausal Inference	CodeCode Available	3	5
CHESS: Contextual Harnessing for Efficient SQL Synthesis	May 27, 2024	Large Language ModelPrivacy Preserving	CodeCode Available	3	5
Flexible and Scalable Deep Learning with MMLSpark	Apr 11, 2018	Deep LearningDistributed Computing	CodeCode Available	3	5
A Comprehensive Survey of Small Language Models in the Era of Large Language Models: Techniques, Enhancements, Applications, Collaboration with LLMs, and Trustworthiness	Nov 4, 2024	Question AnsweringText Generation	CodeCode Available	3	5
Why Transformers Need Adam: A Hessian Perspective	Feb 26, 2024		CodeCode Available	3	5
LiftFeat: 3D Geometry-Aware Local Feature Matching	May 6, 2025	3D geometryDepth Estimation	CodeCode Available	3	5
An Empirical Study on Prompt Compression for Large Language Models	Apr 24, 2025	ArticlesMath	CodeCode Available	3	5
This Time is Different: An Observability Perspective on Time Series Foundation Models	May 20, 2025	DecoderMultivariate Time Series Forecasting	CodeCode Available	3	5
Image and Video Tokenization with Binary Spherical Quantization	Jun 11, 2024	DecoderImage Generation	CodeCode Available	3	5
VoiceStar: Robust Zero-Shot Autoregressive TTS with Duration Control and Extrapolation	May 26, 2025	DecoderLanguage Modeling	CodeCode Available	3	5
Distilling LLM Agent into Small Models with Retrieval and Code Tools	May 23, 2025	Action GenerationDomain Generalization	CodeCode Available	3	5
Highly Compressed Tokenizer Can Generate Without Training	Jun 9, 2025	Image GenerationQuantization	CodeCode Available	3	5
When to use Graphs in RAG: A Comprehensive Analysis for Graph Retrieval-Augmented Generation	Jun 6, 2025	RAGRetrieval	CodeCode Available	3	5
Machine Mental Imagery: Empower Multimodal Reasoning with Latent Visual Tokens	Jun 20, 2025	Image GenerationMultimodal Reasoning	CodeCode Available	3	5
Discrete Diffusion in Large Language and Multimodal Models: A Survey	Jun 16, 2025	Denoising	CodeCode Available	3	5
Efficient and Generalizable Speaker Diarization via Structured Pruning of Self-Supervised Models	Jun 23, 2025	Domain AdaptationGPU	CodeCode Available	3	5
FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language	Jun 26, 2025	All	CodeCode Available	3	5