The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7451–7475 of 474278 papers

Title	Date	Tasks	Status	Hype
Hier-SLAM: Scaling-up Semantics in SLAM with a Hierarchically Categorical Gaussian Splatting	Sep 19, 2024	Scene UnderstandingSemantic Segmentation	CodeCode Available	2
Linguistic Minimal Pairs Elicit Linguistic Similarity in Large Language Models	Sep 19, 2024	Semantic SimilaritySemantic Textual Similarity	CodeCode Available	2
GStex: Per-Primitive Texturing of 2D Gaussian Splatting for Decoupled Appearance and Geometry Modeling	Sep 19, 2024	Novel View Synthesis	CodeCode Available	2
Scaling Smart: Accelerating Large Language Model Pre-training with Small Model Initialization	Sep 19, 2024	GPULanguage Modeling	CodeCode Available	2
AutoVerus: Automated Proof Generation for Rust Code	Sep 19, 2024	Code GenerationLanguage Modeling	CodeCode Available	2
Training Language Models to Self-Correct via Reinforcement Learning	Sep 19, 2024	HumanEvalMath	CodeCode Available	2
Iteration of Thought: Leveraging Inner Dialogue for Autonomous Large Language Model Reasoning	Sep 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
All-in-one foundational models learning across quantum chemical levels	Sep 18, 2024	AllCloud Computing	CodeCode Available	2
Gradient-Driven 3D Segmentation and Affordance Transfer in Gaussian Splatting Using 2D Masks	Sep 18, 2024	3DGSSegmentation	CodeCode Available	2
A Controlled Study on Long Context Extension and Generalization in LLMs	Sep 18, 2024	In-Context Learning	CodeCode Available	2
TART: An Open-Source Tool-Augmented Framework for Explainable Table-based Reasoning	Sep 18, 2024	Fact VerificationQuestion Answering	CodeCode Available	2
Recent Advances in OOD Detection: Problems and Approaches	Sep 18, 2024	Out-of-Distribution DetectionOut of Distribution (OOD) Detection	CodeCode Available	2
RockTrack: A 3D Robust Multi-Camera-Ken Multi-Object Tracking Framework	Sep 18, 2024	3D Multi-Object Tracking3D Object Detection	CodeCode Available	2
Large Language Models are Strong Audio-Visual Speech Recognition Learners	Sep 18, 2024	Audio-Visual Speech RecognitionAutomatic Speech Recognition	CodeCode Available	2
Vista3D: Unravel the 3D Darkside of a Single Image	Sep 18, 2024	3D GenerationDiversity	CodeCode Available	2
PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba	Sep 18, 2024	MambaState Space Models	CodeCode Available	2
Multi-Domain Data Aggregation for Axon and Myelin Segmentation in Histology Images	Sep 17, 2024	Segmentation	CodeCode Available	2
SplatFields: Neural Gaussian Splats for Sparse 3D and 4D Reconstruction	Sep 17, 2024	3DGS4D reconstruction	CodeCode Available	2
A mmWave Software-Defined Array Platform for Wireless Experimentation at 24-29.5 GHz	Sep 17, 2024		CodeCode Available	2
Promptriever: Instruction-Trained Retrievers Can Be Prompted Like Language Models	Sep 17, 2024	Information RetrievalRetrieval	CodeCode Available	2
Multi-Document Grounded Multi-Turn Synthetic Dialog Generation	Sep 17, 2024		CodeCode Available	2
Advances in APPFL: A Comprehensive and Extensible Federated Learning Framework	Sep 17, 2024	BenchmarkingFederated Learning	CodeCode Available	2
BAD: Bidirectional Auto-regressive Diffusion for Text-to-Motion Generation	Sep 17, 2024	Human motion predictionMotion Forecasting	CodeCode Available	2
Guess What I Think: Streamlined EEG-to-Image Generation with Latent Diffusion Models	Sep 17, 2024	Brain Computer InterfaceEEG	CodeCode Available	2
Measuring and Enhancing Trustworthiness of LLMs in RAG through Grounded Attributions and Learning to Refuse	Sep 17, 2024	In-Context LearningRAG	CodeCode Available	2