The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9851–9900 of 661570 papers

Title	Date	Tasks	Status	Hype
UGPhysics: A Comprehensive Benchmark for Undergraduate Physics Reasoning with Large Language Models	Feb 1, 2025	Math	CodeCode Available	2
Mathematical Introduction to Deep Learning: Methods, Implementations, and Theory	Oct 31, 2023	Deep Learning	CodeCode Available	2
Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements	Oct 14, 2024	State EstimationTime Series	CodeCode Available	2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Enhancing Vectorized Map Perception with Historical Rasterized Maps	Sep 1, 2024	Autonomous Driving	CodeCode Available	2
RoboBERT: An End-to-end Multimodal Robotic Manipulation Model	Feb 11, 2025	Data Augmentation	CodeCode Available	2
Baleen: Robust Multi-Hop Reasoning at Scale via Condensed Retrieval	Jan 2, 2021	Claim VerificationQuestion Answering	CodeCode Available	2
AtomGS: Atomizing Gaussian Splatting for High-Fidelity Radiance Field	May 20, 2024	3DGSNovel View Synthesis	CodeCode Available	2
Selective Hourglass Mapping for Universal Image Restoration Based on Diffusion Model	Mar 17, 2024	Image RestorationZero-shot Generalization	CodeCode Available	2
Integrating Artificial Intelligence and Augmented Reality in Robotic Surgery: An Initial dVRK Study Using a Surgical Education Scenario	Jan 2, 2022		CodeCode Available	2
VMBench: A Benchmark for Perception-Aligned Video Motion Generation	Mar 13, 2025	Motion GenerationVideo Generation	CodeCode Available	2
SyntheX: Scaling Up Learning-based X-ray Image Analysis Through In Silico Experiments	Jun 13, 2022	Domain GeneralizationLesion Segmentation	CodeCode Available	2
DiscoSG: Towards Discourse-Level Text Scene Graph Parsing through Iterative Graph Refinement	Jun 18, 2025	Graph GenerationHallucination	CodeCode Available	2
pyPESTO: A modular and scalable tool for parameter estimation for dynamic models	May 2, 2023	parameter estimationUncertainty Quantification	CodeCode Available	2
PyTopo3D: A Python Framework for 3D SIMP-based Topology Optimization	Apr 8, 2025		CodeCode Available	2
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models	May 26, 2025		CodeCode Available	2
Scaling Data Generation in Vision-and-Language Navigation	Jul 28, 2023	Imitation LearningVision and Language Navigation	CodeCode Available	2
HLSFactory: A Framework Empowering High-Level Synthesis Datasets for Machine Learning and Beyond	May 1, 2024	BenchmarkingHigh-Level Synthesis	CodeCode Available	2
Geomstats: A Python Package for Riemannian Geometry in Machine Learning	Apr 7, 2020	BIG-bench Machine LearningClustering	CodeCode Available	2
AnyAnomaly: Zero-Shot Customizable Video Anomaly Detection with LVLM	Mar 6, 2025	Anomaly DetectionLanguage Modeling	CodeCode Available	2
Large Continual Instruction Assistant	Oct 8, 2024	Question AnsweringSemantic Similarity	CodeCode Available	2
FedBiOT: LLM Local Fine-tuning in Federated Learning without Full Model	Jun 25, 2024	Federated Learning	CodeCode Available	2
Diffusion Posterior Sampling for General Noisy Inverse Problems	Sep 29, 2022	DeblurringRetrieval	CodeCode Available	2
A vision-based autonomous UAV inspection framework for unknown tunnel construction sites with dynamic obstacles	Jan 20, 2023	Navigate	CodeCode Available	2
Multitask Prompted Training Enables Zero-Shot Task Generalization	Oct 15, 2021	BenchmarkingDecoder	CodeCode Available	2
An Empirical Study of Data Ability Boundary in LLMs' Math Reasoning	Feb 23, 2024	Arithmetic ReasoningAutomated Theorem Proving	CodeCode Available	2
Affordable Generative Agents	Feb 3, 2024		CodeCode Available	2
SynthSoM: A synthetic intelligent multi-modal sensing-communication dataset for Synesthesia of Machines (SoM)	Jan 13, 2025		CodeCode Available	2
MagicDec: Breaking the Latency-Throughput Tradeoff for Long Context Generation with Speculative Decoding	Aug 20, 2024		CodeCode Available	2
Protein Large Language Models: A Comprehensive Survey	Feb 21, 2025	ArticlesProtein Structure Prediction	CodeCode Available	2
Statewide Visual Geolocalization in the Wild	Sep 25, 2024		CodeCode Available	2
Continuous-Time vs. Discrete-Time Vision-based SLAM: A Comparative Study	Feb 17, 2022		CodeCode Available	2
SIU3R: Simultaneous Scene Understanding and 3D Reconstruction Beyond Feature Alignment	Jul 3, 2025	3D ReconstructionScene Understanding	CodeCode Available	2
Graph Prompt Learning: A Comprehensive Survey and Beyond	Nov 28, 2023	Prompt LearningSurvey	CodeCode Available	2
G1: Bootstrapping Perception and Reasoning Abilities of Vision-Language Model via Reinforcement Learning	May 19, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Position: What Can Large Language Models Tell Us about Time Series Analysis	Feb 5, 2024	Decision MakingPosition	CodeCode Available	2
Cloud2BIM: An open-source automatic pipeline for efficient conversion of large-scale point clouds into IFC format	Mar 14, 2025		CodeCode Available	2
Efficient Online Reinforcement Learning Fine-Tuning Need Not Retain Offline Data	Dec 10, 2024	Offline RLReinforcement Learning (RL)	CodeCode Available	2
Continuous Temporal Domain Generalization	May 25, 2024	Domain Generalization	CodeCode Available	2
Map-Relative Pose Regression for Visual Re-Localization	Apr 15, 2024	Novel View Synthesisregression	CodeCode Available	2
LLM-A*: Large Language Model Enhanced Incremental Heuristic Search on Path Planning	Jun 20, 2024	Autonomous NavigationHeuristic Search	CodeCode Available	2
Aligning Language Models with Demonstrated Feedback	Jun 2, 2024	ArticlesAvg	CodeCode Available	2
A Call for Collaborative Intelligence: Why Human-Agent Systems Should Precede AI Autonomy	Jun 11, 2025		CodeCode Available	2
ClimODE: Climate and Weather Forecasting with Physics-informed Neural ODEs	Apr 15, 2024	Uncertainty QuantificationWeather Forecasting	CodeCode Available	2
Can AI Assistants Know What They Don't Know?	Jan 24, 2024	MathOpen-Domain Question Answering	CodeCode Available	2
WildFusion: Individual Animal Identification with Calibrated Similarity Fusion	Aug 23, 2024		CodeCode Available	2
X-Avatar: Expressive Human Avatars	Mar 8, 2023	3D Human Reconstruction	CodeCode Available	2
SliM-LLM: Salience-Driven Mixed-Precision Quantization for Large Language Models	May 23, 2024	Natural Language UnderstandingQuantization	CodeCode Available	2
An L-BFGS-B approach for linear and nonlinear system identification under _1 and group-Lasso regularization	Mar 6, 2024	State Space Modelssubspace methods	CodeCode Available	2
Model Quantization and Hardware Acceleration for Vision Transformers: A Comprehensive Survey	May 1, 2024	Quantization	CodeCode Available	2