The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8651–8675 of 177340 papers

Title	Date	Tasks	Status	Hype	Score
SegVol: Universal and Interactive Volumetric Medical Image Segmentation	Nov 22, 2023	Computed Tomography (CT)Image Segmentation	CodeCode Available	2	5
FinMem: A Performance-Enhanced LLM Trading Agent with Layered Memory and Character Design	Nov 23, 2023	Decision MakingLanguage Modelling	CodeCode Available	2	5
OccWorld: Learning a 3D Occupancy World Model for Autonomous Driving	Nov 27, 2023	Autonomous Driving	CodeCode Available	2	5
Adapter is All You Need for Tuning Visual Tasks	Nov 25, 2023	Allimage-classification	CodeCode Available	2	5
Photo-SLAM: Real-time Simultaneous Localization and Photorealistic Mapping for Monocular, Stereo, and RGB-D Cameras	Nov 28, 2023	Neural Rendering	CodeCode Available	2	5
H2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models	Sep 21, 2023		CodeCode Available	2	5
Achieving Cross Modal Generalization with Multimodal Unified Representation	Sep 21, 2023		CodeCode Available	2	5
M^4: A Unified XAI Benchmark for Faithfulness Evaluation of Feature Attribution Methods across Metrics, Modalities and Models	Sep 26, 2023		CodeCode Available	2	5
Language Models can Solve Computer Tasks	Mar 30, 2023	Language ModellingLarge Language Model	CodeCode Available	2	5
Driving into the Future: Multiview Visual Forecasting and Planning with World Model for Autonomous Driving	Nov 29, 2023	Autonomous DrivingAutonomous Vehicles	CodeCode Available	2	5
Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation	Jun 29, 2023	3D Shape GenerationDecoder	CodeCode Available	2	5
Spike-driven Transformer	Jul 4, 2023		CodeCode Available	2	5
StyleCrafter: Enhancing Stylized Text-to-Video Generation with Style Adapter	Dec 1, 2023	DisentanglementText-to-Video Generation	CodeCode Available	2	5
TimeChat: A Time-sensitive Multimodal Large Language Model for Long Video Understanding	Dec 4, 2023	Dense CaptioningHighlight Detection	CodeCode Available	2	5
Aligning and Prompting Everything All at Once for Universal Visual Perception	Dec 4, 2023	AllObject	CodeCode Available	2	5
Let's Think Outside the Box: Exploring Leap-of-Thought in Large Language Models with Creative Humor Generation	Dec 5, 2023	Logical Reasoning	CodeCode Available	2	5
GauHuman: Articulated Gaussian Splatting from Monocular Human Videos	Dec 5, 2023	Generalizable Novel View SynthesisNeRF	CodeCode Available	2	5
DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving	Jun 18, 2024	Arithmetic ReasoningMath	CodeCode Available	2	5
Exposing flaws of generative model evaluation metrics and their unfair treatment of diffusion models	Jun 7, 2023	DiversityImage Generation	CodeCode Available	2	5
Pgx: Hardware-Accelerated Parallel Game Simulators for Reinforcement Learning	Mar 29, 2023	GPUreinforcement-learning	CodeCode Available	2	5
AnimateZero: Video Diffusion Models are Zero-Shot Image Animators	Dec 6, 2023	Image AnimationVideo Generation	CodeCode Available	2	5
Mind2Web: Towards a Generalist Agent for the Web	Jun 9, 2023		CodeCode Available	2	5
ClimateLearn: Benchmarking Machine Learning for Weather and Climate Modeling	Jul 4, 2023	BenchmarkingWeather Forecasting	CodeCode Available	2	5
When Do Transformers Shine in RL? Decoupling Memory from Credit Assignment	Jul 7, 2023	Reinforcement Learning (RL)	CodeCode Available	2	5
UDiffText: A Unified Framework for High-quality Text Synthesis in Arbitrary Images via Character-aware Diffusion Models	Dec 8, 2023	Image GenerationScene Text Editing	CodeCode Available	2	5