The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 5601–5650 of 661570 papers

Title	Date	Tasks	Status	Hype
Coercing LLMs to do and reveal (almost) anything	Feb 21, 2024		CodeCode Available	2
Beyond Efficiency: A Systematic Survey of Resource-Efficient Large Language Models	Jan 1, 2024	Survey	CodeCode Available	2
Learning-Rate-Free Learning by D-Adaptation	Jan 18, 2023		CodeCode Available	2
Efficient Diffusion Transformer Policies with Mixture of Expert Denoisers for Multitask Learning	Dec 17, 2024	Denoising	CodeCode Available	2
X-Pose: Detecting Any Keypoints	Oct 12, 2023	2D Human Pose Estimation2D Pose Estimation	CodeCode Available	2
A Survey on Detection of LLMs-Generated Content	Oct 24, 2023	Survey	CodeCode Available	2
Predict, Refine, Synthesize: Self-Guiding Diffusion Models for Probabilistic Time Series Forecasting	Jul 21, 2023	ImputationProbabilistic Time Series Forecasting	CodeCode Available	2
COLMAP-Free 3D Gaussian Splatting	Dec 12, 2023	3DGSCamera Pose Estimation	CodeCode Available	2
Delicate Textured Mesh Recovery from NeRF via Adaptive Surface Refinement	Mar 3, 2023	3D ReconstructionNeRF	CodeCode Available	2
Super Monotonic Alignment Search	Sep 12, 2024	CPUGPU	CodeCode Available	2
RaBitQ: Quantizing High-Dimensional Vectors with a Theoretical Error Bound for Approximate Nearest Neighbor Search	May 21, 2024	Quantization	CodeCode Available	2
M^3CoT: A Novel Benchmark for Multi-Domain Multi-step Multi-modal Chain-of-Thought	May 26, 2024		CodeCode Available	2
Hello Again! LLM-powered Personalized Agent for Long-term Dialogue	Jun 9, 2024	Response GenerationRetrieval	CodeCode Available	2
Linguistic-Aware Patch Slimming Framework for Fine-grained Cross-Modal Alignment	Jan 1, 2024	cross-modal alignmentCross-Modal Retrieval	CodeCode Available	2
Diffusion Models and Representation Learning: A Survey	Jun 30, 2024	DenoisingRepresentation Learning	CodeCode Available	2
HybridDepth: Robust Metric Depth Fusion by Leveraging Depth from Focus and Single-Image Priors	Jul 26, 2024	Depth EstimationGPU	CodeCode Available	2
XMainframe: A Large Language Model for Mainframe Modernization	Aug 5, 2024	Code SummarizationLanguage Modeling	CodeCode Available	2
Learning Generative Interactive Environments By Trained Agent Exploration	Sep 10, 2024		CodeCode Available	2
Learning Efficient and Effective Trajectories for Differential Equation-based Image Restoration	Oct 7, 2024	Image RestorationNavigate	CodeCode Available	2
DyCoke: Dynamic Compression of Tokens for Fast Video Large Language Models	Nov 22, 2024		CodeCode Available	2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMs	Dec 1, 2024	Causal Inferencecounterfactual	CodeCode Available	2
2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification	Dec 1, 2024	Computational Efficiencyimage-classification	CodeCode Available	2
B-STaR: Monitoring and Balancing Exploration and Exploitation in Self-Taught Reasoners	Dec 23, 2024	Mathematical Reasoning	CodeCode Available	2
Sparse Autoencoders Learn Monosemantic Features in Vision-Language Models	Apr 3, 2025		CodeCode Available	2
ImageDream: Image-Prompt Multi-view Diffusion for 3D Generation	Dec 2, 2023	3D GenerationObject	CodeCode Available	2
MMLongBench-Doc: Benchmarking Long-context Document Understanding with Visualizations	Jul 1, 2024	Benchmarkingdocument understanding	CodeCode Available	2
Saving 77% of the Parameters in Large Language Models Technical Report	Feb 9, 2025	GPUText Generation	CodeCode Available	2
Tensor field networks: Rotation- and translation-equivariant neural networks for 3D point clouds	Feb 22, 2018	Data AugmentationTranslation	CodeCode Available	2
RARE: Retrieval-Augmented Reasoning Modeling	Mar 30, 2025	HallucinationMemorization	CodeCode Available	2
Torch2Chip: An End-to-end Customizable Deep Neural Network Compression and Deployment Toolkit for Prototype Hardware Accelerator Design	May 2, 2024	Model CompressionNeural Network Compression	CodeCode Available	2
Data Science Education in Undergraduate Physics: Lessons Learned from a Community of Practice	Mar 1, 2024		CodeCode Available	2
Synthesize Diagnose and Optimize: Towards Fine-Grained Vision-Language Understanding	Jan 1, 2024	Attribute	CodeCode Available	2
Adaptive Multi-Agent Reasoning via Automated Workflow Generation	Jul 18, 2025		CodeCode Available	2
JaxMARL: Multi-Agent RL Environments and Algorithms in JAX	Nov 16, 2023	CPUGPU	CodeCode Available	2
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual Scenarios	Mar 7, 2024	Audio-visual Question AnsweringAudio-Visual Question Answering (AVQA)	CodeCode Available	2
LLM-Assisted Light: Leveraging Large Language Model Capabilities for Human-Mimetic Traffic Signal Control in Complex Urban Environments	Mar 13, 2024	Decision MakingLanguage Modeling	CodeCode Available	2
LightGNN: Simple Graph Neural Network for Recommendation	Jan 6, 2025	Computational EfficiencyGraph Neural Network	CodeCode Available	2
Interactive and Explainable Region-guided Radiology Report Generation	Apr 17, 2023	Medical Report Generation	CodeCode Available	2
LLMEmb: Large Language Model Can Be a Good Embedding Generator for Sequential Recommendation	Sep 30, 2024	AttributeCollaborative Filtering	CodeCode Available	2
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act	Oct 10, 2024	BenchmarkingFairness	CodeCode Available	2
Reliable and Efficient Concept Erasure of Text-to-Image Diffusion Models	Jul 17, 2024	BenchmarkingRed Teaming	CodeCode Available	2
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1	Mar 31, 2025	Logical ReasoningMultiple-choice	CodeCode Available	2
Scattertext: a Browser-Based Tool for Visualizing how Corpora Differ	Jul 1, 2017		CodeCode Available	2
ScaleKD: Strong Vision Transformers Could Be Excellent Teachers	Nov 11, 2024	image-classificationImage Classification	CodeCode Available	2
CheXpert Plus: Augmenting a Large Chest X-ray Dataset with Text Radiology Reports, Patient Demographics and Additional Image Formats	May 29, 2024	De-identificationFairness	CodeCode Available	2
NeRF-RPN: A general framework for object detection in NeRFs	Nov 21, 2022	NeRFobject-detection	CodeCode Available	2
HallusionBench: An Advanced Diagnostic Suite for Entangled Language Hallucination and Visual Illusion in Large Vision-Language Models	Oct 23, 2023	DiagnosticHallucination	CodeCode Available	2
Automatic Differentiation-based Full Waveform Inversion with Flexible Workflows	Nov 30, 2024	Dynamic Time Warping	CodeCode Available	2
AirMorph: Topology-Preserving Deep Learning for Pulmonary Airway Analysis	Dec 15, 2024	AnatomyDeep Learning	CodeCode Available	2
Attacks, Defenses and Evaluations for LLM Conversation Safety: A Survey	Feb 14, 2024	Survey	CodeCode Available	2