The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8601–8650 of 661570 papers

Title	Date	Tasks	Status	Hype
Aligning language models with human preferences	Apr 18, 2024	Bayesian Inference	CodeCode Available	2
Towards Universal Sequence Representation Learning for Recommender Systems	Jun 13, 2022	Mixture-of-ExpertsRecommendation Systems	CodeCode Available	2
Agent Planning with World Knowledge Model	May 23, 2024	modelWorld Knowledge	CodeCode Available	2
HAKE: A Knowledge Engine Foundation for Human Activity Understanding	Feb 14, 2022	Action RecognitionHuman-Object Interaction Detection	CodeCode Available	2
SfM-Free 3D Gaussian Splatting via Hierarchical Training	Dec 2, 2024	3DGSNovel View Synthesis	CodeCode Available	2
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations	Apr 14, 2025		CodeCode Available	2
CellViT++: Energy-Efficient and Adaptive Cell Segmentation and Classification Using Foundation Models	Jan 9, 2025	Cell SegmentationDataset Generation	CodeCode Available	2
Fast Feedforward Networks	Aug 28, 2023	Mixture-of-Experts	CodeCode Available	2
Practical Membership Inference Attacks against Fine-tuned Large Language Models via Self-prompt Calibration	Nov 10, 2023	Inference AttackMembership Inference Attack	CodeCode Available	2
GradeADreamer: Enhanced Text-to-3D Generation Using Gaussian Splatting and Multi-View Diffusion	Jun 14, 2024	3D GenerationGPU	CodeCode Available	2
Ensembling Prioritized Hybrid Policies for Multi-agent Pathfinding	Mar 12, 2024	Multi-Agent Path FindingMulti-agent Reinforcement Learning	CodeCode Available	2
Flow of Reasoning:Training LLMs for Divergent Problem Solving with Minimal Examples	Jun 9, 2024	ARCDiversity	CodeCode Available	2
FACT: Frame-Action Cross-Attention Temporal Modeling for Efficient Action Segmentation	Jan 1, 2024	Action SegmentationSegmentation	CodeCode Available	2
The Decades Progress on Code-Switching Research in NLP: A Systematic Survey on Trends and Challenges	Dec 19, 2022		CodeCode Available	2
Generative replay with feedback connections as a general strategy for continual learning	Sep 27, 2018	Continual LearningLifelong learning	CodeCode Available	2
DiffCSE: Difference-based Contrastive Learning for Sentence Embeddings	Apr 21, 2022	Contrastive LearningLanguage Modeling	CodeCode Available	2
Smooth Diffusion: Crafting Smooth Latent Spaces in Diffusion Models	Dec 7, 2023		CodeCode Available	2
UniMoMo: Unified Generative Modeling of 3D Molecules for De Novo Binder Design	Mar 25, 2025	Drug DiscoveryLatent Diffusion Model for 3D	CodeCode Available	2
Cluster and Predict Latents Patches for Improved Masked Image Modeling	Feb 12, 2025	Representation Learning	CodeCode Available	2
NTIRE 2025 Challenge on Cross-Domain Few-Shot Object Detection: Methods and Results	Apr 14, 2025	Cross-Domain Few-ShotCross-Domain Few-Shot Object Detection	CodeCode Available	2
Parting with Misconceptions about Learning-based Vehicle Motion Planning	Jun 13, 2023	MisconceptionsMotion Planning	CodeCode Available	2
CVT-Occ: Cost Volume Temporal Fusion for 3D Occupancy Prediction	Sep 20, 2024	Depth EstimationPrediction	CodeCode Available	2
An AI-Ready Multiplex Staining Dataset for Reproducible and Accurate Characterization of Tumor Immune Microenvironment	May 25, 2023	Style Transfer	CodeCode Available	2
A New Frontier of AI: On-Device AI Training and Personalization	Jun 9, 2022	Efficient Neural Networkspeech-recognition	CodeCode Available	2
MetaOpenFOAM 2.0: Large Language Model Driven Chain of Thought for Automating CFD Simulation and Post-Processing	Feb 1, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Towards Understanding and Boosting Adversarial Transferability from a Distribution Perspective	Oct 9, 2022		CodeCode Available	2
EA-LSS: Edge-aware Lift-splat-shot Framework for 3D BEV Object Detection	Mar 31, 2023	3D Object DetectionDepth Estimation	CodeCode Available	2
Self-playing Adversarial Language Game Enhances LLM Reasoning	Apr 16, 2024		CodeCode Available	2
A Pytorch Reproduction of Masked Generative Image Transformer	Oct 22, 2023	Image Generation	CodeCode Available	2
Unlocking the Capabilities of Thought: A Reasoning Boundary Framework to Quantify and Optimize Chain-of-Thought	Oct 8, 2024		CodeCode Available	2
Unsupervised Post-Training for Multi-Modal LLM Reasoning via GRPO	May 28, 2025	MathReinforcement Learning (RL)	CodeCode Available	2
DiffCLIP: Differential Attention Meets CLIP	Mar 9, 2025	Language ModelingLanguage Modelling	CodeCode Available	2
Compute-Constrained Data Selection	Oct 21, 2024		CodeCode Available	2
VAU-R1: Advancing Video Anomaly Understanding via Reinforcement Fine-Tuning	May 29, 2025	Anomaly DetectionDescriptive	CodeCode Available	2
Neural Rendering for Stereo 3D Reconstruction of Deformable Tissues in Robotic Surgery	Jun 30, 2022	3D ReconstructionNeural Rendering	CodeCode Available	2
MultiCorrupt: A Multi-Modal Robustness Dataset and Benchmark of LiDAR-Camera Fusion for 3D Object Detection	Feb 18, 2024	3D Object DetectionDataset Generation	CodeCode Available	2
Large Language Model Guided Tree-of-Thought	May 15, 2023	Language ModelingLanguage Modelling	CodeCode Available	2
HalOmi: A Manually Annotated Benchmark for Multilingual Hallucination and Omission Detection in Machine Translation	May 19, 2023	HallucinationMachine Translation	CodeCode Available	2
Tamper-Resistant Safeguards for Open-Weight LLMs	Aug 1, 2024	Red TeamingTAR	CodeCode Available	2
Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation	May 25, 2024	Autonomous NavigationDeep Reinforcement Learning	CodeCode Available	2
SC-GS: Sparse-Controlled Gaussian Splatting for Editable Dynamic Scenes	Dec 4, 2023	Novel View Synthesis	CodeCode Available	2
Fast ODE-based Sampling for Diffusion Models in Around 5 Steps	Nov 30, 2023	Image Generation	CodeCode Available	2
CleanMel: Mel-Spectrogram Enhancement for Improving Both Speech Quality and ASR	Feb 27, 2025	Automatic Speech RecognitionAutomatic Speech Recognition (ASR)	CodeCode Available	2
FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement	Mar 23, 2022	Speech Enhancement	CodeCode Available	2
MLLM-Tool: A Multimodal Large Language Model For Tool Agent Learning	Jan 19, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Accelerated Quality-Diversity through Massive Parallelism	Feb 2, 2022	DiversityGPU	CodeCode Available	2
Anomaly Detection with Conditioned Denoising Diffusion Models	May 25, 2023	Anomaly DetectionDenoising	CodeCode Available	2
VQGAN-CLIP: Open Domain Image Generation and Editing with Natural Language Guidance	Apr 18, 2022	Image Generation	CodeCode Available	2
MetaDrive: Composing Diverse Driving Scenarios for Generalizable Reinforcement Learning	Sep 26, 2021	BenchmarkingDecision Making	CodeCode Available	2
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training	Sep 13, 2024	Quantization	CodeCode Available	2