The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 9751–9775 of 474278 papers

Title	Date	Tasks	Status	Hype
Benchmarking Uncertainty Disentanglement: Specialized Uncertainties for Specialized Tasks	Feb 29, 2024	BenchmarkingDisentanglement	CodeCode Available	2
ViewFusion: Towards Multi-View Consistency via Interpolated Denoising	Feb 29, 2024	DenoisingImage Generation	CodeCode Available	2
Training Generative Image Super-Resolution Models by Wavelet-Domain Losses Enables Better Control of Artifacts	Feb 29, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	2
GSM-Plus: A Comprehensive Benchmark for Evaluating the Robustness of LLMs as Mathematical Problem Solvers	Feb 29, 2024	GSM8KMath	CodeCode Available	2
CricaVPR: Cross-image Correlation-aware Representation Learning for Visual Place Recognition	Feb 29, 2024	Representation LearningVisual Place Recognition	CodeCode Available	2
Learning Commonality, Divergence and Variety for Unsupervised Visible-Infrared Person Re-identification	Feb 29, 2024	Contrastive LearningPerson Re-Identification	CodeCode Available	2
Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation	Feb 28, 2024	ClusteringDiversity	CodeCode Available	2
Pre-training Differentially Private Models with Limited Public Data	Feb 28, 2024	TAG	CodeCode Available	2
Unsupervised Information Refinement Training of Large Language Models for Retrieval-Augmented Generation	Feb 28, 2024	Code GenerationIn-Context Learning	CodeCode Available	2
Separate and Conquer: Decoupling Co-occurrence via Decomposition and Representation for Weakly Supervised Semantic Segmentation	Feb 28, 2024	Semantic SegmentationTAG	CodeCode Available	2
Boosting Neural Representations for Videos with a Conditional Decoder	Feb 28, 2024	Decoder	CodeCode Available	2
ProtLLM: An Interleaved Protein-Language LLM with Protein-as-Word Pre-Training	Feb 28, 2024	In-Context LearningLanguage Modeling	CodeCode Available	2
The First Place Solution of WSDM Cup 2024: Leveraging Large Language Models for Conversational Multi-Doc QA	Feb 28, 2024	Natural Language UnderstandingQuestion Answering	CodeCode Available	2
Making Them Ask and Answer: Jailbreaking Large Language Models in Few Queries via Disguise and Reconstruction	Feb 28, 2024	ChatbotReconstruction Attack	CodeCode Available	2
SparseLLM: Towards Global Pruning for Pre-trained Language Models	Feb 28, 2024	Computational EfficiencyProblem Decomposition	CodeCode Available	2
Evaluating Quantized Large Language Models	Feb 28, 2024	MambaQuantization	CodeCode Available	2
Misalignment-Robust Frequency Distribution Loss for Image Transformation	Feb 28, 2024	Image EnhancementStyle Transfer	CodeCode Available	2
Trends, Applications, and Challenges in Human Attention Modelling	Feb 28, 2024	Language Modelling	CodeCode Available	2
DecisionNCE: Embodied Multimodal Representations via Implicit Preference Learning	Feb 28, 2024	Contrastive LearningDecision Making	CodeCode Available	2
Arithmetic Control of LLMs for Diverse User Preferences: Directional Preference Alignment with Multi-Objective Rewards	Feb 28, 2024		CodeCode Available	2
RAVEL: Evaluating Interpretability Methods on Disentangling Language Model Representations	Feb 27, 2024	AttributeLanguage Modeling	CodeCode Available	2
Unsupervised Zero-Shot Reinforcement Learning via Functional Reward Encodings	Feb 27, 2024	DiversityOffline RL	CodeCode Available	2
Sinkhorn Distance Minimization for Knowledge Distillation	Feb 27, 2024	DecoderKnowledge Distillation	CodeCode Available	2
Retrieval is Accurate Generation	Feb 27, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
BioT5+: Towards Generalized Biological Understanding with IUPAC Integration and Multi-task Tuning	Feb 27, 2024	Drug DiscoveryForward reaction prediction	CodeCode Available	2