The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 19901–19950 of 474278 papers

Title	Date	Tasks	Status	Hype
MetaMetrics-MT: Tuning Meta-Metrics for Machine Translation via Human Preference Calibration	Nov 1, 2024	Bayesian OptimizationGaussian Processes	CodeCode Available	1
TaxaBind: A Unified Embedding Space for Ecological Applications	Nov 1, 2024	Audio ClassificationCross-Modal Retrieval	CodeCode Available	1
PatternBoost: Constructions in Mathematics with a Little Help from AI	Nov 1, 2024		CodeCode Available	1
Rationale-Guided Retrieval Augmented Generation for Medical Question Answering	Nov 1, 2024	Medical Question AnsweringQuestion Answering	CodeCode Available	1
A Lorentz-Equivariant Transformer for All of the LHC	Nov 1, 2024	All	CodeCode Available	1
Automated Classification of Cell Shapes: A Comparative Evaluation of Shape Descriptors	Nov 1, 2024	Instance SegmentationSemantic Segmentation	CodeCode Available	1
Contrasting with Symile: Simple Model-Agnostic Representation Learning for Unlimited Modalities	Nov 1, 2024	Contrastive LearningRepresentation Learning	CodeCode Available	1
A Survey on Bundle Recommendation: Methods, Applications, and Challenges	Nov 1, 2024	Recommendation SystemsRepresentation Learning	CodeCode Available	1
Abstracted Shapes as Tokens -- A Generalizable and Interpretable Model for Time-series Classification	Nov 1, 2024	QuantizationRepresentation Learning	CodeCode Available	1
Identify Backdoored Model in Federated Learning via Individual Unlearning	Nov 1, 2024	Anomaly DetectionFederated Learning	CodeCode Available	1
Constant Acceleration Flow	Nov 1, 2024		CodeCode Available	1
LogiCity: Advancing Neuro-Symbolic AI with Abstract Urban Simulation	Nov 1, 2024	Logical ReasoningSequential Decision Making	CodeCode Available	1
KAN-AD: Time Series Anomaly Detection with Kolmogorov-Arnold Networks	Nov 1, 2024	Anomaly DetectionKolmogorov-Arnold Networks	CodeCode Available	1
Beyond Utility: Evaluating LLM as Recommender	Nov 1, 2024	PositionRe-Ranking	CodeCode Available	1
MIRFLEX: Music Information Retrieval Feature Library for Extraction	Nov 1, 2024	BenchmarkingInformation Retrieval	CodeCode Available	1
Nearest Neighbor Normalization Improves Multimodal Retrieval	Oct 31, 2024	Cross-Modal RetrievalImage Captioning	CodeCode Available	1
COSNet: A Novel Semantic Segmentation Network using Enhanced Boundaries in Cluttered Scenes	Oct 31, 2024	SegmentationSemantic Segmentation	CodeCode Available	1
Pedestrian Trajectory Prediction with Missing Data: Datasets, Imputation, and Benchmarking	Oct 31, 2024	BenchmarkingImputation	CodeCode Available	1
Scaling Up Membership Inference: When and How Attacks Succeed on Large Language Models	Oct 31, 2024		CodeCode Available	1
FRoundation: Are Foundation Models Ready for Face Recognition?	Oct 31, 2024	Face Recognition	CodeCode Available	1
Instruction-Tuning Llama-3-8B Excels in City-Scale Mobility Prediction	Oct 31, 2024	Disaster ResponseLanguage Modeling	CodeCode Available	1
EMGBench: Benchmarking Out-of-Distribution Generalization and Adaptation for Electromyography	Oct 31, 2024	BenchmarkingElectromyography (EMG)	CodeCode Available	1
EDT: An Efficient Diffusion Transformer Framework Inspired by Human-like Sketching	Oct 31, 2024	Image GenerationRelation	CodeCode Available	1
Enhancing Motion in Text-to-Video Generation with Decomposed Encoding and Conditioning	Oct 31, 2024	Motion SynthesisText-to-Video Generation	CodeCode Available	1
Enhancing Chess Reinforcement Learning with Graph Representation	Oct 31, 2024	Atari GamesGraph Attention	CodeCode Available	1
GlotCC: An Open Broad-Coverage CommonCrawl Corpus and Pipeline for Minority Languages	Oct 31, 2024	Language Identification	CodeCode Available	1
DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion	Oct 31, 2024	Scene Generation	CodeCode Available	1
Constraint Back-translation Improves Complex Instruction Following of Large Language Models	Oct 31, 2024	Instruction FollowingTranslation	CodeCode Available	1
Reinforcement Learning Gradients as Vitamin for Online Finetuning Decision Transformers	Oct 31, 2024	reinforcement-learningReinforcement Learning	CodeCode Available	1
Graph Learning for Numeric Planning	Oct 31, 2024	Graph LearningInterpretable Machine Learning	CodeCode Available	1
Self-Ensembling Gaussian Splatting for Few-Shot Novel View Synthesis	Oct 31, 2024	3DGSNovel View Synthesis	CodeCode Available	1
Muscles in Time: Learning to Understand Human Motion by Simulating Muscle Activations	Oct 31, 2024		CodeCode Available	1
Automatically Learning Hybrid Digital Twins of Dynamical Systems	Oct 31, 2024		CodeCode Available	1
Local Superior Soups: A Catalyst for Model Merging in Cross-Silo Federated Learning	Oct 31, 2024	Federated Learning	CodeCode Available	1
RAGraph: A General Retrieval-Augmented Graph Learning Framework	Oct 31, 2024	Graph ClassificationGraph Learning	CodeCode Available	1
PSL: Rethinking and Improving Softmax Loss from Pairwise Perspective for Recommendation	Oct 31, 2024	Recommendation Systems	CodeCode Available	1
Can Language Models Perform Robust Reasoning in Chain-of-thought Prompting with Noisy Rationales?	Oct 31, 2024	DenoisingIn-Context Learning	CodeCode Available	1
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments	Oct 31, 2024	Quantization	CodeCode Available	1
Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model	Oct 31, 2024	Semantic SegmentationSpecificity	CodeCode Available	1
EZ-HOI: VLM Adaptation via Guided Prompt Learning for Zero-Shot HOI Detection	Oct 31, 2024	Human-Object Interaction DetectionLarge Language Model	CodeCode Available	1
SeafloorAI: A Large-scale Vision-Language Dataset for Seafloor Geological Survey	Oct 31, 2024		CodeCode Available	1
Prospective Learning: Learning for a Dynamic Future	Oct 31, 2024	PAC learning	CodeCode Available	1
MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation	Oct 31, 2024	Image SegmentationMamba	CodeCode Available	1
AllClear: A Comprehensive Dataset and Benchmark for Cloud Removal in Satellite Imagery	Oct 31, 2024	BenchmarkingCloud Removal	CodeCode Available	1
LLaMo: Large Language Model-based Molecular Graph Assistant	Oct 31, 2024	Instruction FollowingIUPAC Name Prediction	CodeCode Available	1
AlphaTrans: A Neuro-Symbolic Compositional Approach for Repository-Level Code Translation and Validation	Oct 31, 2024	Code TranslationTranslation	CodeCode Available	1
Understanding Generalizability of Diffusion Models Requires Rethinking the Hidden Gaussian Structure	Oct 31, 2024	Inductive BiasMemorization	CodeCode Available	1
Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone Connectivity	Oct 31, 2024	MuJoCoQ-Learning	CodeCode Available	1
SambaMixer: State of Health Prediction of Li-ion Batteries using Mamba State Space Models	Oct 31, 2024	Li-ion State of Health EstimationMamba	CodeCode Available	1
Show Me What and Where has Changed? Question Answering and Grounding for Remote Sensing Change Detection	Oct 31, 2024	Change DetectionQuestion Answering	CodeCode Available	1