The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20701–20750 of 474278 papers

Title	Date	Tasks	Status	Hype
GreenLight-Gym: Reinforcement learning benchmark environment for control of greenhouse production systems	Oct 6, 2024	Numerical IntegrationReinforcement Learning (RL)	CodeCode Available	1
Large Scale MRI Collection and Segmentation of Cirrhotic Liver	Oct 6, 2024	BenchmarkingDiagnostic	CodeCode Available	1
Algorithmic Capabilities of Random Transformers	Oct 6, 2024	Text Generation	CodeCode Available	1
CogDevelop2K: Reversed Cognitive Development in Multimodal Large Language Models	Oct 6, 2024		CodeCode Available	1
MC-CoT: A Modular Collaborative CoT Framework for Zero-shot Medical-VQA with LLM and MLLM Integration	Oct 6, 2024	Medical Visual Question AnsweringQuestion Answering	CodeCode Available	1
Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion	Oct 6, 2024		CodeCode Available	1
Where are we in audio deepfake detection? A systematic analysis over generative and detection models	Oct 6, 2024	Audio Deepfake DetectionAudio Synthesis	CodeCode Available	1
Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF	Oct 6, 2024		CodeCode Available	1
Wrong-of-Thought: An Integrated Reasoning Framework with Multi-Perspective Verification and Wrong Information	Oct 6, 2024		CodeCode Available	1
Towards Secure Tuning: Mitigating Security Risks Arising from Benign Instruction Fine-Tuning	Oct 6, 2024		CodeCode Available	1
From Reading to Compressing: Exploring the Multi-document Reader for Prompt Compression	Oct 5, 2024	Decoder	CodeCode Available	1
Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback	Oct 5, 2024	Data Visualization	CodeCode Available	1
IceCloudNet: 3D reconstruction of cloud ice from Meteosat SEVIRI	Oct 5, 2024	3D Reconstruction	CodeCode Available	1
Improving Temporal Link Prediction via Temporal Walk Matrix Projection	Oct 5, 2024	Computational EfficiencyGraph Neural Network	CodeCode Available	1
DB-SAM: Delving into High Quality Universal Medical Image Segmentation	Oct 5, 2024	Image SegmentationMedical Image Segmentation	CodeCode Available	1
IV-Mixed Sampler: Leveraging Image Diffusion Models for Enhanced Video Synthesis	Oct 5, 2024	Text-to-Video Generation	CodeCode Available	1
AI as Humanity's Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text	Oct 5, 2024	Text Detection	CodeCode Available	1
LongGenBench: Long-context Generation Benchmark	Oct 5, 2024	Language ModellingRetrieval	CodeCode Available	1
Beyond Language: Applying MLX Transformers to Engineering Physics	Oct 5, 2024		CodeCode Available	1
Hyperbolic Fine-tuning for Large Language Models	Oct 5, 2024		CodeCode Available	1
Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning	Oct 5, 2024	BenchmarkingDrug Design	CodeCode Available	1
Embrace rejection: Kernel matrix approximation by accelerated randomly pivoted Cholesky	Oct 4, 2024	Computational chemistry	CodeCode Available	1
ECHOPulse: ECG controlled echocardio-grams video generation	Oct 4, 2024	Video Generation	CodeCode Available	1
Autoregressive Moving-average Attention Mechanism for Time Series Forecasting	Oct 4, 2024	DecoderTime Series	CodeCode Available	1
Entanglement-induced provable and robust quantum learning advantages	Oct 4, 2024		CodeCode Available	1
CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios	Oct 4, 2024	Clinical KnowledgeDiagnostic	CodeCode Available	1
Enriching Music Descriptions with a Finetuned-LLM and Metadata for Text-to-Music Retrieval	Oct 4, 2024	DescriptiveLanguage Modeling	CodeCode Available	1
MDMP: Multi-modal Diffusion for supervised Motion Predictions with uncertainty	Oct 4, 2024	Motion ForecastingMotion Generation	CodeCode Available	1
Geometric Representation Condition Improves Equivariant Molecule Generation	Oct 4, 2024	Drug Designscientific discovery	CodeCode Available	1
LANTERN: Accelerating Visual Autoregressive Models with Relaxed Speculative Decoding	Oct 4, 2024	Image Generation	CodeCode Available	1
Variational Language Concepts for Interpreting Foundation Language Models	Oct 4, 2024		CodeCode Available	1
Robust Barycenter Estimation using Semi-Unbalanced Neural Optimal Transport	Oct 4, 2024		CodeCode Available	1
Cayley Graph Propagation	Oct 4, 2024		CodeCode Available	1
Human-aligned Chess with a Bit of Search	Oct 4, 2024		CodeCode Available	1
Manikin-Recorded Cardiopulmonary Sounds Dataset Using Digital Stethoscope	Oct 4, 2024	Audio Signal ProcessingSound Classification	CodeCode Available	1
Test-time Adaptation for Regression by Subspace Alignment	Oct 4, 2024	regressionTest-time Adaptation	CodeCode Available	1
Aligning LLMs with Individual Preferences via Interaction	Oct 4, 2024		CodeCode Available	1
TrustEMG-Net: Using Representation-Masking Transformer with U-Net for Surface Electromyography Enhancement	Oct 4, 2024	Denoising	CodeCode Available	1
Tadashi: Enabling AI-Based Automated Code Generation With Guaranteed Correctness	Oct 4, 2024	Code Generation	CodeCode Available	1
Gradient-based Jailbreak Images for Multimodal Fusion Models	Oct 4, 2024		CodeCode Available	1
Learning Code Preference via Synthetic Evolution	Oct 4, 2024	Code Generation	CodeCode Available	1
Not All Diffusion Model Activations Have Been Evaluated as Discriminative Features	Oct 4, 2024	Allfeature selection	CodeCode Available	1
Redefining Temporal Modeling in Video Diffusion: The Vectorized Timestep Approach	Oct 4, 2024	Image GenerationImage to Video Generation	CodeCode Available	1
Predictive Coding for Decision Transformer	Oct 4, 2024	Decision MakingReinforcement Learning (RL)	CodeCode Available	1
Can Watermarked LLMs be Identified by Users via Crafted Prompts?	Oct 4, 2024		CodeCode Available	1
Variational Bayes Gaussian Splatting	Oct 4, 2024	Continual LearningVariational Inference	CodeCode Available	1
Diffusion State-Guided Projected Gradient for Inverse Problems	Oct 4, 2024	Image Restoration	CodeCode Available	1
RFBoost: Understanding and Boosting Deep WiFi Sensing via Physical Data Augmentation	Oct 4, 2024	Data Augmentation	CodeCode Available	1
Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization	Oct 4, 2024	Deep Reinforcement LearningQuantization	CodeCode Available	1
Real-World Benchmarks Make Membership Inference Attacks Fail on Diffusion Models	Oct 4, 2024		CodeCode Available	1