The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 17951–18000 of 474278 papers

Title	Date	Tasks	Status	Hype
SAMGPT: Text-free Graph Foundation Model for Multi-domain Pre-training and Cross-domain Adaptation	Feb 8, 2025	Domain Adaptation	CodeCode Available	1
OntoTune: Ontology-Driven Self-training for Aligning Large Language Models	Feb 8, 2025	Hypernym DiscoveryIn-Context Learning	CodeCode Available	1
ShiftySpeech: A Large-Scale Synthetic Speech Dataset with Distribution Shifts	Feb 8, 2025	BenchmarkingSelf-Supervised Learning	CodeCode Available	1
Unsupervised Self-Prior Embedding Neural Representation for Iterative Sparse-View CT Reconstruction	Feb 8, 2025	CT Reconstruction	CodeCode Available	1
A Comprehensive Review of Protein Language Models	Feb 8, 2025		CodeCode Available	1
QuantumDNA: A Python Package for Analyzing Quantum Charge Dynamics in DNA and Exploring Its Biological Relevance	Feb 8, 2025		CodeCode Available	1
UniCMs: A Unified Consistency Model For Efficient Multimodal Generation and Understanding	Feb 8, 2025	DenoisingImage Generation	CodeCode Available	1
Graph Neural Networks for Efficient AC Power Flow Prediction in Power Grids	Feb 8, 2025	Management	CodeCode Available	1
APE: Faster and Longer Context-Augmented Generation via Adaptive Parallel Encoding	Feb 8, 2025	RAG	CodeCode Available	1
Bridging Traffic State and Trajectory for Dynamic Road Network and Trajectory Representation Learning	Feb 8, 2025	Graph AttentionRepresentation Learning	CodeCode Available	1
ParaSurf: A Surface-Based Deep Learning Approach for Paratope-Antigen Interaction Prediction	Feb 8, 2025	Antibody-antigen binding prediction	CodeCode Available	1
ProofWala: Multilingual Proof Data Synthesis and Theorem-Proving	Feb 7, 2025	Automated Theorem Proving	CodeCode Available	1
SurGen: 1020 H&E-stained Whole Slide Images With Survival and Genetic Markers	Feb 7, 2025	Diagnosticwhole slide images	CodeCode Available	1
Swin-MSTP: Swin transformer with multi-scale temporal perception for continuous sign language recognition	Feb 7, 2025	Sign Language Recognition	CodeCode Available	1
Multi-Class Segmentation of Aortic Branches and Zones in Computed Tomography Angiography: The AortaSeg24 Challenge	Feb 7, 2025	Data AugmentationSegmentation	CodeCode Available	1
Gemstones: A Model Suite for Multi-Faceted Scaling Laws	Feb 7, 2025	Experimental DesignLanguage Modeling	CodeCode Available	1
Cached Multi-Lora Composition for Multi-Concept Image Generation	Feb 7, 2025	Computational EfficiencyDenoising	CodeCode Available	1
3DMolFormer: A Dual-channel Framework for Structure-based Drug Discovery	Feb 7, 2025	Drug DesignDrug Discovery	CodeCode Available	1
Position-aware Automatic Circuit Discovery	Feb 7, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
EigenLoRAx: Recycling Adapters to Find Principal Subspaces for Resource-Efficient Adaptation and Inference	Feb 7, 2025		CodeCode Available	1
Towards LLM Unlearning Resilient to Relearning Attacks: A Sharpness-Aware Minimization Perspective and Beyond	Feb 7, 2025		CodeCode Available	1
Tolerance-Aware Deep Optics	Feb 7, 2025		CodeCode Available	1
LLM-Supported Natural Language to Bash Translation	Feb 7, 2025	In-Context LearningTranslation	CodeCode Available	1
MELON: Provable Defense Against Indirect Prompt Injection Attacks in AI Agents	Feb 7, 2025		CodeCode Available	1
Oracular Programming: A Modular Foundation for Building LLM-Enabled Software	Feb 7, 2025	Navigate	CodeCode Available	1
No Task Left Behind: Isotropic Model Merging with Common and Task-Specific Subspaces	Feb 7, 2025		CodeCode Available	1
SSMLoRA: Enhancing Low-Rank Adaptation with State Space Model	Feb 7, 2025	parameter-efficient fine-tuning	CodeCode Available	1
FlightForge: Advancing UAV Research with Procedural Generation of High-Fidelity Simulation and Integrated Autonomy	Feb 7, 2025	Autonomous NavigationCollision Avoidance	CodeCode Available	1
Otter: Generating Tests from Issues to Validate SWE Patches	Feb 7, 2025	test driven development	CodeCode Available	1
DuoGuard: A Two-Player RL-Driven Framework for Multilingual LLM Guardrails	Feb 7, 2025	Reinforcement Learning (RL)Synthetic Data Generation	CodeCode Available	1
Chest X-ray Foundation Model with Global and Local Representations Integration	Feb 7, 2025	Mortality Prediction	CodeCode Available	1
Transforming Science with Large Language Models: A Survey on AI-assisted Scientific Discovery, Experimentation, Content Generation, and Evaluation	Feb 7, 2025	scientific discoverySurvey	CodeCode Available	1
nvAgent: Automated Data Visualization from Natural Language via Collaborative Agent Workflow	Feb 7, 2025	Code GenerationCode Translation	CodeCode Available	1
pytopicgram: A library for data extraction and topic modeling from Telegram channels	Feb 7, 2025	Retrieval	CodeCode Available	1
M-IFEval: Multilingual Instruction-Following Evaluation	Feb 7, 2025	Instruction Following	CodeCode Available	1
An Extended Benchmarking of Multi-Agent Reinforcement Learning Algorithms in Complex Fully Cooperative Tasks	Feb 7, 2025	BenchmarkingMulti-agent Reinforcement Learning	CodeCode Available	1
Distillation and Pruning for Scalable Self-Supervised Representation-Based Speech Quality Assessment	Feb 7, 2025		CodeCode Available	1
Wavelet-Assisted Multi-Frequency Attention Network for Pansharpening	Feb 7, 2025	Pansharpening	CodeCode Available	1
Mitigating Unintended Memorization with LoRA in Federated Learning for LLMs	Feb 7, 2025	Federated LearningMedical Question Answering	CodeCode Available	1
Decoding Human Attentive States from Spatial-temporal EEG Patches Using Transformers	Feb 6, 2025	Brain Computer InterfaceEEG	CodeCode Available	1
Great Models Think Alike and this Undermines AI Oversight	Feb 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
ImprovNet -- Generating Controllable Musical Improvisations with Iterative Corruption Refinement	Feb 6, 2025	Music GenerationRhythm	CodeCode Available	1
LR0.FM: Low-Res Benchmark and Improving Robustness for Zero-Shot Classification in Foundation Models	Feb 6, 2025	zero-shot-classificationZero-shot Generalization	CodeCode Available	1
Every Call is Precious: Global Optimization of Black-Box Functions with Unknown Lipschitz Constants	Feb 6, 2025	global-optimization	CodeCode Available	1
Towards Unified Music Emotion Recognition across Dimensional and Categorical Models	Feb 6, 2025	Emotion RecognitionKnowledge Distillation	CodeCode Available	1
Generative Autoregressive Transformers for Model-Agnostic Federated MRI Reconstruction	Feb 6, 2025	Federated LearningMRI Reconstruction	CodeCode Available	1
Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances	Feb 6, 2025	object-detectionObject Detection	CodeCode Available	1
Division-of-Thoughts: Harnessing Hybrid Language Model Synergy for Efficient On-Device Agents	Feb 6, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
PixFoundation: Are We Heading in the Right Direction with Pixel-level Vision Foundation Models?	Feb 6, 2025	Question AnsweringReferring Expression	CodeCode Available	1
Variational Control for Guidance in Diffusion Models	Feb 6, 2025	SpecificityVariational Inference	CodeCode Available	1