The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21001–21050 of 474278 papers

Title	Date	Tasks	Status	Hype
QuForge: A Library for Qudits Simulation	Sep 26, 2024	Quantum Machine Learning	CodeCode Available	1
Infer Human's Intentions Before Following Natural Language Instructions	Sep 26, 2024	Instruction Following	CodeCode Available	1
CodonMPNN for Organism Specific and Codon Optimal Inverse Folding	Sep 25, 2024		CodeCode Available	1
Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal Recommendation	Sep 25, 2024	Multimodal RecommendationRecommendation Systems	CodeCode Available	1
Robust Scene Change Detection Using Visual Foundation Models and Cross-Attention Mechanisms	Sep 25, 2024	Change DetectionScene Change Detection	CodeCode Available	1
Topological SLAM in colonoscopies leveraging deep features and topological priors	Sep 25, 2024		CodeCode Available	1
Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction	Sep 25, 2024	DiversityRed Teaming	CodeCode Available	1
HazeSpace2M: A Dataset for Haze Aware Single Image Dehazing	Sep 25, 2024	BenchmarkingImage Dehazing	CodeCode Available	1
Pix2Next: Leveraging Vision Foundation Models for RGB to NIR Image Translation	Sep 25, 2024	DecoderImage Generation	CodeCode Available	1
Generative Object Insertion in Gaussian Splatting with a Multi-View Diffusion Model	Sep 25, 2024	3D ReconstructionObject	CodeCode Available	1
Counterfactual Token Generation in Large Language Models	Sep 25, 2024	Bias Detectioncounterfactual	CodeCode Available	1
Navigating the Maze of Explainable AI: A Systematic Approach to Evaluating Methods and Metrics	Sep 25, 2024	Selection bias	CodeCode Available	1
Moner: Motion Correction in Undersampled Radial MRI with Unsupervised Neural Representation	Sep 25, 2024	Model Optimization	CodeCode Available	1
Beyond Redundancy: Information-aware Unsupervised Multiplex Graph Structure Learning	Sep 25, 2024	Contrastive LearningGraph Learning	CodeCode Available	1
ControlCity: A Multimodal Diffusion Model Based Approach for Accurate Geospatial Data Generation and Urban Morphology Analysis	Sep 25, 2024		CodeCode Available	1
HDFlow: Enhancing LLM Complex Problem-Solving with Hybrid Thinking and Dynamic Workflows	Sep 25, 2024	Computational Efficiency	CodeCode Available	1
BitQ: Tailoring Block Floating Point Precision for Improved DNN Efficiency on Resource-Constrained Devices	Sep 25, 2024	image-classificationImage Classification	CodeCode Available	1
Search for Efficient Large Language Models	Sep 25, 2024	GPUModel Compression	CodeCode Available	1
First Place Solution to the ECCV 2024 BRAVO Challenge: Evaluating Robustness of Vision Foundation Models for Semantic Segmentation	Sep 25, 2024	DecoderSemantic Segmentation	CodeCode Available	1
FineZip : Pushing the Limits of Large Language Models for Practical Lossless Text Compression	Sep 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Dashing for the Golden Snitch: Multi-Drone Time-Optimal Motion Planning with Multi-Agent Reinforcement Learning	Sep 25, 2024	Collision AvoidanceMotion Planning	CodeCode Available	1
CodeInsight: A Curated Dataset of Practical Coding Solutions from Stack Overflow	Sep 25, 2024	Code Generation	CodeCode Available	1
Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification	Sep 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
PACE: Marrying generalization in PArameter-efficient fine-tuning with Consistency rEgularization	Sep 25, 2024	8kDomain Adaptation	CodeCode Available	1
CaBRNet, an open-source library for developing and evaluating Case-Based Reasoning Models	Sep 25, 2024	Explainable Models	CodeCode Available	1
Towards General Text-guided Image Synthesis for Customized Multimodal Brain MRI Generation	Sep 25, 2024	Contrastive LearningImage Generation	CodeCode Available	1
Plurals: A System for Guiding LLMs Via Simulated Social Ensembles	Sep 25, 2024		CodeCode Available	1
GraphLoRA: Structure-Aware Contrastive Low-Rank Adaptation for Cross-Graph Transfer Learning	Sep 25, 2024	Transfer Learning	CodeCode Available	1
Prompt Sliders for Fine-Grained Control, Editing and Erasing of Concepts in Diffusion Models	Sep 25, 2024	Image Generation	CodeCode Available	1
Scalable Multi-Robot Informative Path Planning for Target Mapping via Deep Reinforcement Learning	Sep 25, 2024	Collision AvoidanceDeep Reinforcement Learning	CodeCode Available	1
Inline Photometrically Calibrated Hybrid Visual SLAM	Sep 25, 2024		CodeCode Available	1
Semi-LLIE: Semi-supervised Contrastive Learning with Mamba-based Low-light Image Enhancement	Sep 25, 2024	Contrastive LearningImage Enhancement	CodeCode Available	1
DALDA: Data Augmentation Leveraging Diffusion Model and LLM with Adaptive Guidance Scaling	Sep 25, 2024	Data AugmentationDiversity	CodeCode Available	1
Enhancing Nighttime UAV Tracking with Light Distribution Suppression	Sep 25, 2024	Object Trackingparameter estimation	CodeCode Available	1
Training Language Models to Win Debates with Self-Play Improves Judge Accuracy	Sep 25, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
DRIM: Learning Disentangled Representations from Incomplete Multimodal Healthcare Data	Sep 25, 2024	Contrastive LearningPrognosis	CodeCode Available	1
EventHallusion: Diagnosing Event Hallucinations in Video LLMs	Sep 25, 2024	HallucinationInstruction Following	CodeCode Available	1
SDCL: Students Discrepancy-Informed Correction Learning for Semi-supervised Medical Image Segmentation	Sep 25, 2024	Image SegmentationLeft Atrium Segmentation	CodeCode Available	1
Face Forgery Detection with Elaborate Backbone	Sep 25, 2024	DeepFake DetectionFace Generation	CodeCode Available	1
HVT: A Comprehensive Vision Framework for Learning in Non-Euclidean Space	Sep 25, 2024	image-classificationImage Classification	CodeCode Available	1
TiM4Rec: An Efficient Sequential Recommendation Model Based on Time-Aware Structured State Space Duality Model	Sep 24, 2024	Computational EfficiencyMamba	CodeCode Available	1
In-Context Ensemble Learning from Pseudo Labels Improves Video-Language Models for Low-Level Workflow Understanding	Sep 24, 2024	Ensemble LearningIn-Context Learning	CodeCode Available	1
Fine-Tuning is Fine, if Calibrated	Sep 24, 2024		CodeCode Available	1
TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models	Sep 24, 2024	ClassificationData Augmentation	CodeCode Available	1
FLEX: Expert-level False-Less EXecution Metric for Reliable Text-to-SQL Benchmark	Sep 24, 2024	Text to SQLText-To-SQL	CodeCode Available	1
PDT: Uav Target Detection Dataset for Pests and Diseases Tree	Sep 24, 2024	object-detectionObject Detection	CodeCode Available	1
MM-CamObj: A Comprehensive Multimodal Dataset for Camouflaged Object Scenarios	Sep 24, 2024	Instruction Following	CodeCode Available	1
Lessons and Insights from a Unifying Study of Parameter-Efficient Fine-Tuning (PEFT) in Visual Recognition	Sep 24, 2024	parameter-efficient fine-tuningTransfer Learning	CodeCode Available	1
Efficient Motion Prediction: A Lightweight & Accurate Trajectory Prediction Model With Fast Training and Inference Speed	Sep 24, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	1
XTRUST: On the Multilingual Trustworthiness of Large Language Models	Sep 24, 2024	EthicsFairness	CodeCode Available	1