The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8051–8100 of 661570 papers

Title	Date	Tasks	Status	Hype
LLMs as Hackers: Autonomous Linux Privilege Escalation Attacks	Oct 17, 2023	In-Context Learning	CodeCode Available	2
ChemVLM: Exploring the Power of Multimodal Large Language Models in Chemistry Area	Aug 14, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Text2BIM: Generating Building Models Using a Large Language Model-based Multi-Agent Framework	Aug 15, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
FastCPH: Efficient Survival Analysis for Neural Networks	Aug 21, 2022	Survival Analysis	CodeCode Available	2
C2P-CLIP: Injecting Category Common Prompt in CLIP to Enhance Generalization in Deepfake Detection	Aug 19, 2024	DeepFake DetectionFace Swapping	CodeCode Available	2
PerturBench: Benchmarking Machine Learning Models for Cellular Perturbation Analysis	Aug 20, 2024	Benchmarking	CodeCode Available	2
BearLLM: A Prior Knowledge-Enhanced Bearing Health Management Framework with Unified Vibration Signal Representation	Aug 21, 2024	Fault DiagnosisManagement	CodeCode Available	2
Scalable Autoregressive Image Generation with Mamba	Aug 22, 2024	Image GenerationMamba	CodeCode Available	2
SPIRAL: Self-Play on Zero-Sum Games Incentivizes Reasoning via Multi-Agent Multi-Turn Reinforcement Learning	Jun 30, 2025	MathMulti-agent Reinforcement Learning	CodeCode Available	2
MLR-Copilot: Autonomous Machine Learning Research based on Large Language Models Agents	Aug 26, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
LLMs as Zero-shot Graph Learners: Alignment of GNN Representations with LLM Token Embeddings	Aug 25, 2024	Language ModellingLink Prediction	CodeCode Available	2
Leveraging Hallucinations to Reduce Manual Prompt Dependency in Promptable Segmentation	Aug 27, 2024	Camouflaged Object SegmentationCamouflaged Object Segmentation with a Single Task-generic Prompt	CodeCode Available	2
Stochastic Parameter Decomposition	Jun 25, 2025		CodeCode Available	2
Enhancing Privacy in Federated Learning: Secure Aggregation for Real-World Healthcare Applications	Sep 2, 2024	CPUFederated Learning	CodeCode Available	2
Boosting Vision-Language Models for Histopathology Classification: Predict all at once	Sep 3, 2024	Allzero-shot-classification	CodeCode Available	2
FunctionChat-Bench: Comprehensive Evaluation of Language Models' Generative Capabilities in Korean Tool-use Dialogs	Nov 21, 2024	Relevance Detection	CodeCode Available	2
Make Your ViT-based Multi-view 3D Detectors Faster via Token Compression	Sep 1, 2024	Autonomous Driving	CodeCode Available	2
Towards a Unified View of Preference Learning for Large Language Models: A Survey	Sep 4, 2024		CodeCode Available	2
UniDet3D: Multi-dataset Indoor 3D Object Detection	Sep 6, 2024	3D Object DetectionObject	CodeCode Available	2
A Pair Programming Framework for Code Generation via Multi-Plan Exploration and Feedback-Driven Refinement	Sep 8, 2024	Code Generation	CodeCode Available	2
Assessing SPARQL capabilities of Large Language Models	Sep 9, 2024	BenchmarkingKnowledge Graphs	CodeCode Available	2
DiffusionPen: Towards Controlling the Style of Handwritten Text Generation	Sep 9, 2024	DiversityHTR	CodeCode Available	2
ThermalGaussian: Thermal 3D Gaussian Splatting	Sep 11, 2024	3DGSNeRF	CodeCode Available	2
What is the Relationship between Tensor Factorizations and Circuits (and How Can We Exploit it)?	Sep 12, 2024		CodeCode Available	2
Recent Trends of Multimodal Affective Computing: A Survey from NLP Perspective	Sep 11, 2024	Aspect-Based Sentiment AnalysisEmotion Recognition	CodeCode Available	2
EZIGen: Enhancing zero-shot personalized image generation with precise subject encoding and decoupled guidance	Sep 12, 2024	DenoisingImage Generation	CodeCode Available	2
SSR-Speech: Towards Stable, Safe and Robust Zero-shot Text-based Speech Editing and Synthesis	Sep 11, 2024	DecoderSpeech Synthesis	CodeCode Available	2
Fit and Prune: Fast and Training-free Visual Token Pruning for Multi-modal Large Language Models	Sep 16, 2024		CodeCode Available	2
Large Language Models are Strong Audio-Visual Speech Recognition Learners	Sep 18, 2024	Audio-Visual Speech RecognitionAutomatic Speech Recognition	CodeCode Available	2
HSIGene: A Foundation Model For Hyperspectral Image Generation	Sep 19, 2024	Data AugmentationDenoising	CodeCode Available	2
Small Language Models: Survey, Measurements, and Insights	Sep 24, 2024	BenchmarkingDecoder	CodeCode Available	2
Archon: An Architecture Search Framework for Inference-Time Techniques	Sep 23, 2024	Hyperparameter OptimizationInstruction Following	CodeCode Available	2
Youku-mPLUG: A 10 Million Large-scale Chinese Video-Language Dataset for Pre-training and Benchmarks	Jun 7, 2023	Cross-Modal RetrievalLanguage Modelling	CodeCode Available	2
PointSAM: Pointly-Supervised Segment Anything Model for Remote Sensing Images	Sep 20, 2024	Image SegmentationSemantic Segmentation	CodeCode Available	2
LTNtorch: PyTorch Implementation of Logic Tensor Networks	Sep 24, 2024	Binary ClassificationLogical Reasoning	CodeCode Available	2
Occupancy-Based Dual Contouring	Sep 20, 2024	3D ReconstructionGPU	CodeCode Available	2
Revisiting the Solution of Meta KDD Cup 2024: CRAG	Sep 9, 2024	RAGRetrieval	CodeCode Available	2
Source-Free Domain Adaptation for YOLO Object Detection	Sep 25, 2024	Domain AdaptationModel Selection	CodeCode Available	2
Game4Loc: A UAV Geo-Localization Benchmark from Game Data	Sep 25, 2024	Drone-view target localizationgeo-localization	CodeCode Available	2
E.T. Bench: Towards Open-Ended Event-Level Video-Language Understanding	Sep 26, 2024	Question AnsweringVideo Understanding	CodeCode Available	2
Resolving Multi-Condition Confusion for Finetuning-Free Personalized Image Generation	Sep 26, 2024	Image GenerationObject	CodeCode Available	2
Rethinking the Power of Timestamps for Robust Time Series Forecasting: A Global-Local Fusion Perspective	Sep 27, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2
Melody-Guided Music Generation	Sep 30, 2024	cross-modal alignmentMusic Generation	CodeCode Available	2
Restore Anything with Masks: Leveraging Mask Image Modeling for Blind All-in-One Image Restoration	Sep 28, 2024	AllAttribute	CodeCode Available	2
GSPR: Multimodal Place Recognition Using 3D Gaussian Splatting for Autonomous Driving	Oct 1, 2024	Autonomous DrivingAutonomous Vehicles	CodeCode Available	2
EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control	Oct 1, 2024	Emotional Speech SynthesisSpeech Synthesis	CodeCode Available	2
WAFT: Warping-Alone Field Transforms for Optical Flow	Jun 26, 2025	Optical Flow EstimationZero-shot Generalization	CodeCode Available	2
Selective Aggregation for Low-Rank Adaptation in Federated Learning	Oct 2, 2024	Federated LearningGeneral Knowledge	CodeCode Available	2
StickyLand: Breaking the Linear Presentation of Computational Notebooks	Feb 22, 2022		CodeCode Available	2
Look Twice Before You Answer: Memory-Space Visual Retracing for Hallucination Mitigation in Multimodal Large Language Models	Oct 4, 2024	DecoderHallucination	CodeCode Available	2