The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 20151–20200 of 474278 papers

Title	Date	Tasks	Status	Hype
Cross-model Control: Improving Multiple Large Language Models in One-time Training	Oct 23, 2024	Instruction FollowingLanguage Modeling	CodeCode Available	1
Value Residual Learning For Alleviating Attention Concentration In Transformers	Oct 23, 2024		CodeCode Available	1
ADEM-VL: Adaptive and Embedded Fusion for Efficient Vision-Language Tuning	Oct 23, 2024	Image CaptioningInstruction Following	CodeCode Available	1
PlantCamo: Plant Camouflage Detection	Oct 23, 2024	object-detectionObject Detection	CodeCode Available	1
VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning	Oct 23, 2024	Question AnsweringSpeech Recognition	CodeCode Available	1
WAGLE: Strategic Weight Attribution for Effective and Modular Unlearning in Large Language Models	Oct 23, 2024		CodeCode Available	1
Entity-based Reinforcement Learning for Autonomous Cyber Defence	Oct 23, 2024	Deep Reinforcement Learningreinforcement-learning	CodeCode Available	1
Federated Transformer: Multi-Party Vertical Federated Learning on Practical Fuzzily Linked Data	Oct 23, 2024	Entity AlignmentFederated Learning	CodeCode Available	1
Graphusion: A RAG Framework for Knowledge Graph Construction with a Global Perspective	Oct 23, 2024	graph constructionKnowledge Graphs	CodeCode Available	1
ROCKET-1: Mastering Open-World Interaction with Visual-Temporal Context Prompting	Oct 23, 2024	Decision MakingMinecraft	CodeCode Available	1
Vehicle Dynamics Parameter Estimation Methodology for Virtual Automated Driving Testing	Oct 23, 2024	parameter estimation	CodeCode Available	1
PyTSC: A Unified Platform for Multi-Agent Reinforcement Learning in Traffic Signal Control	Oct 23, 2024	ManagementMulti-agent Reinforcement Learning	CodeCode Available	1
SpeakGer: A meta-data enriched speech corpus of German state and federal parliaments	Oct 23, 2024	DescriptiveSentiment Analysis	CodeCode Available	1
Neural Cover Selection for Image Steganography	Oct 23, 2024	Image Steganography	CodeCode Available	1
CLEAR: Character Unlearning in Textual and Visual Modalities	Oct 23, 2024	Machine Unlearning	CodeCode Available	1
Physics-informed Neural Networks for Functional Differential Equations: Cylindrical Approximation and Its Convergence Guarantees	Oct 23, 2024		CodeCode Available	1
GraphTeam: Facilitating Large Language Model-based Graph Analysis via Multi-Agent Collaboration	Oct 23, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Mapping the Media Landscape: Predicting Factual Reporting and Political Bias Through Web Interactions	Oct 23, 2024		CodeCode Available	1
Multi-scale feature reconstruction network for industrial anomaly detection	Oct 23, 2024	Anomaly DetectionUnsupervised Anomaly Detection	CodeCode Available	1
Personalized Instance-based Navigation Toward User-Specific Objects in Realistic Environments	Oct 23, 2024	ObjectVisual Navigation	CodeCode Available	1
Gaze-Assisted Medical Image Segmentation	Oct 23, 2024	DiagnosticImage Segmentation	CodeCode Available	1
Att2CPC: Attention-Guided Lossy Attribute Compression of Point Clouds	Oct 23, 2024	Attribute	CodeCode Available	1
DisenGCD: A Meta Multigraph-assisted Disentangled Graph Learning Framework for Cognitive Diagnosis	Oct 23, 2024	cognitive diagnosisDiagnostic	CodeCode Available	1
Leveraging Skills from Unlabeled Prior Data for Efficient Online Exploration	Oct 23, 2024	Efficient ExplorationReinforcement Learning (RL)	CodeCode Available	1
Scalable Random Feature Latent Variable Models	Oct 23, 2024	Bayesian InferenceComputational Efficiency	CodeCode Available	1
Diffusion Priors for Variational Likelihood Estimation and Image Denoising	Oct 23, 2024	DenoisingImage Denoising	CodeCode Available	1
Spiking Graph Neural Network on Riemannian Manifolds	Oct 23, 2024	Graph Neural Network	CodeCode Available	1
Fire and Smoke Detection with Burning Intensity Representation	Oct 22, 2024	object-detectionObject Detection	CodeCode Available	1
LiNo: Advancing Recursive Residual Decomposition of Linear and Nonlinear Patterns for Robust Time Series Forecasting	Oct 22, 2024	Time SeriesTime Series Forecasting	CodeCode Available	1
Benchmarking Multi-Scene Fire and Smoke Detection	Oct 22, 2024	Benchmarking	CodeCode Available	1
Progressive Compositionality In Text-to-Image Generative Models	Oct 22, 2024	AttributeContrastive Learning	CodeCode Available	1
GALA: Graph Diffusion-based Alignment with Jigsaw for Source-free Domain Adaptation	Oct 22, 2024	Domain AdaptationGRAPH DOMAIN ADAPTATION	CodeCode Available	1
Publishing Neural Networks in Drug Discovery Might Compromise Training Data Privacy	Oct 22, 2024	Drug DiscoveryMolecular Property Prediction	CodeCode Available	1
Scalable Influence and Fact Tracing for Large Language Model Pretraining	Oct 22, 2024	AttributeLanguage Modeling	CodeCode Available	1
EEG-DIF: Early Warning of Epileptic Seizures through Generative Diffusion Model-based Multi-channel EEG Signals Forecasting	Oct 22, 2024	DiagnosticEEG	CodeCode Available	1
SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition	Oct 22, 2024	Action RecognitionAutonomous Driving	CodeCode Available	1
Math Neurosurgery: Isolating Language Models' Math Reasoning Abilities Using Only Forward Passes	Oct 22, 2024	GSM8KLanguage Modeling	CodeCode Available	1
Emphasizing Discriminative Features for Dataset Distillation in Complex Scenarios	Oct 22, 2024	Dataset Distillation	CodeCode Available	1
Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning	Oct 22, 2024	RetrievalRetrieval-augmented Generation	CodeCode Available	1
Aligning Large Language Models via Self-Steering Optimization	Oct 22, 2024		CodeCode Available	1
TopoDiffusionNet: A Topology-aware Diffusion Model	Oct 22, 2024	Denoisingmodel	CodeCode Available	1
Automated Spinal MRI Labelling from Reports Using a Large Language Model	Oct 22, 2024	Language ModelingLanguage Modelling	CodeCode Available	1
Do Vision-Language Models Represent Space and How? Evaluating Spatial Frame of Reference Under Ambiguities	Oct 22, 2024	Spatial Reasoning	CodeCode Available	1
Joint Point Cloud Upsampling and Cleaning with Octree-based CNNs	Oct 22, 2024	point cloud upsampling	CodeCode Available	1
ETHIC: Evaluating Large Language Models on Long-Context Tasks with High Information Coverage	Oct 22, 2024		CodeCode Available	1
Multi-Layer Gaussian Splatting for Immersive Anatomy Visualization	Oct 22, 2024	AnatomyDiagnostic	CodeCode Available	1
Towards Automated Penetration Testing: Introducing LLM Benchmark, Analysis, and Improvements	Oct 22, 2024		CodeCode Available	1
LiNeS: Post-training Layer Scaling Prevents Forgetting and Enhances Model Merging	Oct 22, 2024	Out-of-Distribution Generalization	CodeCode Available	1
Meaning Typed Prompting: A Technique for Efficient, Reliable Structured Output Generation	Oct 22, 2024	Large Language ModelMultimodal Large Language Model	CodeCode Available	1
Non-myopic Generation of Language Models for Reasoning and Planning	Oct 22, 2024	Computational EfficiencyLanguage Modelling	CodeCode Available	1