The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 13151–13200 of 474278 papers

Title	Date	Tasks	Status	Hype
Voyaging into Unbounded Dynamic Scenes from a Single View	Jul 5, 2025	Scene Generation	—Unverified	0
NRSeg: Noise-Resilient Learning for BEV Semantic Segmentation via Driving World Models	Jul 5, 2025	Autonomous DrivingBEV Segmentation	CodeCode Available	0
A Survey on Proactive Defense Strategies Against Misinformation in Large Language Models	Jul 5, 2025	Misinformation	—Unverified	0
Graph Collaborative Attention Network for Link Prediction in Knowledge Graphs	Jul 5, 2025	Graph Neural NetworkInformation Retrieval	CodeCode Available	0
Combining Graph Neural Networks and Mixed Integer Linear Programming for Molecular Inference under the Two-Layered Model	Jul 5, 2025	Molecular Property PredictionProperty Prediction	—Unverified	0
LLMThinkBench: Towards Basic Math Reasoning and Overthinking in Large Language Models	Jul 5, 2025	BenchmarkingGPU	CodeCode Available	1
CTR-Guided Generative Query Suggestion in Conversational Search	Jul 5, 2025	Conversational SearchDiversity	—Unverified	0
Addressing The Devastating Effects Of Single-Task Data Poisoning In Exemplar-Free Continual Learning	Jul 5, 2025	Continual LearningData Poisoning	CodeCode Available	0
Taming Anomalies with Down-Up Sampling Networks: Group Center Preserving Reconstruction for 3D Anomaly Detection	Jul 5, 2025	3D Anomaly DetectionAnomaly Detection	—Unverified	0
Quantum Stochastic Walks for Portfolio Optimization: Theory and Implementation on Financial Networks	Jul 5, 2025	Portfolio Optimization	—Unverified	0
Stochastic Human Motion Prediction with Memory of Action Transition and Action Characteristic	Jul 5, 2025	Human motion predictionmotion prediction	CodeCode Available	0
Temporal Continual Learning with Prior Compensation for Human Motion Prediction	Jul 5, 2025	Continual LearningHuman motion prediction	CodeCode Available	0
Learning Disentangled Stain and Structural Representations for Semi-Supervised Histopathology Segmentation	Jul 5, 2025	PrognosisSegmentation	CodeCode Available	0
skfolio: Portfolio Optimization in Python	Jul 5, 2025	ManagementPortfolio Optimization	CodeCode Available	5
Taylor-Model Physics-Informed Neural Networks (PINNs) for Ordinary Differential Equations	Jul 5, 2025		CodeCode Available	0
PresentAgent: Multimodal Agent for Presentation Video Generation	Jul 5, 2025	text-to-speechText to Speech	CodeCode Available	2
All-atom inverse protein folding through discrete flow matching	Jul 4, 2025		CodeCode Available	0
Open-Vocabulary Object Detection in UAV Imagery: A Review and Future Perspectives	Jul 4, 2025		CodeCode Available	0
Low-Light Enhancement via Encoder-Decoder Network with Illumination Guidance	Jul 4, 2025		CodeCode Available	0
Team RAS in 9th ABAW Competition: Multimodal Compound Expression Recognition Approach	Jul 4, 2025		—Unverified	0
Four Shades of Life Sciences: A Dataset for Disinformation Detection in the Life Sciences	Jul 4, 2025		CodeCode Available	0
Chat2SPaT: A Large Language Model Based Tool for Automating Traffic Signal Control Plan Management	Jul 4, 2025		CodeCode Available	0
SciVid: Cross-Domain Evaluation of Video Models in Scientific Applications	Jul 4, 2025		CodeCode Available	0
ObjectRL: An Object-Oriented Reinforcement Learning Codebase	Jul 4, 2025		CodeCode Available	0
MLASDO: a software tool to detect and explain clinical and omics inconsistencies applied to the Parkinson's Progression Markers Initiative cohort	Jul 4, 2025		CodeCode Available	0
On the rankability of visual embeddings	Jul 4, 2025		CodeCode Available	0
MGSfM: Multi-Camera Geometry Driven Global Structure-from-Motion	Jul 4, 2025		CodeCode Available	0
Masked Temporal Interpolation Diffusion for Procedure Planning in Instructional Videos	Jul 4, 2025		CodeCode Available	0
Task-Specific Generative Dataset Distillation with Difficulty-Guided Sampling	Jul 4, 2025	Dataset Distillation	—Unverified	0
GRAFT: A Graph-based Flow-aware Agentic Framework for Document-level Machine Translation	Jul 4, 2025	Document Level Machine TranslationDocument Translation	—Unverified	0
Agent-Based Detection and Resolution of Incompleteness and Ambiguity in Interactions with Large Language Models	Jul 4, 2025	Question Answering	—Unverified	0
LTLCrit: A Temporal Logic-based LLM Critic for Safe and Efficient Embodied Agents	Jul 4, 2025	Decision MakingFormal Logic	—Unverified	0
Recon, Answer, Verify: Agents in Search of Truth	Jul 4, 2025	Fact Checking	—Unverified	0
Dyn-O: Building Structured World Models with Object-Centric Representations	Jul 4, 2025	Object	—Unverified	0
Be the Change You Want to See: Revisiting Remote Sensing Change Detection Practices	Jul 4, 2025	Change Detection	CodeCode Available	1
LRM-1B: Towards Large Routing Model	Jul 4, 2025	Combinatorial Optimizationmodel	—Unverified	0
Transforming Calabi-Yau Constructions: Generating New Calabi-Yau Manifolds with Transformers	Jul 4, 2025	Language ModelingLanguage Modelling	—Unverified	0
EvoAgentX: An Automated Framework for Evolving Agentic Workflows	Jul 4, 2025	Code GenerationMath	CodeCode Available	7
AI-VaxGuide: An Agentic RAG-Based LLM for Vaccination Decisions	Jul 4, 2025	Question AnsweringRAG	—Unverified	0
CoreCodeBench: A Configurable Multi-Scenario Repository-Level Benchmark	Jul 4, 2025	Bug fixingCode Generation	CodeCode Available	1
GDGB: A Benchmark for Generative Dynamic Text-Attributed Graph Learning	Jul 4, 2025	BenchmarkingGraph Generation	CodeCode Available	2
Behaviour Space Analysis of LLM-driven Meta-heuristic Discovery	Jul 4, 2025	Large Language Model	—Unverified	0
Bridging Domain Generalization to Multimodal Domain Generalization via Unified Representations	Jul 4, 2025	DisentanglementDomain Generalization	—Unverified	0
Outdoor Monocular SLAM with Global Scale-Consistent 3D Gaussian Pointmaps	Jul 4, 2025	3DGSNovel View Synthesis	—Unverified	0
Evaluating the Evaluators: Trust in Adversarial Robustness Tests	Jul 4, 2025	Adversarial Robustness	—Unverified	0
Helping CLIP See Both the Forest and the Trees: A Decomposition and Description Approach	Jul 4, 2025	AttributeContrastive Learning	—Unverified	0
SAMed-2: Selective Memory Enhanced Medical Segment Anything Model	Jul 4, 2025	Continual LearningImage Segmentation	CodeCode Available	1
Causal-SAM-LLM: Large Language Models as Causal Reasoners for Robust Medical Segmentation	Jul 4, 2025	AnatomyDisentanglement	—Unverified	0
Communication Efficient, Differentially Private Distributed Optimization using Correlation-Aware Sketching	Jul 4, 2025	Distributed OptimizationFederated Learning	—Unverified	0
Large Language Models for Combinatorial Optimization: A Systematic Review	Jul 4, 2025	Combinatorial Optimization	—Unverified	0