The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 7151–7200 of 661570 papers

Title	Date	Tasks	Status	Hype
Adaptive Probabilistic ODE Solvers Without Adaptive Memory Requirements	Oct 14, 2024	State EstimationTime Series	CodeCode Available	2
Free Video-LLM: Prompt-guided Visual Perception for Efficient Training-free Video LLMs	Oct 14, 2024	Computational EfficiencyQuestion Answering	CodeCode Available	2
TextCtrl: Diffusion-based Scene Text Editing with Prior Guidance Control	Oct 14, 2024	DisentanglementImage Generation	CodeCode Available	2
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes	Oct 14, 2024	Motion GenerationMotion Synthesis	CodeCode Available	2
Tex4D: Zero-shot 4D Scene Texturing with Video Diffusion Models	Oct 14, 2024	3D geometryDenoising	CodeCode Available	2
Your Mixture-of-Experts LLM Is Secretly an Embedding Model For Free	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
Efficiently Democratizing Medical LLMs for 50 Languages via a Mixture of Language Family Experts	Oct 14, 2024	Mixture-of-Experts	CodeCode Available	2
A Scalable Communication Protocol for Networks of Large Language Models	Oct 14, 2024		CodeCode Available	2
Learning to Optimize for Mixed-Integer Non-linear Programming with Feasibility Guarantees	Oct 14, 2024		CodeCode Available	2
Queryable Prototype Multiple Instance Learning with Vision-Language Models for Incremental Whole Slide Image Classification	Oct 14, 2024	Classificationimage-classification	CodeCode Available	2
TRESTLE: A Model of Concept Formation in Structured Domains	Oct 14, 2024	Attribute	CodeCode Available	2
Text4Seg: Reimagining Image Segmentation as Text Generation	Oct 13, 2024	Image SegmentationReferring Expression	CodeCode Available	2
Large Scale Longitudinal Experiments: Estimation and Inference	Oct 13, 2024	Computational Efficiency	CodeCode Available	2
Bayesian Enhancement Models for One-to-Many Mapping in Image Enhancement	Oct 13, 2024	Image EnhancementLow-Light Image Enhancement	CodeCode Available	2
LLM-Based Multi-Agent Systems are Scalable Graph Generative Models	Oct 13, 2024	BenchmarkingGraph Generation	CodeCode Available	2
Training-Free Adaptive Diffusion with Bounded Difference Approximation Strategy	Oct 13, 2024	DenoisingPrediction	CodeCode Available	2
Learning Pattern-Specific Experts for Time Series Forecasting Under Patch-level Distribution Shift	Oct 13, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2
LibEER: A Comprehensive Benchmark and Algorithm Library for EEG-based Emotion Recognition	Oct 13, 2024	EEGEmotion Recognition	CodeCode Available	2
SimBa: Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning	Oct 13, 2024	Computational EfficiencyDeep Reinforcement Learning	CodeCode Available	2
Reconstructive Visual Instruction Tuning	Oct 12, 2024	Denoising	CodeCode Available	2
ESVO2: Direct Visual-Inertial Odometry with Stereo Event Cameras	Oct 12, 2024	motion predictionPose Tracking	CodeCode Available	2
Toward General Instruction-Following Alignment for Retrieval-Augmented Generation	Oct 12, 2024	Instruction FollowingRAG	CodeCode Available	2
Many Heads Are Better Than One: Improved Scientific Idea Generation by A LLM-Based Multi-Agent System	Oct 12, 2024	Experimental Designscientific discovery	CodeCode Available	2
Look Gauss, No Pose: Novel View Synthesis using Gaussian Splatting without Accurate Pose Initialization	Oct 11, 2024	Camera Pose EstimationNovel View Synthesis	CodeCode Available	2
StructRAG: Boosting Knowledge Intensive Reasoning of LLMs via Inference-time Hybrid Information Structurization	Oct 11, 2024	RAGRetrieval-augmented Generation	CodeCode Available	2
pyhgf: A neural network library for predictive coding	Oct 11, 2024	Causal DiscoveryMeta-Learning	CodeCode Available	2
On the State of NLP Approaches to Modeling Depression in Social Media: A Post-COVID-19 Outlook	Oct 11, 2024	EthicsFairness	CodeCode Available	2
Enhancing Multi-Step Reasoning Abilities of Language Models through Direct Q-Function Optimization	Oct 11, 2024	GSM8KLanguage Modeling	CodeCode Available	2
JAILJUDGE: A Comprehensive Jailbreak Judge Benchmark with Multi-Agent Enhanced Explanation Evaluation Framework	Oct 11, 2024		CodeCode Available	2
Retriever-and-Memory: Towards Adaptive Note-Enhanced Retrieval-Augmented Generation	Oct 11, 2024	Open-Domain Question AnsweringQuestion Answering	CodeCode Available	2
radarODE-MTL: A Multi-Task Learning Framework with Eccentric Gradient Alignment for Robust Radar-Based ECG Reconstruction	Oct 11, 2024	Multi-Task Learning	CodeCode Available	2
DelTA: An Online Document-Level Translation Agent Based on Multi-Level Memory	Oct 10, 2024	Document TranslationMachine Translation	CodeCode Available	2
Deconstructing equivariant representations in molecular systems	Oct 10, 2024	Property Prediction	CodeCode Available	2
IncEventGS: Pose-Free Gaussian Splatting from a Single Event Camera	Oct 10, 2024	Motion EstimationNeRF	CodeCode Available	2
From Exploration to Mastery: Enabling LLMs to Master Tools via Self-Driven Interactions	Oct 10, 2024	Diversity	CodeCode Available	2
Poison-splat: Computation Cost Attack on 3D Gaussian Splatting	Oct 10, 2024	3DGS	CodeCode Available	2
VibeCheck: Discover and Quantify Qualitative Differences in Large Language Models	Oct 10, 2024	Math	CodeCode Available	2
MotionGS: Exploring Explicit Motion Guidance for Deformable 3D Gaussian Splatting	Oct 10, 2024	3D ReconstructionDynamic Reconstruction	CodeCode Available	2
Heating Up Quasi-Monte Carlo Graph Random Features: A Diffusion Kernel Perspective	Oct 10, 2024		CodeCode Available	2
PointOBB-v2: Towards Simpler, Faster, and Stronger Single Point Supervised Oriented Object Detection	Oct 10, 2024	object-detectionObject Detection	CodeCode Available	2
Progressive Autoregressive Video Diffusion Models	Oct 10, 2024	DenoisingVideo Denoising	CodeCode Available	2
TurboRAG: Accelerating Retrieval-Augmented Generation with Precomputed KV Caches for Chunked Text	Oct 10, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
MathCoder2: Better Math Reasoning from Continued Pretraining on Model-translated Mathematical Code	Oct 10, 2024	MathMathematical Reasoning	CodeCode Available	2
OneRef: Unified One-tower Expression Grounding and Segmentation with Mask Referring Modeling	Oct 10, 2024	Language ModelingLanguage Modelling	CodeCode Available	2
Efficiently Learning at Test-Time: Active Fine-Tuning of LLMs	Oct 10, 2024	Active LearningLanguage Modeling	CodeCode Available	2
Reversible Decoupling Network for Single Image Reflection Removal	Oct 10, 2024	Reflection Removal	CodeCode Available	2
Doob's Lagrangian: A Sample-Efficient Variational Approach to Transition Path Sampling	Oct 10, 2024	Protein Folding	CodeCode Available	2
Benchmarking Agentic Workflow Generation	Oct 10, 2024	Benchmarking	CodeCode Available	2
COMPL-AI Framework: A Technical Interpretation and LLM Benchmarking Suite for the EU Artificial Intelligence Act	Oct 10, 2024	BenchmarkingFairness	CodeCode Available	2
VoxelPrompt: A Vision-Language Agent for Grounded Medical Image Analysis	Oct 10, 2024	Medical Image AnalysisQuestion Answering	CodeCode Available	2