The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 15501–15550 of 474278 papers

Title	Date	Tasks	Status	Hype
CLIPGaussian: Universal and Multimodal Style Transfer Based on Gaussian Splatting	May 28, 2025	Style Transfer	CodeCode Available	1
IMTS is Worth Time Channel Patches: Visual Masked Autoencoders for Irregular Multivariate Time Series Prediction	May 28, 2025	Missing ValuesSelf-Supervised Learning	CodeCode Available	1
Scalable Parameter and Memory Efficient Pretraining for LLM: Recent Algorithmic Advances and Benchmarking	May 28, 2025	Benchmarking	CodeCode Available	1
Do You See Me : A Multidimensional Benchmark for Evaluating Visual Perception in Multimodal LLMs	May 28, 2025		CodeCode Available	1
Large Language Models for Depression Recognition in Spoken Language Integrating Psychological Knowledge	May 28, 2025	Depression DetectionDiagnostic	CodeCode Available	1
GoMatching++: Parameter- and Data-Efficient Arbitrary-Shaped Video Text Spotting and Benchmarking	May 28, 2025	BenchmarkingText Spotting	CodeCode Available	1
Update Your Transformer to the Latest Release: Re-Basin of Task Vectors	May 28, 2025	Re-basin	CodeCode Available	1
FALCON: An ML Framework for Fully Automated Layout-Constrained Analog Circuit Design	May 28, 2025	Graph Neural Network	CodeCode Available	1
SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem	May 28, 2025	Benchmarking	CodeCode Available	1
Pre-Training Curriculum for Multi-Token Prediction in Language Models	May 28, 2025	Prediction	CodeCode Available	1
ChatVLA-2: Vision-Language-Action Model with Open-World Embodied Reasoning from Pretrained Knowledge	May 28, 2025	Imitation LearningMath	CodeCode Available	1
Self-orthogonalizing attractor neural networks emerging from the free energy principle	May 28, 2025		CodeCode Available	1
ChatCFD: an End-to-End CFD Agent with Domain-specific Structured Thinking	May 28, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Training Language Models to Generate Quality Code with Program Analysis Feedback	May 28, 2025	Code Generation	CodeCode Available	1
GeoLLaVA-8K: Scaling Remote-Sensing Multimodal Large Language Models to 8K Resolution	May 27, 2025	8kAvg	CodeCode Available	1
See through the Dark: Learning Illumination-affined Representations for Nighttime Occupancy Prediction	May 27, 2025	Image EnhancementLow-Light Image Enhancement	CodeCode Available	1
DeSocial: Blockchain-based Decentralized Social Networks	May 27, 2025	Model SelectionPrediction	CodeCode Available	1
R1-Code-Interpreter: Training LLMs to Reason with Code via Supervised and Reinforcement Learning	May 27, 2025	Code GenerationReinforcement Learning (RL)	CodeCode Available	1
MedSentry: Understanding and Mitigating Safety Risks in Medical LLM Multi-Agent Systems	May 27, 2025		CodeCode Available	1
Empowering Vector Graphics with Consistently Arbitrary Viewing and View-dependent Visibility	May 27, 2025	3DGSScheduling	CodeCode Available	1
REAL-Prover: Retrieval Augmented Lean Prover for Mathematical Reasoning	May 27, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
RefAV: Towards Planning-Centric Scenario Mining	May 27, 2025	Autonomous VehiclesMotion Planning	CodeCode Available	1
Breaking the Ceiling: Exploring the Potential of Jailbreak Attacks through Expanding Strategy Space	May 27, 2025	Prompt Engineering	CodeCode Available	1
ConText-CIR: Learning from Concepts in Text for Composed Image Retrieval	May 27, 2025	Image RetrievalRetrieval	CodeCode Available	1
Explainability of Large Language Models using SMILE: Statistical Model-agnostic Interpretability with Local Explanations	May 27, 2025		CodeCode Available	1
MUSEG: Reinforcing Video Temporal Understanding via Timestamp-Aware Multi-Segment Grounding	May 27, 2025	Reinforcement Learning (RL)Video Understanding	CodeCode Available	1
CogniBench: A Legal-inspired Framework and Dataset for Assessing Cognitive Faithfulness of Large Language Models	May 27, 2025	HallucinationLanguage Modeling	CodeCode Available	1
LPOI: Listwise Preference Optimization for Vision Language Models	May 27, 2025	Object	CodeCode Available	1
AgriFM: A Multi-source Temporal Remote Sensing Foundation Model for Crop Mapping	May 27, 2025		CodeCode Available	1
Inverse Virtual Try-On: Generating Multi-Category Product-Style Images from Clothed Individuals	May 27, 2025	Virtual Try-OffVirtual Try-on	CodeCode Available	1
Taylor expansion-based Kolmogorov-Arnold network for blind image quality assessment	May 27, 2025	Blind Image Quality AssessmentComputational Efficiency	CodeCode Available	1
Minute-Long Videos with Dual Parallelisms	May 27, 2025	DenoisingGPU	CodeCode Available	1
Bencher: Simple and Reproducible Benchmarking for Black-Box Optimization	May 27, 2025	Benchmarking	CodeCode Available	1
FinTagging: An LLM-ready Benchmark for Extracting and Structuring Financial Information	May 27, 2025	Concept AlignmentMulti-class Classification	CodeCode Available	1
Dual-Polarization Stacked Intelligent Metasurfaces for Holographic MIMO	May 27, 2025		CodeCode Available	1
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone Navigation	May 27, 2025	BenchmarkingDecision Making	CodeCode Available	1
Scaling External Knowledge Input Beyond Context Windows of LLMs via Multi-Agent Collaboration	May 27, 2025	Multi-hop Question AnsweringQuestion Answering	CodeCode Available	1
AutoReproduce: Automatic AI Experiment Reproduction with Paper Lineage	May 27, 2025		CodeCode Available	1
Cross from Left to Right Brain: Adaptive Text Dreamer for Vision-and-Language Navigation	May 27, 2025	Large Language ModelLogical Reasoning	CodeCode Available	1
DiMoSR: Feature Modulation via Multi-Branch Dilated Convolutions for Efficient Image Super-Resolution	May 27, 2025	Computational EfficiencyImage Super-Resolution	CodeCode Available	1
RoBiS: Robust Binary Segmentation for High-Resolution Industrial Images	May 27, 2025	Anomaly DetectionBinarization	CodeCode Available	1
FastFace: Tuning Identity Preservation in Distilled Diffusion via Guidance and Attention	May 27, 2025		CodeCode Available	1
Pretraining Language Models to Ponder in Continuous Space	May 27, 2025	Language ModelingLanguage Modelling	CodeCode Available	1
Music Source Restoration	May 27, 2025	Music Source Separation	CodeCode Available	1
FlowCut: Rethinking Redundancy via Information Flow for Efficient Vision-Language Models	May 26, 2025	Token Reduction	CodeCode Available	1
OB3D: A New Dataset for Benchmarking Omnidirectional 3D Reconstruction Using Blender	May 26, 2025	3DGS3D Reconstruction	CodeCode Available	1
Efficient Multi-modal Long Context Learning for Training-free Adaptation	May 26, 2025		CodeCode Available	1
Lifelong Safety Alignment for Language Models	May 26, 2025	Safety Alignment	CodeCode Available	1
REARANK: Reasoning Re-ranking Agent via Reinforcement Learning	May 26, 2025	Data AugmentationInformation Retrieval	CodeCode Available	1
Win Fast or Lose Slow: Balancing Speed and Accuracy in Latency-Sensitive Decisions of LLMs	May 26, 2025	Code GenerationRecommendation Systems	CodeCode Available	1