The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

661,570 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3651–3700 of 661570 papers

Title	Date	Tasks	Status	Hype
Impact of architecture on robustness and interpretability of multispectral deep neural networks	Sep 21, 2023	Deep Learning	CodeCode Available	3
Are Language Models Actually Useful for Time Series Forecasting?	Jun 22, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3
PDEBENCH: An Extensive Benchmark for Scientific Machine Learning	Oct 13, 2022		CodeCode Available	3
Activating More Pixels in Image Super-Resolution Transformer	May 9, 2022	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
The First Competition on Resource-Limited Infrared Small Target Detection Challenge: Methods and Results	Aug 18, 2024		CodeCode Available	3
ELIZA Reanimated: The world's first chatbot restored on the world's first time sharing system	Jan 12, 2025	Chatbot	CodeCode Available	3
The Manga Whisperer: Automatically Generating Transcriptions for Comics	Jan 18, 2024		CodeCode Available	3
Lighting Every Darkness with 3DGS: Fast Training and Real-Time Rendering for HDR View Synthesis	Jun 10, 2024	2k3DGS	CodeCode Available	3
Speedy-Splat: Fast 3D Gaussian Splatting with Sparse Pixels and Sparse Primitives	Nov 30, 2024	3D Scene ReconstructionNeRF	CodeCode Available	3
Dispelling the Mirage of Progress in Offline MARL through Standardised Baselines and Evaluation	Jun 13, 2024	Multi-agent Reinforcement Learning	CodeCode Available	3
Deep Neural Networks for Rank-Consistent Ordinal Regression Based On Conditional Probabilities	Nov 17, 2021	regression	CodeCode Available	3
Channel Permutations for N:M Sparsity	Dec 1, 2021		CodeCode Available	3
PP-MSVSR: Multi-Stage Video Super-Resolution	Dec 6, 2021	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
QOC: Quantum On-Chip Training with Parameter Shift and Gradient Pruning	Feb 26, 2022	image-classificationImage Classification	CodeCode Available	3
Pastiche Master: Exemplar-Based High-Resolution Portrait Style Transfer	Mar 24, 2022	Style TransferTransfer Learning	CodeCode Available	3
Min-Max Similarity: A Contrastive Semi-Supervised Deep Learning Network for Surgical Tools Segmentation	Mar 29, 2022	Contrastive LearningSegmentation	CodeCode Available	3
WikiChat: Stopping the Hallucination of Large Language Model Chatbots by Few-Shot Grounding on Wikipedia	May 23, 2023	ChatbotHallucination	CodeCode Available	3
Deep Learning for Trajectory Data Management and Mining: A Survey and Beyond	Mar 21, 2024	Anomaly DetectionDeep Learning	CodeCode Available	3
DeepCAVE: An Interactive Analysis Tool for Automated Machine Learning	Jun 7, 2022	AutoMLBIG-bench Machine Learning	CodeCode Available	3
Plotly-Resampler: Effective Visual Analytics for Large Time Series	Jun 17, 2022	Data VisualizationTime Series	CodeCode Available	3
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-Making	Apr 22, 2024	Decision MakingMedical Diagnosis	CodeCode Available	3
The Common Core Ontologies	Apr 27, 2024		CodeCode Available	3
PARTNR: A Benchmark for Planning and Reasoning in Embodied Multi-agent Tasks	Oct 31, 2024		CodeCode Available	3
PANGAEA: A Global and Inclusive Benchmark for Geospatial Foundation Models	Dec 5, 2024	Earth Observation	CodeCode Available	3
RLVER: Reinforcement Learning with Verifiable Emotion Rewards for Empathetic Agents	Jul 3, 2025	Emotional Intelligencereinforcement-learning	CodeCode Available	3
SEED-Bench: Benchmarking Multimodal Large Language Models	Jan 1, 2024	BenchmarkingImage Generation	CodeCode Available	3
Step-DPO: Step-wise Preference Optimization for Long-chain Reasoning of LLMs	Jun 26, 2024	Arithmetic ReasoningGSM8K	CodeCode Available	3
Reasoning with Language Model Prompting: A Survey	Dec 19, 2022	Arithmetic ReasoningCommon Sense Reasoning	CodeCode Available	3
How to build the best medical image segmentation algorithm using foundation models: a comprehensive empirical study with Segment Anything Model	Apr 15, 2024	DecoderImage Segmentation	CodeCode Available	3
Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning	Feb 26, 2024	GPUMinecraft	CodeCode Available	3
ThoughtSource: A central hub for large language model reasoning data	Jan 27, 2023	Language ModelingLanguage Modelling	CodeCode Available	3
VoxFormer: Sparse Voxel Transformer for Camera-based 3D Semantic Scene Completion	Feb 23, 2023	3D geometry3D Semantic Scene Completion	CodeCode Available	3
Foundation Models for Music: A Survey	Aug 26, 2024	In-Context LearningRepresentation Learning	CodeCode Available	3
Bridging Evolutionary Algorithms and Reinforcement Learning: A Comprehensive Survey on Hybrid Algorithms	Jan 22, 2024	Evolutionary Algorithmsreinforcement-learning	CodeCode Available	3
GaussianTalker: Real-Time High-Fidelity Talking Head Synthesis with Audio-Driven 3D Gaussian Splatting	Apr 24, 2024	3DGSAttribute	CodeCode Available	3
Visual Causal Scene Refinement for Video Question Answering	May 7, 2023	Contrastive LearningQuestion Answering	CodeCode Available	3
TimeMachine: A Time Series is Worth 4 Mambas for Long-term Forecasting	Mar 14, 2024	Computational EfficiencyMamba	CodeCode Available	3
Caravan MultiMet: Extending Caravan with Multiple Weather Nowcasts and Forecasts	Nov 14, 2024	Benchmarking	CodeCode Available	3
Monkey: Image Resolution and Text Label Are Important Things for Large Multi-modal Models	Nov 11, 2023	Image CaptioningMMR total	CodeCode Available	3
Conceptual Framework for Autonomous Cognitive Entities	Oct 3, 2023		CodeCode Available	3
NoMaD: Goal Masked Diffusion Policies for Navigation and Exploration	Oct 11, 2023		CodeCode Available	3
One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization	Jun 29, 2023	3D ReconstructionImage to 3D	CodeCode Available	3
Sequential Modeling Enables Scalable Learning for Large Vision Models	Dec 1, 2023	Diversity	CodeCode Available	3
UniGS: Unified Representation for Image Generation and Segmentation	Dec 4, 2023	Image GenerationSegmentation	CodeCode Available	3
Physical Symbolic Optimization	Dec 6, 2023	regressionreinforcement-learning	CodeCode Available	3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library	Dec 25, 2023	CPUDeep Reinforcement Learning	CodeCode Available	3
Universal Time-Series Representation Learning: A Survey	Jan 8, 2024	Feature EngineeringRepresentation Learning	CodeCode Available	3
Small LLMs Are Weak Tool Learners: A Multi-LLM Agent	Jan 14, 2024	Language ModellingLarge Language Model	CodeCode Available	3
Large Language Models are Superpositions of All Characters: Attaining Arbitrary Role-play via Self-Alignment	Jan 23, 2024	AllInstruction Following	CodeCode Available	3
Marabou 2.0: A Versatile Formal Analyzer of Neural Networks	Jan 25, 2024		CodeCode Available	3