The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

659,983 papers248,104 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 3101–3150 of 659983 papers

Title	Date	Tasks	Status	Hype
CAX: Cellular Automata Accelerated in JAX	Oct 3, 2024	ARCArtificial Life	CodeCode Available	3
Diffusion Models are Evolutionary Algorithms	Oct 3, 2024	DenoisingEvolutionary Algorithms	CodeCode Available	3
How to Train Long-Context Language Models (Effectively)	Oct 3, 2024		CodeCode Available	3
Agent Security Bench (ASB): Formalizing and Benchmarking Attacks and Defenses in LLM-based Agents	Oct 3, 2024	Autonomous DrivingBackdoor Attack	CodeCode Available	3
FAN: Fourier Analysis Networks	Oct 3, 2024	Language ModelingLanguage Modelling	CodeCode Available	3
Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding	Oct 2, 2024	Image GenerationText to Image Generation	CodeCode Available	3
OmniGenBench: Automating Large-scale in-silico Benchmarking for Genomic Foundation Models	Oct 2, 2024	Benchmarking	CodeCode Available	3
ImageFolder: Autoregressive Image Generation with Folded Tokens	Oct 2, 2024	Image GenerationImage Reconstruction	CodeCode Available	3
Deep Learning Alternatives of the Kolmogorov Superposition Theorem	Oct 2, 2024	Deep LearningKolmogorov-Arnold Networks	CodeCode Available	3
MMFNet: Multi-Scale Frequency Masking Neural Network for Multivariate Time Series Forecasting	Oct 2, 2024	Multivariate Time Series ForecastingMultivariate Time Series Forecastingm	CodeCode Available	3
SonicSim: A customizable simulation platform for speech processing in moving sound source scenarios	Oct 2, 2024	Speech EnhancementSpeech Separation	CodeCode Available	3
SegEarth-OV: Towards Training-Free Open-Vocabulary Segmentation for Remote Sensing Images	Oct 2, 2024	Open Vocabulary Semantic SegmentationOpen-Vocabulary Semantic Segmentation	CodeCode Available	3
MVGS: Multi-view-regulated Gaussian Splatting for Novel View Synthesis	Oct 2, 2024	3DGSNeRF	CodeCode Available	3
MixLinear: Extreme Low Resource Multivariate Time Series Forecasting with 0.1K Parameters	Oct 2, 2024	Multivariate Time Series ForecastingTime Series	CodeCode Available	3
LayerKV: Optimizing Large Language Model Serving with Layer-wise KV Cache Management	Oct 1, 2024	GPULanguage Modeling	CodeCode Available	3
Simple and Fast Distillation of Diffusion Models	Sep 29, 2024	GPUImage Generation	CodeCode Available	3
PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation	Sep 27, 2024	Image to Video GenerationVideo Generation	CodeCode Available	3
Emu3: Next-Token Prediction is All You Need	Sep 27, 2024	All	CodeCode Available	3
DANA: Domain-Aware Neurosymbolic Agents for Consistency and Accuracy	Sep 27, 2024	Financial Analysis	CodeCode Available	3
CycleNet: Enhancing Time Series Forecasting through Modeling Periodic Patterns	Sep 27, 2024	Time SeriesTime Series Forecasting	CodeCode Available	3
Does End-to-End Autonomous Driving Really Need Perception Tasks?	Sep 26, 2024	Autonomous Driving	CodeCode Available	3
The Elephant in the Room: Towards A Reliable Time-Series Anomaly Detection Benchmark	Sep 26, 2024	Anomaly DetectionBenchmarking	CodeCode Available	3
Harmful Fine-tuning Attacks and Defenses for Large Language Models: A Survey	Sep 26, 2024	Safety Alignment	CodeCode Available	3
Generative Modeling of Molecular Dynamics Trajectories	Sep 26, 2024		CodeCode Available	3
Cascade Prompt Learning for Vision-Language Model Adaptation	Sep 26, 2024	General Knowledgeimage-classification	CodeCode Available	3
Programming Every Example: Lifting Pre-training Data Quality like Experts at Scale	Sep 25, 2024	Large Language Model	CodeCode Available	3
Text2CAD: Generating Sequential CAD Models from Beginner-to-Expert Level Text Prompts	Sep 25, 2024	CAD ReconstructionText to 3D	CodeCode Available	3
Degradation-Guided One-Step Image Super-Resolution with Diffusion Priors	Sep 25, 2024	Image Super-ResolutionSuper-Resolution	CodeCode Available	3
Results of the Big ANN: NeurIPS'23 competition	Sep 25, 2024	Diversity	CodeCode Available	3
TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control	Sep 24, 2024	ClusteringLanguage Modelling	CodeCode Available	3
Language-based Audio Moment Retrieval	Sep 24, 2024	audio moment retrievalMoment Retrieval	CodeCode Available	3
WeSep: A Scalable and Flexible Toolkit Towards Generalizable Target Speaker Extraction	Sep 24, 2024	Managementspeech-recognition	CodeCode Available	3
ESPnet-Codec: Comprehensive Training and Evaluation of Neural Codecs for Audio, Music, and Speech	Sep 24, 2024	Audio Generation	CodeCode Available	3
MCTrack: A Unified 3D Multi-Object Tracking Framework for Autonomous Driving	Sep 23, 2024	3D Multi-Object TrackingAutonomous Driving	CodeCode Available	3
Addressing Emotion Bias in Music Emotion Recognition and Generation with Frechet Audio Distance	Sep 23, 2024	Emotion RecognitionFAD	CodeCode Available	3
PixWizard: Versatile Image-to-Image Visual Assistant with Open-Language Instructions	Sep 23, 2024	Image GenerationImage Restoration	CodeCode Available	3
ReMEmbR: Building and Reasoning Over Long-Horizon Spatio-Temporal Memory for Robot Navigation	Sep 20, 2024	DescriptiveQuestion Answering	CodeCode Available	3
Data Augmentation for Sequential Recommendation: A Survey	Sep 20, 2024	Data AugmentationRecommendation Systems	CodeCode Available	3
Colorful Diffuse Intrinsic Image Decomposition in the Wild	Sep 20, 2024	Color ConstancyIntrinsic Image Decomposition	CodeCode Available	3
GTSinger: A Global Multi-Technique Singing Corpus with Realistic Music Scores for All Singing Tasks	Sep 20, 2024	AllSinging Voice Synthesis	CodeCode Available	3
Fact, Fetch, and Reason: A Unified Evaluation of Retrieval-Augmented Generation	Sep 19, 2024	RAGRetrieval	CodeCode Available	3
DrivingForward: Feed-forward 3D Gaussian Splatting for Driving Scene Reconstruction from Flexible Surround-view Input	Sep 19, 2024		CodeCode Available	3
Oryx MLLM: On-Demand Spatial-Temporal Understanding at Arbitrary Resolution	Sep 19, 2024	document understandingVideo Question Answering	CodeCode Available	3
3DGS-LM: Faster Gaussian-Splatting Optimization with Levenberg-Marquardt	Sep 19, 2024	3DGSGPU	CodeCode Available	3
WiLoR: End-to-end 3D Hand Localization and Reconstruction in-the-wild	Sep 18, 2024	3D Hand Pose EstimationHand Detection	CodeCode Available	3
SOAP: Improving and Stabilizing Shampoo using Adam	Sep 17, 2024	Computational Efficiency	CodeCode Available	3
CORE-Bench: Fostering the Credibility of Published Research Through a Computational Reproducibility Agent Benchmark	Sep 17, 2024		CodeCode Available	3
Deep Graph Anomaly Detection: A Survey and New Perspectives	Sep 16, 2024	Anomaly DetectionGraph Anomaly Detection	CodeCode Available	3
Playground v3: Improving Text-to-Image Alignment with Deep-Fusion Large Language Models	Sep 16, 2024	DecoderDiversity	CodeCode Available	3
Towards Kinetic Manipulation of the Latent Space	Sep 15, 2024		CodeCode Available	3