The Open Verification Layer for ML Research

Community benchmark tracking and reproducibility verification. Built for researchers and autonomous research agents.

474,278 papers248,326 code links4,818 tasks

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 8301–8325 of 474278 papers

Title	Date	Tasks	Status	Hype
PUP 3D-GS: Principled Uncertainty Pruning for 3D Gaussian Splatting	Jun 14, 2024	NeRFNovel View Synthesis	CodeCode Available	2
SuperSVG: Superpixel-based Scalable Vector Graphics Synthesis	Jun 14, 2024	SuperpixelsVector Graphics	CodeCode Available	2
ControlVAR: Exploring Controllable Visual Autoregressive Modeling	Jun 14, 2024	Image Generation	CodeCode Available	2
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs	Jun 14, 2024	Memorization	CodeCode Available	2
DurLAR: A High-fidelity 128-channel LiDAR Dataset with Panoramic Ambient and Reflectivity Imagery for Multi-modal Autonomous Driving Applications	Jun 14, 2024	Autonomous DrivingDepth Estimation	CodeCode Available	2
Consistency-diversity-realism Pareto fronts of conditional image generative models	Jun 14, 2024	Diversity	CodeCode Available	2
ChartMimic: Evaluating LMM's Cross-Modal Reasoning Capability via Chart-to-Code Generation	Jun 14, 2024	Code Generation	CodeCode Available	2
SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages	Jun 14, 2024	Diversity	CodeCode Available	2
Sim-to-Real Transfer via 3D Feature Fields for Vision-and-Language Navigation	Jun 14, 2024	NavigateVision and Language Navigation	CodeCode Available	2
QQQ: Quality Quattuor-Bit Quantization for Large Language Models	Jun 14, 2024	Quantization	CodeCode Available	2
RaNeuS: Ray-adaptive Neural Surface Reconstruction	Jun 14, 2024	NeRFNovel View Synthesis	CodeCode Available	2
BEACON: Benchmark for Comprehensive RNA Tasks and Language Models	Jun 14, 2024	Language Modelling	CodeCode Available	2
PianoMotion10M: Dataset and Benchmark for Hand Motion Generation in Piano Performance	Jun 13, 2024	Motion GenerationPosition	CodeCode Available	2
JailbreakEval: An Integrated Toolkit for Evaluating Jailbreak Attempts Against Large Language Models	Jun 13, 2024		CodeCode Available	2
Are We There Yet? A Brief Survey of Music Emotion Prediction Datasets, Models and Outstanding Challenges	Jun 13, 2024	Emotion RecognitionMusic Emotion Recognition	CodeCode Available	2
Yo'LLaVA: Your Personalized Language and Vision Assistant	Jun 13, 2024	Image CaptioningQuestion Answering	CodeCode Available	2
An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records	Jun 13, 2024	Adversarial RobustnessExplainable Artificial Intelligence (XAI)	CodeCode Available	2
Fredformer: Frequency Debiased Transformer for Time Series Forecasting	Jun 13, 2024	Time SeriesTime Series Forecasting	CodeCode Available	2
BEVSpread: Spread Voxel Pooling for Bird's-Eye-View Representation in Vision-based Roadside 3D Object Detection	Jun 13, 2024	3D Object DetectionAutonomous Driving	CodeCode Available	2
DSL-FIQA: Assessing Facial Image Quality via Dual-Set Degradation Learning and Landmark-Guided Transformer	Jun 13, 2024	Face Image QualityFace Image Quality Assessment	CodeCode Available	2
Understanding Hallucinations in Diffusion Models through Mode Interpolation	Jun 13, 2024	HallucinationImage Generation	CodeCode Available	2
An Initial Investigation of Language Adaptation for TTS Systems under Low-resource Scenarios	Jun 13, 2024	Language IdentificationSelf-Supervised Learning	CodeCode Available	2
Delta-CoMe: Training-Free Delta-Compression with Mixed-Precision for Large Language Models	Jun 13, 2024	MathQuantization	CodeCode Available	2
Classic GNNs are Strong Baselines: Reassessing GNNs for Node Classification	Jun 13, 2024	Node ClassificationNode Property Prediction	CodeCode Available	2
Navigating the Shadows: Unveiling Effective Disturbances for Modern AI Content Detectors	Jun 13, 2024	Data AugmentationText Detection	CodeCode Available	2